junyeong-nero commited on
Commit
cd4bbe2
·
verified ·
1 Parent(s): ff68e83

Upload diffusion checkpoint

Browse files
Files changed (2) hide show
  1. README.md +5 -7
  2. diffusion.pt +2 -2
README.md CHANGED
@@ -2,7 +2,7 @@
2
  license: mit
3
  tags:
4
  - tiny-stable-diffusion
5
- - vae
6
  - image-generation
7
  - diffusion
8
  library_name: pytorch
@@ -10,17 +10,15 @@ library_name: pytorch
10
 
11
  # tiny-sd-models
12
 
13
- This is a **VAE** model trained with [tiny-stable-diffusion](https://github.com/your-username/tiny-stable-diffusion).
14
 
15
  ## Model Description
16
 
17
- This is a Variational Autoencoder (VAE) trained to compress images into a latent space.
18
- The VAE follows the SD3 architecture with 16 latent channels and f8 compression ratio.
19
 
20
  ### Architecture
21
- - **Type**: AutoencoderKL
22
- - **Latent Channels**: 16
23
- - **Compression**: f8 (64x64 → 8x8)
24
 
25
  ## Usage
26
 
 
2
  license: mit
3
  tags:
4
  - tiny-stable-diffusion
5
+ - diffusion
6
  - image-generation
7
  - diffusion
8
  library_name: pytorch
 
10
 
11
  # tiny-sd-models
12
 
13
+ This is a **DIFFUSION** model trained with [tiny-stable-diffusion](https://github.com/your-username/tiny-stable-diffusion).
14
 
15
  ## Model Description
16
 
17
+ This is a Diffusion Transformer (DiT/MMDiT) trained for text-to-image generation in latent space.
 
18
 
19
  ### Architecture
20
+ - **Type**: DiT or MMDiT (Multi-Modal Diffusion Transformer)
21
+ - **Conditioning**: CLIP text embeddings
 
22
 
23
  ## Usage
24
 
diffusion.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0911a70d13fdeb0b20507332f14796b46b897262caaa7f5d5ce7aca7711ff33d
3
- size 3498240588
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2e08a4bbd081c1377a26404009ab45d7af816aa81b0cb130db5c818ae62b5ebc
3
+ size 1758572077