|
|
--- |
|
|
license: apache-2.0 |
|
|
library_name: diffusers |
|
|
pipeline_tag: image-to-image |
|
|
base_model: shallowdream204/BitDance-Tokenizer |
|
|
language: |
|
|
- en |
|
|
tags: |
|
|
- bitdance |
|
|
- tokenizer |
|
|
- autoencoder |
|
|
- custom-architecture |
|
|
- diffusers |
|
|
--- |
|
|
|
|
|
# BitDance-Tokenizer (Diffusers) |
|
|
|
|
|
Diffusers-formatted BitDance tokenizer autoencoders (AE) converted from the upstream BitDance tokenizer checkpoints. |
|
|
|
|
|
## Available Autoencoders |
|
|
|
|
|
- `ae_d16c32` (`z_channels=32`, `gan_decoder=false`) |
|
|
- `ae_d32c128` (`z_channels=128`, `gan_decoder=true`) |
|
|
- `ae_d32c256` (`z_channels=256`, `gan_decoder=true`) |
|
|
|
|
|
Each subfolder includes: |
|
|
|
|
|
- `config.json` with the autoencoder architecture |
|
|
- `conversion_metadata.json` documenting the source checkpoint and config |
|
|
|
|
|
## Test (load tokenizer only) |
|
|
|
|
|
This repo is self-contained: it includes `bitdance_diffusers` (copied from BitDance-14B-64x-diffusers) for the `BitDanceAutoencoder` class. Run the test to verify loading and encode/decode: |
|
|
|
|
|
The test loads all three autoencoders and runs a quick encode/decode check with `ae_d16c32` (no full image generation). |
|
|
|
|
|
## Loading tokenizer autoencoders |
|
|
|
|
|
```python |
|
|
import sys |
|
|
from pathlib import Path |
|
|
|
|
|
# Self-contained: add local path so bitdance_diffusers is found |
|
|
BASE_DIR = Path(__file__).resolve().parent |
|
|
sys.path.insert(0, str(BASE_DIR)) |
|
|
|
|
|
from bitdance_diffusers import BitDanceAutoencoder |
|
|
|
|
|
# Load any tokenizer autoencoder (use repo path or local path) |
|
|
ae = BitDanceAutoencoder.from_pretrained( |
|
|
"BiliSakura/BitDance-Tokenizer-diffusers", # or str(BASE_DIR) for local |
|
|
subfolder="ae_d16c32", |
|
|
) |
|
|
# ae_d16c32: z_channels=32, patch_size=16 |
|
|
# ae_d32c128: z_channels=128, patch_size=32 |
|
|
# ae_d32c256: z_channels=256, patch_size=32 |
|
|
``` |
|
|
|
|
|
## Using with a BitDance pipeline (full inference) |
|
|
|
|
|
To swap a tokenizer into a BitDance diffusers pipeline for image generation: |
|
|
|
|
|
```python |
|
|
import torch |
|
|
from diffusers import DiffusionPipeline |
|
|
|
|
|
# Load a BitDance diffusers pipeline first (provides BitDanceAutoencoder class). |
|
|
pipe = DiffusionPipeline.from_pretrained( |
|
|
"BiliSakura/BitDance-14B-16x-diffusers", |
|
|
trust_remote_code=True, |
|
|
torch_dtype=torch.bfloat16, |
|
|
).to("cuda") |
|
|
|
|
|
# Swap in a tokenizer autoencoder from this repository. |
|
|
pipe.autoencoder = pipe.autoencoder.__class__.from_pretrained( |
|
|
"BiliSakura/BitDance-Tokenizer-diffusers", |
|
|
subfolder="ae_d16c32", |
|
|
).to("cuda") |
|
|
|
|
|
image = pipe( |
|
|
prompt="A watercolor painting of a red fox in a snowy forest.", |
|
|
height=1024, |
|
|
width=1024, |
|
|
).images[0] |
|
|
image.save("bitdance_with_custom_tokenizer.png") |
|
|
``` |
|
|
|
|
|
> Note: this repository stores tokenizer autoencoder components; use `trust_remote_code=True` with a BitDance runtime repo when loading custom classes. |
|
|
|
|
|
## Citation |
|
|
|
|
|
If you use this model, please cite BitDance and Diffusers: |
|
|
|
|
|
```bibtex |
|
|
@article{ai2026bitdance, |
|
|
title = {BitDance: Scaling Autoregressive Generative Models with Binary Tokens}, |
|
|
author = {Ai, Yuang and Han, Jiaming and Zhuang, Shaobin and Hu, Xuefeng and Yang, Ziyan and Yang, Zhenheng and Huang, Huaibo and Yue, Xiangyu and Chen, Hao}, |
|
|
journal = {arXiv preprint arXiv:2602.14041}, |
|
|
year = {2026} |
|
|
} |
|
|
|
|
|
@inproceedings{von-platen-etal-2022-diffusers, |
|
|
title = {Diffusers: State-of-the-art diffusion models}, |
|
|
author = {Patrick von Platen and Suraj Patil and Anton Lozhkov and Damar Jablonski and Hernan Bischof and Thomas Wolf}, |
|
|
booktitle = {GitHub repository}, |
|
|
year = {2022}, |
|
|
url = {https://github.com/huggingface/diffusers} |
|
|
} |
|
|
``` |
|
|
|
|
|
## License |
|
|
|
|
|
This repository is distributed under the Apache-2.0 license, consistent with the upstream BitDance release. |
|
|
|