diffutron
/

DiffutronLM-0.3B-1st-Stage

Text Generation

Model card Files Files and versions

suayptalha commited on 13 days ago

Commit

386ea89

·

verified ·

1 Parent(s): 946cd63

Update README.md

Files changed (1) hide show

README.md +25 -8

README.md CHANGED Viewed

@@ -61,14 +61,31 @@ Despite being an intermediate checkpoint, the 1st-Stage model demonstrates highl
 ## 💻 Usage
-Inference requires generating text via a discrete diffusion process rather than causal next-token prediction. We recommend using the `dllm` library.
-**Recommended Generation Parameters:**
-* **Steps:** 64 to 128
-* **Temperature:** 0.1
-* **Block Length:** 32
-* **Repetition Penalty:** 1.2
-* **Remask Strategy:** `low_conf`
 ## ⚠️ Limitations

 ## 💻 Usage
+Because Diffutron is a Masked Diffusion Language Model, it requires inference strategies distinct from standard causal generation. We recommend using the `dllm` library or custom generation loops tailored for discrete diffusion.
+### 1. Install the dllm Library:
+```bash
+git clone https://github.com/Diffutron/dllm.git
+cd dllm
+pip install -e .
+```
+### 2. Chat via Interaction Mode:
+```bash
+python -u examples/bert/chat.py \
+    --model_name_or_path "diffutron/DiffutronLM-0.3B-1st-Stage" \
+    --chat True \
+    --steps 64 \
+    --max_new_tokens 64 \
+    --temperature 0.1 \
+    --block_length 32 \
+    --repetition_penalty 1.2 \
+    --remasking "low_confidence" \
+    --stochastic_transfer False \
+    --cfg_scale 0.0
+```
+For other inference modes, see [dllm](https://github.com/Diffutron/dllm) library.
 ## ⚠️ Limitations