jky594176 nielsr HF Staff commited on
Commit
9acd1d0
·
1 Parent(s): eff992d

Add model card for GDSD (#1)

Browse files

- Add model card for GDSD (c3d984ec86a63206bf2061755f07adefe3bdd350)


Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +29 -0
README.md ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ pipeline_tag: text-generation
3
+ library_name: transformers
4
+ ---
5
+
6
+ # GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models
7
+
8
+ This repository contains the model weights for GDSD, as presented in the paper [GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models](https://huggingface.co/papers/2605.29398).
9
+
10
+ Guided Denoiser Self-Distillation (GDSD) is a reinforcement learning framework for diffusion language models (dLLMs). It improves the denoiser of dLLMs by distilling from an advantage-guided self-teacher, bypassing the biases associated with evidence lower bound (ELBO) surrogates used in prior methods. GDSD provides a more stable and effective RL procedure, achieving significant performance gains on planning, math, and coding benchmarks.
11
+
12
+ ## Resources
13
+
14
+ - **Paper:** [GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models](https://arxiv.org/abs/2605.29398)
15
+ - **GitHub Repository:** [https://github.com/GaryBall/GDSD](https://github.com/GaryBall/GDSD)
16
+
17
+ ## Citation
18
+
19
+ ```bibtex
20
+ @misc{tang2026gdsdreinforcementlearningguided,
21
+ title={GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models},
22
+ author={Xiaohang Tang and Keyue Jiang and Che Liu and Qifang Zhao and Xiaoxiao Xu and Sangwoong Yoon and Ilija Bogunovic},
23
+ year={2026},
24
+ eprint={2605.29398},
25
+ archivePrefix={arXiv},
26
+ primaryClass={cs.LG},
27
+ url={https://arxiv.org/abs/2605.29398},
28
+ }
29
+ ```