Add metadata and improve model card
Browse filesHi, I'm Niels, part of the community science team at Hugging Face!
This PR aims to improve the discoverability and documentation of this model. Based on the associated paper and repository files, I've:
- Added the `text-classification` pipeline tag.
- Added `library_name: transformers`.
- Enhanced the model card with a description of the TESSY framework, its role as a boundary predictor, and links to the official paper and GitHub repository.
Please let me know if you have any questions!
README.md
CHANGED
|
@@ -1,4 +1,34 @@
|
|
| 1 |
-
|
| 2 |
-
|
|
|
|
|
|
|
| 3 |
|
| 4 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
pipeline_tag: text-classification
|
| 3 |
+
library_name: transformers
|
| 4 |
+
---
|
| 5 |
|
| 6 |
+
# Boundary Predictor - TESSY
|
| 7 |
+
|
| 8 |
+
This model is a boundary predictor developed as part of the work:
|
| 9 |
+
[*How to Fine-Tune a Reasoning Model? A Teacher–Student Cooperation Framework to Synthesize Student-Consistent SFT Data*](https://huggingface.co/papers/2604.14164).
|
| 10 |
+
|
| 11 |
+
## Model Description
|
| 12 |
+
|
| 13 |
+
This model serves as a critical component of **TESSY** (Teacher–Student Cooperation Data Synthesis), a framework designed to synthesize "on-policy" SFT data for reasoning models like Qwen3.
|
| 14 |
+
|
| 15 |
+
The primary role of this predictor is to identify **reasoning boundaries**—the specific points in a sequence where the model transitions between reasoning steps (capability tokens) and non-reasoning stylistic content (style tokens). By predicting these boundaries, TESSY allows a teacher model (to provide reasoning) and a student model (to provide stylistic consistency) to alternate during the data generation process.
|
| 16 |
+
|
| 17 |
+
## Resources
|
| 18 |
+
|
| 19 |
+
- **Paper:** [How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data](https://huggingface.co/papers/2604.14164)
|
| 20 |
+
- **GitHub Repository:** [CoopReason/TESSY](https://github.com/CoopReason/TESSY/)
|
| 21 |
+
- **Associated Dataset:** [TESSY-Code-80K](https://huggingface.co/datasets/CoopReason/TESSY-Code-80K)
|
| 22 |
+
|
| 23 |
+
## Citation
|
| 24 |
+
|
| 25 |
+
If you find this work or model useful, please cite:
|
| 26 |
+
|
| 27 |
+
```bibtex
|
| 28 |
+
@article{TESSY,
|
| 29 |
+
title={How to Fine-Tune a Reasoning Model? A Teacher--Student Cooperation Framework to Synthesize Student-Consistent SFT Data},
|
| 30 |
+
author={Huang, Zixian and Yang, Kaichen and Huang, Xu and Hao, Feiyang and Qiming, Ge and Li, Bowen and Du, He and Chen, Kai and Guo, Qipeng},
|
| 31 |
+
journal={arXiv preprint arXiv:2604.14164},
|
| 32 |
+
year={2026}
|
| 33 |
+
}
|
| 34 |
+
```
|