nielsr HF Staff commited on
Commit
fc71df8
·
verified ·
1 Parent(s): 095360d

Add metadata and improve model card

Browse files

Hi, I'm Niels, part of the community science team at Hugging Face!

This PR aims to improve the discoverability and documentation of this model. Based on the associated paper and repository files, I've:
- Added the `text-classification` pipeline tag.
- Added `library_name: transformers`.
- Enhanced the model card with a description of the TESSY framework, its role as a boundary predictor, and links to the official paper and GitHub repository.

Please let me know if you have any questions!

Files changed (1) hide show
  1. README.md +33 -3
README.md CHANGED
@@ -1,4 +1,34 @@
1
- This tool is developed as part of our work
2
- [*How to Fine-Tune a Reasoning Model? A Teacher–Student Cooperation Framework to Synthesize Student-Consistent SFT Data*](https://github.com/CoopReason/TESSY/blob/main/paper/TESSY.pdf).
 
 
3
 
4
- For more details, please refer to the [GitHub repository](https://github.com/CoopReason/TESSY/).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ pipeline_tag: text-classification
3
+ library_name: transformers
4
+ ---
5
 
6
+ # Boundary Predictor - TESSY
7
+
8
+ This model is a boundary predictor developed as part of the work:
9
+ [*How to Fine-Tune a Reasoning Model? A Teacher–Student Cooperation Framework to Synthesize Student-Consistent SFT Data*](https://huggingface.co/papers/2604.14164).
10
+
11
+ ## Model Description
12
+
13
+ This model serves as a critical component of **TESSY** (Teacher–Student Cooperation Data Synthesis), a framework designed to synthesize "on-policy" SFT data for reasoning models like Qwen3.
14
+
15
+ The primary role of this predictor is to identify **reasoning boundaries**—the specific points in a sequence where the model transitions between reasoning steps (capability tokens) and non-reasoning stylistic content (style tokens). By predicting these boundaries, TESSY allows a teacher model (to provide reasoning) and a student model (to provide stylistic consistency) to alternate during the data generation process.
16
+
17
+ ## Resources
18
+
19
+ - **Paper:** [How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data](https://huggingface.co/papers/2604.14164)
20
+ - **GitHub Repository:** [CoopReason/TESSY](https://github.com/CoopReason/TESSY/)
21
+ - **Associated Dataset:** [TESSY-Code-80K](https://huggingface.co/datasets/CoopReason/TESSY-Code-80K)
22
+
23
+ ## Citation
24
+
25
+ If you find this work or model useful, please cite:
26
+
27
+ ```bibtex
28
+ @article{TESSY,
29
+ title={How to Fine-Tune a Reasoning Model? A Teacher--Student Cooperation Framework to Synthesize Student-Consistent SFT Data},
30
+ author={Huang, Zixian and Yang, Kaichen and Huang, Xu and Hao, Feiyang and Qiming, Ge and Li, Bowen and Du, He and Chen, Kai and Guo, Qipeng},
31
+ journal={arXiv preprint arXiv:2604.14164},
32
+ year={2026}
33
+ }
34
+ ```