Add model card and metadata for CauGym-GRPO-14B
#1
by
nielsr
HF Staff
- opened
This PR improves the model card for CauGym-GRPO-14B by:
- Adding relevant metadata tags (
pipeline_tag,library_name,base_model, andtags). - Linking to the research paper: "Can Post-Training Transform LLMs into Causal Reasoners?".
- Linking to the official GitHub repository: OpenCausaLab/CauGym.
- Providing a description of the model's performance and training methodology (GRPO).
- Adding the BibTeX citation for researchers.