| | --- |
| | license: apache-2.0 |
| | datasets: |
| | - TIGER-Lab/rStar-Critique-Data |
| | language: |
| | - en |
| | metrics: |
| | - accuracy |
| | base_model: |
| | - Qwen/Qwen3-4B |
| | tags: |
| | - code |
| | --- |
| | |
| | ## Model |
| | We release the 4B model trained with [Critique-Coder](https://github.com/TIGER-AI-Lab/Critique-Coder). |
| |
|
| | ## Data |
| | Data Construction Pipeline is shown: |
| |
|
| |  |
| |
|
| | ## Paper |
| | [Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning](https://huggingface.co/papers/2509.22824) |
| |
|
| | ## Project Page |
| | https://tiger-ai-lab.github.io/Critique-Coder |
| |
|
| | ## Code |
| | https://github.com/TIGER-AI-Lab/Critique-Coder |
| |
|
| | ## Sample Usage |
| |
|
| | You can download this dataset using the Hugging Face CLI: |
| |
|
| | ```bash |
| | hf download Critique-Coder/rStar-Critique-Data --local-dir ./data/critique-coder-dataset --repo dataset |
| | ``` |
| |
|
| | ## Citation |
| | ``` |
| | @article{ruan2025critiquecoder, |
| | title={Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning}, |
| | author={Ruan, Chi and Jiang, Dongfu and Wang, Yubo and Chen, Wenhu}, |
| | journal={ArXiv}, |
| | year={2025}, |
| | volume={2509.22824} |
| | } |
| | ``` |