TIGER-Lab
/

Critique-Coder-4B

Model card Files Files and versions

Critique-Coder-4B / README.md

wenhu's picture

Update README.md

5f45ee8 verified 5 months ago

|

history blame contribute delete

1.13 kB

	---
	license: apache-2.0
	datasets:
	- TIGER-Lab/rStar-Critique-Data
	language:
	- en
	metrics:
	- accuracy
	base_model:
	- Qwen/Qwen3-4B
	tags:
	- code
	---

	## Model
	We release the 4B model trained with [Critique-Coder](https://github.com/TIGER-AI-Lab/Critique-Coder).

	## Data
	Data Construction Pipeline is shown:

	![pipeline](https://github.com/TIGER-AI-Lab/Critique-Coder/blob/main/assets/images/dataset.png?raw=true)

	## Paper
	[Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning](https://huggingface.co/papers/2509.22824)

	## Project Page
	https://tiger-ai-lab.github.io/Critique-Coder

	## Code
	https://github.com/TIGER-AI-Lab/Critique-Coder

	## Sample Usage

	You can download this dataset using the Hugging Face CLI:

	```bash
	hf download Critique-Coder/rStar-Critique-Data --local-dir ./data/critique-coder-dataset --repo dataset
	```

	## Citation
	```
	@article{ruan2025critiquecoder,
	title={Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning},
	author={Ruan, Chi and Jiang, Dongfu and Wang, Yubo and Chen, Wenhu},
	journal={ArXiv},
	year={2025},
	volume={2509.22824}
	}
	```