Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
HuggingFaceH4
/
Qwen2.5-Math-1.5B-Instruct-PRM-0.2
like
0
Follow
Hugging Face H4
1.42k
Token Classification
Transformers
Safetensors
HuggingFaceH4/prm800k-trl-dedup
qwen2
Generated from Trainer
trl
prm
text-generation-inference
arxiv:
2211.14275
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen2.5-Math-1.5B-Instruct-PRM-0.2
Commit History
Update README.md
bc44d4a
verified
plaguss
commited on
Jan 9, 2025
Update README.md
b0c5086
verified
plaguss
commited on
Jan 9, 2025
End of training
34c257e
verified
plaguss
commited on
Jan 8, 2025
Model save
ca50afa
verified
plaguss
commited on
Jan 8, 2025
Model save
749de75
verified
plaguss
commited on
Jan 8, 2025
Training in progress, step 721
3af57d2
verified
plaguss
commited on
Jan 8, 2025
Training in progress, step 500
7bd295f
verified
plaguss
commited on
Jan 8, 2025
initial commit
9f17009
verified
plaguss
commited on
Jan 8, 2025