Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AlignmentResearch
/
pineapple-oskar_005ga_rm_training
like
0
Follow
FAR AI
53
PEFT
Safetensors
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
main
pineapple-oskar_005ga_rm_training
/
reference
Commit History
Upload trained reward model
fc08669
verified
skar0
commited on
Jul 22, 2025
Upload trained reward model
3e85fec
verified
skar0
commited on
Jul 22, 2025
Upload trained reward model
745597a
verified
skar0
commited on
Jul 21, 2025
Upload trained reward model
9b4e15a
verified
skar0
commited on
Jul 21, 2025
Upload trained reward model
140f50d
verified
skar0
commited on
Jul 21, 2025
Upload trained reward model
8910107
verified
skar0
commited on
Jul 21, 2025
Upload trained reward model
a834a40
verified
skar0
commited on
Jul 21, 2025