Transformers
Safetensors
Generated from Trainer
ppo_with_value14 / policy_model
653 MB
AMindToThink's picture
Model save
8f156bb verified