Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
sohyunan
/
debug
like
0
Text Generation
Transformers
Safetensors
maze_5x5
gemma2
Generated from Trainer
controller-grpo
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
debug
10.7 GB
1 contributor
History:
3 commits
sohyunan
End of training
8e387e6
verified
about 1 year ago
maze
Model save
about 1 year ago
merged
Model save
about 1 year ago
.gitattributes
2.13 kB
Model save
about 1 year ago
README.md
2.15 kB
End of training
about 1 year ago
adapter_config.json
766 Bytes
Model save
about 1 year ago
adapter_model.safetensors
6.42 MB
xet
Model save
about 1 year ago
all_results.json
344 Bytes
Model save
about 1 year ago
config.json
949 Bytes
End of training
about 1 year ago
eval_results.json
168 Bytes
Model save
about 1 year ago
special_tokens_map.json
636 Bytes
Model save
about 1 year ago
tokenizer.json
34.4 MB
xet
Model save
about 1 year ago
tokenizer_config.json
47.3 kB
Model save
about 1 year ago
train_results.json
178 Bytes
Model save
about 1 year ago
trainer_state.json
1.22 kB
Model save
about 1 year ago
training_args.bin
7.16 kB
xet
Model save
about 1 year ago