Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
tphage
/
BeamPERL
like
0
Text Generation
PEFT
Safetensors
English
qwen2
reinforcement-learning
grpo
lora
beam-mechanics
structural-engineering
math
reasoning
conversational
arxiv:
2504.15777
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Use this model
main
BeamPERL
3.57 GB
1 contributor
History:
4 commits
tphage
Add model card
7fcbf30
verified
6 days ago
.gitattributes
1.57 kB
Upload folder using huggingface_hub
6 days ago
README.md
3 kB
Add model card
6 days ago
chat_template.jinja
2.25 kB
Upload folder using huggingface_hub
6 days ago
config.json
703 Bytes
Upload folder using huggingface_hub
6 days ago
generation_config.json
181 Bytes
Upload folder using huggingface_hub
6 days ago
model.safetensors
3.55 GB
xet
Upload folder using huggingface_hub
6 days ago
special_tokens_map.json
485 Bytes
Upload folder using huggingface_hub
6 days ago
tokenizer.json
11.4 MB
xet
Upload folder using huggingface_hub
6 days ago
tokenizer_config.json
4.49 kB
Upload folder using huggingface_hub
6 days ago