Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
LeTue09
/
arithmetic-grpo
like
0
arxiv:
14 papers
Model card
Files
Files and versions
xet
Community
main
arithmetic-grpo
/
examples
/
data_preprocess
73.5 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
LeTue09
initial clean commit
1faccd4
18 days ago
aime2024_multiturn_w_tool.py
2.86 kB
initial clean commit
18 days ago
aime_dataset.py
4.48 kB
initial clean commit
18 days ago
aime_history_dataset.py
6.02 kB
initial clean commit
18 days ago
dapo_multiturn_w_tool.py
2.86 kB
initial clean commit
18 days ago
full_hh_rlhf.py
5.99 kB
initial clean commit
18 days ago
geo3k.py
3.55 kB
initial clean commit
18 days ago
geo3k_multiturn_w_tool.py
4.71 kB
initial clean commit
18 days ago
gsm8k.py
3.64 kB
initial clean commit
18 days ago
gsm8k_multiturn_sft.py
3.37 kB
initial clean commit
18 days ago
gsm8k_multiturn_w_interaction.py
4.39 kB
initial clean commit
18 days ago
gsm8k_multiturn_w_tool.py
4.96 kB
initial clean commit
18 days ago
gsm8k_tool_agent_loop.py
5.01 kB
initial clean commit
18 days ago
hellaswag.py
3.92 kB
initial clean commit
18 days ago
math_dataset.py
3.86 kB
initial clean commit
18 days ago
multiturn.py
4.67 kB
initial clean commit
18 days ago
pokemon.py
2.31 kB
initial clean commit
18 days ago
preprocess_search_r1_dataset.py
6.94 kB
initial clean commit
18 days ago