Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
LeTue09
/
arithmetic-grpo
like
0
arxiv:
14 papers
Model card
Files
Files and versions
xet
Community
main
arithmetic-grpo
/
examples
/
ppo_trainer
61.8 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
LeTue09
initial clean commit
1faccd4
18 days ago
README.md
6.88 kB
initial clean commit
18 days ago
run_deepseek7b_llm.sh
1.84 kB
initial clean commit
18 days ago
run_deepseek7b_llm_modelscope.sh
1.83 kB
initial clean commit
18 days ago
run_deepseek7b_llm_pfppo.sh
1.98 kB
initial clean commit
18 days ago
run_deepseek7b_llm_sandbox_fusion.sh
2.03 kB
initial clean commit
18 days ago
run_deepseek7b_llm_sp2.sh
1.91 kB
initial clean commit
18 days ago
run_deepseek_full_hh_rlhf.sh
1.91 kB
initial clean commit
18 days ago
run_deepseek_math_gsm8k_megatron.sh
2.07 kB
initial clean commit
18 days ago
run_deepseek_math_gsm8k_megatron_nsys.sh
2.69 kB
initial clean commit
18 days ago
run_gemma.sh
1.7 kB
initial clean commit
18 days ago
run_moonlight16b_a3b_gsm8k_megatron.sh
4.82 kB
initial clean commit
18 days ago
run_qwen1.5_moe_a2.7b-gsm8k_megatron.sh
3.05 kB
initial clean commit
18 days ago
run_qwen2-7b_math_gsm8k_megatron.sh
2.01 kB
initial clean commit
18 days ago
run_qwen2-7b_rm.sh
3.17 kB
initial clean commit
18 days ago
run_qwen2-7b_rm_reward_loop_colocate.sh
2.99 kB
initial clean commit
18 days ago
run_qwen2-7b_rm_seq_balance.sh
2.62 kB
initial clean commit
18 days ago
run_qwen2-7b_rm_seq_balance_fused_kernels.sh
2.84 kB
initial clean commit
18 days ago
run_qwen2-7b_rm_seq_balance_nsys.sh
3.37 kB
initial clean commit
18 days ago
run_qwen2-7b_seq_balance.sh
2.41 kB
initial clean commit
18 days ago
run_qwen2-7b_sglang_seq_balance.sh
2.15 kB
initial clean commit
18 days ago
run_qwen2.5-32b.sh
2.11 kB
initial clean commit
18 days ago
run_qwen2.5-3b_rm_reward_loop_colocate.sh
2.99 kB
initial clean commit
18 days ago
run_qwen3-8b_npu.sh
2.42 kB
initial clean commit
18 days ago