·
AI & ML interests
None yet
Organizations
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-ppo-v0.3
8B • Updated • 156k
• 1
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-ppo-v0.3
3B • Updated • 47
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-em-ppo-v0.3
3B • Updated • 50.9k
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-grpo-v0.3
8B • Updated • 23
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-em-grpo-v0.3
3B • Updated • 37
• 1
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-grpo-v0.3
3B • Updated • 6
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-7b-it-em-grpo-v0.3
8B • Updated • 15
• 1
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-32b-em-grpo-v0.3
33B • Updated • 12
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-14b-it-em-grpo-v0.3
15B • Updated • 7
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-14b-em-grpo-v0.3
15B • Updated • 1
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-14b-em-ppo-v0.3
15B • Updated • 1
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-14b-em-ppo-v0.2
15B • Updated • 4
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-14b-it-em-ppo-v0.2
15B • Updated • 5
PeterJinGo/R1-nq_hotpotqa_train-qwen2.5-3b-em-ppo-v0.2
3B • Updated • 1
PeterJinGo/R1-nq_hotpotqa_train-qwen2.5-3b-it-em-ppo-v0.2
3B • Updated • 2
PeterJinGo/R1-nq_hotpotqa_train-qwen2.5-14b-em-ppo-v0.2
15B • Updated • 3
PeterJinGo/R1-nq_hotpotqa_train-qwen2.5-14b-it-em-ppo-v0.2
15B • Updated • 2
PeterJinGo/R1-nq_hotpotqa_train-qwen2.5-7b-it-em-ppo-v0.2
8B • Updated • 2
PeterJinGo/R1-nq_hotpotqa_train-qwen2.5-7b-em-ppo-v0.2
8B • Updated • 2
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-7b-it-em-grpo-v0.2
8B • Updated • 4
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-grpo-v0.2
8B • Updated • 122
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-7b-it-em-ppo-v0.2
8B • Updated • 3
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-grpo-v0.2
3B • Updated • 69
• 1
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-grpo-groupsize3-v0.2
8B • Updated • 2
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-ppo-v0.2
8B • Updated • 1.26k
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-14b-em-grpo-groupsize1-v0.2
15B • Updated • 2
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-14b-it-em-grpo-groupsize1-v0.2
15B • Updated • 4
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-ppo-v0.2
3B • Updated • 4
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-em-grpo-v0.2
3B • Updated • 61
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-em-ppo-v0.2
3B • Updated • 4
• 1