·
AI & ML interests
None yet
Organizations
YYF42/opd_r1_distill_qwen1.5b_630_step_alltokens
YYF42/opd_r1_distill_qwen1.5b_1epoch
Text Classification
• 22.7M • Updated • 1
YYF42/Qwen-2.5-1.5B-Simple-RL-3epoch-newprompt
2B • Updated • 1
YYF42/Qwen-2.5-1.5B-Simple-RL-3epoch
Text Generation
• 2B • Updated • 1
YYF42/Qwen-2.5-1.5B-Simple-RL-16response
Text Generation
• 2B • Updated • 1
YYF42/Qwen-2.5-1.5B-Simple-RL-4response
Text Generation
• 2B • Updated • 3
YYF42/Qwen-2.5-1.5B-Simple-RL-2response
Text Generation
• 2B • Updated YYF42/Qwen-2.5-1.5B-Simple-RL-baseline3
Text Generation
• 2B • Updated YYF42/Qwen-2.5-1.5B-Simple-RL-baseline2
Text Generation
• 2B • Updated • 1
YYF42/Qwen-2.5-1.5B-Simple-RL
2B • Updated • 1
YYF42/Qwen2.5-1.5B-Open-R1-GRPO
Updated
YYF42/Qwen-2.5-7B-Simple-RL
Updated