AI & ML interests
None yet
Organizations
None yet
models 11
Blancy/Qwen3-1.7B-Open-R1-Code-GRPO
Text Generation
• 2B • Updated Blancy/Qwen3-0.6B-Open-R1-Distill
Text Generation
• 2B • Updated • 5
Blancy/Qwen3-0.6B-Open-R1-GRPO
Text Generation
• 2B • Updated • 1
Blancy/Qwen3-1.7B-Open-R1-GRPO
Text Generation
• 2B • Updated • 4
• • 2
Blancy/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Text Generation
• 2B • Updated Blancy/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 4
• 1
Blancy/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 4
Blancy/Qwen-2.5-7B-Simple-RL
Text Generation
• 8B • Updated • 1
• 1
Blancy/Qwen2.5-1.5B-Open-R1-Code-GRPO
Text Generation
• 2B • Updated • 4
Blancy/DeepSeek-R1-Distill-Qwen-0.5B-GRPO
Text Generation
• 0.6B • Updated • 9
datasets 43
Blancy/1ktestfrom10kwithdifficultyclasses_selfguided
Viewer
• Updated • 1k • 7
Blancy/verifiable-coding-problems-SFT
Viewer
• Updated • 1.09k • 66
Blancy/verifiable-coding-problems-CoT
Viewer
• Updated • 1.09k • 10
Blancy/verifiable-coding-problems-python-filtered
Viewer
• Updated • 2k • 15
Blancy/OpenThoughts-114k-Code_fit_code_reward
Viewer
• Updated • 1k • 22
Blancy/OpenThoughts-114k-Code_oj_format
Viewer
• Updated • 1k • 17
Blancy/OpenThoughts-114k-Code_decontaminated_final_verinfo
Viewer
• Updated • 1k • 8
Blancy/OpenThoughts-114k-Code_decontaminated_final
Viewer
• Updated • 1k • 5
Blancy/OpenThoughts-114k-Code_decontaminated_3000to5000_problem_leq400
Viewer
• Updated • 1.82k • 5
Blancy/OpenThoughts-114k-Code_decontaminated_3000to5000
Viewer
• Updated • 2.92k • 16