AI & ML interests
None yet
Organizations
models 33
jyc0325/Qwen2.5-1.5B-Open-R1-Code-GRPO
Text Generation
• 2B • Updated jyc0325/Qwen2.5-1.5B-Instruct-gccpSFT-GRPO
Text Generation
• 2B • Updated jyc0325/Qwen2.5-1.5B-Instruct-gccpSFT
Text Generation
• 2B • Updated • 2
jyc0325/Qwen2.5-7B-Instruct-SFT
Text Generation
• 8B • Updated • 3
jyc0325/Qwen2.5-1.5B-Open-R1-Code-GRPOv2
Text Generation
• 2B • Updated jyc0325/Qwen2.5-1.5B-SFT-ORPO
Text Generation
• 2B • Updated jyc0325/Qwen2.5-1.5B-DPO-SFT-code
Text Generation
• 2B • Updated jyc0325/Qwen2.5-1.5B-SFT-v1
Text Generation
• 2B • Updated • 3
jyc0325/Qwen2.5-1.5B-ORPO-code-hard
Text Generation
• 2B • Updated • 1
jyc0325/Qwen2.5-1.5B-DPO-code-hard
Text Generation
• 2B • Updated datasets 10
Viewer
• Updated • 35.7k • 13
Viewer
• Updated • 35.7k • 4
jyc0325/vcpp-pref-hard-pairs
Viewer
• Updated • 26.9k • 7
jyc0325/vcpp-pref-code-only
Viewer
• Updated • 32.9k • 4
jyc0325/vezora-pref-code-only
Viewer
• Updated • 52.9k • 4
jyc0325/vezora-pref-clean
Viewer
• Updated • 54k • 4
jyc0325/verifiable-coding-problems-python-pref
Viewer
• Updated • 32.9k • 9
jyc0325/Code-Preference-Pairs
Viewer
• Updated • 54k • 3
jyc0325/Mixture-of-Thoughts-code-8k
Viewer
• Updated • 25.2k • 5
jyc0325/python_decontaminated_OpenR1-Math-220k
Preview
• Updated • 4