IDEALLab/Qwen2.5-Coder-14B-Instruct-GRPO-SDS-Minimalist-seed202 Reinforcement Learning • 15B • Updated 8 days ago • 15
IDEALLab/Qwen2.5-Coder-14B-Instruct-GRPO-SDS-Minimalist-seed303 Reinforcement Learning • 15B • Updated 8 days ago • 16
mykola-lexsi/pricoder-aligntune-qwen3-5-4b-v2-iter-remapped Text Generation • Updated 3 days ago • 17