DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution Paper β’ 2601.13761 β’ Published Jan 20 β’ 16
meta-llama/Llama-3.2-1B-Instruct Text Generation β’ 1B β’ Updated Oct 24, 2024 β’ 4.18M β’ β’ 1.33k
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper β’ 2507.01006 β’ Published Jul 1, 2025 β’ 252
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper β’ 2507.01006 β’ Published Jul 1, 2025 β’ 252