papers
updated
Visual Representation Alignment for Multimodal Large Language Models
Paper
• 2509.07979
• Published
• 84
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Paper
• 2509.07980
• Published
• 104
Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth
Paper
• 2509.03867
• Published
• 211
Why Language Models Hallucinate
Paper
• 2509.04664
• Published
• 196
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs
via Bi-Mode Annealing and Reinforce Learning
Paper
• 2508.21113
• Published
• 110
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
Paper
• 2509.00676
• Published
• 85
Towards a Unified View of Large Language Model Post-Training
Paper
• 2509.04419
• Published
• 76
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task
Arithmetic
Paper
• 2509.01363
• Published
• 59
Does DINOv3 Set a New Medical Vision Standard?
Paper
• 2509.06467
• Published
• 38
Reinforced Visual Perception with Tools
Paper
• 2509.01656
• Published
• 32