PANDO: Efficient Multimodal AI Agents via Online Skill Distillation Paper • 2605.24785 • Published 4 days ago • 2
The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning Paper • 2603.29025 • Published Mar 30 • 13