Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 9 days ago • 138
COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation Paper • 2605.31264 • Published 8 days ago • 105
Imagine Before You Predict: Interleaved Latent Visual Reasoning for Video Event Prediction Paper • 2606.05769 • Published 2 days ago • 5