AffordanceVLA: A Vision-Language-Action Model Empowering Action Generation through Affordance-Aware Understanding Paper • 2606.06155 • Published 3 days ago • 6
UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling Paper • 2604.19734 • Published Apr 21 • 33
DIAL: Decoupling Intent and Action via Latent World Modeling for End-to-End VLA Paper • 2603.29844 • Published Mar 31 • 2