ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning Paper • 2604.24300 • Published 4 days ago • 60
EgoFun3D: Modeling Interactive Objects from Egocentric Videos using Function Templates Paper • 2604.11038 • Published 18 days ago
Generalizable Articulated Object Reconstruction from Casually Captured RGBD Videos Paper • 2506.08334 • Published Jun 10, 2025 • 1
TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach Paper • 2407.03245 • Published Jul 3, 2024