3D-VCD: Hallucination Mitigation in 3D-LLM Embodied Agents through Visual Contrastive Decoding Paper • 2604.08645 • Published 24 days ago • 1
mTSBench: Benchmarking Multivariate Time Series Anomaly Detection and Model Selection at Scale Paper • 2506.21550 • Published Jun 26, 2025
DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent Denoising Paper • 2603.19216 • Published Mar 19 • 1
VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs Paper • 2603.23481 • Published Mar 24 • 7
Phantom: Physics-Infused Video Generation via Joint Modeling of Visual and Latent Physical Dynamics Paper • 2604.08503 • Published 24 days ago • 7
RewardFlow: Generate Images by Optimizing What You Reward Paper • 2604.08536 • Published 24 days ago • 5
RewardFlow: Generate Images by Optimizing What You Reward Paper • 2604.08536 • Published 24 days ago • 5
Phantom: Physics-Infused Video Generation via Joint Modeling of Visual and Latent Physical Dynamics Paper • 2604.08503 • Published 24 days ago • 7
VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs Paper • 2603.23481 • Published Mar 24 • 7
PRIMA: Multi-Image Vision-Language Models for Reasoning Segmentation Paper • 2412.15209 • Published Dec 19, 2024 • 1
CoRe3D: Collaborative Reasoning as a Foundation for 3D Intelligence Paper • 2512.12768 • Published Dec 14, 2025 • 4
Toward Cognitive Supersensing in Multimodal Large Language Model Paper • 2602.01541 • Published Feb 2 • 16
DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent Denoising Paper • 2603.19216 • Published Mar 19 • 1