SenseNova-U1 Collection SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture • 3 items • Updated 3 days ago • 37
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation Paper • 2604.24764 • Published 6 days ago • 115
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company Paper • 2604.22446 • Published 9 days ago • 116
Fast-dVLM: Efficient Block-Diffusion VLM via Direct Conversion from Autoregressive VLM Paper • 2604.06832 • Published 23 days ago
Feed-Forward 3D Scene Modeling: A Problem-Driven Perspective Paper • 2604.14025 • Published 18 days ago • 15
Feed-Forward 3D Scene Modeling: A Problem-Driven Perspective Paper • 2604.14025 • Published 18 days ago • 15
INSPATIO-WORLD: A Real-Time 4D World Simulator via Spatiotemporal Autoregressive Modeling Paper • 2604.07209 • Published 25 days ago • 36
OneWorld: Taming Scene Generation with 3D Unified Representation Autoencoder Paper • 2603.16099 • Published Mar 17 • 1
Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding Paper • 2510.15253 • Published Oct 17, 2025
Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment Paper • 2505.21494 • Published May 27, 2025 • 8
Hyper3D: Efficient 3D Representation via Hybrid Triplane and Octree Feature for Enhanced 3D Shape Variational Auto-Encoders Paper • 2503.10403 • Published Mar 13, 2025
OneWorld: Taming Scene Generation with 3D Unified Representation Autoencoder Paper • 2603.16099 • Published Mar 17 • 1