GenRecon: Bridging Generative Priors for Multi-View 3D Scene Reconstruction Paper • 2605.23888 • Published 5 days ago • 8
Cartridges: Lightweight and general-purpose long context representations via self-study Paper • 2506.06266 • Published Jun 6, 2025 • 8
LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation Paper • 2605.18739 • Published 9 days ago • 109
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 23 days ago • 342
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation Paper • 2605.03849 • Published 22 days ago • 124
Video Analysis and Generation via a Semantic Progress Function Paper • 2604.22554 • Published Apr 24 • 63
Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation Paper • 2604.18168 • Published Apr 20 • 97
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published Apr 9 • 247
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published Mar 16 • 154
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper • 2603.03143 • Published Mar 3 • 145
OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer Paper • 2601.14250 • Published Jan 20 • 48
ShapeR: Robust Conditional 3D Shape Generation from Casual Captures Paper • 2601.11514 • Published Jan 16 • 24
Sharp Monocular View Synthesis in Less Than a Second Paper • 2512.10685 • Published Dec 11, 2025 • 30
Reflection Removal through Efficient Adaptation of Diffusion Transformers Paper • 2512.05000 • Published Dec 4, 2025 • 18