Embodied-R1.5: Evolving Physical Intelligence via Embodied Foundation Models Paper • 2606.11324 • Published 2 days ago • 5
InternVideo3: Agentify Foundation Models with Multimodal Contextual Reasoning Paper • 2606.12195 • Published 1 day ago • 13
i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models Paper • 2606.11289 • Published 2 days ago • 1
World Model Self-Distillation: Training World Models to Solve General Tasks Paper • 2606.12072 • Published 1 day ago • 4
Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling Paper • 2606.12370 • Published 1 day ago • 11
SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning Paper • 2606.10804 • Published 2 days ago • 32
Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields Paper • 2606.11042 • Published 2 days ago • 17
Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning Paper • 2606.11087 • Published 2 days ago • 3
MilliVid: Hierarchical Latents for Long-Range Consistency in Video Generation Paper • 2606.09056 • Published 3 days ago • 4
Evaluation Cards: An Interpretive Layer for AI Evaluation Reporting Paper • 2606.09809 • Published 3 days ago • 2
OASIS: From Simulation Data Collection to Real-World Humanoid Loco-Manipulation Paper • 2606.08548 • Published 4 days ago • 2
Echo-Memory: A Controlled Study of Memory in Action World Models Paper • 2606.09803 • Published 3 days ago • 30
Streaming Video Generation with Streaming Force Control Paper • 2606.07508 • Published 6 days ago • 1
WorldBench: A Challenging and Visually Diverse Multimodal Reasoning Benchmark Paper • 2606.06538 • Published 7 days ago • 1
Socratic-SWE: Self-Evolving Coding Agents via Trace-Derived Agent Skills Paper • 2606.07412 • Published 6 days ago • 12
World-Language-Action Model for Unified World Modeling, Language Reasoning, and Action Synthesis Paper • 2606.05979 • Published 7 days ago • 8