Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published 11 days ago • 6
DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off Paper • 2604.13902 • Published 9 days ago • 59
DCAgent/e1_random_d1_constrain_sandboxes_glm_4.7_traces_jupiter Viewer • Updated 12 days ago • 13.2k • 16
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published 25 days ago • 340
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 342
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling Paper • 2603.25746 • Published 29 days ago • 155
HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions Paper • 2603.15612 • Published Mar 16 • 152