dWorldEval: Scalable Robotic Policy Evaluation via Discrete Diffusion World Model Paper • 2604.22152 • Published 4 days ago • 2
AgentSearchBench: A Benchmark for AI Agent Search in the Wild Paper • 2604.22436 • Published 4 days ago • 8
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Paper • 2604.22748 • Published 4 days ago • 132
WorldMark: A Unified Benchmark Suite for Interactive Video World Models Paper • 2604.21686 • Published 5 days ago • 36
Seeing Fast and Slow: Learning the Flow of Time in Videos Paper • 2604.21931 • Published 5 days ago • 17
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 6 days ago • 231
MMCORE: MultiModal COnnection with Representation Aligned Latent Embeddings Paper • 2604.19902 • Published 7 days ago • 2
CreativeGame:Toward Mechanic-Aware Creative Game Generation Paper • 2604.19926 • Published 7 days ago • 2
Cortex 2.0: Grounding World Models in Real-World Industrial Deployment Paper • 2604.20246 • Published 6 days ago • 6
SWE-chat: Coding Agent Interactions From Real Users in the Wild Paper • 2604.20779 • Published 6 days ago • 11
SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing Paper • 2604.19587 • Published 7 days ago • 45
ClawNet: Human-Symbiotic Agent Network for Cross-User Autonomous Cooperation Paper • 2604.19211 • Published 7 days ago • 11
Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items Paper • 2604.19748 • Published 7 days ago • 246
EvoMaster: A Foundational Agent Framework for Building Evolving Autonomous Scientific Agents at Scale Paper • 2604.17406 • Published 9 days ago • 4
Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration Paper • 2604.18131 • Published 8 days ago • 9
WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models Paper • 2604.18224 • Published 8 days ago • 22