EO-WM: A Physically Informed World Model for Probabilistic Earth Observation Forecasting Paper • 2606.27277 • Published 2 days ago • 2
OpenBioRQ: Unsolved Biomedical Research Questions for Agents Paper • 2606.21959 • Published 7 days ago • 3
ABACUS: Adapting Unified Foundation Model for Bridging Image Count Understanding and Generation Paper • 2606.23835 • Published 5 days ago • 2
COrigami: An AI Pipeline for Co-Designing Flat-Foldable Visually Recognisable Origami Paper • 2606.26299 • Published 3 days ago • 3
When Does Combining Language Models Help? A Co-Failure Ceiling on Routing, Voting, and Mixture-of-Agents Across 67 Frontier Models Paper • 2606.27288 • Published 2 days ago • 3
Information-Aware KV Cache Compression for Long Reasoning Paper • 2606.26875 • Published 2 days ago • 5
CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies Paper • 2606.16613 • Published 12 days ago • 7
Neglected Free Lunch from Post-training: Progress Advantage for LLM Agents Paper • 2606.26080 • Published 3 days ago • 6
Hallucination in World Models is Predictable and Preventable Paper • 2606.27326 • Published 2 days ago • 7
Confidence-Aware Tool Orchestration for Robust Video Understanding Paper • 2606.26904 • Published 2 days ago • 9
PhysiFormer: Learning to Simulate Mechanics in World Space Paper • 2606.27364 • Published 2 days ago • 8
LISA: Likelihood Score Alignment for Visual-condition Controllable Generation Paper • 2606.27192 • Published 2 days ago • 12
Running the Gauntlet: Re-evaluating the Capabilities of Agents Beyond Familiar Environments Paper • 2606.14397 • Published 2 days ago • 14
Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It Paper • 2606.26027 • Published 3 days ago • 14
AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints Paper • 2606.05622 • Published 23 days ago • 44