Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants Paper • 2604.00842 • Published 3 days ago • 9
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook Paper • 2604.02029 • Published 3 days ago • 113
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published 3 days ago • 78
Paper Reconstruction Evaluation: Evaluating Presentation and Hallucination in AI-written Papers Paper • 2604.01128 • Published 3 days ago • 11
Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published 3 days ago • 23
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome Paper • 2603.28407 • Published 5 days ago • 61
ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers Paper • 2603.24414 • Published 10 days ago • 174
FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration Paper • 2603.29557 • Published 4 days ago • 15
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 15 days ago • 313
INSID3: Training-Free In-Context Segmentation with DINOv3 Paper • 2603.28480 • Published 5 days ago • 5
Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design Paper • 2603.28376 • Published 5 days ago • 20
ResAdapt: Adaptive Resolution for Efficient Multimodal Reasoning Paper • 2603.28610 • Published 5 days ago • 20
Story2Proposal: A Scaffold for Structured Scientific Paper Writing Paper • 2603.27065 • Published 8 days ago • 21
Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization Paper • 2603.28342 • Published 5 days ago • 24
MuSEAgent: A Multimodal Reasoning Agent with Stateful Experiences Paper • 2603.27813 • Published 6 days ago • 23
Gen-Searcher: Reinforcing Agentic Search for Image Generation Paper • 2603.28767 • Published 5 days ago • 54
Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models? Paper • 2603.22582 • Published 12 days ago • 7