From Model Scaling to System Scaling: Scaling the Harness in Agentic AI Paper • 2605.26112 • Published 17 days ago • 9
Arbitrage: Efficient Reasoning via Advantage-Aware Speculation Paper • 2512.05033 • Published Dec 4, 2025 • 17
Let's (not) just put things in Context: Test-Time Training for Long-Context LLMs Paper • 2512.13898 • Published Dec 15, 2025 • 2
$V_1$: Unifying Generation and Self-Verification for Parallel Reasoners Paper • 2603.04304 • Published Mar 4 • 14
Reward Under Attack: Analyzing the Robustness and Hackability of Process Reward Models Paper • 2603.06621 • Published Feb 20
Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution Paper • 2604.07725 • Published Apr 10
Learning, Fast and Slow: Towards LLMs That Adapt Continually Paper • 2605.12484 • Published about 1 month ago • 18
Learning, Fast and Slow: Towards LLMs That Adapt Continually Paper • 2605.12484 • Published about 1 month ago • 18
Grid2Matrix: Revealing Digital Agnosia in Vision-Language Models Paper • 2604.09687 • Published Apr 14 • 8
Combee: Scaling Prompt Learning for Self-Improving Language Model Agents Paper • 2604.04247 • Published Apr 5 • 31
Can AI Agents Answer Your Data Questions? A Benchmark for Data Agents Paper • 2603.20576 • Published Mar 21 • 4
SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing Paper • 2603.08982 • Published Mar 9 • 16
V_1: Unifying Generation and Self-Verification for Parallel Reasoners Paper • 2603.04304 • Published Mar 4 • 14
SLA2: Sparse-Linear Attention with Learnable Routing and QAT Paper • 2602.12675 • Published Feb 13 • 59
AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions Paper • 2602.06008 • Published Feb 5 • 5