AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications Paper • 2602.22769 • Published Feb 26 • 10
Who Prices Cognitive Labor in the Age of Agents? Compute-Anchored Wages Paper • 2605.05558 • Published 25 days ago • 3
The Many Faces of On-Policy Distillation: Pitfalls, Mechanisms, and Fixes Paper • 2605.11182 • Published 22 days ago • 5
The Many Faces of On-Policy Distillation: Pitfalls, Mechanisms, and Fixes Paper • 2605.11182 • Published 22 days ago • 5
Who Prices Cognitive Labor in the Age of Agents? Compute-Anchored Wages Paper • 2605.05558 • Published 25 days ago • 3
Rethinking Memory Mechanisms of Foundation Agents in the Second Half: A Survey Paper • 2602.06052 • Published Jan 14 • 6
Probing the Knowledge Boundary: An Interactive Agentic Framework for Deep Knowledge Extraction Paper • 2602.00959 • Published Feb 1
Agentic AI Systems Should Be Designed as Marginal Token Allocators Paper • 2605.01214 • Published about 1 month ago • 4
Agentic AI Systems Should Be Designed as Marginal Token Allocators Paper • 2605.01214 • Published about 1 month ago • 4
OpenTinker: Separating Concerns in Agentic Reinforcement Learning Paper • 2601.07376 • Published Jan 12 • 7
OpenTinker: Separating Concerns in Agentic Reinforcement Learning Paper • 2601.07376 • Published Jan 12 • 7
Benchmarking Scientific Understanding and Reasoning for Video Generation using VideoScience-Bench Paper • 2512.02942 • Published Dec 2, 2025 • 5
Fast and Accurate Causal Parallel Decoding using Jacobi Forcing Paper • 2512.14681 • Published Dec 16, 2025 • 43
Fast and Accurate Causal Parallel Decoding using Jacobi Forcing Paper • 2512.14681 • Published Dec 16, 2025 • 43
Multi-Agent Evolve: LLM Self-Improve through Co-evolution Paper • 2510.23595 • Published Oct 27, 2025 • 13
Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs Paper • 2310.16355 • Published Oct 25, 2023
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena Paper • 2306.05685 • Published Jun 9, 2023 • 43
Toward Inference-optimal Mixture-of-Expert Large Language Models Paper • 2404.02852 • Published Apr 3, 2024
LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch Paper • 2501.07124 • Published Jan 13, 2025
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published Jun 17, 2025 • 50