AutoTrainess: Teaching Language Models to Improve Language Models Autonomously Paper • 2606.31551 • Published 4 days ago • 13
MemoryArena: Benchmarking Agent Memory in Interdependent Multi-Session Agentic Tasks Paper • 2602.16313 • Published Feb 18 • 4
Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models Paper • 2311.09278 • Published Nov 15, 2023 • 9
SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting Paper • 2604.10688 • Published Apr 12 • 27
ADWIN: Adaptive Windows for Horizon-Aware On-Policy Distillation Paper • 2605.28396 • Published May 27 • 1
Teacher-Guided Policy Optimization for On-Policy Reasoning Distillation under Large Policy Divergence Paper • 2605.13230 • Published May 28 • 1
Filter, Then Reweight: Rethinking Optimization Granularity in On-Policy Distillation Paper • 2606.02684 • Published Jun 1 • 17
Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe Paper • 2605.03677 • Published May 5 • 28