Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration Paper • 2604.18131 • Published Apr 20 • 11
FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention Paper • 2606.09079 • Published 3 days ago • 53
Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration Paper • 2604.18131 • Published Apr 20 • 11
AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration Paper • 2602.03786 • Published Feb 3 • 90
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published Dec 15, 2025 • 113
Improving LLMs' Generalized Reasoning Abilities by Graph Problems Paper • 2507.17168 • Published Jul 23, 2025 • 1