Meta-Cognitive Memory Policy Optimization for Long-Horizon LLM Agents Paper • 2605.30159 • Published 8 days ago • 3
Meta-Cognitive Memory Policy Optimization for Long-Horizon LLM Agents Paper • 2605.30159 • Published 8 days ago • 3
ArborKV: Structure-Aware KV Cache Management for Scaling Tree-based LLM Reasoning Paper • 2605.22106 • Published 15 days ago • 1
VLA-Pruner: Temporal-Aware Dual-Level Visual Token Pruning for Efficient Vision-Language-Action Inference Paper • 2511.16449 • Published Nov 20, 2025 • 1
How Much Reasoning Do Retrieval-Augmented Models Add beyond LLMs? A Benchmarking Framework for Multi-Hop Inference over Hybrid Knowledge Paper • 2602.10210 • Published Feb 10 • 1
FaithRL: Learning to Reason Faithfully through Step-Level Faithfulness Maximization Paper • 2602.03507 • Published Feb 12
Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment Paper • 2511.04555 • Published Dec 5, 2025
ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning Paper • 2606.03503 • Published 3 days ago • 24
Meta-Cognitive Memory Policy Optimization for Long-Horizon LLM Agents Paper • 2605.30159 • Published 8 days ago • 3
ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning Paper • 2606.03503 • Published 3 days ago • 24
Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving Paper • 2512.10739 • Published Dec 11, 2025 • 47
VLA-Pruner: Temporal-Aware Dual-Level Visual Token Pruning for Efficient Vision-Language-Action Inference Paper • 2511.16449 • Published Nov 20, 2025 • 1