Boosting Reinforcement Learning with Verifiable Rewards via Randomly Selected Few-Shot Guidance Paper • 2605.15012 • Published 2 days ago • 1
CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis Paper • 2503.23145 • Published Mar 29, 2025 • 35
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models Paper • 2310.04406 • Published Oct 6, 2023 • 10