TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration Paper • 2604.14116 • Published 16 days ago • 13
From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space Paper • 2604.14142 • Published 16 days ago • 29
Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents Paper • 2604.14004 • Published 16 days ago • 30
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents Paper • 2604.07429 • Published 23 days ago • 117
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published 18 days ago • 100
SAIRfoundation/equational-theories-selected-problems Viewer • Updated 3 days ago • 2.67k • 3.77k • 10
Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop Paper • 2506.10968 • Published Jun 12, 2025 • 1
GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics Paper • 2602.12617 • Published Feb 13 • 20