Efficient RL Training for LLMs with Experience Replay Paper • 2604.08706 • Published 18 days ago • 19
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published Jan 26 • 42