arxiv:2606.20002
Yanxi Chen
yanxi-chen
AI & ML interests
None yet
Recent Activity
authored a paper about 14 hours ago
R$^3$L: Reflect-then-Retry Reinforcement Learning with Language-Guided Exploration, Pivotal Credit, and Positive Amplification authored a paper about 14 hours ago
SeeUPO: Sequence-Level Agentic-RL with Convergence Guarantees authored a paper about 14 hours ago
Connect the Dots: Training LLMs for Long-Lifecycle Agents with Cross-Domain Generalization Via Reinforcement LearningOrganizations
None yet