youyou
shiyingcheng
AI & ML interests
None yet
Recent Activity
upvoted a paper about 8 hours ago
EvoTrainer: Co-Evolving LLM Policies and Training Harnesses for Autonomous Agentic Reinforcement Learning submitted a paper about 8 hours ago
EvoTrainer: Co-Evolving LLM Policies and Training Harnesses for Autonomous Agentic Reinforcement Learning upvoted a paper 9 days ago
ESPO: Early-Stopping Proximal Policy OptimizationOrganizations
None yet