Mingzhe Li
Mubuky
ยท
AI & ML interests
RL & Agent
Recent Activity
upvoted
a
paper
16 days ago
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking