arxiv:2605.31455
mj
mujianijan
ยท
AI & ML interests
RL, LLM Agent
Recent Activity
upvoted a paper about 8 hours ago
DRIFT: Decoupled Rollouts and Importance-Weighted Fine-Tuning for Efficient Multi-Turn Optimization authored a paper about 8 hours ago
GUI-360: A Comprehensive Dataset and Benchmark for Computer-Using Agents authored a paper about 10 hours ago
DRIFT: Decoupled Rollouts and Importance-Weighted Fine-Tuning for Efficient Multi-Turn Optimization