The ToolRL model trained for tool use through GRPO
Cheng Qian
chengq9
AI & ML interests
Agent, Tool Learning
Recent Activity
upvoted a paper about 7 hours ago
AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints upvoted a paper 2 days ago
Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues upvoted a paper 8 days ago
Advancing Creative Physical Intelligence in Large Multimodal Models