BadCat
Foresta
ยท
AI & ML interests
LLMs
Deep learning
Reinforcement learning
Recent Activity
upvoted a paper 1 day ago
XSkill: Continual Learning from Experience and Skills in Multimodal Agents upvoted an article 6 days ago
From GRPO to DAPO and GSPO: What, Why, and How upvoted a paper 2 months ago
AT^2PO: Agentic Turn-based Policy Optimization via Tree Search Organizations
None yet