Aleksandr
akrylov
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper 1 day ago
Trust-Region Behavior Blending for On-Policy Distillation upvoted a paper 4 months ago
F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare published a model about 1 year ago
akrylov/lora-trained-xlOrganizations
None yet