Nathanaël Beau
Nbeau
AI & ML interests
Code generation
Recent Activity
published a model about 4 hours ago
Nbeau/qwen-swan-sig-2b upvoted a paper 8 months ago
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep
Thinking upvoted a paper about 1 year ago
Ignore the KL Penalty! Boosting Exploration on Critical Tokens to
Enhance RL Fine-TuningOrganizations
None yet