Two LoRA cold-start SFT experiments teaching structured think/answer reasoning to Nanbeige4-3B-Base using distilled traces from frontier models
Mrinaal Arora
mrinaalarora
AI & ML interests
None yet
Recent Activity
updated a Space about 13 hours ago
mrinaalarora/textarena-wordle-env updated a Space about 17 hours ago
mrinaalarora/drylabsim published a Space 1 day ago
mrinaalarora/drylabsimOrganizations
None yet