minnesotanlp/Finch-8B-KTO
Text Generation • 8B • Updated • 37 • 1
NLP group at University of Minnesota
Evolution Fine-Tuning: Learning to Discover Across 371 Optimization Tasks
Abstain-R1: Calibrated Abstention and Post-Refusal Clarification via Verifiable RL