Is this distill model?
#1
by
sergeantson - opened
Is this model distillation from R1?
No, it is a fine-tuned model with GPRO methods to gain reasoning capacity
umarigan changed discussion status to
closed
Is this model distillation from R1?
No, it is a fine-tuned model with GPRO methods to gain reasoning capacity