Is this distill model?
#1
by sergeantson - opened
Is this model distillation from R1?
No, it is a fine-tuned model with GPRO methods to gain reasoning capacity
umarigan changed discussion status to closed
Is this model distillation from R1?
No, it is a fine-tuned model with GPRO methods to gain reasoning capacity