Reasoning during fine-tune?
#1
by nikich340 - opened
Good day, I wonder how did you solve missed reasoning (chains of thoughts) in datasets used for fine-tuning. If there were no any CoTs used, how does it affect general model reasoning?
What format and reasoning effort were used during fine-tuning, and what reasoning effort should be used for inference?
Thank you in advance for the answers.