Reasoning during fine-tune?

by nikich340 - opened Feb 10

Feb 10

Good day, I wonder how did you solve missed reasoning (chains of thoughts) in datasets used for fine-tuning. If there were no any CoTs used, how does it affect general model reasoning?
What format and reasoning effort were used during fine-tuning, and what reasoning effort should be used for inference?
Thank you in advance for the answers.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment