On the Optimal Reasoning Length for RL-Trained Language Models
Paper
•
2602.09591
•
Published
•
3
Formerly, MDEL, we have renamed ourselves after the model we deployed, Aurora-M. Visit us here: https://huggingface.co/aurora-m