Efficient RL Training for LLMs with Experience Replay Paper • 2604.08706 • Published 12 days ago • 17
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published Jan 26 • 42