OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper • 2602.05400 • Published 17 days ago • 320
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published Aug 7, 2025 • 183 • 21
Running Featured 252 Jupyter Agent 2 🏃 252 Generate and run Jupyter notebooks from natural language tasks
view post Post 4225 why did 36 people unfollow me 😭we are back in the hundreds.if you become my 500th follower and have proof I'll give you 5 dollars worth of openrouter credits as an API key See translation 3 replies · 😔 4 4 😎 4 4 👀 1 1 + Reply