Data Darwinism Part I: Unlocking the Value of Scientific Data for Pre-training Paper • 2602.07824 • Published 22 days ago • 16
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts Paper • 2602.13367 • Published 16 days ago • 31
view article Article Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs Jan 27 • 24
Running 38 Falcon-H1-Tiny: A series of extremely small, yet powerful language models redefining capabilities at small scale 📝 38 Generate text using extremely small yet powerful language models
tiiuae/Falcon-H1-Tiny-90M-Instruct-Curriculum-pre-DPO Text Generation • 91.1M • Updated Jan 15 • 8 • 1