Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines? Paper • 2602.14111 • Published 4 days ago • 55
Running Featured 45 Porting nanochat to Transformers: an AI modeling history lesson 📝 45 Learn about ML and Transformers through nanochat
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published Nov 19, 2025 • 231
view article Article CircleGuardBench: New Standard for Evaluating AI Moderation Models May 7, 2025 • 59
Running 3.7k The Ultra-Scale Playbook 🌌 3.7k The ultimate guide to training LLM on large GPU Clusters
Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian Paper • 2405.13929 • Published May 22, 2024 • 55