T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Paper β’ 2512.10430 β’ Published Dec 11, 2025 β’ 115
Interpretability as Compression: Reconsidering SAE Explanations of Neural Activations with MDL-SAEs Paper β’ 2410.11179 β’ Published Oct 15, 2024 β’ 2
Teach Old SAEs New Domain Tricks with Boosting Paper β’ 2507.12990 β’ Published Jul 17, 2025 β’ 12
Alchemist: Turning Public Text-to-Image Data into Generative Gold Paper β’ 2505.19297 β’ Published May 25, 2025 β’ 84
Train Sparse Autoencoders Efficiently by Utilizing Features Correlation Paper β’ 2505.22255 β’ Published May 28, 2025 β’ 24
You Do Not Fully Utilize Transformer's Representation Capacity Paper β’ 2502.09245 β’ Published Feb 13, 2025 β’ 37
The Differences Between Direct Alignment Algorithms are a Blur Paper β’ 2502.01237 β’ Published Feb 3, 2025 β’ 113
Mechanistic Permutability: Match Features Across Layers Paper β’ 2410.07656 β’ Published Oct 10, 2024 β’ 20
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning Paper β’ 2406.08973 β’ Published Jun 13, 2024 β’ 89
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality Paper β’ 2405.21060 β’ Published May 31, 2024 β’ 68
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models Paper β’ 2404.02258 β’ Published Apr 2, 2024 β’ 107
Learn Your Reference Model for Real Good Alignment Paper β’ 2404.09656 β’ Published Apr 15, 2024 β’ 90