The Depth Ceiling: On the Limits of Large Language Models in Discovering Latent Planning Paper β’ 2604.06427 β’ Published 6 days ago β’ 7
view article Article How I contributed a new model to the Transformers library using Codex 13 days ago β’ 45
Reasoning Shift: How Context Silently Shortens LLM Reasoning Paper β’ 2604.01161 β’ Published 11 days ago β’ 29
mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition β’ 4B β’ Updated Mar 11 β’ 878k β’ 809
Running 28 Open Source AI Year In Review 2025 π 28 Reviewing Progress of the Open Source Ecosystem
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand Dec 4, 2025 β’ 68