view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 19 days ago • 867
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 503
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand Dec 4, 2025 • 68
view article Article Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models Nov 19, 2025 • 34
view article Article We’re open-sourcing our text-to-image model and the process behind it Nov 12, 2025 • 97
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency Jan 30, 2025 • 299
view article Article NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image Reasoning, Image and Video QA Tasks Oct 28, 2025 • 17