When Models Manipulate Manifolds: The Geometry of a Counting Task Paper • 2601.04480 • Published Jan 8 • 4
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4 Text Generation • 18B • Updated 25 days ago • 566k • 136
view article Article Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models Jan 6 • 28
meituan-longcat/LongCat-Flash-Thinking-2601 Text Generation • 562B • Updated Jan 23 • 3.43k • 108
unsloth/Qwen3-Next-80B-A3B-Instruct-GGUF Text Generation • 80B • Updated Jan 14 • 11.7k • 174
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 186
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 124
view article Article Introducing swift-huggingface: The Complete Swift Client for Hugging Face Dec 5, 2025 • 43