nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 11 days ago • 53.3k • 511
mradermacher/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-i1-GGUF 27B • Updated 16 days ago • 114k • 57
view post Post 5269 Qwen releases 4 new Qwen3.5 Small models: 0.8B • 2B • 4B • 9B!Run Qwen3.5-0.8B, 2B and 4B on your phone. Run 9B on 6GB RAM.The vision reasoning LLMs perform better than models 4x their size.GGUFs to run: https://huggingface.co/collections/unsloth/qwen35Guide: https://unsloth.ai/docs/models/qwen3.5 See translation 5 replies · 🔥 14 14 🤗 13 13 🚀 1 1 + Reply