Alibaba-NLP/gme-Qwen2-VL-2B-Instruct Sentence Similarity β’ 2B β’ Updated Jun 9, 2025 β’ 59.1k β’ 133
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand Dec 4, 2025 β’ 65
RedHatAI/gemma-3-27b-it-FP8-dynamic Image-Text-to-Text β’ 27B β’ Updated Jun 9, 2025 β’ 20.7k β’ 12
Running 11 Jina Embeddings V4 Retrieval Visual π 11 Visualize text-image similarity with interactive heatmaps
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8 Text Generation β’ 235B β’ Updated Jul 30, 2025 β’ 77.2k β’ 82
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8 Text Generation β’ 235B β’ Updated Sep 17, 2025 β’ 739k β’ 146
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 β’ 486
view post Post 4772 Qwen 3 can launch very soon. πhttps://github.com/ggml-org/llama.cpp/pull/12828 See translation 3 replies Β· π₯ 16 16 π 9 9 β€οΈ 8 8 + Reply