Ex0bit/Gemma4-26B-A4B-PRISM-PRO-DQ-GGUF Image-Text-to-Text • 25B • Updated Apr 11 • 10.9k • 71
Qwen3.5 Collection Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 21 days ago • 152
Running on Zero Agents Featured 17 Qwen3 VL Video Grounding 🥠17 Text-guided object tracking, point tracking, reasoning.
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding Paper • 2403.15377 • Published Mar 22, 2024 • 29
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 421k • 1.6k