Vision-Language moonshotai/Kimi-VL-A3B-Thinking Image-Text-to-Text • 16B • Updated Jan 30 • 92.7k • 446 OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 2.85k • 49 LifuWang/DistillT5 0.1B • Updated Apr 11, 2025 • 134 • 29
OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 2.85k • 49
Vision-Language moonshotai/Kimi-VL-A3B-Thinking Image-Text-to-Text • 16B • Updated Jan 30 • 92.7k • 446 OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 2.85k • 49 LifuWang/DistillT5 0.1B • Updated Apr 11, 2025 • 134 • 29
OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 2.85k • 49