Medical google/medgemma-1.5-4b-it Image-Text-to-Text • Updated Jan 23 • 173k • 483 google/medsiglip-448 Zero-Shot Image Classification • 0.9B • Updated Jul 10, 2025 • 32.7k • 120 google/medgemma-27b-it Image-Text-to-Text • Updated Jul 10, 2025 • 32.6k • 316 google/medgemma-27b-text-it Text Generation • Updated Sep 16, 2025 • 45.9k • 408
Audio nvidia/audio-flamingo-3-hf Audio-Text-to-Text • Updated Jan 27 • 176k • 173 facebook/sam-audio-large Updated Dec 30, 2025 • 29.2k • 372 google/medasr Automatic Speech Recognition • Updated Jan 26 • 37.2k • 288 FunAudioLLM/Fun-CosyVoice3-0.5B-2512 Text-to-Speech • Updated 27 days ago • 6.42k • 467
OCR lightonai/LightOnOCR-1B-1025 Image-to-Text • Updated 9 days ago • 159k • 246 tencent/HunyuanOCR Image-Text-to-Text • Updated Jan 13 • 683k • 555 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • Updated about 1 month ago • 22.6k • 425 PaddlePaddle/PP-DocLayoutV3 Image Segmentation • Updated about 1 month ago • 14k • 49
Judge ai-forever/pollux-judge-32b Text Generation • 33B • Updated Jun 27, 2025 • 146 • 5 ai-forever/pollux-judge-32b-r Text Generation • 33B • Updated Jun 27, 2025 • 4
Ru text encoders ai-forever/ru-en-RoSBERTa Feature Extraction • 0.4B • Updated Sep 26, 2024 • 134k • • 76 Tochka-AI/ruRoPEBert-e5-base-512 Feature Extraction • 0.1B • Updated Mar 13, 2024 • 3 Tochka-AI/ruRoPEBert-e5-base-2k Feature Extraction • 0.1B • Updated Mar 13, 2024 • 2.33k • 11
VLMs Qwen/Qwen2-VL-7B-Instruct Image-Text-to-Text • Updated Feb 6, 2025 • 1.63M • 1.27k NVEagle/Eagle-X5-13B-Chat Image-Text-to-Text • 15B • Updated Sep 16, 2024 • 4 • 28 internlm/internlm-xcomposer2d5-7b Visual Question Answering • Updated Jul 22, 2024 • 2.1k • 209 AIRI-Institute/OmniFusion Updated Apr 10, 2024 • 59
Translate google/translategemma-12b-it Image-Text-to-Text • Updated Jan 28 • 551k • 264 tencent/HY-MT1.5-1.8B Translation • Updated Jan 1 • 27.1k • 572 google/translategemma-4b-it Image-Text-to-Text • Updated Jan 28 • 133k • 659
Video encoders google/videoprism-lvt-base-f16r288 Video Classification • Updated Jul 29, 2025 • 101k • 11 nvidia/omni-embed-nemotron-3b Feature Extraction • 5B • Updated Oct 9, 2025 • 2.3k • 91
Datasets for Embodied agibot-world/AgiBotWorld-Alpha Viewer • Updated Sep 29, 2025 • 49.8M • 3.79k • 211 nvidia/PhysicalAI-Autonomous-Vehicles Updated Jan 21 • 384k • 765 genrobot2025/10Kh-RealOmin-OpenData Updated 1 day ago • 51.1k • 187
Text2Image stabilityai/stable-diffusion-3-medium Text-to-Image • Updated Aug 12, 2024 • 8.3k • • 4.91k black-forest-labs/FLUX.2-dev Image-to-Image • Updated 12 days ago • 170k • • 1.4k fal/FLUX.2-dev-Turbo Text-to-Image • Updated Dec 30, 2025 • 21.5k • • 337 black-forest-labs/FLUX.2-klein-4B Image-to-Image • Updated 6 days ago • 218k • • 503
Medical google/medgemma-1.5-4b-it Image-Text-to-Text • Updated Jan 23 • 173k • 483 google/medsiglip-448 Zero-Shot Image Classification • 0.9B • Updated Jul 10, 2025 • 32.7k • 120 google/medgemma-27b-it Image-Text-to-Text • Updated Jul 10, 2025 • 32.6k • 316 google/medgemma-27b-text-it Text Generation • Updated Sep 16, 2025 • 45.9k • 408
Audio nvidia/audio-flamingo-3-hf Audio-Text-to-Text • Updated Jan 27 • 176k • 173 facebook/sam-audio-large Updated Dec 30, 2025 • 29.2k • 372 google/medasr Automatic Speech Recognition • Updated Jan 26 • 37.2k • 288 FunAudioLLM/Fun-CosyVoice3-0.5B-2512 Text-to-Speech • Updated 27 days ago • 6.42k • 467
Translate google/translategemma-12b-it Image-Text-to-Text • Updated Jan 28 • 551k • 264 tencent/HY-MT1.5-1.8B Translation • Updated Jan 1 • 27.1k • 572 google/translategemma-4b-it Image-Text-to-Text • Updated Jan 28 • 133k • 659
OCR lightonai/LightOnOCR-1B-1025 Image-to-Text • Updated 9 days ago • 159k • 246 tencent/HunyuanOCR Image-Text-to-Text • Updated Jan 13 • 683k • 555 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • Updated about 1 month ago • 22.6k • 425 PaddlePaddle/PP-DocLayoutV3 Image Segmentation • Updated about 1 month ago • 14k • 49
Video encoders google/videoprism-lvt-base-f16r288 Video Classification • Updated Jul 29, 2025 • 101k • 11 nvidia/omni-embed-nemotron-3b Feature Extraction • 5B • Updated Oct 9, 2025 • 2.3k • 91
Judge ai-forever/pollux-judge-32b Text Generation • 33B • Updated Jun 27, 2025 • 146 • 5 ai-forever/pollux-judge-32b-r Text Generation • 33B • Updated Jun 27, 2025 • 4
Datasets for Embodied agibot-world/AgiBotWorld-Alpha Viewer • Updated Sep 29, 2025 • 49.8M • 3.79k • 211 nvidia/PhysicalAI-Autonomous-Vehicles Updated Jan 21 • 384k • 765 genrobot2025/10Kh-RealOmin-OpenData Updated 1 day ago • 51.1k • 187
Ru text encoders ai-forever/ru-en-RoSBERTa Feature Extraction • 0.4B • Updated Sep 26, 2024 • 134k • • 76 Tochka-AI/ruRoPEBert-e5-base-512 Feature Extraction • 0.1B • Updated Mar 13, 2024 • 3 Tochka-AI/ruRoPEBert-e5-base-2k Feature Extraction • 0.1B • Updated Mar 13, 2024 • 2.33k • 11
Text2Image stabilityai/stable-diffusion-3-medium Text-to-Image • Updated Aug 12, 2024 • 8.3k • • 4.91k black-forest-labs/FLUX.2-dev Image-to-Image • Updated 12 days ago • 170k • • 1.4k fal/FLUX.2-dev-Turbo Text-to-Image • Updated Dec 30, 2025 • 21.5k • • 337 black-forest-labs/FLUX.2-klein-4B Image-to-Image • Updated 6 days ago • 218k • • 503
VLMs Qwen/Qwen2-VL-7B-Instruct Image-Text-to-Text • Updated Feb 6, 2025 • 1.63M • 1.27k NVEagle/Eagle-X5-13B-Chat Image-Text-to-Text • 15B • Updated Sep 16, 2024 • 4 • 28 internlm/internlm-xcomposer2d5-7b Visual Question Answering • Updated Jul 22, 2024 • 2.1k • 209 AIRI-Institute/OmniFusion Updated Apr 10, 2024 • 59