ocr tencent/HunyuanOCR Image-Text-to-Text • Updated Jan 13 • 207k • 562 opendatalab/MinerU2.5-2509-1.2B Image-Text-to-Text • 1B • Updated Sep 29, 2025 • 104k • 347 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • 1.0B • Updated 21 days ago • 344k • 530 PaddlePaddle/PaddleOCR-VL Image-Text-to-Text • 1.0B • Updated 12 days ago • 7k • 1.58k
asr FireRedTeam/FireRedASR-AED-L Automatic Speech Recognition • Updated Mar 5, 2025 • 140 • 68 microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 283k • 1.58k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 283k • 1.58k
ocr tencent/HunyuanOCR Image-Text-to-Text • Updated Jan 13 • 207k • 562 opendatalab/MinerU2.5-2509-1.2B Image-Text-to-Text • 1B • Updated Sep 29, 2025 • 104k • 347 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • 1.0B • Updated 21 days ago • 344k • 530 PaddlePaddle/PaddleOCR-VL Image-Text-to-Text • 1.0B • Updated 12 days ago • 7k • 1.58k
asr FireRedTeam/FireRedASR-AED-L Automatic Speech Recognition • Updated Mar 5, 2025 • 140 • 68 microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 283k • 1.58k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 283k • 1.58k