Melvin Vivas PRO
AI & ML interests
Recent Activity
Organizations
-
Running on CPU Upgrade989
Open VLM Leaderboard
π989VLMEvalKit Evaluation Results Collection
-
Running on ZeroFeatured412
DeepSeek OCR 2 Demo
π412Try out DeepSeek-OCR-2 on your PDFs or images
-
Running on ZeroMCP61
Multimodal OCR3
π61demo of a collection of impressive ocr models on the hub
-
Qwen/Qwen3-VL-30B-A3B-Instruct
Image-Text-to-Text β’ 31B β’ Updated β’ 1.27M β’ β’ 529
-
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Paper β’ 2306.05685 β’ Published β’ 39 -
Attention Is All You Need
Paper β’ 1706.03762 β’ Published β’ 111 -
Does AI-Assisted Coding Deliver? A Difference-in-Differences Study of Cursor's Impact on Software Projects
Paper β’ 2511.04427 β’ Published
-
openai/whisper-large-v3
Automatic Speech Recognition β’ Updated β’ 6.08M β’ β’ 5.39k -
openai/whisper-large-v3-turbo
Automatic Speech Recognition β’ Updated β’ 3.13M β’ β’ 2.82k -
Running on ZeroMCPFeatured813
Whisper Large V3
π€«813Transcribe or translate audio from mic, file, or YouTube
-
Running on ZeroFeatured59
Kugel Audio
π59Generate natural-sounding speech in European languages with voice cloning
-
Sleeping1
Qwen-3-VL-8B OCR Receipts
π1structured data parser from receipt images
-
RunningFeatured250
Qwen3 Omni Demo
β‘250Chat with AI via text, voice, image or video; get spoken replies
-
Running on ZeroFeatured113
VLM Object Understanding
π¦113Explore object detection, visual grounding, keypoint Detecti
-
Running2
Dataset Card Drafter
π»2Create dataset descriptions and open PRs automatically
-
Running on ZeroFeatured171
VibeVoice-Realtime-0.5B
π¨171Generate natural speech from text
-
microsoft/VibeVoice-1.5B
Text-to-Speech β’ 3B β’ Updated β’ 165k β’ 2.21k -
RunningFeatured381
Qwen3 TTS Demo
π381Generate speech from text with many voices
-
mradermacher/Qwen3-1.7B-Multilingual-TTS-GGUF
2B β’ Updated β’ 4.76k β’ 4
-
NebulaByte/E-Commerce_Customer_Support_Conversations
Viewer β’ Updated β’ 1k β’ 123 β’ 47 -
Lakshan2003/customer_service_30k_telcom_client_agent_conversations
Viewer β’ Updated β’ 195k β’ 5 β’ 3 -
lhoestq/E-Commerce_Customer_Support_Conversations_More_Concise
Viewer β’ Updated β’ 1k β’ 3 β’ 1
-
Sleeping1
Qwen-3-VL-8B OCR Receipts
π1structured data parser from receipt images
-
RunningFeatured250
Qwen3 Omni Demo
β‘250Chat with AI via text, voice, image or video; get spoken replies
-
Running on ZeroFeatured113
VLM Object Understanding
π¦113Explore object detection, visual grounding, keypoint Detecti
-
Running2
Dataset Card Drafter
π»2Create dataset descriptions and open PRs automatically
-
Running on CPU Upgrade989
Open VLM Leaderboard
π989VLMEvalKit Evaluation Results Collection
-
Running on ZeroFeatured412
DeepSeek OCR 2 Demo
π412Try out DeepSeek-OCR-2 on your PDFs or images
-
Running on ZeroMCP61
Multimodal OCR3
π61demo of a collection of impressive ocr models on the hub
-
Qwen/Qwen3-VL-30B-A3B-Instruct
Image-Text-to-Text β’ 31B β’ Updated β’ 1.27M β’ β’ 529
-
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Paper β’ 2306.05685 β’ Published β’ 39 -
Attention Is All You Need
Paper β’ 1706.03762 β’ Published β’ 111 -
Does AI-Assisted Coding Deliver? A Difference-in-Differences Study of Cursor's Impact on Software Projects
Paper β’ 2511.04427 β’ Published
-
Running on ZeroFeatured171
VibeVoice-Realtime-0.5B
π¨171Generate natural speech from text
-
microsoft/VibeVoice-1.5B
Text-to-Speech β’ 3B β’ Updated β’ 165k β’ 2.21k -
RunningFeatured381
Qwen3 TTS Demo
π381Generate speech from text with many voices
-
mradermacher/Qwen3-1.7B-Multilingual-TTS-GGUF
2B β’ Updated β’ 4.76k β’ 4
-
openai/whisper-large-v3
Automatic Speech Recognition β’ Updated β’ 6.08M β’ β’ 5.39k -
openai/whisper-large-v3-turbo
Automatic Speech Recognition β’ Updated β’ 3.13M β’ β’ 2.82k -
Running on ZeroMCPFeatured813
Whisper Large V3
π€«813Transcribe or translate audio from mic, file, or YouTube
-
Running on ZeroFeatured59
Kugel Audio
π59Generate natural-sounding speech in European languages with voice cloning
-
NebulaByte/E-Commerce_Customer_Support_Conversations
Viewer β’ Updated β’ 1k β’ 123 β’ 47 -
Lakshan2003/customer_service_30k_telcom_client_agent_conversations
Viewer β’ Updated β’ 195k β’ 5 β’ 3 -
lhoestq/E-Commerce_Customer_Support_Conversations_More_Concise
Viewer β’ Updated β’ 1k β’ 3 β’ 1