Marco
AI & ML interests
Recent Activity
Organizations
-
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text • 0.6B • Updated • 16.7k • 229 - Runtime error85
GOT OCR Transformers
📷85Demo of GOT-OCR 2.0's Transformers implementation
-
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text • 8B • Updated • 9.99k • 701 -
allenai/olmOCR-mix-0225
Viewer • Updated • 259k • 776 • 170
- Running554
DeepSeek-R1 WebGPU
🧠554Next-generation reasoning model that runs locally in-browser
- Running100
Qwen2.5-1M Demo
💻100Ask questions about your uploaded documents
-
mistralai/Mistral-Small-24B-Base-2501
Updated • 16k • 261 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 11k • 179
- RunningFeatured262
Qwen3 Omni Demo
⚡262Chat with multimodal AI using text, audio, images, and video
- Running62
Qwen3 Omni Captioner Demo
🐠62Generate captions from audio
-
Qwen/Qwen3-Omni-30B-A3B-Thinking
Any-to-Any • 32B • Updated • 31k • 294 -
Qwen/Qwen3-Omni-30B-A3B-Instruct
Any-to-Any • 35B • Updated • 358k • 904
- RunningMCP131
Consilium MCP Server
🏢131Multi-AI Expert Consensus Platform
- Runtime errorMCP2
MCP Hackathon Deepfake Watchdog
🛡2Upload your image and/or voice to scan for deepfake misuse o
- Runtime error35
VulnBuster
🛡35AI Security Agent: Multi-MCP Code Vulnerability Scanner
- RunningMCP196
AI Marketing Content Generator
🎨196An AI-powered tool made for content creators and marketers
-
nvidia/parakeet-tdt-0.6b-v2
Automatic Speech Recognition • Updated • 168k • 1.45k - Running on T4Featured467
Parakeet-TDT-0.6b-V2
467Transcribe audio files with timestamps and download transcripts
- Running on CPU Upgrade33
Blazing Fast Whisper
👁33Blazing Fast Whisper Deployed on HF Inference Endpoints
- Running on CPU UpgradeFeatured1.31k
Open ASR Leaderboard
🏆1.31kExplore speech recognition model benchmarks and request new ones
- Running on T4131
RF-DETR
🔥131SOTA real-time object detection model
- Running on CPU Upgrade50
YOLO ARENA
🏟50compare performance of top object detectors
- Running on ZeroFeatured91
D-Fine - SOTA Real-Time Object Detector
⚡91Object Detection on Images and Video
- Running on Zero30
Gaze LLE
👀30Gaze Target Estimation
- Running on ZeroMCPFeatured582
LatentSync
👄582Audio Conditioned LipSync with Latent Diffusion Models
- Paused228
BEN2
🚀228Remove background from images and videos
- Build error81
SmolVLM
📊81Generate answers by combining text and images
- Build error59
SmolVLM2 HighlightGenerator
🐨59Generate video highlights from uploaded video
-
NexaAI/Qwen2-Audio-7B-GGUF
Audio-Text-to-Text • 8B • Updated • 4.55k • 170 -
kyutai/hibiki-2b-pytorch-bf16
Translation • Updated • 25 • 61 -
Zyphra/Zonos-v0.1-hybrid
Text-to-Speech • Updated • 1.7k • 1.1k - Running on ZeroFeatured686
Di♪♪Rhythm
🎶686Blazingly Fast and Embarrassingly Simple Song Generation
-
onnx-community/Kokoro-82M-ONNX
Text-to-Speech • Updated • 32.2k • 171 - Running222
Kokoro Text-to-Speech
🗣222High-quality speech synthesis powered by Kokoro TTS
-
NexaAI/Qwen2-Audio-7B-GGUF
Audio-Text-to-Text • 8B • Updated • 4.55k • 170 -
jonatasgrosman/wav2vec2-large-xlsr-53-english
Automatic Speech Recognition • 0.3B • Updated • 42.4k • 478
- RunningFeatured262
Qwen3 Omni Demo
⚡262Chat with multimodal AI using text, audio, images, and video
- Running62
Qwen3 Omni Captioner Demo
🐠62Generate captions from audio
-
Qwen/Qwen3-Omni-30B-A3B-Thinking
Any-to-Any • 32B • Updated • 31k • 294 -
Qwen/Qwen3-Omni-30B-A3B-Instruct
Any-to-Any • 35B • Updated • 358k • 904
- RunningMCP131
Consilium MCP Server
🏢131Multi-AI Expert Consensus Platform
- Runtime errorMCP2
MCP Hackathon Deepfake Watchdog
🛡2Upload your image and/or voice to scan for deepfake misuse o
- Runtime error35
VulnBuster
🛡35AI Security Agent: Multi-MCP Code Vulnerability Scanner
- RunningMCP196
AI Marketing Content Generator
🎨196An AI-powered tool made for content creators and marketers
-
nvidia/parakeet-tdt-0.6b-v2
Automatic Speech Recognition • Updated • 168k • 1.45k - Running on T4Featured467
Parakeet-TDT-0.6b-V2
467Transcribe audio files with timestamps and download transcripts
- Running on CPU Upgrade33
Blazing Fast Whisper
👁33Blazing Fast Whisper Deployed on HF Inference Endpoints
- Running on CPU UpgradeFeatured1.31k
Open ASR Leaderboard
🏆1.31kExplore speech recognition model benchmarks and request new ones
- Running on T4131
RF-DETR
🔥131SOTA real-time object detection model
- Running on CPU Upgrade50
YOLO ARENA
🏟50compare performance of top object detectors
- Running on ZeroFeatured91
D-Fine - SOTA Real-Time Object Detector
⚡91Object Detection on Images and Video
- Running on Zero30
Gaze LLE
👀30Gaze Target Estimation
-
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text • 0.6B • Updated • 16.7k • 229 - Runtime error85
GOT OCR Transformers
📷85Demo of GOT-OCR 2.0's Transformers implementation
-
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text • 8B • Updated • 9.99k • 701 -
allenai/olmOCR-mix-0225
Viewer • Updated • 259k • 776 • 170
- Running on ZeroMCPFeatured582
LatentSync
👄582Audio Conditioned LipSync with Latent Diffusion Models
- Paused228
BEN2
🚀228Remove background from images and videos
- Build error81
SmolVLM
📊81Generate answers by combining text and images
- Build error59
SmolVLM2 HighlightGenerator
🐨59Generate video highlights from uploaded video
-
NexaAI/Qwen2-Audio-7B-GGUF
Audio-Text-to-Text • 8B • Updated • 4.55k • 170 -
kyutai/hibiki-2b-pytorch-bf16
Translation • Updated • 25 • 61 -
Zyphra/Zonos-v0.1-hybrid
Text-to-Speech • Updated • 1.7k • 1.1k - Running on ZeroFeatured686
Di♪♪Rhythm
🎶686Blazingly Fast and Embarrassingly Simple Song Generation
- Running554
DeepSeek-R1 WebGPU
🧠554Next-generation reasoning model that runs locally in-browser
- Running100
Qwen2.5-1M Demo
💻100Ask questions about your uploaded documents
-
mistralai/Mistral-Small-24B-Base-2501
Updated • 16k • 261 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 11k • 179
-
onnx-community/Kokoro-82M-ONNX
Text-to-Speech • Updated • 32.2k • 171 - Running222
Kokoro Text-to-Speech
🗣222High-quality speech synthesis powered by Kokoro TTS
-
NexaAI/Qwen2-Audio-7B-GGUF
Audio-Text-to-Text • 8B • Updated • 4.55k • 170 -
jonatasgrosman/wav2vec2-large-xlsr-53-english
Automatic Speech Recognition • 0.3B • Updated • 42.4k • 478