ThinkSound
π
318
Generate audio for a silent video using text prompts
Deeply interrogate audio file content
Zero-Shot Material Transfer from a Single Image
Multi-AI Expert Consensus Platform
AI generates PPT with diagrams and images from given topics
The agent using over 9000 vision models from the HF Hub.
Demo for Nanonets-OCR
EfficientVLM
coreOCR / Camel-Doc-OCR / docscopeOCR / MonkeyOCR
Dolphin Demo