TripoSG
Generate textured 3D models from a single image
OmniParser, turn your LLM into GUI agent
Consistency generation of portrait and subject
High-quality speech synthesis powered by Kokoro TTS
FitDiT is a high-fidelity virtual try-on model.
Detect and visualize human poses in images or videos
Vote on the latest TTS models!
Generate 3D models from images
Upscale images with control and customization
Image to Video Generation
Transcribe or translate audio from mic, files, or YouTube
Execute dynamic Python scripts from environment variables