High quality, efficient voice cloning. Just 100M parameters.
All-in-one hub of general purpose tools useful for any agent
Generate speech from text using multiple TTS services
Chat with AI using text and images
Generate images from text prompts