text-to-speech
updated
FlashSpeech: Efficient Zero-Shot Speech Synthesis
Paper
• 2404.14700
• Published • 32
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Paper
• 2306.15687
• Published
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and
Diffusion Models
Paper
• 2403.03100
• Published • 37
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through
Direct Preference Optimization
Paper
• 2404.09956
• Published • 11
Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech
Prompts
Paper
• 2307.07218
• Published • 28
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive
Bias
Paper
• 2306.03509
• Published • 5
parler-tts/dac_44khZ_8kbps
76.7M • Updated • 66
• 19
parler-tts/parler_tts_mini_v0.1
Text-to-Speech
• 0.6B • Updated • 3.77k
• 359
Wenetspeech4TTS/WenetSpeech4TTS
Updated • 1.07k
• 85
Text-to-Audio
• Updated • 4
• 9
Feature Extraction
• 96.2M • Updated • 1.3M
• • 295
Text-to-Speech
• Updated • 9.65M
• • 5.88k
Text-to-Speech
• 4B • Updated • 316
• 526
Text-to-Speech
• Updated • 1.98k
• 1.1k
stepfun-ai/Step-Audio-TTS-3B
Text-to-Speech
• 4B • Updated • 50
• 196
Text-to-Speech
• Updated • 123
• 417
Text-to-Speech
• Updated • 112k
• • 2.83k