MLX Speech Models
Collection
Speech AI models for Apple Silicon via MLX. ASR, TTS, VAD, diarization, speaker embedding. • 33 items • Updated • 3
MLX 4-bit quantized conversion of Qwen/Qwen3-TTS-12Hz-0.6B-Base for Apple Silicon inference.
Used by speech-swift Qwen3TTS module:
let model = try await Qwen3TTSModel.fromPretrained()
let audio = try model.synthesize("Hello, world!")
audio speak "Hello, world!" -o output.wav
4-bit
Base model
Qwen/Qwen3-TTS-12Hz-0.6B-Base