MOSS-TTS-Local-Transformer-MLX-8bit

MOSS TTS Local Transformer — MLX 8-bit

This repository contains an MLX-native int8 conversion of MOSS TTS Local Transformer for single-speaker text-to-speech on Apple Silicon.

Note This repo is a community mirror of the canonical MLX conversion maintained by AppAutomaton at appautomaton/openmoss-tts-local-mlx.

Variants

Path	Precision
`mlx-int8/`	int8 quantized weights

Model Details

Developed by: AppAutomaton
Shared by: mlx-community
Original MLX repo: appautomaton/openmoss-tts-local-mlx
Upstream model: OpenMOSS-Team/MOSS-TTS-Local-Transformer
Task: single-speaker text-to-speech and voice cloning
Runtime: MLX on Apple Silicon

How to Get Started

Command-line generation with mlx-speech:

Generate speech:

python scripts/generate_moss_local.py \
  --text "Hello, this is a test." \
  --output outputs/out.wav

Clone a voice:

python scripts/generate_moss_local.py \
  --mode clone \
  --text "This is a cloned voice." \
  --reference-audio reference.wav \
  --output outputs/clone.wav

Minimal Python usage:

from mlx_speech.generation import MossTTSLocalModel

model = MossTTSLocalModel.from_path("mlx-int8")

Notes

This repo contains the quantized MLX runtime artifact only.
The conversion keeps the original local TTS architecture and remaps weights explicitly for MLX inference.
The default runtime path uses W8Abf16 mixed precision with global and local KV cache enabled.
This mirror is a duplicated repo, not an automatically synchronized namespace mirror.

License

Apache 2.0 — following the upstream license published with OpenMOSS-Team/MOSS-TTS-Local-Transformer.

Downloads last month: -; Downloads are not tracked for this model. How to track

MLX

Hardware compatibility

Quantized

Model tree for mlx-community/MOSS-TTS-Local-Transformer-MLX-8bit

Base model

OpenMOSS-Team/MOSS-TTS-Local-Transformer