Please refer to the main model card

This model page contains the Moshika (female voice) model weights for the rust backend of the MoshiVis repo, in Q8 format. We provide the same model weights for other backends and quantization formats in the associated model collection.

Downloads last month: 885

GGUF

Model size

8B params

Architecture

undefined

Hardware compatibility

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for kyutai/moshika-vis-candle-q8

Base model

google/paligemma2-3b-pt-448

Quantized

(2)

this model

Collection including kyutai/moshika-vis-candle-q8

MoshiVis v0.1

Collection

MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 9 items • Updated Dec 23, 2025 • 23