Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kreshnik 's Collections
music
OCR
3D
Language
Image
Voice
Papers
Model training

Voice

updated Jan 25
Upvote
-

  • microsoft/VibeVoice-1.5B

    Text-to-Speech • 3B • Updated Jan 22 • 220k • 2.24k

  • Running
    Featured
    441

    FastVLM WebGPU

    🍎
    441

    Real-time video captioning powered by FastVLM


  • openbmb/VoxCPM-0.5B

    Text-to-Speech • Updated Sep 19, 2025 • 708 • 766

  • Running on CPU Upgrade
    76

    MiMo-Audio-Chat

    💬
    76

    Chat with Xiaomi MiMo-Audio using voice


  • FlashLabs/Chroma-4B

    Any-to-Any • Updated Jan 28 • 2.87k • 341

  • numind/NuMarkdown-8B-Thinking

    Image-to-Text • Updated Nov 13, 2025 • 54.7k • 447
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs