STT / README.md
github-actions[bot]
Auto-deploy from GitHub: 898f681ae67425055f39b96fde956658f2fe0d29
f145097
metadata
title: STT Audio Caption Generator
emoji: 🎵
colorFrom: blue
colorTo: purple
sdk: docker
pinned: false
license: mit

Audio Caption Generator

A Python-based audio transcription service with a neobrutalist web interface.

Features

  • 🎵 Audio file upload via REST API
  • 🤖 Automatic STT processing using faster-whisper
  • 💾 SQLite database for queue management
  • 🎨 Neobrutalist UI with smooth animations
  • 🔄 Real-time status updates

Usage

Access the web interface at the Space URL above.

API Endpoints

  • POST /api/upload - Upload audio file
  • GET /api/files - Get all files
  • GET /api/files/<id> - Get specific file

Supported Formats

WAV, MP3, FLAC, OGG, M4A, AAC


Auto-deployed from GitHub