Robust Speech Recognition via Large-Scale Weak Supervision
Paper • 2212.04356 • Published • 53
This model is a fine-tuned version of opeanai/whisper-small on Fleurs Dataset.
This model is used to predict the transcription of indonesian audio.
Use the code below to get started with the model.
!ct2-transformers-converter --model cobrayyxx/whisper-small-indo-transcription --output_dir cobrayyxx/whisper-small-indo-transcription-ct2 --copy_files tokenizer.json preprocessor_config.json --quantization float16from faster_whisper import WhisperModel
model_transcribe = WhisperModel(model_transcribe, device="cpu", compute_type="float32")
@misc{radford2022whisper,
doi = {10.48550/ARXIV.2212.04356},
url = {https://arxiv.org/abs/2212.04356},
author = {Radford, Alec and Kim, Jong Wook and Xu, Tao and Brockman, Greg and McLeavey, Christine and Sutskever, Ilya},
title = {Robust Speech Recognition via Large-Scale Weak Supervision},
publisher = {arXiv},
year = {2022},
copyright = {arXiv.org perpetual, non-exclusive license}
}
Base model
openai/whisper-small