ASR demo using onnx-asr (Russian models)

Automatic Speech Recognition in Python using ONNX models - onnx-asr

Models used in demo:

  • gigaam-v2-ctc - Sber GigaAM v2 CTC (origin, onnx)
  • gigaam-v2-rnnt - Sber GigaAM v2 RNN-T (origin, onnx)
  • nemo-fastconformer-ru-ctc - Nvidia FastConformer-Hybrid Large (ru) with CTC decoder (origin, onnx)
  • nemo-fastconformer-ru-rnnt - Nvidia FastConformer-Hybrid Large (ru) with RNN-T decoder (origin, onnx)
  • alphacep/vosk-model-ru - Alpha Cephei Vosk 0.54-ru (origin)
  • alphacep/vosk-model-small-ru - Alpha Cephei Vosk 0.52-small-ru (origin)
  • whisper-base - OpenAI Whisper Base exported with onnxruntime (origin, onnx)

output

output