HyperAIHyperAI

Speech Recognition On Gigaspeech Dev

Metrics

Word Error Rate (WER)

Results

Performance results of various models on this benchmark

Model Name
Word Error Rate (WER)
Paper TitleRepository
Zipformer+pruned transducer w/ CR-CTC (no external language model)9.95CR-CTC: Consistency regularization on CTC for improved speech recognition-
Zipformer+CR-CTC (no external language model)10.15CR-CTC: Consistency regularization on CTC for improved speech recognition-
Zipformer+pruned transducer (no external language model)10.09CR-CTC: Consistency regularization on CTC for improved speech recognition-
SAMBA ASR9.12Samba-asr state-of-the-art speech recognition leveraging structured state-space models-
Conformer/Transformer-AED10.90GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio-
0 of 5 row(s) selected.
Speech Recognition On Gigaspeech Dev | SOTA | HyperAI