Speech Recognition On Wenetspeech
Metrics
Character Error Rate (CER)
Results
Performance results of various models on this benchmark
Model Name | Character Error Rate (CER) | Paper Title | Repository |
---|---|---|---|
Kaldi | 9.07 | WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition | - |
Conformer-MoE (32e) | 7.49 | 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition | - |
Paraformer-large | 6.97 | FunASR: A Fundamental End-to-End Speech Recognition Toolkit | - |
Conformer-MoE (16e) | 7.67 | 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition | - |
Wenet | 8.88 | WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition | - |
Espnet | 9.7 | WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition | - |
Zipformer+pruned transducer (no external language model) | 7.29 | Zipformer: A faster and better encoder for automatic speech recognition | - |
Conformer-MoE (64e) | 7.19 | 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition | - |
0 of 8 row(s) selected.