HyperAI超神经

Text To Speech Synthesis On Ljspeech

评估指标

Audio Quality MOS

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称Audio Quality MOS
fastspeech-fast-robust-and-controllable-text3.84
fastdiff-a-fast-conditional-diffusion-model4.03
naturalspeech-end-to-end-text-to-speech4.34
grad-tts-a-diffusion-probabilistic-model-for4.37
flowtron-an-autoregressive-flow-based-
neural-speech-synthesis-with-transformer3.88
naturalspeech-end-to-end-text-to-speech4.43
matcha-tts-a-fast-tts-architecture-with-
flowtron-an-autoregressive-flow-based-
fastspeech-2-fast-and-high-quality-end-to-end4.32
模型 111.25
fastdiff-a-fast-conditional-diffusion-model4.28
overflow-putting-flows-on-top-of-neural3.37
naturalspeech-end-to-end-text-to-speech4.56
fastspeech-fast-robust-and-controllable-text2.4
glow-tts-a-generative-flow-for-text-to-speech4.34