HyperAI超神经

首页算力平台文档资讯论文教程数据集百科 SOTA LLM 模型天梯 GPU 天梯顶会

中文

HyperAI超神经

Text To Speech Synthesis On Ljspeech

评估指标

Audio Quality MOS

评测结果

各个模型在此基准测试上的表现结果

		Paper Title	Repository
NaturalSpeech	4.56	NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality
VITS	4.43	NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality
Grad-TTS + HiFiGAN (1000 steps)	4.37	Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech
FastSpeech 2 + HiFiGAN	4.34	NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality
Glow-TTS + HiFiGAN	4.34	Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search
FastSpeech 2 + HiFiGAN	4.32	FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
FastDiff (4 steps)	4.28	FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis
FastDiff-TTS	4.03	FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis
Transformer TTS (Mel + WaveGlow)	3.88	Neural Speech Synthesis with Transformer Network
FastSpeech (Mel + WaveGlow)	3.84	FastSpeech: Fast, Robust and Controllable Text to Speech
OverFlow	3.37	OverFlow: Putting flows on top of neural transducers for better TTS
Merlin	2.4	FastSpeech: Fast, Robust and Controllable Text to Speech
temp	1.25	-	-
Flowtron	-	Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis
Matcha-TTS	-	Matcha-TTS: A fast TTS architecture with conditional flow matching
Tacotron 2	-	Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis

0 of 16 row(s) selected.

Text To Speech Synthesis On Ljspeech | SOTA | HyperAI超神经