Research Article

End-to-End Speech Synthesis for Tibetan Multidialect

Table 4

The MOS comparison of speech synthesized by different models.

ModelMOS of Lhasa-Ü-Tsang dialectMOS of Amdo pastoral dialect

Linear predictive amplitude spectrum + Griffin–Lim3.303.52
Mel spectrogram + Griffin–Lim3.553.70
Mel spectrogram + WaveNet3.954.18