MOS TTS multilanguage reference phrases

From Voice Technology Wiki
Revision as of 22:39, 3 November 2021 by Thorsten (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search


MOS (Mean Opinion Score)

The MOS (Mean Opinion Score) is a numeric value to define a voice quality between 1 (bad) and 5 (good). To make a reliable MOS comparison we should define default phrases (for several languages) for all TTS models. This should avoid "cherry picking" to generate best results based on a specific dataset and TTS model.

Reference phrases for MOS comparion

English

  • "It took me quite a long time to develop a voice, and now that I have it I'm not going to be silent."
  • "Be a voice, not an echo."
  • "I'm sorry Dave. I'm afraid I can't do that."
  • "This cake is great. It's so delicious and moist."
  • feel free to add items here

German

  • "Es dauert lange, eine eigene Stimme zu entwickeln, aber jetzt wo ich sie habe, bin ich nie wieder still."
  • feel free to add items here

French

Spanish