MOS TTS multilanguage reference phrases

From Voice Technology Wiki
Jump to navigation Jump to search


MOS (Mean Opinion Score)[edit | edit source]

The MOS (Mean Opinion Score) is a numeric value to define a voice quality between 1 (bad) and 5 (good). To make a reliable MOS comparison we should define default phrases (for several languages) for all TTS models. This should avoid "cherry picking" to generate best results based on a specific dataset and TTS model.

Reference phrases for MOS comparion[edit | edit source]

English[edit | edit source]

  • "It took me quite a long time to develop a voice, and now that I have it I'm not going to be silent."
  • "Be a voice, not an echo."
  • "I'm sorry Dave. I'm afraid I can't do that."
  • "This cake is great. It's so delicious and moist."
  • feel free to add items here

German[edit | edit source]

  • "Es dauert lange, eine eigene Stimme zu entwickeln, aber jetzt wo ich sie habe, bin ich nie wieder still."
  • feel free to add items here

French[edit | edit source]

Spanish[edit | edit source]