Jump to content

Speaking issues Coqui TTS Tacotron2 DDC model

From Open Voice Technology Wiki


  • TTS model: tts_models/en/ljspeech/tacotron2-DDC
  • Vocoder model: vocoder_models/en/ljspeech/hifigan_v2

General

Most models are trained with a dot, exclamation or question mark at the end. So always end a sentence to avoid model synthesizing weird output.

nelson mandela

"ah" at end of sentence generally produces strange results. Just the name produces a 12 second clip.

pergola

"ah" at end of sentence generally produces strange results. Just the name produces a 12 second clip.

ABC news

To speak acronyms as letters it needs to be formatted as "A. B. C. news"

Cookies help us deliver our services. By using our services, you agree to our use of cookies.