Speaking issues Coqui TTS Tacotron2 DDC model: Difference between revisions

Created page with "Category:Coqui Category:TTS models Category:Pronunciation issues *TTS model: tts_models/en/ljspeech/tacotron2-DDC *Vocoder model: vocoder_models/en/ljspeech/hifig..."
 
m Rollback changes
Tags: Manual revert Visual edit
 
(4 intermediate revisions by 2 users not shown)
Line 2: Line 2:
[[Category:TTS models]]
[[Category:TTS models]]
[[Category:Pronunciation issues]]
[[Category:Pronunciation issues]]
== General ==
Most models are trained with a dot, exclamation or question mark at the end. So always end a sentence to avoid model synthesizing weird output.


*TTS model: tts_models/en/ljspeech/tacotron2-DDC
*TTS model: tts_models/en/ljspeech/tacotron2-DDC
*Vocoder model: vocoder_models/en/ljspeech/hifigan_v2
*Vocoder model: vocoder_models/en/ljspeech/hifigan_v2


==nelson mandela==
==Input string formatting==
"ah" at end of sentence generally produces strange results. Just the name produces a 12 second clip.
===Phrases ending in "ah"===
"ah" at end of sentence generally produces strange results. Short names produce a 12 second clip.
 
Examples:
 
* Nelson Mandela
* pergola
 
==== Mitigation ====
If at the end of the input, adding punctuation to the end synthesizes correctly:
 
Example "Nelson Mandela" > "Nelson Mandela."
 
=== Acronyms ===
To speak acronyms as letters it needs to be formatted as:<blockquote>"A. B. C. news"</blockquote>Not:<blockquote>"ABC news"
 
"A.B.C. news"</blockquote>


== pergola ==
== Mispronounced Words ==
"ah" at end of sentence generally produces strange results. Just the name produces a 12 second clip.


== ABC news ==
* video
To speak acronyms as letters it needs to be formatted as "A. B. C. news"