Speaking issues Coqui TTS Tacotron2 DDC model: Difference between revisions

Jump to navigation Jump to search
Restructure and add section for mispronounced words
(Restructure and add section for mispronounced words)
Line 9: Line 9:
Most models are trained with a dot, exclamation or question mark at the end. So always end a sentence to avoid model synthesizing weird output.  
Most models are trained with a dot, exclamation or question mark at the end. So always end a sentence to avoid model synthesizing weird output.  


==nelson mandela==
==Input string formatting==
"ah" at end of sentence generally produces strange results. Just the name produces a 12 second clip.
===Phrases ending in "ah"===
"ah" at end of sentence generally produces strange results. Short names produce a 12 second clip.


== pergola ==
Examples:
"ah" at end of sentence generally produces strange results. Just the name produces a 12 second clip.


== ABC news ==
* Nelson Mandela
To speak acronyms as letters it needs to be formatted as "A. B. C. news"
* pergola
 
==== Mitigation ====
If at the end of the input, adding punctuation to the end synthesizes correctly:
 
Example "Nelson Mandela" > "Nelson Mandela."
 
=== Acronyms ===
To speak acronyms as letters it needs to be formatted as:<blockquote>"A. B. C. news"</blockquote>Not:<blockquote>"ABC news"
 
"A.B.C. news"</blockquote>
 
== Mispronounced Words ==
 
* video
Anonymous user
Cookies help us deliver our services. By using our services, you agree to our use of cookies.

Navigation menu