General TTS tutorial: Difference between revisions
Jump to navigation
Jump to search
m
no edit summary
mNo edit summary |
mNo edit summary |
||
Line 18: | Line 18: | ||
# Preparing a voice dataset using the text dataset | # Preparing a voice dataset using the text dataset | ||
# Training a TTS model using the voice dataset | # Training a TTS model using the voice dataset | ||
== Building a text dataset == | |||
Training a TTS model requires text and voice dataset. To put it simply, TTS training aims to learn the relationship between text and sound and record that learned information in a file. Then when speech is synthesized the recorded information can be used to convert text to speech. | |||
When choosing text for the dataset it is important to think about the context in which the trained TTS model will be used. For example, if you use legal texts to train the model and then use the model to read everyday speech, then it may not meet your expectations. | |||
''The page is being developed.'' |