Editing Recording tipps

Jump to navigation Jump to search
Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 2: Line 2:
[[Category:Lessons learned]]
[[Category:Lessons learned]]


When you plan to record a voice dataset to be used for a TTS model training you should check these tips and tricks:
When you plan to record a voice dataset to be used for a TTS model training you should check these tipps and tricks:


* '''Use a good microphone and a quiet recording room setup''' (no computer fans, air conditioning, ...)
* '''Use a good microphone and a quiet recording room setup''' (no computers fans, air conditioning, ...)
* Use a text corpus with cleaned numbers/abbreviations and good phoneme coverage
* Use a text corpus with cleaned numbers/abbreviations and good phoneme coverage
* Read in a neutral style, but with a natural speech flow and do not swallow up letters
* Read neutral, but with a natural speech flow and do not swallow up letters
* Adjust tone and pitch with punctuation
* Adjust tone and pitch with punctuation
* Use a constant recording speed
* Use a constant recording speed
* Check your recordings regularly in high volume for background noise
* Check your recordings regularly in high volume for background noise
* Take breaks regularly and do not record more than four hours a day
* Make breaks regularly and do not record more than four hours a day
* Record error free
* Record error free
* Investing in a quality interface and mic can make a big difference in quality. A 24 bit 96khz interface with a large diaphragm condenser can be had for about $200 USD.
* Investing in a quality interface and mic can make a big difference in quality. A 24 bit 96khz interface with a large diaphragm condenser can be had for about $200 USD.
* Record at the highest quality level practical.  You can convert to lesser formats later, but you cannot up convert cleanly
* Record at the highest quality level practical.  You can convert to lesser formats later, but you can't up convert cleanly
* Review your work at regular intervals and compare with previous recording to ensure consistent quality
* Review your work at regular intervals and compare with previous recording to ensure consistent quality
* Do not be afraid to ask for help! Getting feedback on your data early on can help prevent wasted effort.
* Do not be afraid to ask for help! Getting feedback on your data early on can help prevent wasted effort.
*There's a wealth of information on the internet about recording.  For instance, https://wiki.librivox.org/index.php/Newbie_Guide_to_Recording from Librivox is a useful guide with numerous sub pages of information.  Some is audio-book specific, but the majority is useful for anyone recording voice.
*There's a wealth of information on the internet about recording.  For instance, https://wiki.librivox.org/index.php/Newbie_Guide_to_Recording from Librivox is a useful guide with numerous sub pages of information.  Some is audio-book specific, but the majority is useful for anyone recording voice.
Please note that all contributions to Voice Technology Wiki are considered to be released under the Creative Commons Attribution-ShareAlike (see Voice Technology Wiki:Copyrights for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource. Do not submit copyrighted work without permission!
Cancel Editing help (opens in new window)