Recording tipps: Difference between revisions

From Voice Technology Wiki
Jump to navigation Jump to search
No edit summary
m (Fixed some minor typos)
Tags: Mobile edit Mobile web edit Advanced mobile edit
Line 2: Line 2:
[[Category:Lessons learned]]
[[Category:Lessons learned]]


When you plan to record a voice dataset to be used for a TTS model training you should check these tipps and tricks:
When you plan to record a voice dataset to be used for a TTS model training you should check these tips and tricks:


* '''Use a good microphone and a quiet recording room setup''' (no computers fans, air conditioning, ...)
* '''Use a good microphone and a quiet recording room setup''' (no computer fans, air conditioning, ...)
* Use a text corpus with cleaned numbers/abbreviations and good phoneme coverage
* Use a text corpus with cleaned numbers/abbreviations and good phoneme coverage
* Read neutral, but with a natural speech flow and do not swallow up letters
* Read in a neutral style, but with a natural speech flow and do not swallow up letters
* Adjust tone and pitch with punctuation
* Adjust tone and pitch with punctuation
* Use a constant recording speed
* Use a constant recording speed
* Check your recordings regularly in high volume for background noise
* Check your recordings regularly in high volume for background noise
* Make breaks regularly and do not record more than four hours a day
* Take breaks regularly and do not record more than four hours a day
* Record error free
* Record error free
* Investing in a quality interface and mic can make a big difference in quality. A 24 bit 96khz interface with a large diaphragm condenser can be had for about $200 USD.
* Investing in a quality interface and mic can make a big difference in quality. A 24 bit 96khz interface with a large diaphragm condenser can be had for about $200 USD.
* Record at the highest quality level practical.  You can convert to lesser formats later, but you can't up convert cleanly
* Record at the highest quality level practical.  You can convert to lesser formats later, but you cannot up convert cleanly
* Review your work at regular intervals and compare with previous recording to ensure consistent quality
* Review your work at regular intervals and compare with previous recording to ensure consistent quality
* Do not be afraid to ask for help! Getting feedback on your data early on can help prevent wasted effort.
* Do not be afraid to ask for help! Getting feedback on your data early on can help prevent wasted effort.

Revision as of 14:54, 12 November 2021


When you plan to record a voice dataset to be used for a TTS model training you should check these tips and tricks:

  • Use a good microphone and a quiet recording room setup (no computer fans, air conditioning, ...)
  • Use a text corpus with cleaned numbers/abbreviations and good phoneme coverage
  • Read in a neutral style, but with a natural speech flow and do not swallow up letters
  • Adjust tone and pitch with punctuation
  • Use a constant recording speed
  • Check your recordings regularly in high volume for background noise
  • Take breaks regularly and do not record more than four hours a day
  • Record error free
  • Investing in a quality interface and mic can make a big difference in quality. A 24 bit 96khz interface with a large diaphragm condenser can be had for about $200 USD.
  • Record at the highest quality level practical. You can convert to lesser formats later, but you cannot up convert cleanly
  • Review your work at regular intervals and compare with previous recording to ensure consistent quality
  • Do not be afraid to ask for help! Getting feedback on your data early on can help prevent wasted effort.