Jump to content

Recording tipps: Difference between revisions

From Open Voice Technology Wiki
No edit summary
m Fixed some minor typos
Tags: Mobile edit Mobile web edit Advanced mobile edit
Line 2: Line 2:
[[Category:Lessons learned]]
[[Category:Lessons learned]]


When you plan to record a voice dataset to be used for a TTS model training you should check these tipps and tricks:
When you plan to record a voice dataset to be used for a TTS model training you should check these tips and tricks:


* '''Use a good microphone and a quiet recording room setup''' (no computers fans, air conditioning, ...)
* '''Use a good microphone and a quiet recording room setup''' (no computer fans, air conditioning, ...)
* Use a text corpus with cleaned numbers/abbreviations and good phoneme coverage
* Use a text corpus with cleaned numbers/abbreviations and good phoneme coverage
* Read neutral, but with a natural speech flow and do not swallow up letters
* Read in a neutral style, but with a natural speech flow and do not swallow up letters
* Adjust tone and pitch with punctuation
* Adjust tone and pitch with punctuation
* Use a constant recording speed
* Use a constant recording speed
* Check your recordings regularly in high volume for background noise
* Check your recordings regularly in high volume for background noise
* Make breaks regularly and do not record more than four hours a day
* Take breaks regularly and do not record more than four hours a day
* Record error free
* Record error free
* Investing in a quality interface and mic can make a big difference in quality. A 24 bit 96khz interface with a large diaphragm condenser can be had for about $200 USD.
* Investing in a quality interface and mic can make a big difference in quality. A 24 bit 96khz interface with a large diaphragm condenser can be had for about $200 USD.
* Record at the highest quality level practical.  You can convert to lesser formats later, but you can't up convert cleanly
* Record at the highest quality level practical.  You can convert to lesser formats later, but you cannot up convert cleanly
* Review your work at regular intervals and compare with previous recording to ensure consistent quality
* Review your work at regular intervals and compare with previous recording to ensure consistent quality
* Do not be afraid to ask for help! Getting feedback on your data early on can help prevent wasted effort.
* Do not be afraid to ask for help! Getting feedback on your data early on can help prevent wasted effort.

Revision as of 13:54, 12 November 2021


When you plan to record a voice dataset to be used for a TTS model training you should check these tips and tricks:

  • Use a good microphone and a quiet recording room setup (no computer fans, air conditioning, ...)
  • Use a text corpus with cleaned numbers/abbreviations and good phoneme coverage
  • Read in a neutral style, but with a natural speech flow and do not swallow up letters
  • Adjust tone and pitch with punctuation
  • Use a constant recording speed
  • Check your recordings regularly in high volume for background noise
  • Take breaks regularly and do not record more than four hours a day
  • Record error free
  • Investing in a quality interface and mic can make a big difference in quality. A 24 bit 96khz interface with a large diaphragm condenser can be had for about $200 USD.
  • Record at the highest quality level practical. You can convert to lesser formats later, but you cannot up convert cleanly
  • Review your work at regular intervals and compare with previous recording to ensure consistent quality
  • Do not be afraid to ask for help! Getting feedback on your data early on can help prevent wasted effort.
Cookies help us deliver our services. By using our services, you agree to our use of cookies.