<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://openvoice-tech.net/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=72.128.154.254</id>
	<title>Open Voice Technology Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://openvoice-tech.net/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=72.128.154.254"/>
	<link rel="alternate" type="text/html" href="https://openvoice-tech.net/wiki/Special:Contributions/72.128.154.254"/>
	<updated>2026-05-13T11:37:47Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.43.1</generator>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=A_good_wake_word&amp;diff=1806</id>
		<title>A good wake word</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=A_good_wake_word&amp;diff=1806"/>
		<updated>2021-11-10T09:38:06Z</updated>

		<summary type="html">&lt;p&gt;72.128.154.254: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;What makes a good wake word?&lt;br /&gt;
&lt;br /&gt;
Several factors contribute to the overall quality and usability of a wake word.  Ideally it is a short, memorable, easily pronounced phrase, with uncommon phoneme arrangement.  Most people are familiar with &amp;quot;Hey Google&amp;quot; or &amp;quot;Hey Siri&amp;quot; or &amp;quot;Hey Mycroft&amp;quot;, for instance.  &amp;quot;Alexa&amp;quot; is a single-word wake word, albeit one with an uncommon with uncommon phoneme arrangement.  None of these are more than three syllables.  In &amp;quot;Star Trek&amp;quot; the ship computers are frequently addressed by &amp;quot;Computer&amp;quot;.  This is a reasonable choice, but not without some drawbacks.  The word computer can come up frequently in modern daily conversations. Its phoneme arrangement is not uncommon, potential leading to false activations.  &amp;quot;OK Computer&amp;quot; or &amp;quot;Hey computer&amp;quot; add a bit more complexity to the wake word.  Having a wake word over four or potentially five syllables starts to become clunky in its usage. &lt;br /&gt;
 &lt;br /&gt;
While this document is primarily about English, the concepts can be applied to many other languages.  &lt;br /&gt;
&lt;br /&gt;
Modern English contains 44 phonemes: 24 consonants and 20 vowels. The vowel sounds have two groups based on mouth position, monophthongs and diphthongs.  Simplest way to tell them apart is the e in &amp;quot;thee&amp;quot; is a monophthong, and the -oy in &amp;quot;boy&amp;quot; is a diphthong.  The most common American English phoneme is the schwa, which has an &amp;quot;uh&amp;quot; sound. It can be part of any vowel&#039;s pronunciation, like the first a in &amp;quot;Alexa&amp;quot; or the e in &amp;quot;the&amp;quot;.  A [https://journals.sagepub.com/doi/pdf/10.1207/s15548430jlr3601_5 2004 paper] contains a helpful list of the most common phonemes. CMU also has their handy [pronouncing dictionary](http://www.speech.cs.cmu.edu/cgi-bin/cmudict) to help decipher phonemes.  The CMU dictionary file can also be found [https://github.com/Alexir/CMUdict/blob/master/cmudict-0.7b here].  Utilizing those and other resources you can then start to determine the viability of your wake word&#039;s phoneme structure.  &lt;br /&gt;
&lt;br /&gt;
If your wake word has a lot of rhyme potential, this is a sign it&#039;s probably going to have a lot of potential false activations.  This can be partially mitigated with a quality dataset.  The cadence of the wake word should also be as unique as possible.  &amp;quot;Hey Mycroft&amp;quot; frequently has false activations when someone says &amp;quot;Microsoft&amp;quot;.  For these, finding cadence patterns and matching that to other phrases/words to build the  dataset.  The pronunciation of your wake word would ideally be smooth.  Adding &amp;quot;hey&amp;quot; to the front of things is easy choice as the -ey phoneme leaves the mouth fairly neutral and ready to say another phoneme.  Words like &amp;quot;axlotl&amp;quot; are definitely uncommon and unique, but not as smooth or easy to pronounce.&lt;br /&gt;
&lt;br /&gt;
The accuracy of your listening engine in conjunction with your wake word should minimize false negatives for your target audience first, and minimize false positives second.&lt;/div&gt;</summary>
		<author><name>72.128.154.254</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Mycroft_Precise&amp;diff=1805</id>
		<title>Mycroft Precise</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Mycroft_Precise&amp;diff=1805"/>
		<updated>2021-11-10T09:35:39Z</updated>

		<summary type="html">&lt;p&gt;72.128.154.254: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Category:Mycroft]]&lt;br /&gt;
[[Category:Open Voice Assistants]]&lt;br /&gt;
[[Category:Wake words]]&lt;br /&gt;
&lt;br /&gt;
Mycroft Precise is a simple to use [[:Category:Wake words|wake word listener]] created by [[Mycroft]].&lt;br /&gt;
&lt;br /&gt;
Precise can be used to model custom wake words.  They have a good [https://github.com/MycroftAI/mycroft-precise/wiki/Training-your-own-wake-word#how-to-train-your-own-wake-word documentation page] on that, with additional tips and tricks from other [[Mycroft]] community members [https://github.com/el-tocino/localcroft/blob/master/precise/Precise.md here] and [https://github.com/sparky-vision/mycroft-precise-tips here].&lt;/div&gt;</summary>
		<author><name>72.128.154.254</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=A_good_wake_word&amp;diff=1804</id>
		<title>A good wake word</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=A_good_wake_word&amp;diff=1804"/>
		<updated>2021-11-10T09:29:37Z</updated>

		<summary type="html">&lt;p&gt;72.128.154.254: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;What makes a good wake word?&lt;br /&gt;
&lt;br /&gt;
Several factors contribute to the overall quality and usability of a wake word.  Ideally it is a short, memorable, easily pronounced phrase, with uncommon phoneme arrangement.  Most people are familiar with &amp;quot;Hey Google&amp;quot; or &amp;quot;Hey Siri&amp;quot; or &amp;quot;Hey Mycroft&amp;quot;, for instance.  &amp;quot;Alexa&amp;quot; is a single-word wake word, albeit one with an uncommon with uncommon phoneme arrangement.  None of these are more than three syllables.  In &amp;quot;Star Trek&amp;quot; the ship computers are frequently addressed by &amp;quot;Computer&amp;quot;.  This is a reasonable choice, but not without some drawbacks.  The word computer can come up frequently in modern daily conversations. Its phoneme arrangement is not uncommon, potential leading to false activations.  &amp;quot;OK Computer&amp;quot; or &amp;quot;Hey computer&amp;quot; add a bit more complexity to the wake word.  Having a wake word over four or potentially five syllables starts to become clunky in its usage. &lt;br /&gt;
 &lt;br /&gt;
While this document is primarily about English, the concepts can be applied to many other languages.  &lt;br /&gt;
&lt;br /&gt;
Modern English contains 44 phonemes: 24 consonants and 20 vowels. The vowel sounds have two groups based on mouth position, monophthongs and diphthongs.  Simplest way to tell them apart is the e in &amp;quot;thee&amp;quot; is a monophthong, and the -oy in &amp;quot;boy&amp;quot; is a diphthong.  The most common American English phoneme is the schwa, which has an &amp;quot;uh&amp;quot; sound. It can be part of any vowel&#039;s pronunciation, like the first a in &amp;quot;Alexa&amp;quot; or the e in &amp;quot;the&amp;quot;.  A [https://journals.sagepub.com/doi/pdf/10.1207/s15548430jlr3601_5 2004 paper] contains a helpful list of the most common phonemes. CMU also has their handy [pronouncing dictionary](http://www.speech.cs.cmu.edu/cgi-bin/cmudict) to help decipher phonemes.  The CMU dictionary file can also be found [https://github.com/Alexir/CMUdict/blob/master/cmudict-0.7b here].  Utilizing those and other resources you can then start to determine the viability of your wake word&#039;s phoneme structure.  &lt;br /&gt;
&lt;br /&gt;
If your wake word has a lot of rhyme potential, this is a sign it&#039;s probably going to have a lot of potential false activations.  This can be partially mitigated with a quality dataset.  The cadence of the wake word should also be as unique as possible.  &amp;quot;Hey Mycroft&amp;quot; frequently has false activations when someone says &amp;quot;Microsoft&amp;quot;.  For these, finding cadence patterns and matching that to other phrases/words to build the  dataset.  The pronunciation of your wake word would ideally be smooth.  Adding &amp;quot;hey&amp;quot; to the front of things is easy choice as the -ey phoneme leaves the mouth fairly neutral and ready to say another phoneme.  Words like &amp;quot;axlotl&amp;quot; are definitely uncommon and unique, but not as smooth or easy to pronounce.&lt;/div&gt;</summary>
		<author><name>72.128.154.254</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=A_good_wake_word&amp;diff=1803</id>
		<title>A good wake word</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=A_good_wake_word&amp;diff=1803"/>
		<updated>2021-11-10T09:28:23Z</updated>

		<summary type="html">&lt;p&gt;72.128.154.254: Created page with &amp;quot;What makes a good wake word?  Several factors contribute to the overall quality and usability of a wake word.  Ideally it is a short, memorable, easily pronounced phrase, with...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;What makes a good wake word?&lt;br /&gt;
&lt;br /&gt;
Several factors contribute to the overall quality and usability of a wake word.  Ideally it is a short, memorable, easily pronounced phrase, with uncommon phoneme arrangement.  Most people are familiar with &amp;quot;Hey Google&amp;quot; or &amp;quot;Hey Siri&amp;quot; or &amp;quot;Hey Mycroft&amp;quot;, for instance.  &amp;quot;Alexa&amp;quot; is a single-word wake word, albeit one with an uncommon with uncommon phoneme arrangement.  None of these are more than three syllables.  In &amp;quot;Star Trek&amp;quot; the ship computers are frequently addressed by &amp;quot;Computer&amp;quot;.  This is a reasonable choice, but not without some drawbacks.  The word computer can come up frequently in modern daily conversations. Its phoneme arrangement is not uncommon, potential leading to false activations.  &amp;quot;OK Computer&amp;quot; or &amp;quot;Hey computer&amp;quot; add a bit more complexity to the wake word.  Having a wake word over four or potentially five syllables starts to become clunky in its usage. &lt;br /&gt;
 &lt;br /&gt;
While this document is primarily about English, the concepts can be applied to many other languages.  &lt;br /&gt;
&lt;br /&gt;
Modern English contains 44 phonemes: 24 consonants and 20 vowels. The vowel sounds have two groups based on mouth position, monophthongs and diphthongs.  Simplest way to tell them apart is the e in &amp;quot;thee&amp;quot; is a monophthong, and the -oy in &amp;quot;boy&amp;quot; is a diphthong.  The most common American English phoneme is the schwa, which has an &amp;quot;uh&amp;quot; sound. It can be part of any vowel&#039;s pronunciation, like the first a in &amp;quot;Alexa&amp;quot; or the e in &amp;quot;the&amp;quot;.  A [2004 paper](https://journals.sagepub.com/doi/pdf/10.1207/s15548430jlr3601_5) contains a helpful list of the most common phonemes. CMU also has their handy [pronouncing dictionary](http://www.speech.cs.cmu.edu/cgi-bin/cmudict) to help decipher phonemes.  The CMU dictionary file can also be found [here](https://github.com/Alexir/CMUdict/blob/master/cmudict-0.7b).  Utilizing those and other resources you can then start to determine the viability of your wake word&#039;s phoneme structure.  &lt;br /&gt;
&lt;br /&gt;
If your wake word has a lot of rhyme potential, this is a sign it&#039;s probably going to have a lot of potential false activations.  This can be partially mitigated with a quality dataset.  The cadence of the wake word should also be as unique as possible.  &amp;quot;Hey Mycroft&amp;quot; frequently has false activations when someone says &amp;quot;Microsoft&amp;quot;.  For these, finding cadence patterns and matching that to other phrases/words to build the  dataset.  The pronunciation of your wake word would ideally be smooth.  Adding &amp;quot;hey&amp;quot; to the front of things is easy choice as the -ey phoneme leaves the mouth fairly neutral and ready to say another phoneme.  Words like &amp;quot;axlotl&amp;quot; are definitely uncommon and unique, but not as smooth or easy to pronounce.&lt;/div&gt;</summary>
		<author><name>72.128.154.254</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Mycroft_Mimic&amp;diff=1802</id>
		<title>Mycroft Mimic</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Mycroft_Mimic&amp;diff=1802"/>
		<updated>2021-11-10T08:12:01Z</updated>

		<summary type="html">&lt;p&gt;72.128.154.254: /* Build Mimic on device */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Category:Open Voice Assistants]]&lt;br /&gt;
[[Category:Mycroft]]&lt;br /&gt;
[[Category:TTS]]&lt;br /&gt;
&lt;br /&gt;
==What&#039;s Mycroft Mimic==&lt;br /&gt;
It is a TTS system delivered with Mycroft (&#039;&#039;by Nov 2021 it&#039;s english only&#039;&#039;). It&#039;s available in two installation types:&lt;br /&gt;
&lt;br /&gt;
*Mimic (&#039;&#039;version 1&#039;&#039;) runs locally on device and has an understable but not great quality. But runs on small computer devices like a Raspberry Pi. It&#039;s by default the &amp;quot;fallback&amp;quot; if other voices are not available.&lt;br /&gt;
*Mimic (version 2) is provided by Mycroft AI server/cloud backend infrastructure and offers a better voice quality (&#039;&#039;english only&#039;&#039;).&lt;br /&gt;
&lt;br /&gt;
==Tips &amp;amp; Tricks==&lt;br /&gt;
&lt;br /&gt;
===Build Mimic 1 on device===&lt;br /&gt;
While setting up a Mycroft device installer will ask to build Mimic 1 locally. If you initially skipped this you can also re-run dev_setup.sh with a -fm flag for &amp;quot;force mimic&amp;quot;.&amp;lt;blockquote&amp;gt;./dev_setup.sh -fm&amp;lt;/blockquote&amp;gt;You can do the opposite, by running flag to &amp;quot;skip mimic&amp;quot;:&amp;lt;blockquote&amp;gt;./dev_setup.sh -sm&amp;lt;/blockquote&amp;gt;&lt;/div&gt;</summary>
		<author><name>72.128.154.254</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Category:Wake_words&amp;diff=1801</id>
		<title>Category:Wake words</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Category:Wake_words&amp;diff=1801"/>
		<updated>2021-11-10T08:03:12Z</updated>

		<summary type="html">&lt;p&gt;72.128.154.254: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Category:STT]]&lt;br /&gt;
[[Category:Open Voice Assistants]]&lt;br /&gt;
&lt;br /&gt;
Wake words, sometimes called key words, are a special category of Speech-To-Text. Wake words are used to &amp;quot;wake&amp;quot; a listening device and start its functions. In most cases these &amp;quot;wake words&amp;quot; are detected locally on devices while actual speech recognition is mostly done by internet cloud services. Mycroft defaults to &amp;quot;Hey, Mycroft&amp;quot; for its wake word, for instance. Some platforms allow for multiple wake words to be used. Coqui STT engine can even be configured as a wake word listener.    &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Wake word listeners&#039;&#039;&#039;: &lt;br /&gt;
&lt;br /&gt;
* [[Mycroft Precise]]&lt;br /&gt;
* [[Porcupine]]&lt;br /&gt;
* [[Snowboy]] &lt;br /&gt;
* [[Howl]]&lt;br /&gt;
* [[Coqui]] STT&lt;br /&gt;
* Google tensorflow lite speech recognition &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Customizing wake words&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* What makes [[a good wake word]]?&lt;br /&gt;
* Building a wake word dataset&lt;/div&gt;</summary>
		<author><name>72.128.154.254</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Category:Wake_words&amp;diff=1785</id>
		<title>Category:Wake words</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Category:Wake_words&amp;diff=1785"/>
		<updated>2021-11-09T08:17:53Z</updated>

		<summary type="html">&lt;p&gt;72.128.154.254: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Wake words, sometimes called key words, are a special category of Speech-To-Text. Wake words are used to &amp;quot;wake&amp;quot; a listening device and start its functions.  Mycroft defaults to &amp;quot;Hey, Mycroft&amp;quot; for its wake word, for instance.  Some platforms allow for multiple wake words to be used.  Coqui STT engine can even be configured as a wake word listener.    &lt;br /&gt;
&lt;br /&gt;
Wake word listeners: &lt;br /&gt;
&lt;br /&gt;
* Mycroft Precise &lt;br /&gt;
* Porcupine &lt;br /&gt;
* Snowboy &lt;br /&gt;
* Howl &lt;br /&gt;
*Coqui STT&lt;br /&gt;
*Google tensorflow lite speech recognition &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Customizing wake words&lt;br /&gt;
&lt;br /&gt;
* What makes a good wake word?&lt;br /&gt;
* Building a wake word dataset&lt;/div&gt;</summary>
		<author><name>72.128.154.254</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Category:Wake_words&amp;diff=1784</id>
		<title>Category:Wake words</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Category:Wake_words&amp;diff=1784"/>
		<updated>2021-11-09T08:14:27Z</updated>

		<summary type="html">&lt;p&gt;72.128.154.254: Add wake words.&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Wake words are a special category of Speech-To-Text. Wake words are used to &amp;quot;wake&amp;quot; a listening device and start its functions.  Mycroft defaults to &amp;quot;Hey, Mycroft&amp;quot; for its wake word.  &lt;br /&gt;
&lt;br /&gt;
Wake word listeners: &lt;br /&gt;
&lt;br /&gt;
* Mycroft Precise &lt;br /&gt;
* Porcupine &lt;br /&gt;
* Snowboy &lt;br /&gt;
* Howl &lt;br /&gt;
* ? &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Customizing wake words&lt;br /&gt;
&lt;br /&gt;
* What makes a good wake word?&lt;br /&gt;
* Building a wake word dataset&lt;/div&gt;</summary>
		<author><name>72.128.154.254</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Recording_tipps&amp;diff=1783</id>
		<title>Recording tipps</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Recording_tipps&amp;diff=1783"/>
		<updated>2021-11-09T08:09:02Z</updated>

		<summary type="html">&lt;p&gt;72.128.154.254: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Category:Recording tipps]]&lt;br /&gt;
[[Category:Lessons learned]]&lt;br /&gt;
&lt;br /&gt;
When you plan to record a voice dataset to be used for a TTS model training you should check these tipps and tricks:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;&#039;Use a good microphone and a quiet recording room setup&#039;&#039;&#039; (no computers fans, air conditioning, ...)&lt;br /&gt;
* Use a text corpus with cleaned numbers/abbreviations and good phoneme coverage&lt;br /&gt;
* Read neutral, but with a natural speech flow and do not swallow up letters&lt;br /&gt;
* Adjust tone and pitch with punctuation&lt;br /&gt;
* Use a constant recording speed&lt;br /&gt;
* Check your recordings regularly in high volume for background noise&lt;br /&gt;
* Make breaks regularly and do not record more than four hours a day&lt;br /&gt;
* Record error free&lt;br /&gt;
* Investing in a quality interface and mic can make a big difference in quality. A 24 bit 96khz interface with a large diaphragm condenser can be had for about $200 USD.&lt;br /&gt;
* Record at the highest quality level practical.  You can convert to lesser formats later, but you can&#039;t up convert cleanly&lt;br /&gt;
* Review your work at regular intervals and compare with previous recording to ensure consistent quality&lt;br /&gt;
* Do not be afraid to ask for help! Getting feedback on your data early on can help prevent wasted effort.&lt;/div&gt;</summary>
		<author><name>72.128.154.254</name></author>
	</entry>
</feed>