<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://openvoice-tech.net/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Solyarisoftware</id>
	<title>Open Voice Technology Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://openvoice-tech.net/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Solyarisoftware"/>
	<link rel="alternate" type="text/html" href="https://openvoice-tech.net/wiki/Special:Contributions/Solyarisoftware"/>
	<updated>2026-05-12T04:00:05Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.43.1</generator>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Italien_phoneme_list_(it)&amp;diff=2377</id>
		<title>Italien phoneme list (it)</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Italien_phoneme_list_(it)&amp;diff=2377"/>
		<updated>2022-03-27T17:30:00Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: table filled with some test words&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Supported by eSpeak}}&lt;br /&gt;
{{Phoneme list introduction}}&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Italien written version&lt;br /&gt;
!Spoken like&lt;br /&gt;
!Phonemes&lt;br /&gt;
|-&lt;br /&gt;
|caffè&lt;br /&gt;
|caffè&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|thè&lt;br /&gt;
|tè&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|te&lt;br /&gt;
|te&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|solyaris&lt;br /&gt;
|soliaris&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|software&lt;br /&gt;
|softuer&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|trade-off&lt;br /&gt;
|treid off&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
[[Category:Phoneme list]]&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Talk:Python&amp;diff=2227</id>
		<title>Talk:Python</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Talk:Python&amp;diff=2227"/>
		<updated>2022-01-12T08:26:55Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I propose to add, at the end, a statement like:&lt;br /&gt;
&lt;br /&gt;
Python is de facto the most common used programming language in natural language processing (NLP) vertical, also because the huge ecosystem of open-source packages (libraries), e.g. to develop machine learning / deep learning alghorithms ([[Tensorflow]], [[Pythorch]], etc.), speech recognition ([[Vosk]], [[Coqui]], etc.), sound processing, etc.--[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 11:25, 6 January 2022 (CET)&lt;br /&gt;
:Sounding good to me [[User:Solyarisoftware|Solyarisoftware]]. Just pinging [[User:Digitalica|Digitalica]] as author. Could be added in my personal opinion.--[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 23:07, 6 January 2022 (CET)&lt;br /&gt;
&lt;br /&gt;
great idea, go ahead ;-)&lt;br /&gt;
:I&#039;ve added text as suggested by Giorgio. --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 17:52, 7 January 2022 (CET)&lt;br /&gt;
&lt;br /&gt;
:thanks--[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 09:26, 12 January 2022 (CET)&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Talk:Python&amp;diff=2207</id>
		<title>Talk:Python</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Talk:Python&amp;diff=2207"/>
		<updated>2022-01-06T10:27:08Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I propose to add, at the end, a statement like:&lt;br /&gt;
&lt;br /&gt;
Python is de facto the most common used programming language in natural language processing (NLP) vertical, also because the huge ecosystem of open-source packages (libraries), e.g. to develop machine learning / deep learning alghorithms ([[Tensorflow]], [[Pythorch]], etc.), speech recognition ([[Vosk]], [[Coqui]], etc.), sound processing, etc.--[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 11:25, 6 January 2022 (CET)&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Talk:Python&amp;diff=2206</id>
		<title>Talk:Python</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Talk:Python&amp;diff=2206"/>
		<updated>2022-01-06T10:25:25Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: Created page with &amp;quot;I propose to add, at the end, a statement like:  Python is de facto the most common used programming language in natural language processing (NLP) vertical, also because the h...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I propose to add, at the end, a statement like:&lt;br /&gt;
&lt;br /&gt;
Python is de facto the most common used programming language in natural language processing (NLP) vertical, also because the huge ecosystem of open-source packages (libraries), e.g. to develop machine learning / deep learning alghorithms (Tensorflow, Pythorch, etc.), speech recognition (Vosk, Coqui, etc.), sound processing, etc.--[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 11:25, 6 January 2022 (CET)&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Conversational_Design&amp;diff=2205</id>
		<title>Conversational Design</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Conversational_Design&amp;diff=2205"/>
		<updated>2022-01-05T17:54:31Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: Redirected page to Conversation Design&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;#REDIRECT [[Conversation Design]]&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Conversational_Design&amp;diff=2204</id>
		<title>Conversational Design</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Conversational_Design&amp;diff=2204"/>
		<updated>2022-01-05T17:53:06Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: synonym redirection&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;lt;nowiki&amp;gt;#&amp;lt;/nowiki&amp;gt;REDIRECT [[Conversation Design]]&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=User_talk:Solyarisoftware&amp;diff=2203</id>
		<title>User talk:Solyarisoftware</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=User_talk:Solyarisoftware&amp;diff=2203"/>
		<updated>2022-01-05T17:47:01Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: /* Notification test */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Thank you ==&lt;br /&gt;
&lt;br /&gt;
Just wanted to thank you for contributing helpful content here :-).--[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 18:03, 7 December 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
== Notification test ==&lt;br /&gt;
&lt;br /&gt;
Hi [[User:Solyarisoftware|Giorgio]],&lt;br /&gt;
just testing - do you receive a mail notification about that entry? --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 17:53, 5 January 2022 (CET)&lt;br /&gt;
&lt;br /&gt;
hi Thorsten, thanks for your great work; I&#039;m just trying to help/share your seed. &lt;br /&gt;
And yes, I received a mail notification about this entry.--[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 18:47, 5 January 2022 (CET)&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Initiative&amp;diff=2120</id>
		<title>Initiative</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Initiative&amp;diff=2120"/>
		<updated>2021-12-20T08:03:57Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: typo correction&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&#039;&#039;There is a systematic ambiguity with this word  [&#039;&#039;&#039;initiative&#039;&#039;&#039;] in voice-interface design.&#039;&#039; &lt;br /&gt;
&lt;br /&gt;
#&#039;&#039;the first utterance of a dialog pair; for instance, a question that puts &amp;quot;reactive pressure&amp;quot; on the following utterance to be an answer in response.&#039;&#039;&lt;br /&gt;
#&#039;&#039;the flow-control of a dialogue. Whichever agent has control of the conversational flow has initiative.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note that initiative is not just who is talking. In an interview, for instance, the interviewer usually maintains initiative. While both parties speak regularly, and the interviewee often speaks for longer duration, the interviewer controls the flows by setting the agenda, asking the questions, requesting elaborations, and so on.&#039;&#039; &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Speech systems can be &#039;&#039;&#039;fixed-initiative&#039;&#039;&#039;, in which one agent maintains all the control, or &#039;&#039;&#039;mixed-initiative&#039;&#039;&#039;, in which either agent can take control at any time.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Initiative manager&#039;&#039;&#039; is a software module in some voice-interface designs that weights the initiative toward the system ore the user depending on the state of repair, the experience of the user, the requirements of interaction script, and the need for specific dialogue management acts.&#039;&#039; &amp;lt;ref&amp;gt;from page 534 of book &amp;quot;Voice Interaction Design&amp;quot; (Randy, Allen Harris). 2005&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
---&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;A &#039;&#039;&#039;directed dialogue&#039;&#039;&#039; is a fixed-initiative dialogue fixed on the system.&#039;&#039;&amp;lt;ref&amp;gt;from page 528 of book &amp;quot;Voice Interaction Design&amp;quot; (Randy, Allen Harris). 2005&amp;lt;/ref&amp;gt; &lt;br /&gt;
&lt;br /&gt;
---&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;The system initiates, and closely directs, all interaction. Most systems deployed thus far used a directed-dialog strategy. A &#039;&#039;&#039;directed dialog&#039;&#039;&#039; fora travel planning might sound like this:&#039;&#039; &lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System: What&#039;s the departure city?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;Caller: Um, San Francisco&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System: And the arrival city?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;Caller: I wanna go to New York&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System Ok, what day you are living?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;Caller: Next Tuesday&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System: Great. And what time do you want to go?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;Caller: Sometime after ten a.m.&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;This directed-dialog is an example of form-filling. The caller is asked a series of directed questions as is the caller were filling out a form.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;With a mixed-initiative dialog strategy, the same travel dialog might allow callers more flexibility in what they can say. First the initiative come from the caller. Depending on the caller&#039;s response, the system may then take initiative and prompt for missing information:&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System: What are travel plans?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;Caller: I wanna go to New York next Tuesday morning.&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System: Ok, and what&#039;s the departure city?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;In this second example, the caller provides several pieces of information for the trip, and then the system takes the initiative and prompts for the rest. All mixed-initiative dialogs need to include back-off strategies to capture missing pieces of information.&#039;&#039;&amp;lt;ref&amp;gt;froem pages 63-64 of book Voce User Interface Design, by Choen, Giangola, Balogh. 2004&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
---&lt;br /&gt;
&lt;br /&gt;
In a human-machine interaction, &#039;&#039;&#039;mixed-initiative&#039;&#039;&#039; could be also considered as an alternation of flow-control initiative during time.   &lt;br /&gt;
&lt;br /&gt;
By example let&#039;s consider an home automation voice assistant; the initiative could be triggered by a question or command that user ask to the conversational system.   &lt;br /&gt;
&lt;br /&gt;
The user question: W&#039;&#039;hat&#039;s the weather today?&#039;&#039; triggers a turn-taking directed-dialog initiated by the user request (&#039;&#039;&#039;reactive-mode/pull-mode&#039;&#039;&#039;).  &lt;br /&gt;
&lt;br /&gt;
Beside, the system could be initiate a conversation with the user, by example when some relevant event happens, say some input sensors (light sensor) detect the ambient lighting is too dark; In this case the system could ask the user: &#039;&#039;You want me to turn on the lights?&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
In this case is the system that initiates a new conversation session, maybe unrelated from the previous conversation/topic (&#039;&#039;&#039;proactive-mode/push mode&#039;&#039;&#039;).&lt;br /&gt;
&lt;br /&gt;
So mixed-initiative happens on a system able to be reactive to user requests and at the same time being proactive, initiating new conversations (dialog sessions) with the user.  In that sense, the initiative manager is a dialog manager able to manage directed-dialog flows, by example managing dialog digressions, arbitrating system-initiated notification/dialogs with previous conversation initiated by user (to accomplish a task), etc.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Notification&amp;diff=2118</id>
		<title>Notification</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Notification&amp;diff=2118"/>
		<updated>2021-12-19T18:38:09Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: notification definition&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;In a human-machine interaction system, by example a virtual assistant, a notification is a &amp;quot;push&amp;quot; message that informs the user about some relevant event, probably relevant for the user. It could be via text or voice.&lt;br /&gt;
&lt;br /&gt;
The notification could be a single-turn stand-alone message, or could initiate a new conversation with user. In this case the conversation is system-initiated (see [[Initiative]]). &lt;br /&gt;
&lt;br /&gt;
In a voice/voiceonly system, by example where interface devices are smartspeaker, voice notifications rise possible user experience issues. By example too much notifications are annoying and their interrupts of an active dialog have to managed with a smart dialog manager/initiative manager (see [[Initiative]]).&lt;br /&gt;
&lt;br /&gt;
Majority of voice virtual assistants do not manage voice notifications and/or mixed-initiative. Amazon Alexa proposed some ProactiveEvents API&amp;lt;ref&amp;gt;https://developer.amazon.com/it/blogs/alexa/post/bbf23596-766a-4e7c-8d74-cbfc234b6791/how-to-send-media-event-notifications-to-your-alexa-skill-customers&amp;lt;/ref&amp;gt; to manage 3rd party skills. Currently Amazon Echo Dot smartspeakers manage 1st party notification with a visual alert&amp;lt;ref&amp;gt;https://www.youtube.com/watch?v=DmnQO5np7cw&amp;lt;/ref&amp;gt;.&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Initiative&amp;diff=2117</id>
		<title>Initiative</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Initiative&amp;diff=2117"/>
		<updated>2021-12-19T18:16:02Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: better explanation of what is an initiative manager&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&#039;&#039;There is a systematic ambiguity with this word  [&#039;&#039;&#039;mixed-initiative&#039;&#039;&#039;] in voice-interface design.&#039;&#039; &lt;br /&gt;
&lt;br /&gt;
# &#039;&#039;the first utterance of a dialog pair; for instance, a question that puts &amp;quot;reactive pressure&amp;quot; on the following utterance to be an answer in response.&#039;&#039;&lt;br /&gt;
# &#039;&#039;the flow-control of a dialogue. Whichever agent has control of the conversational flow has initiative.&#039;&#039; &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note that initiative is not just who is talking. In an interview, for instance, the interviewer usually maintains initiative. While both parties speak regularly, and the interviewee often speaks for longer duration, the interviewer controls the flows by setting the agenda, asking the questions, requesting elaborations, and so on.&#039;&#039; &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Speech systems can be &#039;&#039;&#039;fixed-initiative&#039;&#039;&#039;, in which one agent maintains all the control, or &#039;&#039;&#039;mixed-initiative&#039;&#039;&#039;, in which either agent can take control at any time.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Initiative manager&#039;&#039;&#039; is a software module in some voice-interface designs that weights the initiative toward the system ore the user depending on the state of repair, the experience of the user, the requirements of interaction script, and the need for specific dialogue management acts.&#039;&#039; &amp;lt;ref&amp;gt;from page 534 of book &amp;quot;Voice Interaction Design&amp;quot; (Randy, Allen Harris). 2005&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
---&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;A &#039;&#039;&#039;directed dialogue&#039;&#039;&#039; is a fixed-initiative dialogue fixed on the system.&#039;&#039;&amp;lt;ref&amp;gt;from page 528 of book &amp;quot;Voice Interaction Design&amp;quot; (Randy, Allen Harris). 2005&amp;lt;/ref&amp;gt; &lt;br /&gt;
&lt;br /&gt;
---&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;The system initiates, and closely directs, all interaction. Most systems deployed thus far used a directed-dialog strategy. A &#039;&#039;&#039;directed dialog&#039;&#039;&#039; fora travel planning might sound like this:&#039;&#039; &lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System: What&#039;s the departure city?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;Caller: Um, San Francisco&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System: And the arrival city?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;Caller: I wanna go to New York&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System Ok, what day you are living?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;Caller: Next Tuesday&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System: Great. And what time do you want to go?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;Caller: Sometime after ten a.m.&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;This directed-dialog is an example of form-filling. The caller is asked a series of directed questions as is the caller were filling out a form.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;With a mixed-initiative dialog strategy, the same travel dialog might allow callers more flexibility in what they can say. First the initiative come from the caller. Depending on the caller&#039;s response, the system may then take initiative and prompt for missing information:&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System: What are travel plans?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;Caller: I wanna go to New York next Tuesday morning.&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System: Ok, and what&#039;s the departure city?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;In this second example, the caller provides several pieces of information for the trip, and then the system takes the initiative and prompts for the rest. All mixed-initiative dialogs need to include back-off strategies to capture missing pieces of information.&#039;&#039;&amp;lt;ref&amp;gt;froem pages 63-64 of book Voce User Interface Design, by Choen, Giangola, Balogh. 2004&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
---&lt;br /&gt;
&lt;br /&gt;
In a human-machine interaction, &#039;&#039;&#039;mixed-initiative&#039;&#039;&#039; could be also considered as an alternation of flow-control initiative during time.   &lt;br /&gt;
&lt;br /&gt;
By example let&#039;s consider an home automation voice assistant; the initiative could be triggered by a question or command that user ask to the conversational system.   &lt;br /&gt;
&lt;br /&gt;
The user question: W&#039;&#039;hat&#039;s the weather today?&#039;&#039; triggers a turn-taking directed-dialog initiated by the user request (&#039;&#039;&#039;reactive-mode/pull-mode&#039;&#039;&#039;).  &lt;br /&gt;
&lt;br /&gt;
Beside, the system could be initiate a conversation with the user, by example when some relevant event happens, say some input sensors (light sensor) detect the ambient lighting is too dark; In this case the system could ask the user: &#039;&#039;You want me to turn on the lights?&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
In this case is the system that initiates a new conversation session, maybe unrelated from the previous conversation/topic (&#039;&#039;&#039;proactive-mode/push mode&#039;&#039;&#039;).&lt;br /&gt;
&lt;br /&gt;
So mixed-initiative happens on a system able to be reactive to user requests and at the same time being proactive, initiating new conversations (dialog sessions) with the user.  In that sense, the initiative manager is a dialog manager able to manage directed-dialogs flows, by example managing dialog digressions, arbitrating system-initiated notification/dialogs with previous conversation initiated by user (to accomplish a task), etc.&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Initiative&amp;diff=2113</id>
		<title>Initiative</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Initiative&amp;diff=2113"/>
		<updated>2021-12-19T11:54:25Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: initiative, mixed-initiative, directed-dialog&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&#039;&#039;There is a systematic ambiguity with this word in voice-interface design.&#039;&#039; &lt;br /&gt;
&lt;br /&gt;
# &#039;&#039;the first utterance of a dialog pair; for instance, a question that puts &amp;quot;reactive pressure&amp;quot; on the following utterance to be an answer in response.&#039;&#039;&lt;br /&gt;
# &#039;&#039;the flow-control of a dialogue. Whichever agent has control of the conversational flow has initiative.&#039;&#039; &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note that initiative is not just who is talking. Ina an interview, for instance, the interviewer usually maintains initiative. While both parties speak regularly, and the interviewee often speaks for longer duration, the interviewer controls the flows by setting the agenda, asking the questions, requesting elaborations, and so on.&#039;&#039; &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Speech systems can be &#039;&#039;&#039;fixed-initiative&#039;&#039;&#039;, in which one agent maintains all the control, or &#039;&#039;&#039;mixed-initiative&#039;&#039;&#039;, in which either agent can take control at any time.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Initiative manager is a software module in some voice-interface designs that weights the initiative toward the system ore the user depending on the state of repair, the experience of the user, the requirements of interaction script, and the need for specific dialogue management acts.&#039;&#039; &amp;lt;ref&amp;gt;from page 534 of book &amp;quot;Voice Interaction Design&amp;quot; (Randy, Allen Harris). 2005&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
---&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;A &#039;&#039;&#039;directed dialogue&#039;&#039;&#039; is a fixed-initiative dialogue fixed on the system.&#039;&#039;&amp;lt;ref&amp;gt;from page 528 of book &amp;quot;Voice Interaction Design&amp;quot; (Randy, Allen Harris). 2005&amp;lt;/ref&amp;gt; &lt;br /&gt;
&lt;br /&gt;
---&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;The system initiates, and closely directs, all interaction. Most systems deployed thus far used a directed-dialog strategy. A directed dialog fora travel planning might sound like this:&#039;&#039; &lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System: What&#039;s the departure city?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;Caller: Um, San Francisco&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System: And the arrival city?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;Caller: I wanna go to New York&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System Ok, what day you are living?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;Caller: Next Tuesday&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System: Great. And what time do you want to go?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;Caller: Sometime after ten a.m.&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;This directed-dialog is an example of form-filling. The caller is asked a series of directed questions as is the caller were filling out a form.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;With a mixed-initiative dialog strategy, the same travel dialog might allow callers more flexibility in what they can say. First the initiative come from the caller. Depending on the caller&#039;s response, the system may then take initiative and prompt for missing information:&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System: What are travel plans?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;Caller: I wanna go to New York next Tuesday morning.&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;System: Ok, and what&#039;s the departure city?&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;In this second example, the caller provides several pieces of information for the trip, and then the system takes the initiative and prompts for the rest. All mixed-initiative dialogs need to include back-off strategies to capture missing pieces of information.&#039;&#039;&amp;lt;ref&amp;gt;froem pages 63-64 of book Voce User Interface Design, by Choen, Giangola, Balogh. 2004&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
---&lt;br /&gt;
&lt;br /&gt;
In a human-machine interaction, &#039;&#039;&#039;mixed-initiative&#039;&#039;&#039; could be also considered as an alternation of flow-control initiative during time. By example let&#039;s consider an home automation voice assistant. The initiative could be triggered by a question or command that user ask to the conversational system. Say &amp;quot;what&#039;s the weather today?&amp;quot;, this trigger a turn-taking directed-dialogue initiated by a user request (&#039;&#039;&#039;reactive-mode/pull-mode&#039;&#039;&#039;).  &lt;br /&gt;
&lt;br /&gt;
Also, the system could be initiate a conversation with the user, by example when some relevant event happens, say some input sensors (light sensor) detect it&#039;s too dark. In this case the system could ask the user &amp;quot;do yo want I turn on the light?&amp;quot;. In this case is the system that initiate a conversation session (&#039;&#039;&#039;proactive-mode/push mode&#039;&#039;&#039;).&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Comparison_of_voice_assistants&amp;diff=2112</id>
		<title>Comparison of voice assistants</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Comparison_of_voice_assistants&amp;diff=2112"/>
		<updated>2021-12-19T11:05:24Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;There are lots of (open) voice assistants out there. Maybe we can make a comparison list of which assistants exists, in which points they&#039;re equal and which aspects differ.&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+Comparison of available voice assistants&lt;br /&gt;
!&lt;br /&gt;
![[Mycroft]] AI&lt;br /&gt;
![[SEPIA]]&lt;br /&gt;
![[Rhasspy]]&lt;br /&gt;
![[Leon]]&lt;br /&gt;
![[Genie]]&lt;br /&gt;
|-&lt;br /&gt;
|Target group&lt;br /&gt;
|&lt;br /&gt;
|Makers, tinkerers, smart-home enthusiasts,&lt;br /&gt;
end-users via mobile app &lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|anyone, private user or enterprise&lt;br /&gt;
|-&lt;br /&gt;
|License&lt;br /&gt;
|&lt;br /&gt;
|MIT&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|Apache 2.0&lt;br /&gt;
https://github.com/stanford-oval/genie-toolkit/blob/master/LICENSE&lt;br /&gt;
|-&lt;br /&gt;
|Requires internet access&lt;br /&gt;
|&lt;br /&gt;
|100% offline is possible, but services&lt;br /&gt;
like Wikipedia or news require internet&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|offline engines, but services like Thingtalk for web services, require internet&lt;br /&gt;
|-&lt;br /&gt;
|Offline STT&lt;br /&gt;
|&lt;br /&gt;
|via [[SEPIA Speech-To-Text Server|SEPIA STT]]&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|No&lt;br /&gt;
|-&lt;br /&gt;
|Offline TTS&lt;br /&gt;
|&lt;br /&gt;
|via SEPIA-Home&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|No&lt;br /&gt;
|-&lt;br /&gt;
|URL&lt;br /&gt;
|&lt;br /&gt;
|https://sepia-framework.github.io/&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|https://oval.cs.stanford.edu/&lt;br /&gt;
https://github.com/stanford-oval&lt;br /&gt;
&lt;br /&gt;
https://github.com/stanford-oval/genie-toolkit&lt;br /&gt;
&lt;br /&gt;
|-&lt;br /&gt;
|Comment&lt;br /&gt;
|&lt;br /&gt;
|Highly customizable. Modules can be distributed to&lt;br /&gt;
multiple Raspberry Pis in home network.&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|Genie is one of the core technologies enabling the Stanford open Virtual Assistant architecture. See: https://oval.cs.stanford.edu/&lt;br /&gt;
|-&lt;br /&gt;
|Linux&lt;br /&gt;
|&lt;br /&gt;
|Yes&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Windows&lt;br /&gt;
|&lt;br /&gt;
|Yes&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Mac OS X&lt;br /&gt;
|&lt;br /&gt;
|Yes&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Raspberry Pi&lt;br /&gt;
|&lt;br /&gt;
|Yes&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Own hw product line&lt;br /&gt;
|&lt;br /&gt;
|No (maybe someday ^^)&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|No&lt;br /&gt;
|-&lt;br /&gt;
|...&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
[[Category:Open Voice Assistants]]&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Genie&amp;diff=2078</id>
		<title>Genie</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Genie&amp;diff=2078"/>
		<updated>2021-12-18T09:38:34Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Genie is an open-source toolkit for building conversational agents in a cost-effective fashion, and a privacy-preserving reference virtual assistant.&lt;br /&gt;
&lt;br /&gt;
Genie is built by the Stanford Open Virtual Assistant Lab (OVAL)&amp;lt;ref name=&amp;quot;:0&amp;quot;&amp;gt;https://oval.cs.stanford.edu/&amp;lt;/ref&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&amp;quot;We have built the first &amp;quot;browser&amp;quot; to the World Wide Voice Web, our Genie open-source virtual assistant. To scale up cost-effectively, we have created a Pretrained Agent Generator that can produce transactional dialogue agents from just database schemas, API signatures, and a few samples of natural language utterances.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;We are now ready to apply it to the WWW. The idea is to standardize on APIs and provide open pretrained agents that interface to the APIs. For example, restaurants provide a menu and an ordering API in the standardized format, and they get a voice agent for ordering food. We believe an open, decentralized voice web ([[WWvW]]) will surpass any proprietary walled gardens.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Decentralization of the voice web promotes equal opportunity, global inclusion and accessibility, and consumer privacy.&amp;quot;&#039;&#039; cit.&amp;lt;ref name=&amp;quot;:0&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Genie is a core component of the Stanford Open Virtual Assistant 2.0 Platform (previously called &#039;&#039;Almond)&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
* https://wiki.genie.stanford.edu/en/getting-started/intro-genie&lt;br /&gt;
&lt;br /&gt;
* https://github.com/stanford-oval/genie-toolkit&lt;br /&gt;
&lt;br /&gt;
* https://oval.cs.stanford.edu/workshop/&lt;br /&gt;
&lt;br /&gt;
* https://convcomp.it/whither-almond-the-stanford-university-open-virtual-assistant-will-go-b4d66167e76c&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Genie&amp;diff=2077</id>
		<title>Genie</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Genie&amp;diff=2077"/>
		<updated>2021-12-18T09:36:02Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: genie definition&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Genie is an open-source toolkit for building conversational agents in a cost-effective fashion, and a privacy-preserving reference virtual assistant.&lt;br /&gt;
&lt;br /&gt;
Genie is built by the Stanford Open Virtual Assistant Lab (OVAL)&amp;lt;ref name=&amp;quot;:0&amp;quot;&amp;gt;https://oval.cs.stanford.edu/&amp;lt;/ref&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&amp;quot;We have built the first &amp;quot;browser&amp;quot; to the World Wide Voice Web, our Genie open-source virtual assistant. To scale up cost-effectively, we have created a Pretrained Agent Generator that can produce transactional dialogue agents from just database schemas, API signatures, and a few samples of natural language utterances.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;We are now ready to apply it to the WWW. The idea is to standardize on APIs and provide open pretrained agents that interface to the APIs. For example, restaurants provide a menu and an ordering API in the standardized format, and they get a voice agent for ordering food. We believe an open, decentralized voice web ([[WWvW]]) will surpass any proprietary walled gardens.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Decentralization of the voice web promotes equal opportunity, global inclusion and accessibility, and consumer privacy.&amp;quot;&#039;&#039; cit.&amp;lt;ref name=&amp;quot;:0&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
* https://wiki.genie.stanford.edu/en/getting-started/intro-genie&lt;br /&gt;
&lt;br /&gt;
* https://github.com/stanford-oval/genie-toolkit&lt;br /&gt;
&lt;br /&gt;
* https://oval.cs.stanford.edu/workshop/&lt;br /&gt;
&lt;br /&gt;
* https://convcomp.it/whither-almond-the-stanford-university-open-virtual-assistant-will-go-b4d66167e76c&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2072</id>
		<title>Conversational Agent</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2072"/>
		<updated>2021-12-13T13:26:30Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: added paragraph &amp;quot;The reason why the term agent&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;A &#039;&#039;conversational agent&#039;&#039;, or &#039;&#039;conversational system&#039;&#039;, or &#039;&#039;dialog system&#039;&#039;, is a computer system (hardware+software platform) intended to converse with a human in a natural language.&lt;br /&gt;
&lt;br /&gt;
Let&#039;s see what wikipedia states for &#039;&#039;conversational agent&#039;&#039;&amp;lt;ref name=&amp;quot;:0&amp;quot;&amp;gt;https://en.wikipedia.org/wiki/Dialogue_system&amp;lt;/ref&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&amp;quot;A dialogue system, or conversational agent (CA), is a computer system intended to converse with a human. Dialogue systems employed one or more of text, speech, graphics, haptics, gestures, and other modes for communication on both the input and output channel&#039;&#039;.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
In common practice, conversational agent in an umbrella term generally used as synonym of [[chatbot]], but technically a chatbot is a specific kind of conversational system where the communication channel user interface (UI) is just text messages, exchanged on chat-based platforms (as [[instant-messaging]] mobile apps). &lt;br /&gt;
&lt;br /&gt;
== &#039;&#039;The reason why the term agent&#039;&#039; ==&lt;br /&gt;
Especially in academic parlance, the term &#039;&#039;conversational agent&#039;&#039; is synonym of &#039;&#039;chatbot&#039;&#039;, and the reason emphasis on the &#039;&#039;agent&#039;&#039; is maybe because conversational agents was considered software agents&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Software_agent&amp;lt;/ref&amp;gt;, special application of intelligent systems&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Intelligent_agent&amp;lt;/ref&amp;gt; and Multi-agent systems&amp;lt;ref&amp;gt;/https://en.wikipedia.org/wiki/Multi-agent_system&amp;lt;/ref&amp;gt; research topic.&lt;br /&gt;
&lt;br /&gt;
The term &#039;&#039;agent&#039;&#039; is so probably misleading, because assimilating any conversational system as a software agent that mediate between an end-user (e.g. a consumer of a business platform) and a business human operator counterpart. That use case is a specific use case and not always applicable/true. &lt;br /&gt;
&lt;br /&gt;
The agentive role of a conversational system is commonly related to a the specifica scenario when the conversational system really acts as intermediator, requiring a final conversation between the end-user and a human operator (that&#039;s commonly called:  &#039;&#039;[[human-in-the-loop]]&#039;&#039;). &lt;br /&gt;
&lt;br /&gt;
But there are many cases where the conversational system is not properly an agent, by example in pure [[command-and-control]] conversational systems that just execute automation commands to action (electro-mechanical) actuators, by example in [[home-automation]] or industrial automation realms.&lt;br /&gt;
&lt;br /&gt;
The concept of conversational agent is currently also a synonym of [https://openvoice-tech.net/Virtual_assistants virtual assistant], in the sense of a real agentive/automation technology that assists humans to solve real-life problems, in real-time, with reactive and proactive dialogs (so called [https://openvoice-tech.net/Mixed-initiative mixed-initiative]), between the end-user and the assistant.  &lt;br /&gt;
&lt;br /&gt;
A concrete example of a conversational agent that&#039;s a true agent, in the sense that act on behalf of end-user, conversating and explating tasks with other humans, is &#039;&#039;Google Duplex&#039;&#039;&amp;lt;ref&amp;gt;https://ai.googleblog.com/2018/05/duplex-ai-system-for-natural-conversation.html&amp;lt;/ref&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Voice_assistant&amp;diff=2071</id>
		<title>Voice assistant</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Voice_assistant&amp;diff=2071"/>
		<updated>2021-12-13T13:18:52Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: voice assitant definition&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;A &#039;&#039;&#039;virtual assistant&#039;&#039;&#039; is defined by wikipedia in this way:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;An &#039;&#039;&#039;intelligent virtual assistant&#039;&#039;&#039; (&#039;&#039;&#039;IVA&#039;&#039;&#039;) or &#039;&#039;&#039;intelligent personal assistant&#039;&#039;&#039; (&#039;&#039;&#039;IPA&#039;&#039;&#039;) is a software agent that can perform tasks or services for an individual based on commands or questions. The term &amp;quot;chatbot&amp;quot; is sometimes used to refer to virtual assistants generally or specifically accessed by online chat. In some cases, online chat programs are exclusively for entertainment purposes. Some virtual assistants are able to interpret human speech and respond via synthesized voices. Users can ask their assistants questions, control home automation devices and media playback via voice, and manage other basic tasks such as email, to-do lists, and calendars with verbal commands. A similar concept, however with differences, lays under the dialogue systems.&#039;&#039;&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Virtual_assistant&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
A &#039;&#039;&#039;voice assistant&#039;&#039;&#039; is so a special case of virtual assistant, where the human-computer interaction is mainly done via voice ([https://openvoice-tech.net/Voicefirst voicefirst]). &lt;br /&gt;
&lt;br /&gt;
[https://openvoice-tech.net/Smartspeakers Smartspeakers] are HW device terminals that enable humans to interact with a (voice) assistant.&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2070</id>
		<title>Conversational Agent</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2070"/>
		<updated>2021-12-13T13:08:48Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;A &#039;&#039;conversational agent&#039;&#039;, or &#039;&#039;conversational system&#039;&#039;, or &#039;&#039;dialog system&#039;&#039;, is a computer system (hardware+software platform) intended to converse with a human in a natural language.&lt;br /&gt;
&lt;br /&gt;
Let&#039;s see what wikipedia states for &#039;&#039;conversational agent&#039;&#039;&amp;lt;ref name=&amp;quot;:0&amp;quot;&amp;gt;https://en.wikipedia.org/wiki/Dialogue_system&amp;lt;/ref&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&amp;quot;A dialogue system, or conversational agent (CA), is a computer system intended to converse with a human. Dialogue systems employed one or more of text, speech, graphics, haptics, gestures, and other modes for communication on both the input and output channel&#039;&#039;.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
In common practice, conversational agent in an umbrella term generally used as synonym of [[chatbot]], but technically a chatbot is a specific kind of conversational system where the communication channel user interface (UI) is just text messages, exchanged on chat-based platforms (as [[instant-messaging]] mobile apps). &lt;br /&gt;
&lt;br /&gt;
Especially in academic parlance, the term &#039;&#039;conversational agent&#039;&#039; is synonym of &#039;&#039;chatbot&#039;&#039;, and the reason emphasis on the &amp;quot;agent&amp;quot; is maybe because conversational agents was considered software agents&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Software_agent&amp;lt;/ref&amp;gt;, special application of intelligent systems&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Intelligent_agent&amp;lt;/ref&amp;gt; and Multi-agent systems&amp;lt;ref&amp;gt;/https://en.wikipedia.org/wiki/Multi-agent_system&amp;lt;/ref&amp;gt; research.&lt;br /&gt;
&lt;br /&gt;
The term &#039;&#039;agent&#039;&#039; is so probably misleading, because assimilating any conversational system as a software agent that mediate between an end-user (e.g. a consumer of a business platform) and a business human operator counterpart. That use case is a specific use case and not always applicable. &lt;br /&gt;
&lt;br /&gt;
The agentive role of a conversational system is commonly related to a the specifica scenario when the conversational system really acts as intermediator, requiring a final conversation between the end-user and a human operator (that&#039;s commonly called:  &#039;&#039;[[human-in-the-loop]]&#039;&#039;). &lt;br /&gt;
&lt;br /&gt;
But there are many cases where the conversational system is not properly an agent, by example in pure [[command-and-control]] conversational systems that just execute automation commands to action (electro-mechanical) actuators, by example in [[home-automation]] or industrial automation realms.&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2069</id>
		<title>Conversational Agent</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2069"/>
		<updated>2021-12-13T13:06:54Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: trying a better explanation of the meaning of the term &amp;quot;agent&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;A &#039;&#039;conversational agent&#039;&#039;, or &#039;&#039;conversational system&#039;&#039;, or &#039;&#039;dialog system&#039;&#039;, is a computer system (hardware+software platform) intended to converse with a human in a natural language.&lt;br /&gt;
&lt;br /&gt;
Let&#039;s see what wikipedia states for &#039;&#039;conversational agent&#039;&#039;&amp;lt;ref name=&amp;quot;:0&amp;quot;&amp;gt;https://en.wikipedia.org/wiki/Dialogue_system&amp;lt;/ref&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&amp;quot;A dialogue system, or conversational agent (CA), is a computer system intended to converse with a human. Dialogue systems employed one or more of text, speech, graphics, haptics, gestures, and other modes for communication on both the input and output channel&#039;&#039;.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
In common practice, conversational agent in an umbrella term generally used as synonym of [[chatbot]], but technically a chatbot is a specific kind of conversational system where the communication channel user interface (UI) is just text messages, exchanged on chat-based platforms (as [[instant-messaging]] mobile apps). &lt;br /&gt;
&lt;br /&gt;
Especially in academic parlance, the term &#039;&#039;conversational agent&#039;&#039; is synonym of &#039;&#039;chatbot&#039;&#039;, and the reason emphasis on the &amp;quot;agent&amp;quot; is maybe because conversational agents was considered software agents&amp;lt;ref name=&amp;quot;:0&amp;quot; /&amp;gt;, special application of intelligent systems&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Intelligent_agent&amp;lt;/ref&amp;gt; and Multi-agent systems&amp;lt;ref&amp;gt;/https://en.wikipedia.org/wiki/Multi-agent_system&amp;lt;/ref&amp;gt; research.&lt;br /&gt;
&lt;br /&gt;
The term &#039;&#039;agent&#039;&#039; is so probably misleading, because assimilating any conversational system as a software agent that mediate between an end-user (e.g. a consumer of a business platform) and a business human operator counterpart. That use case is a specific use case and not always applicable. &lt;br /&gt;
&lt;br /&gt;
The agentive role of a conversational system is commonly related to a the specifica scenario when the conversational system really acts as intermediator, requiring a final conversation bettwen the end-user and a human operator (that&#039;s commonly called:  &#039;&#039;[[human-in-the-loop]]&#039;&#039;). &lt;br /&gt;
&lt;br /&gt;
But there are many cases where the conversational system is not properly an agent, by example in pure [[command-and-control]] conversational systems that just execute automation commands to action (electro-mechanical) actuators, by example in [[home-automation]] or industrial automation realms.&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2068</id>
		<title>Conversational Agent</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2068"/>
		<updated>2021-12-13T08:56:57Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: Minor grammar corrections; the virtual assistant paragraph has been moved away&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;A &#039;&#039;conversational agent&#039;&#039;, is a synonym of &#039;&#039;conversational system&#039;&#039;, a computer system intended to converse with a human in a natural language.&lt;br /&gt;
&lt;br /&gt;
Let&#039;s see what wikipedia states for conversational agent:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;A dialogue system, or conversational agent (CA), is a computer system intended to converse with a human. Dialogue systems employed one or more of text, speech, graphics, haptics, gestures, and other modes for communication on both the input and output channel.&#039;&#039;&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Dialogue_system&amp;lt;/ref&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
In common practice, conversational agent in an umbrella term generally used as synonym of [[chatbot]], but technically a chatbot is instead a conversational system that communicate just via texts, on chat-based platforms (as instant-messaging mobile apps). &lt;br /&gt;
&lt;br /&gt;
Especially in academic parlance, the term conversational agent was synonym of chatbot, and the reason emphasis on the &amp;quot;agent&amp;quot; was maybe because conversational agents was considered a special application in intelligent systems&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Intelligent_agent&amp;lt;/ref&amp;gt; or Multi-agent systems&amp;lt;ref&amp;gt;/https://en.wikipedia.org/wiki/Multi-agent_system&amp;lt;/ref&amp;gt; research.&lt;br /&gt;
&lt;br /&gt;
The term &#039;&#039;agent&#039;&#039; is probably misleading, because assimilating any conversational system as an agent that mediate between an end-user (e.g. a consumer of a business platform) and a business human operator. That&#039;s in general not true. Instead the agentive role is commonly related to a special case of conversational system that act as intermediator, requiring a final conversation with a human operator (that&#039;s commonly called, in customer service realms:  &#039;&#039;human-in-the-loop&#039;&#039;). But there are many cases where the conversational system is not properly an agent, by example in pure [[command-and-control]] conversational systems that just execute automation commands to action (electro-mechanical) actuators, by example in [[home-automation]] or industrial automation realms.&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2067</id>
		<title>Conversational Agent</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2067"/>
		<updated>2021-12-12T13:34:43Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;A &#039;&#039;conversational agent&#039;&#039;, or better , a &#039;&#039;conversational system&#039;&#039;, is a computer system intended to converse with a human in a natural langue.&lt;br /&gt;
&lt;br /&gt;
Let&#039;s see what wikipedia states for conversational agent:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;A dialogue system, or conversational agent (CA), is a computer system intended to converse with a human. Dialogue systems employed one or more of text, speech, graphics, haptics, gestures, and other modes for communication on both the input and output channel.&#039;&#039;&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Dialogue_system&amp;lt;/ref&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
In common practice, conversational agent in an umbrella term generally used as synonym of chatbot, even if a chatbot is a conversational agent that communicate just via texts, on chat-based platforms (as instant-messaging mobile apps). &lt;br /&gt;
&lt;br /&gt;
Especially in academic parlance, the term conversational agent was synonym of chatbot, and the reason emphasis on the &amp;quot;agent&amp;quot; was maybe because conversational agents was considered a special application in intelligent systems&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Intelligent_agent&amp;lt;/ref&amp;gt; or Multi-agent systems&amp;lt;ref&amp;gt;/https://en.wikipedia.org/wiki/Multi-agent_system&amp;lt;/ref&amp;gt; research.&lt;br /&gt;
&lt;br /&gt;
The term &amp;quot;agent&amp;quot; is probably misleading, because assimilating any conversational system as an agent that mediate between an end-user (e.g. a consumer of a business platform) and a business human operator. That&#039;s in general not true. Instead the agentive role is commonly related to a special case of conversational system that act as intermediator, requiring a final conversation with a human operator (that&#039;s commonly called, in customer service realms:  &#039;&#039;human-in-the-loop&#039;&#039;).     &lt;br /&gt;
&lt;br /&gt;
But there are many cases where the conversational system is not properly an agent, by example in pure [[command-and-control]] conversational systems that just execute automation commands to action (electro-mechanical) actuators, by example in [[home-automation]] or industrial automation realms.    &lt;br /&gt;
&lt;br /&gt;
Nevertheless the concept of &amp;quot;conversational agent&amp;quot;, even if ambiguous, is currently an active area of research of [[virtual assistants]], in the sense of a real agentive/automation technology that assists humans to solve real-life problems, in real-time in a reactive and proactive way (co called [[mixed-initiative]]). [[Voice assistants]] are a special case of virtual assistants&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Virtual_assistant&amp;lt;/ref&amp;gt;, where the human-computer interaction is mainly done via voice ([[voicefirst]]). [[Smartspeakers]] are HW device terminals that enable humans to interact with a (voice) assistant.&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2066</id>
		<title>Conversational Agent</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2066"/>
		<updated>2021-12-12T13:32:53Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;A &#039;&#039;conversational agent&#039;&#039;, or better , a &#039;&#039;conversational system&#039;&#039;, is a computer system intended to converse with a human in a natural langue.&lt;br /&gt;
&lt;br /&gt;
Let&#039;s see what wikipedia states for conversational agent:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;A dialogue system, or conversational agent (CA), is a computer system intended to converse with a human. Dialogue systems employed one or more of text, speech, graphics, haptics, gestures, and other modes for communication on both the input and output channel.&#039;&#039;&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Dialogue_system&amp;lt;/ref&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
In common practice, conversational agent in an umbrella term generally used as synonym of chatbot, even if a chatbot is a conversational agent that communicate just via texts, on chat-based platforms (as instant-messaging mobile apps). &lt;br /&gt;
&lt;br /&gt;
Especially in academic parlance, the term conversational agent was synonym of chatbot, and the reason emphasis on the &amp;quot;agent&amp;quot; was maybe because conversational agents was considered a special application in intelligent systems&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Intelligent_agent&amp;lt;/ref&amp;gt; / Multi-agent systems&amp;lt;ref&amp;gt;/https://en.wikipedia.org/wiki/Multi-agent_system&amp;lt;/ref&amp;gt; research.&lt;br /&gt;
&lt;br /&gt;
The term &amp;quot;agent&amp;quot; is probably misleading, because assimilating any conversational system as an agent that mediate between an end-user (e.g. a consumer of a business platform) and a business human operator. That&#039;s in general not true. Instead the agentive role is commonly related to a special case of conversational system that act as intermediator, requiring a final conversation with a human operator (that&#039;s commonly called, in customer service realms:  &#039;&#039;human-in-the-loop&#039;&#039;).     &lt;br /&gt;
&lt;br /&gt;
But there are many cases where the conversational system is not properly an agent, by example in pure [[command-and-control]] conversational systems that just execute automation commands to action (electro-mechanical) actuators, by example in [[home-automation]] or industrial automation realms.    &lt;br /&gt;
&lt;br /&gt;
Nevertheless the concept of &amp;quot;conversational agent&amp;quot;, even if ambiguous, is currently an active area of research of [[virtual assistants]], in the sense of a real agentive/automation technology that assists humans to solve real-life problems, in real-time in a reactive and proactive way (co called [[mixed-initiative]]). [[Voice assistants]] are a special case of virtual assistants&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Virtual_assistant&amp;lt;/ref&amp;gt;, where the human-computer interaction is mainly done via voice ([[voicefirst]]). [[Smartspeakers]] are HW device terminals that enable humans to interact with a (voice) assistant.&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2065</id>
		<title>Conversational Agent</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2065"/>
		<updated>2021-12-12T13:27:30Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;A &#039;&#039;conversational agent&#039;&#039;, or better , a &#039;&#039;conversational system&#039;&#039;, is a computer system intended to converse with a human in a natural langue.&lt;br /&gt;
&lt;br /&gt;
Let&#039;s see what wikipedia states for conversational agent:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;A dialogue system, or conversational agent (CA), is a computer system intended to converse with a human. Dialogue systems employed one or more of text, speech, graphics, haptics, gestures, and other modes for communication on both the input and output channel.&#039;&#039;&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Dialogue_system&amp;lt;/ref&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
In common practice, conversational agent in an umbrella term generally used as synonym of chatbot, even if a chatbot is a conversational agent that communicate just via texts, on chat-based platforms (as instant-messaging mobile apps). &lt;br /&gt;
&lt;br /&gt;
Especially in academic parlance, the term conversational agent was synonym of chatbot, and the reason emphasis on the &amp;quot;agent&amp;quot; was maybe because conversational agents was considered an special application of intelligent systems&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Intelligent_agent&amp;lt;/ref&amp;gt; / Multi-agent systems&amp;lt;ref&amp;gt;/https://en.wikipedia.org/wiki/Multi-agent_system&amp;lt;/ref&amp;gt; research.&lt;br /&gt;
&lt;br /&gt;
The term &amp;quot;agent&amp;quot; is misleading, because assimilating any conversational system as an agent that mediate between an end-user (e.g. a consumer of a business platform) and a business operator. That&#039;s in general not true. Instead this is commonly referred as a special case of conversational system and the human operator intervention is called &amp;quot;human-in-the-loop&amp;quot;. There are many cases where the conversational system is not an agent, by example in pure command-and-control conversational systems that just execute automation commands to action (electro-mechanical) actuators, by example in home-automation or industrial realms.    &lt;br /&gt;
&lt;br /&gt;
Nevertheless the concept of &amp;quot;conversational agent&amp;quot;, even if ambiguous, is currently an active area of research of virtual assistants, in the sense of a real agentive/automation technology that assists humans to solve real-life problems, in real-time in a reactive and proactive way (co called mixed-initiative). Voice assistants are a special case of virtual assistants&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Virtual_assistant&amp;lt;/ref&amp;gt;, where the human-computer interaction is mainly done via voice (voicefirst). Smartspeakers are HW device terminals that enable humans to interact with a (voice) assistant.&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2064</id>
		<title>Conversational Agent</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Conversational_Agent&amp;diff=2064"/>
		<updated>2021-12-12T11:08:46Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: Conversational Agent definition&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;A &#039;&#039;conversational agent&#039;&#039;, or better , a &#039;&#039;conversational system&#039;&#039;, is a computer system intended to converse with a human in a natural langue.&lt;br /&gt;
&lt;br /&gt;
Let&#039;s see what wikipedia states for conversational agent:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;A dialogue system, or conversational agent (CA), is a computer system intended to converse with a human. Dialogue systems employed one or more of text, speech, graphics, haptics, gestures, and other modes for communication on both the input and output channel.&#039;&#039;&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Dialogue_system&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The wikipedia definition is pretty good but a bit updated, because even the voice interaction is not mentioned! &lt;br /&gt;
&lt;br /&gt;
In common practice, conversational agent in an umbrella term generally used as synonym of chatbot, even if a chatbot is a conversational agent that communicate just via texts, on chat-based platforms (as instant-messaging mobile apps). &lt;br /&gt;
&lt;br /&gt;
Especially in academic parlance, the term conversational agent was synonym of chatbot, and the reason emphasis on the &amp;quot;agent&amp;quot; was maybe because conversational agents was considered an special application of intelligent systems&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Intelligent_agent&amp;lt;/ref&amp;gt; / Multi-agent systems&amp;lt;ref&amp;gt;/https://en.wikipedia.org/wiki/Multi-agent_system&amp;lt;/ref&amp;gt; research.&lt;br /&gt;
&lt;br /&gt;
The term &amp;quot;agent&amp;quot; is misleading, because assimilating any conversational system as an agent that mediate between an end-user (e.g. a consumer of a business platform) and a business operator. That&#039;s in general not true. Instead this is commonly referred as a special case of conversational system and the human operator intervention is called &amp;quot;human-in-the-loop&amp;quot;. There are many cases where the conversational system is not an agent, by example in pure command-and-control conversational systems that just execute automation commands to action (electro-mechanical) actuators, by example in home-automation or industrial realms.    &lt;br /&gt;
&lt;br /&gt;
Nevertheless the concept of &amp;quot;conversational agent&amp;quot;, even if ambiguous, is currently an active area of research of virtual assistants, in the sense of a real agentive/automation technology that assists humans to solve real-life problems, in real-time in a reactive and proactive way (co called mixed-initiative). Voice assistants are a special case of virtual assistants&amp;lt;ref&amp;gt;https://en.wikipedia.org/wiki/Virtual_assistant&amp;lt;/ref&amp;gt;, where the human-computer interaction is mainly done via voice (voicefirst). Smartspeakers are HW device terminals that enable humans to interact with a (voice) assistant.&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Talk:Conversation_Design&amp;diff=2063</id>
		<title>Talk:Conversation Design</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Talk:Conversation_Design&amp;diff=2063"/>
		<updated>2021-12-12T10:22:51Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Category ==&lt;br /&gt;
&lt;br /&gt;
Any idea for a category for that article [[User:Solyarisoftware‎|Solyarisoftware‎]]--[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 18:33, 10 December 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
:Short answer: following the classification in this wiki home page: open voice assistants -&amp;gt; Voice (any, not only open) assistant -&amp;gt; any assitant / conversational &amp;quot;agent&amp;quot; --[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 10:36, 12 December 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
::I could rename the category &amp;quot;Open Voice Assistants&amp;quot; to &amp;quot;Voice Assistants&amp;quot;. Would &amp;quot;conversational agent&amp;quot; be a subcategory below &amp;quot;Voice Assistants&amp;quot; or just a page in that category. So do you expect lots of pages in &amp;quot;conversational agent&amp;quot;? --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 10:56, 12 December 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
:::Two topics to be discussed:&lt;br /&gt;
&lt;br /&gt;
::: 1. &#039;&#039;&#039;Open voice Assistant or Voice Assistant?&#039;&#039;&#039; So a voice assistant is a voice-interfaced virtual assistant. Maybe an open voice assitant is an open assistant made by opensource+opendata enablers. Usually we ALSO tend to refer to an open voice assistant to on-prem/offline system... That&#039;s frankly a specific case (I could have a full open system but delivered as a cloud service...). It&#039;s all debatable :)&lt;br /&gt;
 &lt;br /&gt;
::: 2. &#039;&#039;&#039;What&#039;s a &amp;quot;conversational agent&amp;quot; and is this a subcategory of an &amp;quot;assistant&amp;quot; ?&#039;&#039;&#039; That&#039;s a good question! I&#039;ll try to create a definition asap. In short: &#039;&#039;conversational agent&#039;&#039; is an umbrella academic term for &#039;&#039;chatbot&#039;&#039; (nowadays also a &#039;&#039;voicebot&#039;&#039; or &#039;&#039;multimodal bot&#039;&#039;). Not necessarly a conversational agent is an &#039;&#039;assistant&#039;&#039;. Indeed an assistant is a special kind of chatbot that just &amp;quot;assists&amp;quot; the user. I&#039;ll give examples in the definition.--[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 11:22, 12 December 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
:Long anwser/questions: What&#039;s a category here? There is an ontology of categories in the site? I&#039;m asking because some concepts are maybe not precisely related to just a single category. By example the [[real-time-factor]] is commonly used in ASR (one category) but also valid sometime when reasoning about latency of a TTS (another category). --[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 10:36, 12 December 2021 (CET)&lt;br /&gt;
:: If you look at the category tree on the Mainpage i thought it&#039;s helpful finding relevant content. I&#039;ve already added some pages to multiple categories. So for example i&#039;d add TTS and STT category to [[real-time-factor]] because it&#039;s related to both categories. --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 10:56, 12 December 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
::: ok. thanks--[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 11:22, 12 December 2021 (CET)&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Talk:Main_Page&amp;diff=2062</id>
		<title>Talk:Main Page</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Talk:Main_Page&amp;diff=2062"/>
		<updated>2021-12-12T10:04:16Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: /* What infos on Mainpage? */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== What infos on Mainpage? ==&lt;br /&gt;
I&#039;m not sure which content is helpful on Mainpage. Any ideas? --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 08:46, 20 October 2021 (CEST)&lt;br /&gt;
&lt;br /&gt;
That&#039;s really important because is the landing page for a new user; some thoughts about it here below.&lt;br /&gt;
&lt;br /&gt;
In the home page I&#039;d clarify a sort of &amp;quot;manifesto&amp;quot;, let me point out what I understand s far:&lt;br /&gt;
&lt;br /&gt;
1- Openness: Open-source + open-data&lt;br /&gt;
The wiki is dedicated to open* stuff. open-source/open-data. Two things a are obviously related. I absolutely agree and I&#039;d stress this point, even if some concepts are independent from the openness, by example the concept of [[Conversation Design|conversational design]], or some basic glossary. &lt;br /&gt;
&lt;br /&gt;
2- the wiki as web &amp;quot;communication&amp;quot; pattern&lt;br /&gt;
&lt;br /&gt;
- An how-to page section?&lt;br /&gt;
Yo conceived this repo as a wiki, just &amp;quot;a la wikipedia&amp;quot; in all communication / content management. That&#039; fully fair and I appreciate &lt;br /&gt;
totally it but maybe it&#039;s not so clear to everyone involved. In my opinion the process that bringing a person to edit an existing page, or &amp;quot;worst&amp;quot;, to create a new page, is not trivial.  My modest suggestion is to help reader (future author) with all kind of notes / remind in the pages, by example explaining the way to update a content using the discussion panel, etc. &lt;br /&gt;
So maybe an &amp;quot;HOW TO&amp;quot; section could help&lt;br /&gt;
&lt;br /&gt;
- Video links&lt;br /&gt;
Why to not link video of your youtube channel https://www.youtube.com/channel/UCjqqTVVBTsxpm0iOhQ1fp9g/videos? &lt;br /&gt;
--[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 11:01, 12 December 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
== General ideas on this wiki? ==&lt;br /&gt;
&lt;br /&gt;
What do you think? --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 21:22, 1 November 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
:This seems a really worthwhile initiative. Regarding the section for Voice related paper, maybe you already have thoughts on this, but I think it would be useful to recommend the kinds of detail desired (Title, Authors, DOI, source link(s), adding categories). Open links (eg Arxiv) would be preferred over those to non-free / subscription sites. Might also want to mention any preferences around pre-review/early access papers and handling updated versions and broken links. [[User:Nmstoker|Nmstoker]] ([[User talk:Nmstoker|talk]]) 14:06, 12 November 2021 (CET)&lt;br /&gt;
::Hello [[User:Nmstoker|Nmstoker]], i&#039;m thinking of adding standard infobox templates for papers with that kind of informations you mentioned.--[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 20:23, 12 November 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
:Another general thought: where do you stand on inclusion of opinion/original information in the wiki? Wikipedia is firmly about referencing information from sources, so they avoid hosting original information and only include things that are backed by references. I&#039;m guessing that there may be grounds for more flexibility here, but deciding where to draw the line is worth considering. In a similar way, deciding where the line is between sharing knowledge vs excess promotion might need judgement. The key open source players do not strike me as being likely to over promote but there could be firms that aren&#039;t so reasonable, so having some kind of policy on this might help. [[User:Nmstoker|Nmstoker]] ([[User talk:Nmstoker|talk]]) 14:15, 12 November 2021 (CET)&lt;br /&gt;
::Copy&#039;n Paste information from internetpages is not what i would prefer. Linking to sources seems is better. I&#039;d hope that we can create a central knowledgebase for all open voice related projects and for that we have to add relevant information here. Over time we could vote some active users for being Admins here and help to control excessive promotion. --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 20:23, 12 November 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
== Logo ideas ==&lt;br /&gt;
&lt;br /&gt;
As i&#039;m a technology enthusiast, but truely not a design-guy the OpenVoice-Tech logo is &amp;quot;functional&amp;quot;. If you have ideas for a more attractive logo please help me :-). --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 19:12, 14 November 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
== Added box with newest 5 articles on Mainpage ==&lt;br /&gt;
&lt;br /&gt;
Based on an idea from a nice guy at Mycroft community i&#039;ve added a box containing the newest 5 articles on the Mainpage.--[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 07:17, 16 November 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
== Added &amp;quot;Contribution&amp;quot; box on Mainpage ==&lt;br /&gt;
&lt;br /&gt;
If you would like to contribute to this Wiki i&#039;ve added a box showing requested but not yet created pages on the Mainpage as a starting point. What do you think on this [[User:Eltocino|Eltocino]], [[User:Nmstoker|Nmstoker]], [[User:Florian|Florian]]? --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 16:58, 19 November 2021 (CET)&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Talk:Main_Page&amp;diff=2061</id>
		<title>Talk:Main Page</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Talk:Main_Page&amp;diff=2061"/>
		<updated>2021-12-12T10:02:45Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: /* What infos on Mainpage? */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== What infos on Mainpage? ==&lt;br /&gt;
I&#039;m not sure which content is helpful on Mainpage. Any ideas? --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 08:46, 20 October 2021 (CEST)&lt;br /&gt;
&lt;br /&gt;
That&#039;s really important because is the landing page for a new user; some thoughts about it here below.&lt;br /&gt;
&lt;br /&gt;
In the home page I&#039;d clarify a sort of &amp;quot;manifesto&amp;quot;, let me point out what I understand s far:&lt;br /&gt;
&lt;br /&gt;
1- Openness: Open-source + open-data&lt;br /&gt;
The wiki is dedicated to open* stuff. open-source/open-data. Two things a are obviously related. I absolutely agree and I&#039;d stress this point, even if some concepts are independent from the openness, by example the concept of [[Conversation Design|conversational design]], or some basic glossary. &lt;br /&gt;
&lt;br /&gt;
2- the wiki as web &amp;quot;communication&amp;quot; pattern&lt;br /&gt;
&lt;br /&gt;
- An how-to page section?&lt;br /&gt;
Yo conceived this repo as a wiki, just &amp;quot;a la wikipedia&amp;quot; in all communication / content management. That&#039; fully fair and I appreciate &lt;br /&gt;
totally it but maybe it&#039;s not so clear to everyone involved. In my opinion the process that bringing a person to edit an existing page, or &amp;quot;worst&amp;quot;, to create a new page, is not trivial.  My modest suggestion is to help reader (future author) with all kind of notes / remind in the pages, maybe with an &amp;quot;HOW TO&amp;quot; section?&lt;br /&gt;
&lt;br /&gt;
- Video links&lt;br /&gt;
Why to not link video of your youtube channel https://www.youtube.com/channel/UCjqqTVVBTsxpm0iOhQ1fp9g/videos? &lt;br /&gt;
--[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 11:01, 12 December 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
== General ideas on this wiki? ==&lt;br /&gt;
&lt;br /&gt;
What do you think? --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 21:22, 1 November 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
:This seems a really worthwhile initiative. Regarding the section for Voice related paper, maybe you already have thoughts on this, but I think it would be useful to recommend the kinds of detail desired (Title, Authors, DOI, source link(s), adding categories). Open links (eg Arxiv) would be preferred over those to non-free / subscription sites. Might also want to mention any preferences around pre-review/early access papers and handling updated versions and broken links. [[User:Nmstoker|Nmstoker]] ([[User talk:Nmstoker|talk]]) 14:06, 12 November 2021 (CET)&lt;br /&gt;
::Hello [[User:Nmstoker|Nmstoker]], i&#039;m thinking of adding standard infobox templates for papers with that kind of informations you mentioned.--[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 20:23, 12 November 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
:Another general thought: where do you stand on inclusion of opinion/original information in the wiki? Wikipedia is firmly about referencing information from sources, so they avoid hosting original information and only include things that are backed by references. I&#039;m guessing that there may be grounds for more flexibility here, but deciding where to draw the line is worth considering. In a similar way, deciding where the line is between sharing knowledge vs excess promotion might need judgement. The key open source players do not strike me as being likely to over promote but there could be firms that aren&#039;t so reasonable, so having some kind of policy on this might help. [[User:Nmstoker|Nmstoker]] ([[User talk:Nmstoker|talk]]) 14:15, 12 November 2021 (CET)&lt;br /&gt;
::Copy&#039;n Paste information from internetpages is not what i would prefer. Linking to sources seems is better. I&#039;d hope that we can create a central knowledgebase for all open voice related projects and for that we have to add relevant information here. Over time we could vote some active users for being Admins here and help to control excessive promotion. --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 20:23, 12 November 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
== Logo ideas ==&lt;br /&gt;
&lt;br /&gt;
As i&#039;m a technology enthusiast, but truely not a design-guy the OpenVoice-Tech logo is &amp;quot;functional&amp;quot;. If you have ideas for a more attractive logo please help me :-). --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 19:12, 14 November 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
== Added box with newest 5 articles on Mainpage ==&lt;br /&gt;
&lt;br /&gt;
Based on an idea from a nice guy at Mycroft community i&#039;ve added a box containing the newest 5 articles on the Mainpage.--[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 07:17, 16 November 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
== Added &amp;quot;Contribution&amp;quot; box on Mainpage ==&lt;br /&gt;
&lt;br /&gt;
If you would like to contribute to this Wiki i&#039;ve added a box showing requested but not yet created pages on the Mainpage as a starting point. What do you think on this [[User:Eltocino|Eltocino]], [[User:Nmstoker|Nmstoker]], [[User:Florian|Florian]]? --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 16:58, 19 November 2021 (CET)&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Talk:Main_Page&amp;diff=2060</id>
		<title>Talk:Main Page</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Talk:Main_Page&amp;diff=2060"/>
		<updated>2021-12-12T10:02:26Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: /* What infos on Mainpage? */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== What infos on Mainpage? ==&lt;br /&gt;
I&#039;m not sure which content is helpful on Mainpage. Any ideas? --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 08:46, 20 October 2021 (CEST)&lt;br /&gt;
&lt;br /&gt;
That&#039;s really important because is the landing page for a new user; some thoughts about it here below.&lt;br /&gt;
&lt;br /&gt;
In the home page I&#039;d clarify a sort of &amp;quot;manifesto&amp;quot;, let me point out what I understand s far:&lt;br /&gt;
&lt;br /&gt;
1- Openness: Open-source + open-data&lt;br /&gt;
The wiki is dedicated to open* stuff. open-source/open-data. Two things a are obviously related. I absolutely agree and I&#039;d stress this point, even if some concepts are independent from the openness, by example the concept of [[Conversation Design|conversational design]], or some basic glossary. &lt;br /&gt;
&lt;br /&gt;
2- the wiki as web &amp;quot;communication&amp;quot; pattern&lt;br /&gt;
&lt;br /&gt;
- An how-to page section?&lt;br /&gt;
Yo conceived this repo as a wiki, just &amp;quot;a la wikipedia&amp;quot; in all communication / content management. That&#039; fully fair and I appreciate &lt;br /&gt;
totally it but maybe it&#039;s not so clear to everyone involved. In my opinion the process that bringing a person to edit an existing page, or &amp;quot;worst&amp;quot;, to create a new page, is not trivial.  My modest suggestion is to help reader (future author) with all kind of notes / remind in the pages, maybe with an &amp;quot;HOW TO&amp;quot; section?&lt;br /&gt;
&lt;br /&gt;
- Video links&lt;br /&gt;
Why to not link video of your youtube channel https://www.youtube.com/channel/UCjqqTVVBTsxpm0iOhQ1fp9g/videos? &lt;br /&gt;
 &lt;br /&gt;
 --[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 11:01, 12 December 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
== General ideas on this wiki? ==&lt;br /&gt;
&lt;br /&gt;
What do you think? --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 21:22, 1 November 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
:This seems a really worthwhile initiative. Regarding the section for Voice related paper, maybe you already have thoughts on this, but I think it would be useful to recommend the kinds of detail desired (Title, Authors, DOI, source link(s), adding categories). Open links (eg Arxiv) would be preferred over those to non-free / subscription sites. Might also want to mention any preferences around pre-review/early access papers and handling updated versions and broken links. [[User:Nmstoker|Nmstoker]] ([[User talk:Nmstoker|talk]]) 14:06, 12 November 2021 (CET)&lt;br /&gt;
::Hello [[User:Nmstoker|Nmstoker]], i&#039;m thinking of adding standard infobox templates for papers with that kind of informations you mentioned.--[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 20:23, 12 November 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
:Another general thought: where do you stand on inclusion of opinion/original information in the wiki? Wikipedia is firmly about referencing information from sources, so they avoid hosting original information and only include things that are backed by references. I&#039;m guessing that there may be grounds for more flexibility here, but deciding where to draw the line is worth considering. In a similar way, deciding where the line is between sharing knowledge vs excess promotion might need judgement. The key open source players do not strike me as being likely to over promote but there could be firms that aren&#039;t so reasonable, so having some kind of policy on this might help. [[User:Nmstoker|Nmstoker]] ([[User talk:Nmstoker|talk]]) 14:15, 12 November 2021 (CET)&lt;br /&gt;
::Copy&#039;n Paste information from internetpages is not what i would prefer. Linking to sources seems is better. I&#039;d hope that we can create a central knowledgebase for all open voice related projects and for that we have to add relevant information here. Over time we could vote some active users for being Admins here and help to control excessive promotion. --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 20:23, 12 November 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
== Logo ideas ==&lt;br /&gt;
&lt;br /&gt;
As i&#039;m a technology enthusiast, but truely not a design-guy the OpenVoice-Tech logo is &amp;quot;functional&amp;quot;. If you have ideas for a more attractive logo please help me :-). --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 19:12, 14 November 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
== Added box with newest 5 articles on Mainpage ==&lt;br /&gt;
&lt;br /&gt;
Based on an idea from a nice guy at Mycroft community i&#039;ve added a box containing the newest 5 articles on the Mainpage.--[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 07:17, 16 November 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
== Added &amp;quot;Contribution&amp;quot; box on Mainpage ==&lt;br /&gt;
&lt;br /&gt;
If you would like to contribute to this Wiki i&#039;ve added a box showing requested but not yet created pages on the Mainpage as a starting point. What do you think on this [[User:Eltocino|Eltocino]], [[User:Nmstoker|Nmstoker]], [[User:Florian|Florian]]? --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 16:58, 19 November 2021 (CET)&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Talk:Conversation_Design&amp;diff=2056</id>
		<title>Talk:Conversation Design</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Talk:Conversation_Design&amp;diff=2056"/>
		<updated>2021-12-12T09:36:43Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Category ==&lt;br /&gt;
&lt;br /&gt;
Any idea for a category for that article [[User:Solyarisoftware‎|Solyarisoftware‎]]--[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 18:33, 10 December 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
Short answer: &lt;br /&gt;
following the classification in this wiki home page: &lt;br /&gt;
open voice assistants -&amp;gt; Voice (any, not only open) assistant -&amp;gt; any assitant / conversational &amp;quot;agent&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Long anwser/questions:&lt;br /&gt;
What&#039;s a category here? &lt;br /&gt;
There is an ontology of categories in the site?&lt;br /&gt;
&lt;br /&gt;
I&#039;m asking because some concepts are maybe not precisely related to just a single category. By example the [[real-time-factor]] is commonly used in ASR (one category) but also valid sometime when reasoning about latency of a TTS (another category).&lt;br /&gt;
--[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 10:36, 12 December 2021 (CET)&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Talk:OVT:What_is_OpenVoice-Tech_Wiki&amp;diff=2055</id>
		<title>Talk:OVT:What is OpenVoice-Tech Wiki</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Talk:OVT:What_is_OpenVoice-Tech_Wiki&amp;diff=2055"/>
		<updated>2021-12-12T09:25:31Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: /* about contributes on the wiki */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Our principles ==&lt;br /&gt;
&lt;br /&gt;
Thanks [[User:Solyarisoftware|Solyarisoftware]] for bringing this topic up. I&#039;ve created this page as a collection for principals of OpenVoice-Tech Wiki. Maybe we can discuss and develop our principals here. --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 12:59, 10 December 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
== about contributes on the wiki ==&lt;br /&gt;
&lt;br /&gt;
Hi Thorsten, a minor duplication of the same subject here: https://openvoice-tech.net/index.php?title=OpenVoice-Tech_Wiki_talk:About&lt;br /&gt;
&lt;br /&gt;
Following your right suggestions, I guess the fair process is: &#039;&#039;If you disagree with a written content do not simply change it, but use &amp;quot;Discussion&amp;quot; page to discuss with original writer&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
thanks&lt;br /&gt;
--[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 13:34, 10 December 2021 (CET)&lt;br /&gt;
: In case it&#039;s obvious wrong then it can be corrected, but if it&#039;s general disagreement it should be discussed before edited. Do you have additional ideas for &amp;quot;contribution guidelines&amp;quot;?--[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 18:20, 10 December 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
It&#039;s clear and I agree. Maybe the entire process (discussion on disagreement/major correction) could be pointed out / put in evidence in the main page / guidelines page, because for me, at first glance wasn&#039;t so clear.--[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 10:25, 12 December 2021 (CET)&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Talk:OVT:What_is_OpenVoice-Tech_Wiki&amp;diff=2050</id>
		<title>Talk:OVT:What is OpenVoice-Tech Wiki</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Talk:OVT:What_is_OpenVoice-Tech_Wiki&amp;diff=2050"/>
		<updated>2021-12-10T12:34:11Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: /* about contributes on the wiki */ new section&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Our principles ==&lt;br /&gt;
&lt;br /&gt;
Thanks [[User:Solyarisoftware|Solyarisoftware]] for bringing this topic up. I&#039;ve created this page as a collection for principals of OpenVoice-Tech Wiki. Maybe we can discuss and develop our principals here. --[[User:Thorsten|Thorsten]] ([[User talk:Thorsten|talk]]) 12:59, 10 December 2021 (CET)&lt;br /&gt;
&lt;br /&gt;
== about contributes on the wiki ==&lt;br /&gt;
&lt;br /&gt;
Hi Thorsten, a minor duplication of the same subject here: https://openvoice-tech.net/index.php?title=OpenVoice-Tech_Wiki_talk:About&lt;br /&gt;
&lt;br /&gt;
Following your right suggestions, I guess the fair process is: &#039;&#039;If you disagree with a written content do not simply change it, but use &amp;quot;Discussion&amp;quot; page to discuss with original writer&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
thanks&lt;br /&gt;
--[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 13:34, 10 December 2021 (CET)&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Conversation_Design&amp;diff=2046</id>
		<title>Conversation Design</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Conversation_Design&amp;diff=2046"/>
		<updated>2021-12-10T08:52:58Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: Conversation design definition&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;quot;&#039;&#039;Conversation design&#039;&#039; (&#039;&#039;CxD&#039;&#039;) is about defining the interactions between the user and a conversational agent, based on how people communicate in real life.&amp;quot; [https://uxdesign.cc/intro-to-conversation-design-ce3bd30e4385 cit.]&lt;br /&gt;
&lt;br /&gt;
Designing a (human-to-machine) conversation is mainly related to the linguistics (pragmatics, psycholinguistics, sociolinguistics) and the authoring/screenwriting. [https://developers.google.com/assistant/conversation-design/what-is-conversation-design Google], with the famous CxD  depth, lead by [https://developers.google.com/assistant/conversation-design/learn-about-conversation James Giangola] et al, people that conceived Google Assistant UX, contributed few years ago to divulgate concepts now became &amp;quot;common sense&amp;quot; as: Voice User Interfaces (VUI) best practices, Grice&#039;s Maxims, botpersona, persona, multimodal conversations.  &lt;br /&gt;
&lt;br /&gt;
The &#039;&#039;conversation designer&#039;&#039; has a fundamental role in any enterprise team that build professional conversational agents/virtual agents.&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Open_Voice_Technology_Wiki_talk:About&amp;diff=2045</id>
		<title>Open Voice Technology Wiki talk:About</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Open_Voice_Technology_Wiki_talk:About&amp;diff=2045"/>
		<updated>2021-12-10T08:28:46Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: /* about contributes on the wiki */ new section&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== about contributes on the wiki ==&lt;br /&gt;
&lt;br /&gt;
Hi Thorsten!&lt;br /&gt;
&lt;br /&gt;
Thanks for your initiative here and your work on Open German Voice Dataset (even if I don&#039;t know German language(ù)!&lt;br /&gt;
I&#039;ll try to contribute the wiki and for sure I&#039;ll share on twitter and linkedin.&lt;br /&gt;
&lt;br /&gt;
My personal main concern, when by example writing on this wiki a definition of a concept, or a company/project, is that I&#039;m naturally biased/opinionated on a technology or any tech solution. Also any definition I could add is for sure debatable. Even if I&#039;m aware about the wiki/wikipedia-like common way to evolve/refine contents (with the continuous-delivery :) contribute of many people during time), my question is:&lt;br /&gt;
&lt;br /&gt;
In general, it&#039;s ok if I submit a definition that inevitably contains a personal bias/comment ?      &lt;br /&gt;
&lt;br /&gt;
Thanks again&lt;br /&gt;
&lt;br /&gt;
respect&lt;br /&gt;
&lt;br /&gt;
giorgio --[[User:Solyarisoftware|Solyarisoftware]] ([[User talk:Solyarisoftware|talk]]) 09:28, 10 December 2021 (CET)&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Open_Voice_Technology_Wiki_talk:About&amp;diff=2044</id>
		<title>Open Voice Technology Wiki talk:About</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Open_Voice_Technology_Wiki_talk:About&amp;diff=2044"/>
		<updated>2021-12-10T08:28:26Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: Blanked the page&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Open_Voice_Technology_Wiki_talk:About&amp;diff=2043</id>
		<title>Open Voice Technology Wiki talk:About</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Open_Voice_Technology_Wiki_talk:About&amp;diff=2043"/>
		<updated>2021-12-10T08:26:47Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: about contributes on the wiki&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Hi Thorsten!&lt;br /&gt;
&lt;br /&gt;
Thanks for your initiative here and your work on Open German Voice Dataset (even if I don&#039;t know German language(ù)!&lt;br /&gt;
I&#039;ll try to contribute the wiki and for sure I&#039;ll share on twitter and linkedin.&lt;br /&gt;
&lt;br /&gt;
My personal main concern, when by example writing on this wiki a definition of a concept, or a company/project, is that I&#039;m naturally biased/opinionated on a technology or any tech solution. Also any definition I could add is for sure debatable. Even if I&#039;m aware about the wiki/wikipedia-like common way to evolve/refine contents (with the continuous-delivery :) contribute of many people during time), my question is:&lt;br /&gt;
&lt;br /&gt;
In general, it&#039;s ok if I submit a definition that inevitably contains a personal bias/comment ?      &lt;br /&gt;
&lt;br /&gt;
Thanks again&lt;br /&gt;
respect&lt;br /&gt;
giorgio&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Real-time-factor&amp;diff=2042</id>
		<title>Real-time-factor</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Real-time-factor&amp;diff=2042"/>
		<updated>2021-12-10T08:05:46Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: RTF definition&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The &#039;&#039;real time factor&#039;&#039; (&#039;&#039;RTF&#039;&#039;) is a common metric of measuring the speed of an automatic speech recognition system (ASR) in the decoding phase (&amp;quot;at run-time&amp;quot;). It can also be used in other context where an audio or video signal is processed (usually automatically) at nearly constant rate. All in all RTF is a measure of the latency of any (audio) processing system, not only a speech recognition engine, but also a text-to-speech engine, a [[transcoding]] engine, etc.&lt;br /&gt;
&lt;br /&gt;
If it takes time f(d) to process an input of duration d , the real time factor is defined as: RTF = f(d)/d&lt;br /&gt;
&lt;br /&gt;
If, for example, it takes 8 hours of computation time to process a recording of duration 2 hours, the real time factor is 4. When the real time factor is 1, the processing is done &amp;quot;in real time&amp;quot;. It is a hardware dependent value, it is a network bandwidth dependent value (this is important to note, if processing is done as cloud-based service).&lt;br /&gt;
&lt;br /&gt;
Usually a state of the art speech-to-text cloud-based service supplied by Google, Azure, AWS, etc. has values between 0.2 and 0.6. Note that is all very depending on many factors, the network/internet bandwith, the speech content, etc. In case of an on-prem ASR, the major impacting factor is the algorithm and the hardware resources (CPU/RAM).  &amp;lt;syntaxhighlight lang=&amp;quot;python&amp;quot;&amp;gt;&lt;br /&gt;
def real_time_factor(processingTime, audioLenght, decimals=2):&lt;br /&gt;
&lt;br /&gt;
    &#039;&#039;&#039; Real-Time Factor (RTF) is defined as processing-time / length-of-audio. &#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
    rtf = (processingTime / audioLenght)&lt;br /&gt;
&lt;br /&gt;
    return round(rtf, decimals)&lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Conversational_AI&amp;diff=2041</id>
		<title>Conversational AI</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Conversational_AI&amp;diff=2041"/>
		<updated>2021-12-10T07:41:11Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The term &#039;&#039;Conversational AI&#039;&#039;, shortcut for &#039;&#039;Conversational Artificial Intelligence&#039;&#039; is an umbrella term, become spread in recent years, used to define all technologies around speech recognition (ASR), synthetic voice generation (TTS), natural language generation (NLG), dialog management (DM), chatbots, [[voicebots]], multimodal assistants in general.&lt;br /&gt;
&lt;br /&gt;
Not fully sure but the term has been probably &amp;quot;coined&amp;quot; in [[IBM Watson]] (TBV) and used as synonym of &#039;&#039;Conversational Computing&#039;&#039;, another definition used at time in IBM, that doesn&#039;t gained success (TBV).&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=RASA&amp;diff=2040</id>
		<title>RASA</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=RASA&amp;diff=2040"/>
		<updated>2021-12-10T07:38:15Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;quot;&#039;&#039;Open source machine learning tools for developers to build, improve, and deploy text-and voice-based chatbots and assistants&#039;&#039;&amp;quot;. (cit. [https://github.com/RasaHQ/ RASA github home page]).&lt;br /&gt;
&lt;br /&gt;
RASA is probably the most important open-source tool to develop &amp;quot;task-oriented&amp;quot; conversational applications. Despite the RASA official statement, the original project has not  conceived to manage voice interactions, but just [[chatbots]] with some support to GUI/buttons.  &lt;br /&gt;
&lt;br /&gt;
RASA architecture consist in two main components: &lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;RASA NLU&#039;&#039; is based upon [https://rasa.com/blog/introducing-dual-intent-and-entity-transformer-diet-state-of-the-art-performance-on-a-lightweight-architecture/ DIET algorithm], a a refined state of the art  intent/entities &amp;quot;classifier&lt;br /&gt;
* &#039;&#039;RASA Core&#039;&#039; (now called &#039;&#039;RASA Dialog Manager&#039;&#039;), based on [https://rasa.com/blog/unpacking-the-ted-policy-in-rasa-open-source/ TED policy], a machine learning algorithm, to manage multi-turn dialogs, escaping the traditional state-machine based way, but instead allowing conversation developers to insert &amp;quot;&#039;&#039;stories&amp;quot;,&#039;&#039; set of of intents-actions sequences (conversation examples). With [https://rasa.com/blog/were-a-step-closer-to-getting-rid-of-intents/ end-to-end training], developers program the conversational agent [[dialog manager]] giving end-to-end turn-taking examples (the stories).&lt;br /&gt;
&lt;br /&gt;
RASA owned in few years now, a huge open community of developers and researchers. It&#039;s probably the biggest open source project to develop on-premise &amp;quot;production-ready&amp;quot; complex dialog systems. All the development ecosystem is around the [[Python]] programming language.&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
* Home page: https://rasa.com/&lt;br /&gt;
&lt;br /&gt;
* Github: https://github.com/RasaHQ/&lt;br /&gt;
&lt;br /&gt;
* Community forum: https://forum.rasa.com/&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Glossary_of_voice_tech&amp;diff=2037</id>
		<title>Glossary of voice tech</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Glossary_of_voice_tech&amp;diff=2037"/>
		<updated>2021-12-09T16:09:12Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: /* Voice assistant terms */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Category:Open Voice Tech]]&lt;br /&gt;
&lt;br /&gt;
In the field of voice technology there are lots of buzzwords. Some are self explaining, other lead to confusion regularly. This list should be a glossary.&lt;br /&gt;
&lt;br /&gt;
==General terms==&lt;br /&gt;
&lt;br /&gt;
*[[:Category:Dataset|Dataset]]&lt;br /&gt;
*[[Research papers|Papers]] (&#039;&#039;research papers&#039;&#039;)&lt;br /&gt;
*[[Phonemes]]&lt;br /&gt;
*[[Model]]&lt;br /&gt;
*[[Checkpoint]]&lt;br /&gt;
*[[Repository]]&lt;br /&gt;
&lt;br /&gt;
==STT terms==&lt;br /&gt;
&lt;br /&gt;
*[[:Category:Wake words|Wake word]]&lt;br /&gt;
*[[Hotword]]&lt;br /&gt;
*[[Voice print]]&lt;br /&gt;
*[[Word error rate]] (&#039;&#039;WER&#039;&#039;)&lt;br /&gt;
*[[Diarization]]&lt;br /&gt;
*[[Barge-in]]&lt;br /&gt;
&lt;br /&gt;
==TTS terms==&lt;br /&gt;
&lt;br /&gt;
*&lt;br /&gt;
&lt;br /&gt;
==Voice assistant terms==&lt;br /&gt;
&lt;br /&gt;
*[[Conversational AI]]&lt;br /&gt;
*[[Natural language understanding]] (&#039;&#039;NLU&#039;&#039;)&lt;br /&gt;
*[[Utterance]]&lt;br /&gt;
*[[Voiceonly]]&lt;br /&gt;
&lt;br /&gt;
==Machine learning==&lt;br /&gt;
&lt;br /&gt;
*[[Epoch]]&lt;br /&gt;
*[[Step]]&lt;br /&gt;
*[[Batch size]]&lt;br /&gt;
*[[Learning rate]]&lt;br /&gt;
*[[Inference]]&lt;br /&gt;
*[[Alignment]]&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Conversational_AI&amp;diff=2036</id>
		<title>Conversational AI</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Conversational_AI&amp;diff=2036"/>
		<updated>2021-12-09T16:07:34Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: definition of Conversational AI&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The term &#039;&#039;Conversational AI&#039;&#039;, shortcut for &#039;&#039;Conversational Artificial Intelligence&#039;&#039; is an umbrella term, become spread in recent years, used to define all technologies around speech recognition (ASR), synthetic voice generation (TTS), natural language generation (NLG), dialog management (DM), chatbots, voicebots, multimodal assistants in general.&lt;br /&gt;
&lt;br /&gt;
Not fully sure but the term has been probably &amp;quot;coined&amp;quot; in IBM (Watson) and used at times as synonym of &#039;&#039;Conversational Computing&#039;&#039;. TBV.&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Glossary_of_voice_tech&amp;diff=2035</id>
		<title>Glossary of voice tech</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Glossary_of_voice_tech&amp;diff=2035"/>
		<updated>2021-12-09T16:00:31Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: update of some keywords&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Category:Open Voice Tech]]&lt;br /&gt;
&lt;br /&gt;
In the field of voice technology there are lots of buzzwords. Some are self explaining, other lead to confusion regularly. This list should be a glossary.&lt;br /&gt;
&lt;br /&gt;
==General terms==&lt;br /&gt;
&lt;br /&gt;
*[[:Category:Dataset|Dataset]]&lt;br /&gt;
*[[Research papers|Papers]] (&#039;&#039;research papers&#039;&#039;)&lt;br /&gt;
*[[Phonemes]]&lt;br /&gt;
*[[Model]]&lt;br /&gt;
*[[Checkpoint]]&lt;br /&gt;
*[[Repository]]&lt;br /&gt;
&lt;br /&gt;
==STT terms==&lt;br /&gt;
&lt;br /&gt;
*[[:Category:Wake words|Wake word]]&lt;br /&gt;
*[[Hotword]]&lt;br /&gt;
*[[Voice print]]&lt;br /&gt;
*[[Word error rate]] (&#039;&#039;WER&#039;&#039;)&lt;br /&gt;
*[[Diarization]]&lt;br /&gt;
*[[Barge-in]]&lt;br /&gt;
&lt;br /&gt;
==TTS terms==&lt;br /&gt;
&lt;br /&gt;
*&lt;br /&gt;
&lt;br /&gt;
==Voice assistant terms==&lt;br /&gt;
&lt;br /&gt;
*[[Utterance]]&lt;br /&gt;
*[[Natural language understanding]] (&#039;&#039;NLU&#039;&#039;)&lt;br /&gt;
*[[Voiceonly]]&lt;br /&gt;
&lt;br /&gt;
==Machine learning==&lt;br /&gt;
&lt;br /&gt;
*[[Epoch]]&lt;br /&gt;
*[[Step]]&lt;br /&gt;
*[[Batch size]]&lt;br /&gt;
*[[Learning rate]]&lt;br /&gt;
*[[Inference]]&lt;br /&gt;
*[[Alignment]]&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Natural_language_understanding&amp;diff=2034</id>
		<title>Natural language understanding</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Natural_language_understanding&amp;diff=2034"/>
		<updated>2021-12-09T15:55:49Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: DIET classifier link added&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Natural Language Understanding (NLU) is a a misleading term, highly discussed in the Conversational AI / scientific community.&lt;br /&gt;
&lt;br /&gt;
In recent years, especially in the chatbot engineering industry, we tend to use NLU to mean an intent/entities classifier, based on machine learning techniques (transformers, etc.). The main open source project / state of the art of this approach is probably the [https://rasa.com/blog/introducing-dual-intent-and-entity-transformer-diet-state-of-the-art-performance-on-a-lightweight-architecture/ RASA DIET classifier].&lt;br /&gt;
&lt;br /&gt;
Besides, in terms of linguistic, and psycho-linguistic/cognitive scientific disciplines, there is a great skepticism about naming &amp;quot;language understanding&amp;quot; a ML-based classifier of intents (and entities). A growing number of researcher linguists state that it&#039;s even impossible to understand language with machine language techniques (the more famous and currently debated is probably GPT-3). One of the scientist more active in this battle is [https://ontologik.medium.com/ Walid Saba].&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Voiceonly&amp;diff=2033</id>
		<title>Voiceonly</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Voiceonly&amp;diff=2033"/>
		<updated>2021-12-09T15:53:41Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: voiceonly definition&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;A &#039;&#039;voiceonly&#039;&#039; (or &#039;&#039;voice-only&#039;&#039;) application is, as the name suggests, a software application (like a voice-interfaced &#039;&#039;chatbot&#039;&#039;, called also &#039;&#039;voicebot&#039;&#039; in this case) where the interface channel is only voice-based, without any graphical interface (GUI). &lt;br /&gt;
&lt;br /&gt;
In last years, a voiceonly channel is a synonym for using &#039;&#039;smartspeakers&#039;&#039;, since when Amazon Echo device, terminal of the Amazon Alexa cloud-based system, was put on production in 2014; before, the traditional voice-only channel is of course the telephone and the IVR voice automation, still alive nowadays. &lt;br /&gt;
&lt;br /&gt;
To be precise, popular smartspeakers by Amazon and Google are example of so called &#039;&#039;voicefirst&#039;&#039; devices (not just voiceonly), because there users interact with virtual assistants primary via the voice channel (through the smartspeaker), but the users can also interact with the same assistants using the chat interface on a mobile phone app. &lt;br /&gt;
&lt;br /&gt;
Funny fact: the term voicefirst and especially the hashtag &#039;&#039;#voicefirst&#039;&#039; become popular few years ago on twitter, maybe used initially by [https://twitter.com/BrianRoemmele Brain Roemmele]. The term voicefirst become soon viral on voice / conversational AI community since then.&lt;br /&gt;
&lt;br /&gt;
Digression: the step forward the voicefirst conversational user experience (UX) is a &#039;&#039;multimodal&#039;&#039; conversational experience where the channel is not just the speech recognition or the texting (input) or the synthetic voice play or a written prompt (in output), but the conversation is true multimodal when really multi-sensory, where by example the input is not just text or voice but also gaze detection, gesture detection, geolocation info, ambient sensors, etc.&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=User:Solyarisoftware&amp;diff=2032</id>
		<title>User:Solyarisoftware</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=User:Solyarisoftware&amp;diff=2032"/>
		<updated>2021-12-09T11:14:43Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: personal page&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;My name is Giorgio Robino.  I&#039;m from Genova, Italy. My nickname, solyarisoftware, is a tribute to Andrej Tarkovskij&#039;s movie SOLYARIS. &lt;br /&gt;
&lt;br /&gt;
I&#039;m an engineer and researcher in &amp;quot;Conversational AI&amp;quot; verticals. I&#039;m generally interested in voice/voiceonly multimodal interfaces (ASR/TTS) but I&#039;m especially focused in dialog management and task-oriented realtime &amp;quot;assistants&amp;quot; that help human operator to complete real world working tasks. I call this kind of assistants: &amp;quot;[https://docs.google.com/presentation/d/1ieZnAdREzEGXkcO4C_XPIbS9YAnE76mB0wpP2k-yOlQ/edit#slide=id.gc0058244ce_0_10 Enterpise Voice Cobots]&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
As Italian, I&#039;m especially focused in the italian accademies and industries ecosystem (in natural language processing realms), and I realized in 2016 a first italian open conference #convcomp2016 about &amp;quot;conversational computing&amp;quot;. Afterward I maintained the related blog: [https://www.convcomp.it www.convcomp.it] where I try to share articles about topics related to chatbots/voicebots/virtual assistants. I&#039;m also pretty active on [https://www.twitter.com/solyarisoftware twitter], where I share and chat/rant  just only about conversational AI/voice related stuff.&lt;br /&gt;
&lt;br /&gt;
My current day job is &amp;quot;Conversational AI Tech Lead&amp;quot; in the Italian company [https://www.almawave.it www.almawave.it], where I&#039;m now working in R%D projects, especially in ehealth realms.&lt;br /&gt;
&lt;br /&gt;
Of course I&#039;m supporter of opensource and opendata also in voice realms. As free-time beside project, I published small open source projects:&lt;br /&gt;
&lt;br /&gt;
* https://github.com/solyarisoftware/naifjs, simple state-machine based dialog manager, in nodejs&lt;br /&gt;
* https://github.com/solyarisoftware/voskJs, Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server&lt;br /&gt;
* https://github.com/solyarisoftware/CoquiSTTJs, Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server&lt;br /&gt;
* https://github.com/solyarisoftware/jointts, a brainless concatenative text to speech&lt;br /&gt;
* https://github.com/solyarisoftware/webad, Web Browser Audio Detection/Speech Recording Events API&lt;br /&gt;
* https://github.com/solyarisoftware/Highlight.vim, Highlight vim plugin colorizes pattern of texts, with a random or specified background colors&lt;br /&gt;
&lt;br /&gt;
Last but not least, I have been an ambient music maker as [http://solyaris.altervista.org/ SOLYARIS MUSIC].&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Natural_language_understanding&amp;diff=2025</id>
		<title>Natural language understanding</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Natural_language_understanding&amp;diff=2025"/>
		<updated>2021-12-05T18:06:34Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: NLU definition&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Natural Language Understanding (NLU) is a a misleading term, highly discussed in the Conversational AI / scientific community.&lt;br /&gt;
&lt;br /&gt;
In recent years, especially in the chatbot engineering industry, we tend to use NLU to mean an intent/entities classifier, based on machine learning techniques (transformers, etc.). The main open source project / state of the art of this approach is probably the RASA DIET classifier.&lt;br /&gt;
&lt;br /&gt;
Besides, in terms of linguistic, and psycho-linguistic/cognitive scientific disciplines, there is a great skepticism about naming &amp;quot;language understanding&amp;quot; a ML-based classifier of intents (and entities). A growing number of researcher linguists state that it&#039;s even impossible to understand language with machine language techniques (the more famous and currently debated is probably GPT-3). One of the scientist more active in this battle is [https://ontologik.medium.com/ Walid Saba].&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=RASA&amp;diff=2022</id>
		<title>RASA</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=RASA&amp;diff=2022"/>
		<updated>2021-12-05T17:50:34Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: RASA platform description&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Open source machine learning tools for developers to build, improve, and deploy text-and voice-based chatbots and assistants. (cit. RASA github home).&lt;br /&gt;
&lt;br /&gt;
RASA is probably the most important tool to develop &amp;quot;task-oriented&amp;quot; conversational application. Despite the RASA official statement, is not originally developed to manage voice interaction, but chatbots. RASA consist in two main components: &lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;RASA NLU&#039;&#039; is based upon DIET, a a refined state of the art  intent/entities classifier&lt;br /&gt;
* &#039;&#039;RASA Core&#039;&#039; (now called &#039;&#039;RASA Dialog Manager&#039;&#039;), based on TED, a machine learning algorithm, to manage multi-turn dialogs, escaping the traditional state-machine based way, but instead allowing conversation developers to insert &amp;quot;&#039;&#039;stories&amp;quot;,&#039;&#039; set of sequences of intents-actions examples. In a sense, developers define the conversational agent dialog manager giving examples (the stories).&lt;br /&gt;
&lt;br /&gt;
RASA own a huge open community of developers and researchers. It&#039;s probably the biggest open source project to develop on-premise &amp;quot;production-ready&amp;quot; complex dialog systems. All the development ecosystem is around the Python programming language.&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
home page: https://rasa.com/&lt;br /&gt;
&lt;br /&gt;
github: https://github.com/RasaHQ/&lt;br /&gt;
&lt;br /&gt;
community forum: https://forum.rasa.com/&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Diarization&amp;diff=2017</id>
		<title>Diarization</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Diarization&amp;diff=2017"/>
		<updated>2021-12-05T09:43:01Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: diarization definition minor correction&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&#039;&#039;Speaker diarisation&#039;&#039; (or &#039;&#039;diarization&#039;&#039;), or s&#039;&#039;peaker separation&#039;&#039; is the process of partitioning an input audio stream into homogeneous segments according to the speaker identity. It can enhance the readability of an automatic speech transcription by structuring the audio stream into speaker turns and, when used together with speaker recognition systems, by providing the speaker’s true identity.&lt;br /&gt;
 &lt;br /&gt;
Source: https://en.wikipedia.org/wiki/Speaker_diarisation&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Barge-in&amp;diff=2016</id>
		<title>Barge-in</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Barge-in&amp;diff=2016"/>
		<updated>2021-12-05T09:41:57Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: barge-in definition&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&#039;&#039;&amp;quot;Barge-in is a feature that allows callers to interrupt a prompt and provide their response before the prompt has finished playing&amp;quot;&#039;&#039; at pag. 24 of book &amp;quot;Voice User Interface Design&amp;quot; by James Giangola et al.&lt;br /&gt;
&lt;br /&gt;
In other words, barge-in is, in any voice interface system / voice assistant, the user capability of interrupt/stop the assistant spoken (a text-to-speech synthetic voice play), to impose a new overriding user voice request to be processed asap.&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Diarization&amp;diff=2015</id>
		<title>Diarization</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Diarization&amp;diff=2015"/>
		<updated>2021-12-04T16:54:54Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: diarization new page&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Speaker diarisation (or diarization), or Speaker separation is the process of partitioning an input audio stream into homogeneous segments according to the speaker identity. It can enhance the readability of an automatic speech transcription by structuring the audio stream into speaker turns and, when used together with speaker recognition systems, by providing the speaker’s true identity.&lt;br /&gt;
 &lt;br /&gt;
Source: https://en.wikipedia.org/wiki/Speaker_diarisation&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Coqui&amp;diff=2009</id>
		<title>Coqui</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Coqui&amp;diff=2009"/>
		<updated>2021-12-03T17:27:57Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: introduced coaqui.ai description&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Category:Coqui]]&lt;br /&gt;
[[Category:Project]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
https://coqui.ai/ Coqui is dedicated to open speech technology and to serving as the hub where speech researchers, developers, and practitioners congregate.&lt;br /&gt;
&lt;br /&gt;
https://github.com/coqui-ai/STT The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.&lt;br /&gt;
&lt;br /&gt;
https://github.com/coqui-ai/TTS 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production&lt;br /&gt;
&lt;br /&gt;
https://github.com/coqui-ai/snakepit 🐍 Coqui&#039;s machine learning job scheduler&lt;br /&gt;
&lt;br /&gt;
---&lt;br /&gt;
&lt;br /&gt;
related projects:&lt;br /&gt;
&lt;br /&gt;
-  https://github.com/solyarisoftware/CoquiSTTJs Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.&lt;br /&gt;
&lt;br /&gt;
== TTS ==&lt;br /&gt;
Let&#039;s collect some questions related to Coqui TTS.&lt;br /&gt;
&lt;br /&gt;
* [[Continue Coqui TTS training based on checkpoint]]&lt;br /&gt;
* [[Finetune existing Coqui TTS model]]&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
	<entry>
		<id>https://openvoice-tech.net/index.php?title=Vosk&amp;diff=2008</id>
		<title>Vosk</title>
		<link rel="alternate" type="text/html" href="https://openvoice-tech.net/index.php?title=Vosk&amp;diff=2008"/>
		<updated>2021-12-03T17:08:45Z</updated>

		<summary type="html">&lt;p&gt;Solyarisoftware: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[https://github.com/alphacep/vosk-api Vosk] is an open-source speech recognition toolkit by Alphacephei&amp;lt;ref&amp;gt;https://alphacephei.com/vosk/&amp;lt;/ref&amp;gt;. Key features are:&lt;br /&gt;
&lt;br /&gt;
# Supports 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish. More to come.&lt;br /&gt;
# Works offline, even on lightweight devices - Raspberry Pi, Android, iOS&lt;br /&gt;
# Installs with simple &amp;lt;code&amp;gt;pip3 install vosk&amp;lt;/code&amp;gt;&lt;br /&gt;
# Portable per-language models are only 50Mb each, but there are much bigger server models available.&lt;br /&gt;
# Provides streaming API for the best user experience (unlike popular speech-recognition python packages)&lt;br /&gt;
# There are bindings for different programming languages, too - java/csharp/javascript etc.&lt;br /&gt;
# Allows quick reconfiguration of vocabulary for best accuracy.&lt;br /&gt;
# Supports speaker identification beside simple speech recognition.&lt;br /&gt;
&lt;br /&gt;
[[Category:STT]]&lt;br /&gt;
&amp;lt;references /&amp;gt;Vosk related projects&lt;br /&gt;
&lt;br /&gt;
- https://github.com/solyarisoftware/voskjs Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server&lt;/div&gt;</summary>
		<author><name>Solyarisoftware</name></author>
	</entry>
</feed>