Jump to content

Diarization: Difference between revisions

From Open Voice Technology Wiki
diarization new page
 
Formatting
 
(One intermediate revision by one other user not shown)
Line 1: Line 1:
Speaker diarisation (or diarization), or Speaker separation is the process of partitioning an input audio stream into homogeneous segments according to the speaker identity. It can enhance the readability of an automatic speech transcription by structuring the audio stream into speaker turns and, when used together with speaker recognition systems, by providing the speaker’s true identity.
''[[Speaker]] diarisation''<ref>https://en.wikipedia.org/wiki/Speaker_diarisation</ref> (or ''diarization''), or s''peaker separation'' is the process of partitioning an input audio stream into homogeneous segments according to the speaker identity. It can enhance the readability of an automatic speech transcription by structuring the audio stream into speaker turns and, when used together with speaker recognition systems, by providing the speaker’s true identity.
 
Source: https://en.wikipedia.org/wiki/Speaker_diarisation
== References ==
<references />
 
[[Category:STT]]

Latest revision as of 16:00, 5 December 2021

Speaker diarisation[1] (or diarization), or speaker separation is the process of partitioning an input audio stream into homogeneous segments according to the speaker identity. It can enhance the readability of an automatic speech transcription by structuring the audio stream into speaker turns and, when used together with speaker recognition systems, by providing the speaker’s true identity.

References[edit | edit source]

Cookies help us deliver our services. By using our services, you agree to our use of cookies.