Jump to content

TTS watermarks

From Open Voice Technology Wiki
Revision as of 19:49, 23 December 2021 by Thorsten (talk | contribs) (Added new thoughts from Erogol about watermarks.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)


Preventing deep fakes

Based on Erogol's idea from Coqui there should be a way to identify deep-fakes in voice context. After some Twitter chatting[1] there seems one thing without doubt: "It's the old story between hacker and the people trying to prevent misusage".

Possible techniques[2]

What kind of techniques are useful for what and what's the pros and cons:

Watermark in TTS output

Easy to analyse / reproduce using original sourcecode.

Watermark in TTS dataset

Models can learn to reproduce watermark without seeing anything on that in the code.

References

Cookies help us deliver our services. By using our services, you agree to our use of cookies.