Skip to content

An audio deepfake is when a “cloned” voice that is potentially indistinguishable from the real person’s is used to produce synthetic audio.

Notifications You must be signed in to change notification settings

Amey-Thakur/DEEPFAKE-AUDIO

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 

Repository files navigation

DEEPFAKE-AUDIO

👉🏻 An audio deepfake is when a “cloned” voice that is potentially indistinguishable from the real person’s is used to produce synthetic audio.

This project is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time.

SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generate new voices.


👉🏻 Created to Learn the working of Deepfake-Audio 👈🏻

👷 Project Authors: Amey Thakur and Mega Satish

✌🏻 Back To Engineering ✌🏻

About

An audio deepfake is when a “cloned” voice that is potentially indistinguishable from the real person’s is used to produce synthetic audio.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published