648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip - podcast episode cover

648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip

Jan 27, 202310 min
--:--
--:--
Listen in podcast apps:
Metacast
Spotify
Youtube
RSS

Episode description

Text-to-speech gets a groundbreaking update with Microsoft’s VALL-E. On this Five-Minute Friday, Jon Krohn investigates how the Microsoft team modeled their tool to replicate natural human speech using just three seconds of a person’s voice. Additional materials: www.superdatascience.com/648 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
For the best experience, listen in Metacast app for iOS or Android
Open in Metacast
648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip | Super Data Science: ML & AI Podcast with Jon Krohn - Listen or read transcript on Metacast