648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip - podcast episode cover

648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip

Jan 27, 202310 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Text-to-speech gets a groundbreaking update with Microsoft’s VALL-E. On this Five-Minute Friday, Jon Krohn investigates how the Microsoft team modeled their tool to replicate natural human speech using just three seconds of a person’s voice. Additional materials: www.superdatascience.com/648 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
For the best experience, listen in Metacast app for iOS or Android
Open in Metacast
648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip | Super Data Science: ML & AI Podcast with Jon Krohn - Listen or read transcript on Metacast