#44 - Data2Vec, training one model with text, audio and image. - podcast episode cover

#44 - Data2Vec, training one model with text, audio and image.

Mar 31, 202219 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Hey guys, in this episode I talk about Data2Vec, a revolutionary algorithm that is able to learn from 3 different data modalities, audio, text and image and be better or comparable than previous state of the art methods. Also this algorithm uses a self-supervised approach, which means that it doesn’t use labels to the training. If you want to better understand it go listen to the episode! 


Instagram: https://www.instagram.com/podcast.lifewithai/

Linkedin: https://www.linkedin.com/company/life-with-ai

Paper: https://arxiv.org/pdf/2202.03555.pdf 

Github code and models: https://github.com/pytorch/fairseq/tree/main/examples/data2vec

For the best experience, listen in Metacast app for iOS or Android