More powerful deep learning with transformers (Ep. 84) (Rebroadcast) - podcast episode cover

More powerful deep learning with transformers (Ep. 84) (Rebroadcast)

Nov 27, 201938 minEp. 85
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Some of the most powerful NLP models like BERT and GPT-2 have one thing in common: they all use the transformer architecture.
Such architecture is built on top of another important concept already known to the community: self-attention.
In this episode I explain what these mechanisms are, how they work and why they are so powerful.

Don't forget to subscribe to our Newsletter or join the discussion on our Discord server

 

References
For the best experience, listen in Metacast app for iOS or Android