LLM Training: Superman's Kryptonite-Proof Suit - podcast episode cover

LLM Training: Superman's Kryptonite-Proof Suit

May 29, 202319 minSeason 1Ep. 13
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Why isn't Superman's suit Kryptonite-proof? This question reveals how large language models are trained. We break down transformers (the T in GPT), self-attention mechanisms, and the inference process—using Superman to explain why GPT-3 can generate coherent answers to questions it's never seen before. Solo episode on LLM architecture.

To stay in touch, sign up for our newsletter at https://www.superprompt.fm

For the best experience, listen in Metacast app for iOS or Android