LLM Training: Superman's Kryptonite-Proof Suit

Super Prompt: Generative AI

May 29, 2023•19 min•Season 1Ep. 13

--:--

Listen in podcast apps:

Apple Podcasts

Spotify

Download

Listen to this episode in Metacast mobile app

Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Why isn't Superman's suit Kryptonite-proof? This question reveals how large language models are trained. We break down transformers (the T in GPT), self-attention mechanisms, and the inference process—using Superman to explain why GPT-3 can generate coherent answers to questions it's never seen before. Solo episode on LLM architecture.

To stay in touch, sign up for our newsletter at https://www.superprompt.fm

For the best experience, listen in Metacast app for iOS or Android