Episode description


Quantizing Transformers by Helping Attention Heads Do Nothing with Markus Nagel - #663 | The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) - Listen or read transcript on Metacast