Speculative Decoding and Efficient LLM Inference with Chris Lott - #717 | The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

‌

Episode description

‌

Speculative Decoding and Efficient LLM Inference with Chris Lott - #717 | The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) - Listen or read transcript on Metacast