Latent Space: The AI Engineer Podcast - podcast cover

Latent Space: The AI Engineer Podcast

Latent.Spacewww.latent.space
The podcast by and for AI Engineers! In 2025, over 10 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0. We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al. Full show notes always on https://latent.space

www.latent.space
Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

Truly Serverless Infra for AI Engineers - with Erik Bernhardsson of Modal

We’re writing this one day after the monster release of OpenAI’s Sora and Gemini 1.5 . We covered this on Alex Volkov ‘s ThursdAI space , so head over there for our takes. IRL: We’re ONE WEEK away from Latent Space: Final Frontiers , the second edition and anniversary of our first ever Latent Space event ! Also: join us on June 25-27 for the biggest AI Engineer conference of the year ! Online: All three Discord clubs are thriving. Join us every Wednesday/Friday ! Almost 12 years ago, while worki...

Feb 16, 20241 hr 2 min

Cloud Intelligence at the speed of 5000 tok/s - with Ce Zhang and Vipul Ved Prakash of Together AI

Our first ever demo day aimed for 15-20 people and ended up ballooning to >200 and covered in the news . We are now running the 2024 edition in SF on Feb 23 : Latent Space Final Frontiers , a startup and research competition in “The Autonomous Workforce”, ​”Beyond Transformers & GPUs”, and “​Embodied AI”. RSVP here ! You can find all LS online/IRL events on our new calendar . Super Early Bird tickets have just gone on sale for AI Engineer World’s Fair, June 25-27 ! Today we have the honor...

Feb 08, 20241 hr 3 min

Why StackOverflow usage is down 50% — with David Hsu of Retool

We are announcing the second edition of our Latent Space demo day event in SF on 2/23: Final Frontiers , a startup and research competition in “The Autonomous Workforce”, ​”Beyond Transformers & GPUs”, and “​Embodied AI”. RSVP here ! The first one was aimed for 15-20 people and ended up blowing up to >200 and covered in the Information - let’s see what a year of growth (and competition) does to the local events space in 2024. You can find all Latent Space events here , and of course get i...

Feb 01, 202458 min

The Four Wars of the AI Stack (Dec 2023 Audio Recap)

Note for Latent Space Community members: we have now soft-launched meetups in Singapore , as well as two new virtual paper club/meetups for AI in Action and LLM Paper Club . We’re also running Latent Space: Final Frontiers , our second annual demo day hackathon from last year . Edit from March 2024: We did a followup on the Four Wars on the AI Breakdown . For the first time, we are doing an audio version of monthly AI Engineering recap that we publish on Latent Space! This month it’s “The Four W...

Jan 25, 20241 hr 8 min

How to train your own Large Multimodal Model — with Hugo Laurençon & Leo Tronchon of HuggingFace M4

Latent Space is heating up! Our paper club ran into >99 person Discord limits, oops. We are also introducing 2 new online meetups: LLM Paper Club Asia for Asia timezone (led by Ivan), and AI in Action: hands-on application of AI (led by KBall). To be notified of all upcoming Latent Space events, subscribe to our new Luma calendar ( sign up for individual events, or hit the RSS icon to sync all events to calendar ). In the halcyon open research days of 2022 BC ( Before-ChatGPT ), DeepMind was ...

Jan 19, 20241 hr 12 min

RLHF 201 - with Nathan Lambert of AI2 and Interconnects

In 2023 we did a few Fundamentals episodes covering Benchmarks 101 , Datasets 101 , FlashAttention , and Transformers Math , and it turns out those were some of your evergreen favorites! So we are experimenting with more educational/survey content in the mix alongside our regular founder and event coverage . Pls request more ! We have a new calendar for events; join to be notified of upcoming things in 2024! Today we visit the shoggoth mask factory : how do transformer models go from trawling a ...

Jan 11, 20241 hr 26 min

The Accidental AI Canvas - with Steve Ruiz of tldraw

Happy 2024! We appreciated all the feedback on the listener survey ( still open, link here ) ! Surprising to see that some people’s favorite episodes were others’ least, but we’ll always work on improving our audio quality and booking great guests. Help us out by leaving reviews on Twitter , YouTube , and Apple Podcasts ! 🙏 Big thanks to Chris Anderson for the latest review - be like Chris! Note to the Audio-only Listener Because of the nature of today’s topic, it makes the most sense to follow...

Jan 05, 20241 hr 4 min

NeurIPS 2023 Recap — Top Startups

We are running an end of year listener survey ! Please let us know any feedback you have, what episodes resonated with you, and guest requests for 2024! Survey link here . We can’t think of a more Latent-Space-y way to end 2023 than with a mega episode featuring many old and new friends recapping their biggest news, achievements, and themes and memes of the year! We previously covered the Best Papers of NeurIPS 2023 , but the other part of NeurIPS being an industry friendly conference is all the...

Dec 30, 20232 hr 42 min

NeurIPS 2023 Recap — Best Papers

We are running an end of year listener survey ! Please let us know any feedback you have, what episodes resonated with you, and guest requests for 2024! Survey link here . NeurIPS 2023 took place from Dec 10–16 in New Orleans. The Latent Space crew was onsite for as many of the talks and workshops as we could attend (and more importantly, hosted cocktails and parties after hours)! Picking from the 3586 papers accepted to the conference ( available online , full schedule here) is an impossible ta...

Dec 23, 20233 hr 20 min

The AI-First Graphics Editor - with Suhail Doshi of Playground AI

We are running an end of year survey for our listeners! Please let us know any feedback you have, what episodes resonated with you, and guest requests for 2024! Survey link here! Listen to the end for a little surprise from Suhail . Before language models became all the rage in November 2022, image generation was the hottest space in AI (it was the subject of our first piece on Latent Space !) In our interview with Sharif Shameem from Lexica we talked through the launch of StableDiffusion and th...

Dec 20, 202359 min

The "Normsky" architecture for AI coding agents — with Beyang Liu + Steve Yegge of SourceGraph

We are running an end of year survey for our listeners. Let us know any feedback you have for us, what episodes resonated with you the most, and guest requests for 2024! RAG has emerged as one of the key pieces of the AI Engineer stack. Jerry from LlamaIndex called it a “hack” , Bryan from Hex compared it to “a recommendation system from LLMs” , and even LangChain started with it . RAG is crucial in any AI coding workflow. We talked about context quality for code in our Phind episode . Today’s g...

Dec 14, 20231 hr 20 min

The Busy Person's Intro to Finetuning & Open Source AI - Wing Lian, Axolotl

The Latent Space crew will be at NeurIPS on Tuesday ! Reach out with any parties and papers of interest. We have also been incubating a smol daily AI Newsletter and Latent Space University is making progress. Good open models like Llama 2 and Mistral 7B (which has just released an 8x7B MoE model ) have enabled their own sub-industry of finetuned variants for a myriad of reasons: * Ownership & Control - you take responsibility for serving the models * Privacy - not having to send data to a th...

Dec 08, 20231 hr 4 min

Notebooks = Chat++ and RAG = RecSys! — with Bryan Bischof of Hex Magic

Catch us at Modular’s ModCon next week with Chris Lattner , and join our community ! 2024 note: Hex is now hiring AI Engineers . Due to Bryan ’s very wide ranging experience in data science and AI across Blue Bottle (!), StitchFix, Weights & Biases, and now Hex Magic, this episode can be considered a two-parter. Notebooks = Chat++ We’ve talked a lot about AI UX (in our meetups , writeups , and guest posts ), and today we’re excited to dive into a new old player in AI interfaces: notebooks! D...

Nov 29, 202352 min

The State of Silicon and the GPU Poors - with Dylan Patel of SemiAnalysis

This episode came together at ~4 hrs notice since Dylan had just landed in SF and we had to setup quickly; you might notice some small audio issues in some segments, we apologize. We’re currently building our own podcast studio for 2024! 🙏 We’re ramping up our presence on Twitter and YouTube if you’d like to support us. Note: 17k people joined our emergency pod on Sam Altman’s ouster today. If Charles Dickens was alive in 2024, A Tale of Two Cities might be the divide between the “GPU poor” and...

Nov 17, 202353 min

AGI is Being Achieved Incrementally (DevDay Recap - cleaned audio)

We left a high amount of background audio in the Devday podcast , which many of you loved, but we definitely understand that some of you may have had trouble with it. Listener Klaus Breyer ran it through Auphonic with speech islolation and we figured we’d upload it as a backdated pod for people who prefer this. Of course it means that our speakers sound out of place since they now sound like they are talking loudly in a quiet room. Let us know in the comments what you think? Timestamps the clean...

Nov 08, 20232 hr 22 min

AGI is Being Achieved Incrementally (OpenAI DevDay w/ Simon Willison, Alex Volkov, Jim Fan, Raza Habib, Shreya Rajpal, Rahul Ligma, et al)

SF folks: join us at the AI Engineer Foundation’s Emergency Hackathon tomorrow and consider the Newton if you’d like to cowork in the heart of the Cerebral Arena . Our community page is up to date as usual! ~800,000 developers watched OpenAI Dev Day, ~8,000 of whom listened along live on our ThursdAI x Latent Space , and ~800 of whom got tickets to attend in person: OpenAI’s first developer conference easily surpassed most people’s lowballed expectations - they simply did everything short of ann...

Nov 08, 20232 hr 23 min

Beating GPT-4 with Open Source LLMs — with Michael Royzen of Phind

At the AI Pioneers Summit we announced Latent Space Launchpad , an AI-focused accelerator in partnership with Decibel . If you’re an AI founder of enterprise early adopter, fill out this form and we’ll be in touch with more details. We also have a lot of events coming up as we wrap up the year, so make sure to check out our community events page and come say hi! We previously interviewed the founders of many developer productivity startups embedded in the IDE, like Codium AI , Cursor , and Codei...

Nov 03, 20231 hr 7 min

Powering your Copilot for Data – with Artem Keydunov of Cube.dev

The first workshops and talks from the AI Engineer Summit are now up ! Join the >20k viewers on YouTube , find clips on Twitter (we’re also clipping @latentspacepod ), and chat with us on Discord ! Text-to-SQL was one of the first applications of NLP. Thoughtspot offered “Ask your data questions” as their core differentiation compared to traditional dashboarding tools. In a way, they provide a much friendlier interface with your own structured (aka “tabular”, as in “SQL tables”) data, the sam...

Oct 26, 202339 min

The End of Finetuning — with Jeremy Howard of Fast.ai

Thanks to the over 17,000 people who have joined the first AI Engineer Summit! A full recap is coming. Last call to fill out the State of AI Engineering survey ! See our Community page for upcoming meetups in SF, Paris and NYC . This episode had good interest on Twitter and was discussed on the Vanishing Gradients podcast . Fast.ai’s “Practical Deep Learning” courses been watched by over >6,000,000 people, and the fastai library has over 25,000 stars on Github. Jeremy Howard, one of the creat...

Oct 19, 20231 hr 9 min

Why AI Agents Don't Work (yet) - with Kanjun Qiu of Imbue

Thanks to the over 11,000 people who joined us for the first AI Engineer Summit! A full recap is coming, but you can 1) catch up on the fun and videos on Twitter and YouTube , 2) help us reach 1000 people for the first comprehensive State of AI Engineering survey and 3) submit projects for the new AI Engineer Foundation . See our Community page for upcoming meetups in SF, Paris, NYC, and Singapore . This episode had good interest on Twitter . Last month, Imbue was crowned as AI’s newest unicorn ...

Oct 14, 20231 hr 5 min

[AIE Summit Preview #2] The AI Horcrux — Swyx on Cognitive Revolution

This is a special double weekend crosspost of AI podcasts, helping attendees prepare for the AI Engineer Summit next week. After our first friendly feedswap with the Cognitive Revolution pod , swyx was invited for a full episode to go over the state of AI Engineering and to preview the AI Engineer Summit Schedule , where we share many former CogRev guests as speakers. For those seeking to understand how two top AI podcasts think about major top of mind AI Engineering topics, this should be the p...

Oct 08, 20231 hr 30 min

[AIE Summit Preview #1] Swyx on Software 3.0 and the Rise of the AI Engineer

This is a special double weekend crosspost of AI podcasts, helping attendees prepare for the AI Engineer Summit next week. Swyx gave a keynote on the Software 3.0 Landscape recently (referenced in our recent Humanloop episode ) and was invited to go deeper in podcast format, and to preview the AI Engineer Summit Schedule . For those seeking to ramp up on the current state of thinking on AI Engineering, this should be the perfect place to start, alongside our upcoming Latent Space University cour...

Oct 07, 202339 min

RAG Is A Hack - with Jerry Liu from LlamaIndex

Want to help define the AI Engineer stack ? >800 folks have weighed in on the top tools, communities and builders for the first State of AI Engineering survey, which we will present for the first time at next week’s AI Engineer Summit . Join us online ! This post had robust discussion on HN and Twitter . In October 2022, Robust Intelligence hosted an internal hackathon to play around with LLMs which led to the creation of two of the most important AI Engineering tools: LangChain 🦜⛓️ ( our in...

Oct 05, 20231 hr 8 min

Building the Foundation Model Ops Platform — with Raza Habib of Humanloop

Want to help define the AI Engineer stack? >500 folks have weighed in on the top tools, communities and builders for the first State of AI Engineering survey! Please fill it out (and help us reach 1000!) The AI Engineer Summit schedule is now live! We are running two Summits and judging two Hackathons this Oct. As usual, see our Discord and community page for all events. A rite of passage for every AI Engineer is shipping a quick and easy demo, and then having to cobble together a bunch of so...

Sep 29, 20231 hr 21 min

Heralds of the AI Content Flippening — with Youssef Rizk of Wondercraft.ai

Want to help define the AI Engineer stack? Have opinions on the top tools, communities and builders? We’re collaborating with friends at Amplify to launch the first State of AI Engineering survey ! Please fill it out (and tell your friends)! In March, we started off our GPT4 coverage framing one of this year’s key forks in the road as the “ Year of Multimodal vs Multimodel AI ”. 6 months in, neither has panned out yet. The vast majority of LLM usage still defaults to chatbots built atop OpenAI (...

Sep 20, 202353 min

Doing it the Hard Way: Making the AI engine and language 🔥 of the future — with Chris Lattner of Modular

Want to help define the AI Engineer stack? Have opinions on the top tools, communities and builders? We’re collaborating with friends at Amplify to launch the first State of AI Engineering survey! Please fill it out (and tell your friends)! If AI is so important, why is its software so bad? This was the motivating question for Chris Lattner as he reconnected with his product counterpart on Tensorflow, Tim Davis , and started working on a modular solution to the problem of sprawling, monolithic, ...

Sep 14, 20231 hr 29 min

The Point of LangChain — with Harrison Chase of LangChain

As alluded to on the pod, LangChain has just launched LangChain Hub : “the go-to place for developers to discover new use cases and polished prompts.” It’s available to everyone with a LangSmith account, no invite code necessary. Check it out ! In 2023, LangChain has speedrun the race from 2:00 to 4:00 to 7:00 Silicon Valley Time . From the back to back $10m Benchmark seed and (rumored) $20-25m Sequoia Series A in April, to back to back critiques of “ LangChain is Pointless ” and “ The Problem w...

Sep 06, 20231 hr 1 min

RWKV: Reinventing RNNs for the Transformer Era — with Eugene Cheah of UIlicious

The AI Engineer Summit Expo has been announced , presented by AutoGPT (and future guest Toran Bruce-Richards !) Stay tuned for more updates on the Summit livestream and Latent Space University . This post was on HN for 10 hours . What comes after the Transformer? This is one of the Top 10 Open Challenges in LLM Research that has been the talk of the AI community this month. Jon Frankle ( friend of the show !) has an ongoing bet with Sasha Rush on whether Attention is All You Need , and the most ...

Aug 30, 20231 hr 12 min

Cursor.so: The AI-first Code Editor — with Aman Sanger of Anysphere

Thanks to the almost 30k people who tuned in to the last episode ! Your podcast cohosts have been busy shipping: * Alessio open sourced smol-podcaster , which makes the show notes here! * swyx launched GodMode . Maybe someday the Cursor of browsers? * We’re also helping organize a Llama Finetuning Hackameetup this Saturday in anticipation of the CodeLlama release. Lastly, more speakers were announced at AI Engineer Summit ! 👀 ~46% of code typed through VS Code is written by Copilot. How do we g...

Aug 22, 202359 min

The Mathematics of Training LLMs — with Quentin Anthony of Eleuther AI

Invites are going out for AI Engineer Summit ! In the meantime, we have just announced our first Actually Open AI event with Brev.dev and Langchain, Aug 26 in our SF HQ (we’ll record talks for those remote). See you soon (and join the Discord)! Special thanks to @nearcyan for helping us arrange this with the Eleuther team. This post was on the HN frontpage for 15 hours. As startups and even VCs hoard GPUs to attract talent, the one thing more valuable than GPUs is knowing how to use them (aka, m...

Aug 16, 202351 min
For the best experience, listen in Metacast app for iOS or Android