Latent Space: The AI Engineer Podcast - podcast cover

Latent Space: The AI Engineer Podcast

swyx + Alessiowww.latent.space
The podcast by and for AI Engineers! In 2024, over 2 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0. We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al. Full show notes always on https://latent.space

Episodes

How to train a Million Context LLM — with Mark Huang of Gradient.ai

AI Engineer World’s Fair in SF! Prices go up soon. Note that there are 4 tracks per day and dozens of workshops/expo sessions; the livestream will air the most stacked speaker list/AI expo floor of 2024 . Apply for free/discounted Diversity Program and Scholarship tickets here. We hope to make this the definitive technical conference for ALL AI engineers. Exactly a year ago, we declared the Beginning of Context=Infinity when Mosaic made their breakthrough training an 84k token context MPT-7B. A ...

May 30, 202458 min

ICLR 2024 — Best Papers & Talks (ImageGen, Vision, Transformers, State Space Models) ft. Durk Kingma, Christian Szegedy, Ilya Sutskever

Speakers for AI Engineer World’s Fair have been announced ! See our Microsoft episode for more info and buy now with code LATENTSPACE — we’ve been studying the best ML research conferences so we can make the best AI industry conf! Note that this year there are 4 main tracks per day and dozens of workshops/expo sessions; the free livestream will air much less than half of the content this time. Apply for free/discounted Diversity Program and Scholarship tickets here. We hope to make this the defi...

May 27, 20243 hr 38 min

Emulating Humans with NSFW Chatbots - with Jesse Silver

Disclaimer: today’s episode touches on NSFW topics. There’s no graphic content or explicit language, but we wouldn’t recommend blasting this in work environments. Product website: https://usewhisper.me/ For over 20 years it’s been an open secret that porn drives many new consumer technology innovations, from VHS and Pay-per-view to VR and the Internet . It’s been no different in AI - many of the most elite Stable Diffusion and Llama enjoyers and merging/prompting/PEFT techniques were born in the...

May 16, 202454 min

WebSim, WorldSim, and The Summer of Simulative AI — with Joscha Bach of Liquid AI, Karan Malhotra of Nous Research, Rob Haisfield of WebSim.ai

We are 200 people over our 300-person venue capacity for AI UX 2024 , but you can subscribe to our YouTube for the video recaps. Our next event, and largest EVER, is the AI Engineer World’s Fair . See you there! Parental advisory: Adult language used in the first 10 mins of this podcast . Any accounting of Generative AI that ends with RAG as its “final form” is seriously lacking in imagination and missing out on its full potential. While AI generation is very good for “spicy autocomplete” and “r...

Apr 27, 202454 min

High Agency Pydantic > VC Backed Frameworks — with Jason Liu of Instructor

We are reuniting for the 2nd AI UX demo day in SF on Apr 28. Sign up to demo here ! And don’t forget tickets for the AI Engineer World’s Fair — for early birds who join before keynote announcements ! About a year ago there was a lot of buzz around prompt engineering techniques to force structured output. Our friend Simon Willison tweeted a bunch of tips and tricks, but the most iconic one is Riley Goodside making it a matter of life or death : Guardrails ( friend of the pod and AI Engineer speak...

Apr 19, 202452 min

Supervise the Process of AI Research — with Jungwon Byun and Andreas Stuhlmüller of Elicit

Maggie, Linus, Geoffrey, and the LS crew are reuniting for our second annual AI UX demo day in SF on Apr 28. Sign up to demo here ! And don’t forget tickets for the AI Engineer World’s Fair — for early birds who join before keynote announcements! It’s become fashionable for many AI startups to project themselves as “the next Google” - while the search engine is so 2000s, both Perplexity and Exa referred to themselves as a “ research engine ” or “ answer engine ” in our NeurIPS pod . However thes...

Apr 11, 202456 min

Latent Space Chats: NLW (Four Wars, GPT5), Josh Albrecht/Ali Rohde (TNAI), Dylan Patel/Semianalysis (Groq), Milind Naphade (Nvidia GTC), Personal AI (ft. Harrison Chase — LangFriend/LangMem)

Our next 2 big events are AI UX and the World’s Fair . Join and apply to speak/sponsor! Due to timing issues we didn’t have an interview episode to share with you this week, but not to worry, we have more than enough “weekend special” content in the backlog for you to get your Latent Space fix, whether you like thinking about the big picture, or learning more about the pod behind the scenes, or talking Groq and GPUs, or AI Leadership, or Personal AI. Enjoy! AI Breakdown The indefatigable NLW had...

Apr 06, 20242 hr 45 min

Presenting the AI Engineer World's Fair — with Sam Schillace, Deputy CTO of Microsoft

TL;DR: You can now buy tickets , apply to speak , or join the expo for the biggest AI Engineer event of 2024. We’re gathering *everyone* you want to meet - see you this June. In last year’s the Rise of the AI Engineer we put our money where our mouth was and announced the AI Engineer Summit , which fortunately went well: With ~500 live attendees and over ~500k views online , the first iteration of the AI Engineer industry affair seemed to be well received. Competing in an expensive city with 3 o...

Mar 29, 202443 min

Why Google failed to make GPT-3 + why Multimodal Agents are the path to AGI — with David Luan of Adept

Our next SF event is AI UX 2024 - let’s see the new frontier for UX since last year ! Last call: we are recording a preview of the AI Engineer World’s Fair with swyx and Ben Dunphy, send any questions about Speaker CFPs and Sponsor Guides you have! Alessio is now hiring engineers for a new startup he is incubating at Decibel: Ideal candidate is an “ex-technical co-founder type”. Reach out to him for more! David Luan has been at the center of the modern AI revolution: he was the ~30th hire at Ope...

Mar 22, 202442 min

Making Transformers Sing - with Mikey Shulman of Suno

Giving computers a voice has always been at the center of sci-fi movies; “I’m sorry Dave, I’m afraid I can’t do that” wouldn’t hit as hard if it just appeared on screen as a terminal output, after all. The first electronic speech synthesizer, the Voder, was built at Bell Labs 85 years ago (1939!), and it’s…. something: We will not cover the history of Text To Speech (TTS), but the evolution of the underlying architecture has generally been Formant Synthesis → Concatenative Synthesis → Neural Net...

Mar 14, 202453 min

Top 5 Research Trends + OpenAI Sora, Google Gemini, Groq Math (Jan-Feb 2024 Audio Recap) + Latent Space Anniversary with Lindy.ai, RWKV, Pixee, Julius.ai, Listener Q&A!

We will be recording a preview of the AI Engineer World’s Fair soon with swyx and Ben Dunphy, send any questions about Speaker CFPs and Sponsor Guides you have! Alessio is now hiring engineers for a new startup he is incubating at Decibel: Ideal candidate is an ex-technical co-founder type (can MVP products end to end, comfortable with ambiguous prod requirements, etc). Reach out to him for more! Thanks for all the love on the Four Wars episode ! We’re excited to develop this new “swyx & Alessio...

Mar 09, 20241 hr 49 min

Open Source AI is AI we can Trust — with Soumith Chintala of Meta AI

Speaker CFPs and Sponsor Guides are now available for AIE World’s Fair — join us on June 25-27 for the biggest AI Engineer conference of 2024 ! Soumith Chintala needs no introduction in the ML world — his insights are incredibly accessible across Twitter , LinkedIn , podcasts , and conference talks (in this pod we’ll assume you’ll have caught up on the History of PyTorch pod from last year and cover different topics). He’s well known as the creator of PyTorch, but he's more broadly the Engineeri...

Mar 06, 20241 hr 20 min

A Brief History of the Open Source AI Hacker - with Ben Firshman of Replicate

This Friday we’re doing a special crossover event in SF with Dylan Patel of SemiAnalysis ( previous guest !), and we will do a live podcast on site. RSVP here . Also join us on June 25-27 for the biggest AI Engineer conference of the year ! Replicate is one of the most popular AI inference providers, reporting over 2 million users as of their $40m Series B with a16z . But how did they get there? The Definitive Replicate Story (warts and all) Their overnight success took 5 years of building, and ...

Feb 28, 20241 hr 10 min

Truly Serverless Infra for AI Engineers - with Erik Bernhardsson of Modal

We’re writing this one day after the monster release of OpenAI’s Sora and Gemini 1.5 . We covered this on Alex Volkov ‘s ThursdAI space , so head over there for our takes. IRL: We’re ONE WEEK away from Latent Space: Final Frontiers , the second edition and anniversary of our first ever Latent Space event ! Also: join us on June 25-27 for the biggest AI Engineer conference of the year ! Online: All three Discord clubs are thriving. Join us every Wednesday/Friday ! Almost 12 years ago, while worki...

Feb 16, 20241 hr 2 min

Cloud Intelligence at the speed of 5000 tok/s - with Ce Zhang and Vipul Ved Prakash of Together AI

Our first ever demo day aimed for 15-20 people and ended up ballooning to >200 and covered in the news . We are now running the 2024 edition in SF on Feb 23 : Latent Space Final Frontiers , a startup and research competition in “The Autonomous Workforce”, ​”Beyond Transformers & GPUs”, and “​Embodied AI”. RSVP here ! You can find all LS online/IRL events on our new calendar . Super Early Bird tickets have just gone on sale for AI Engineer World’s Fair, June 25-27 ! Today we have the honor of hos...

Feb 08, 20241 hr 3 min

Why StackOverflow usage is down 50% — with David Hsu of Retool

We are announcing the second edition of our Latent Space demo day event in SF on 2/23: Final Frontiers , a startup and research competition in “The Autonomous Workforce”, ​”Beyond Transformers & GPUs”, and “​Embodied AI”. RSVP here ! The first one was aimed for 15-20 people and ended up blowing up to >200 and covered in the Information - let’s see what a year of growth (and competition) does to the local events space in 2024. You can find all Latent Space events here , and of course get in touch...

Feb 01, 202458 min

The Four Wars of the AI Stack (Dec 2023 Audio Recap)

Note for Latent Space Community members: we have now soft-launched meetups in Singapore , as well as two new virtual paper club/meetups for AI in Action and LLM Paper Club . We’re also running Latent Space: Final Frontiers , our second annual demo day hackathon from last year . Edit from March 2024: We did a followup on the Four Wars on the AI Breakdown . For the first time, we are doing an audio version of monthly AI Engineering recap that we publish on Latent Space! This month it’s “The Four W...

Jan 25, 20241 hr 8 min

How to train your own Large Multimodal Model — with Hugo Laurençon & Leo Tronchon of HuggingFace M4

Latent Space is heating up! Our paper club ran into >99 person Discord limits, oops. We are also introducing 2 new online meetups: LLM Paper Club Asia for Asia timezone (led by Ivan), and AI in Action: hands-on application of AI (led by KBall). To be notified of all upcoming Latent Space events, subscribe to our new Luma calendar ( sign up for individual events, or hit the RSS icon to sync all events to calendar ). In the halcyon open research days of 2022 BC ( Before-ChatGPT ), DeepMind was the...

Jan 19, 20241 hr 12 min

RLHF 201 - with Nathan Lambert of AI2 and Interconnects

In 2023 we did a few Fundamentals episodes covering Benchmarks 101 , Datasets 101 , FlashAttention , and Transformers Math , and it turns out those were some of your evergreen favorites! So we are experimenting with more educational/survey content in the mix alongside our regular founder and event coverage . Pls request more ! We have a new calendar for events; join to be notified of upcoming things in 2024! Today we visit the shoggoth mask factory : how do transformer models go from trawling a ...

Jan 11, 20241 hr 26 min

The Accidental AI Canvas - with Steve Ruiz of tldraw

Happy 2024! We appreciated all the feedback on the listener survey ( still open, link here ) ! Surprising to see that some people’s favorite episodes were others’ least, but we’ll always work on improving our audio quality and booking great guests. Help us out by leaving reviews on Twitter , YouTube , and Apple Podcasts ! 🙏 Big thanks to Chris Anderson for the latest review - be like Chris! Note to the Audio-only Listener Because of the nature of today’s topic, it makes the most sense to follow...

Jan 05, 20241 hr 4 min

NeurIPS 2023 Recap — Top Startups

We are running an end of year listener survey ! Please let us know any feedback you have, what episodes resonated with you, and guest requests for 2024! Survey link here . We can’t think of a more Latent-Space-y way to end 2023 than with a mega episode featuring many old and new friends recapping their biggest news, achievements, and themes and memes of the year! We previously covered the Best Papers of NeurIPS 2023 , but the other part of NeurIPS being an industry friendly conference is all the...

Dec 30, 20232 hr 42 min

NeurIPS 2023 Recap — Best Papers

We are running an end of year listener survey ! Please let us know any feedback you have, what episodes resonated with you, and guest requests for 2024! Survey link here . NeurIPS 2023 took place from Dec 10–16 in New Orleans. The Latent Space crew was onsite for as many of the talks and workshops as we could attend (and more importantly, hosted cocktails and parties after hours)! Picking from the 3586 papers accepted to the conference ( available online , full schedule here) is an impossible ta...

Dec 23, 20233 hr 20 min

The AI-First Graphics Editor - with Suhail Doshi of Playground AI

We are running an end of year survey for our listeners! Please let us know any feedback you have, what episodes resonated with you, and guest requests for 2024! Survey link here! Listen to the end for a little surprise from Suhail . Before language models became all the rage in November 2022, image generation was the hottest space in AI (it was the subject of our first piece on Latent Space !) In our interview with Sharif Shameem from Lexica we talked through the launch of StableDiffusion and th...

Dec 20, 202359 min

The "Normsky" architecture for AI coding agents — with Beyang Liu + Steve Yegge of SourceGraph

We are running an end of year survey for our listeners. Let us know any feedback you have for us, what episodes resonated with you the most, and guest requests for 2024! RAG has emerged as one of the key pieces of the AI Engineer stack. Jerry from LlamaIndex called it a “hack” , Bryan from Hex compared it to “a recommendation system from LLMs” , and even LangChain started with it . RAG is crucial in any AI coding workflow. We talked about context quality for code in our Phind episode . Today’s g...

Dec 14, 20231 hr 20 min

The Busy Person's Intro to Finetuning & Open Source AI - Wing Lian, Axolotl

The Latent Space crew will be at NeurIPS on Tuesday ! Reach out with any parties and papers of interest. We have also been incubating a smol daily AI Newsletter and Latent Space University is making progress. Good open models like Llama 2 and Mistral 7B (which has just released an 8x7B MoE model ) have enabled their own sub-industry of finetuned variants for a myriad of reasons: * Ownership & Control - you take responsibility for serving the models * Privacy - not having to send data to a third ...

Dec 08, 20231 hr 4 min

Notebooks = Chat++ and RAG = RecSys! — with Bryan Bischof of Hex Magic

Catch us at Modular’s ModCon next week with Chris Lattner , and join our community ! 2024 note: Hex is now hiring AI Engineers . Due to Bryan ’s very wide ranging experience in data science and AI across Blue Bottle (!), StitchFix, Weights & Biases, and now Hex Magic, this episode can be considered a two-parter. Notebooks = Chat++ We’ve talked a lot about AI UX (in our meetups , writeups , and guest posts ), and today we’re excited to dive into a new old player in AI interfaces: notebooks! Depen...

Nov 29, 202352 min

The State of Silicon and the GPU Poors - with Dylan Patel of SemiAnalysis

This episode came together at ~4 hrs notice since Dylan had just landed in SF and we had to setup quickly; you might notice some small audio issues in some segments, we apologize. We’re currently building our own podcast studio for 2024! 🙏 We’re ramping up our presence on Twitter and YouTube if you’d like to support us. Note: 17k people joined our emergency pod on Sam Altman’s ouster today. If Charles Dickens was alive in 2024, A Tale of Two Cities might be the divide between the “GPU poor” and...

Nov 17, 202353 min

AGI is Being Achieved Incrementally (DevDay Recap - cleaned audio)

We left a high amount of background audio in the Devday podcast , which many of you loved, but we definitely understand that some of you may have had trouble with it. Listener Klaus Breyer ran it through Auphonic with speech islolation and we figured we’d upload it as a backdated pod for people who prefer this. Of course it means that our speakers sound out of place since they now sound like they are talking loudly in a quiet room. Let us know in the comments what you think? Timestamps the clean...

Nov 08, 20232 hr 22 min

AGI is Being Achieved Incrementally (OpenAI DevDay w/ Simon Willison, Alex Volkov, Jim Fan, Raza Habib, Shreya Rajpal, Rahul Ligma, et al)

SF folks: join us at the AI Engineer Foundation’s Emergency Hackathon tomorrow and consider the Newton if you’d like to cowork in the heart of the Cerebral Arena . Our community page is up to date as usual! ~800,000 developers watched OpenAI Dev Day, ~8,000 of whom listened along live on our ThursdAI x Latent Space , and ~800 of whom got tickets to attend in person: OpenAI’s first developer conference easily surpassed most people’s lowballed expectations - they simply did everything short of ann...

Nov 08, 20232 hr 23 min

Beating GPT-4 with Open Source LLMs — with Michael Royzen of Phind

At the AI Pioneers Summit we announced Latent Space Launchpad , an AI-focused accelerator in partnership with Decibel . If you’re an AI founder of enterprise early adopter, fill out this form and we’ll be in touch with more details. We also have a lot of events coming up as we wrap up the year, so make sure to check out our community events page and come say hi! We previously interviewed the founders of many developer productivity startups embedded in the IDE, like Codium AI , Cursor , and Codei...

Nov 03, 20231 hr 7 min
For the best experience, listen in Metacast app for iOS or Android
Open in Metacast