Why Frontier AI Still Sees Like a Toddler, w/ Andrew Dai - podcast episode cover

Why Frontier AI Still Sees Like a Toddler, w/ Andrew Dai

Jun 17, 202643 minSeason 1Ep. 62
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

AI can write code, pass exams, and summarize the web, but ask it to reason through a real-world image, and the magic often breaks. Andrew Dai, co-founder and CEO of Elorian, joins The Neuron to explain why visual reasoning may be one of the biggest unsolved problems in AI.


Andrew spent years at Google Brain and DeepMind, including work connected to Gemini and sparse mixture-of-experts systems. Now, he’s building Elorian around a simple but powerful idea: if AI is going to understand the physical world, it needs more than text-based reasoning layered on top of images.


In this episode, Corey and Grant talk with Andrew about why frontier models struggle with counting, navigation, design, engineering, charts, and physical reasoning; why scaling language models hasn’t solved vision; what a “visual chain of thought” might look like; and how better visual reasoning could accelerate robotics, satellite analysis, product design, and mechanical engineering.


Sponsored by Dell Technologies and NVIDIA. Learn more at techrepublic.com/hubs/the-enterprise-guide-to-scalable-ai/.


Sponsored by Outshift: Visit https://outshift.cisco.com/?utm_campaign=fy26q3_outshift_ww_paid-media_ioc-neuronai-outshift_podcast&utm_channel=podcast&utm_source=podcast to learn more about the Internet of Cognition.


Subscribe to The Neuron for more conversations with the people building the future of AI.

For the best experience, listen in Metacast app for iOS or Android