Episode 48: How do the latest updates to large language models stack up against each other? Matt Wolfe (https://x.com/mreflow) and Nathan Lands (https://x.com/NathanLands) are joined by Matthew Berman (https://x.com/MatthewBerman), an expert in deep-diving and testing the nuances of large language models.
In this episode, the trio discusses the recent releases of Grok 3, Claude 3.7, and GPT-4.5, analyzing their strengths, weaknesses, and unique features. Tune in to learn which model might be best for your needs, from coding and real-time information to creative writing and unbiased truth-seeking.
Check out The Next Wave YouTube Channel if you want to see Matt and Nathan on screen: https://lnk.to/thenextwavepd
—
Show Notes:
(00:00) Exploring New AI Models
(05:35) Inconsistent AI Code Performance
(06:26) Redesigning Benchmarks for Modern Models
(11:33) AI Bias Amplification on Social Media
(15:11) AI Bias and Human Oversight
(17:49) Claude 3.7: Improved Coding Abilities
(20:30) Claude Update: Better Code, Worse Chat
(23:19) Resistance to Switching IDE from VS Code
(28:05) Video Producer App Preview
(29:55) Showcasing Nvidia Digits Prototype
(34:00) GROK Model's Distributed Training
(36:31) Optimistic Perspective on Future Upgrades
(40:59) Excited for GPT-5 Launch
(42:08) Claude 3.7 Excels in Coding
—
Mentions:
Matthew Berman: https://x.com/MatthewBerman
Forward Future: https://www.forwardfuture.ai/
Grok 3: https://x.ai/blog/grok-3
Claude 3.7: https://www.anthropic.com/news/claude-3-7-sonnet
GPT-4.5: https://openai.com/index/introducing-gpt-4-5/
Perplexity: https://www.perplexity.ai/
Cursor: https://www.cursor.com/
Gemini: https://ai.google/updates/
Check out this episode on YouTube: https://www.youtube.com/watch?v=pWXT8NZFG_Y
Get the guide to build your own Custom GPT: https://clickhubspot.com/tnw
—
Check Out Matt’s Stuff:
• Future Tools - https://futuretools.beehiiv.com/
• Blog - https://www.mattwolfe.com/
• YouTube- https://www.youtube.com/@mreflow
—
Check Out Nathan's Stuff:
Newsletter: https://news.lore.com/
Blog - https://lore.com/
The Next Wave is a HubSpot Original Podcast // Brought to you by The HubSpot Podcast Network // Production by Darren Clarke // Editing by Ezra Bakker Trupiano
Grok 3 vs Claude 3.7 vs GPT-4.5: Which Update is The Best? | The Next Wave - AI and The Future of Technology podcast - Listen or read transcript on Metacast