Episode 48: How do the latest updates to large language models stack up against each other? Matt Wolfe (https://x.com/mreflow) and Nathan Lands (https://x.com/NathanLands) are joined by Matthew Berman (https://x.com/MatthewBerman), an expert in deep-diving and testing the nuances of large language models.
In this episode, the trio discusses the recent releases of Grok 3, Claude 3.7, and GPT-4.5, analyzing their strengths, weaknesses, and unique features. Tune in to learn which model might be best for your needs, from coding and real-time information to creative writing and unbiased truth-seeking.
Check out The Next Wave YouTube Channel if you want to see Matt and Nathan on screen: https://lnk.to/thenextwavepd
—
Show Notes:
(00:00) Exploring New AI Models(05:35) Inconsistent AI Code Performance(06:26) Redesigning Benchmarks for Modern Models(11:33) AI Bias Amplification on Social Media(15:11) AI Bias and Human Oversight(17:49) Claude 3.7: Improved Coding Abilities(20:30) Claude Update: Better Code, Worse Chat(23:19) Resistance to Switching IDE from VS Code(28:05) Video Producer App Preview(29:55) Showcasing Nvidia Digits Prototype(34:00) GROK Model's Distributed Training(36:31) Optimistic Perspective on Future Upgrades(40:59) Excited for GPT-5 Launch(42:08) Claude 3.7 Excels in CodingMatthew Berman: https://x.com/MatthewBermanForward Future: https://www.forwardfuture.ai/Grok 3: https://x.ai/blog/grok-3Claude 3.7: https://www.anthropic.com/news/claude-3-7-sonnetGPT-4.5: https://openai.com/index/introducing-gpt-4-5/Perplexity: https://www.perplexity.ai/Cursor: https://www.cursor.com/Gemini: https://ai.google/updates/
Check out this episode on YouTube: https://www.youtube.com/watch?v=pWXT8NZFG_Y
Get the guide to build your own Custom GPT: https://clickhubspot.com/tnw
—
Check Out Matt’s Stuff:
• Future Tools - https://futuretools.beehiiv.com/
• Blog - https://www.mattwolfe.com/
• YouTube- https://www.youtube.com/@mreflow
—
Check Out Nathan's Stuff:
Newsletter: https://news.lore.com/Blog - https://lore.com/
The Next Wave is a HubSpot Original Podcast // Brought to you by The HubSpot Podcast Network // Production by Darren Clarke // Editing by Ezra Bakker Trupiano