ThursdAI - The top AI news from the past week

📆 ThursdAI - Nov 14 - Qwen 2.5 Coder, No Walls, Gemini 1114 👑 LLM, ChatGPT OS integrations & more AI news


Listen Later

This week is a very exciting one in the world of AI news, as we get 3 SOTA models, one in overall LLM rankings, on in OSS coding and one in OSS voice + a bunch of new breaking news during the show (which we reacted to live on the pod, and as we're now doing video, you can see us freak out in real time at 59:32)

00:00 Welcome to ThursdAI

00:25 Meet the Hosts

02:38 Show Format and Community

03:18 TLDR Overview

04:01 Open Source Highlights

13:31 Qwen Coder 2.5 Release

14:00 Speculative Decoding and Model Performance

22:18 Interactive Demos and Artifacts

28:20 Training Insights and Future Prospects

33:54 Breaking News: Nexus Flow

36:23 Exploring Athene v2 Agent Capabilities

36:48 Understanding ArenaHard and Benchmarking

40:55 Scaling and Limitations in AI Models

43:04 Nexus Flow and Scaling Debate

49:00 Open Source LLMs and New Releases

52:29 FrontierMath Benchmark and Quantization Challenges

58:50 Gemini Experimental 1114 Release and Performance

01:11:28 LLM Observability with Weave

01:14:55 Introduction to Tracing and Evaluations

01:15:50 Weave API Toolkit Overview

01:16:08 Buzz Corner: Weights & Biases

01:16:18 Nous Forge Reasoning API

01:26:39 Breaking News: OpenAI's New MacOS Features

01:27:41 Live Demo: ChatGPT Integration with VS Code

01:34:28 Ultravox: Real-Time AI Conversations

01:42:03 Tilde Research and Stargazer Tool

01:46:12 Conclusion and Final Thoughts

This week also, there was a debate online, whether deep learning (and scale is all you need) has hit a wall, with folks like Ilya Sutskever being cited by publications claiming it has, folks like Yann LeCoon calling "I told you so". TL;DR? multiple huge breakthroughs later, and both Oriol from DeepMind and Sam Altman are saying "what wall?" and Heiner from X.ai saying "skill issue", there is no walls in sight, despite some tech journalism love to pretend there is. Also, what happened to Yann? 😵‍💫

Ok, back to our scheduled programming, here's the TL;DR, afterwhich, a breakdown of the most important things about today's update, and as always, I encourage you to watch / listen to the show, as we cover way more than I summarize here 🙂

TL;DR and Show Notes:

* Open Source LLMs

* Qwen Coder 2.5 32B (+5 others) - Sonnet @ home (HF, Blog, Tech Report)

* The End of Quantization? (X, Original Thread)

* Epoch : FrontierMath new benchmark for advanced MATH reasoning in AI (Blog)

* Common Corpus: Largest multilingual 2T token dataset (blog)

* NexusFlow - Athena v2 - open model suite (X, Blog, HF)

* Big CO LLMs + APIs

* Gemini 1114 is new king LLM #1 LMArena (X)

* Nous Forge Reasoning API - beta (Blog, X)

* Reuters reports "AI is hitting a wall" and it's becoming a meme (Article)

* Cursor acq. SuperMaven (X)

* This Weeks Buzz

* Weave JS/TS support is here 🙌

* Voice & Audio

* Fixie releases UltraVox SOTA (Demo, HF, API)

* Suno v4 is coming and it's bonkers amazing (Alex Song, SOTA Jingle)

* Tools demoed

* Qwen artifacts - HF Demo

* Tilde Galaxy - Interp Tool



This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit sub.thursdai.news/subscribe
...more
View all episodesView all episodes
Download on the App Store

ThursdAI - The top AI news from the past weekBy From Weights & Biases, Join AI Evangelist Alex Volkov and a panel of experts to cover everything important that happened in the world of AI from the past week

  • 4.9
  • 4.9
  • 4.9
  • 4.9
  • 4.9

4.9

12 ratings


More shows like ThursdAI - The top AI news from the past week

View all
a16z Podcast by Andreessen Horowitz

a16z Podcast

1,006 Listeners

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

507 Listeners

Practical AI by Practical AI LLC

Practical AI

188 Listeners

Last Week in AI by Skynet Today

Last Week in AI

281 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

90 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

350 Listeners

Sharp Tech with Ben Thompson by Ben Thompson

Sharp Tech with Ben Thompson

90 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

129 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

196 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

72 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

429 Listeners

AI For Humans: Making Artificial Intelligence Fun & Practical by Kevin Pereira & Gavin Purcell

AI For Humans: Making Artificial Intelligence Fun & Practical

234 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

438 Listeners

AI + a16z by a16z

AI + a16z

33 Listeners

Training Data by Sequoia Capital

Training Data

37 Listeners