Last Week in AI

#208 - Claude Integrations, ChatGPT Sycophancy, Leaderboard Cheats


Listen Later

Our 208th episode with a summary and discussion of last week's big AI news!

Recorded on 05/02/2025

Hosted by Andrey Kurenkov and Jeremie Harris.

Feel free to email us your questions and feedback at [email protected] and/or [email protected]

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/.

Join our Discord here! https://discord.gg/nTyezGSKwP

In this episode:

  • OpenAI showcases new integration capabilities in their API, enhancing the performance of LLMs and image generators with updated functionalities and improved user interfaces.
  • Analysis of OpenAI's preparedness framework reveals updates focusing on biological and chemical risks, cybersecurity, and AI self-improvement, while tone down the emphasis on persuasion capabilities.
  • Anthropic's research highlights potential security vulnerabilities in AI models, demonstrating various malicious use cases such as influence operations and hacking tool creation.
  • A detailed examination of AI competition between the US and China reveals China's impending capability to match the US in AI advancement this year, emphasizing the impact of export controls and the importance of geopolitical strategy.
  • Timestamps + Links:

    Tools & Apps

    • (00:02:57) Anthropic lets users connect more apps to Claude
    • (00:08:20) OpenAI undoes its glaze-heavy ChatGPT update
      • (00:15:16) Baidu ERNIE X1 and 4.5 Turbo boast high performance at low cost
      • (00:19:44) Adobe adds more image generators to its growing AI family
      • (00:24:35) OpenAI makes its upgraded image generator available to developers
      • (00:27:01) xAI’s Grok chatbot can now ‘see’ the world around it
      • Applications & Business:

        • (00:28:41) Thinking Machines Lab CEO Has Unusual Control in Andreessen-Led Deal
        • (00:33:36) Chip war heats up: Huawei 910C emerges as China’s answer to US export bans
        • (00:34:21) Huawei to Test New AI Chip
        • (00:40:17) ByteDance, Alibaba and Tencent stockpile billions worth of Nvidia chips
        • (00:43:59) Speculation mounts that Musk will raise tens of billions for AI supercomputer with 1 million GPUs: Report
        • Projects & Open Source:

          • (00:47:14) Alibaba unveils Qwen 3, a family of ‘hybrid’ AI reasoning models
          • (00:54:14) Intellect-2
          • (01:02:07) BitNet b1.58 2B4T Technical Report
          • (01:05:33) Meta AI Introduces Perception Encoder: A Large-Scale Vision Encoder that Excels Across Several Vision Tasks for Images and Video
          • Research & Advancements:

            • (01:06:42) The Leaderboard Illusion
            • (01:12:08) Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
            • (01:18:38) Reinforcement Learning for Reasoning in Large Language Models with One Training Example
            • (01:24:40) Sleep-time Compute: Beyond Inference Scaling at Test-time
            • Policy & Safety:

              • (01:28:23) Every AI Datacenter Is Vulnerable to Chinese Espionage, Report Says
              • (01:32:27) OpenAI preparedness framework update
              • (01:38:31) Detecting and Countering Malicious Uses of Claude: March 2025
              • (01:46:33) Chinese AI Will Match America's
              • ...more
                View all episodesView all episodes
                Download on the App Store

                Last Week in AIBy Skynet Today

                • 4.7
                • 4.7
                • 4.7
                • 4.7
                • 4.7

                4.7

                281 ratings


                More shows like Last Week in AI

                View all
                The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

                The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

                441 Listeners

                Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

                Super Data Science: ML & AI Podcast with Jon Krohn

                297 Listeners

                NVIDIA AI Podcast by NVIDIA

                NVIDIA AI Podcast

                322 Listeners

                AI Today Podcast by AI & Data Today

                AI Today Podcast

                156 Listeners

                Practical AI by Practical AI LLC

                Practical AI

                192 Listeners

                Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

                Machine Learning Street Talk (MLST)

                87 Listeners

                Me, Myself, and AI by MIT Sloan Management Review and Boston Consulting Group (BCG)

                Me, Myself, and AI

                106 Listeners

                AI Chat: ChatGPT & AI News, Artificial Intelligence, OpenAI, Machine Learning by Jaeden Schafer

                AI Chat: ChatGPT & AI News, Artificial Intelligence, OpenAI, Machine Learning

                141 Listeners

                AI Hustle: Make Money from AI and ChatGPT, Midjourney, NVIDIA, Anthropic, OpenAI by Jaeden Schafer and Jamie McCauley

                AI Hustle: Make Money from AI and ChatGPT, Midjourney, NVIDIA, Anthropic, OpenAI

                66 Listeners

                This Day in AI Podcast by Michael Sharkey, Chris Sharkey

                This Day in AI Podcast

                201 Listeners

                Latent Space: The AI Engineer Podcast by swyx + Alessio

                Latent Space: The AI Engineer Podcast

                75 Listeners

                The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

                The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

                462 Listeners

                AI For Humans: Making Artificial Intelligence Fun & Practical by Kevin Pereira & Gavin Purcell

                AI For Humans: Making Artificial Intelligence Fun & Practical

                247 Listeners

                A Beginner's Guide to AI by Dietmar Fischer

                A Beginner's Guide to AI

                31 Listeners

                The TED AI Show by TED

                The TED AI Show

                46 Listeners