The Changelog: Software Development, Open Source

Bringing Whisper and LLaMA to the masses (Interview)


Listen Later

This week we’re talking with Georgi Gerganov about his work on Whisper.cpp and llama.cpp. Georgi first crossed our radar with whisper.cpp, his port of OpenAI’s Whisper model in C and C++. Whisper is a speech recognition model enabling audio transcription and translation. Something we’re paying close attention to here at Changelog, for obvious reasons. Between the invite and the show’s recording, he had a new hit project on his hands: llama.cpp. This is a port of Facebook’s LLaMA model in C and C++. Whisper.cpp made a splash, but llama.cpp is growing in GitHub stars faster than Stable Diffusion did, which was a rocket ship itself.

Join the discussion

Changelog++ members get a bonus 12 minutes at the end of this episode and zero ads. Join today!

Sponsors:

  • Postman – Build APIs together — More than 20 million developers use Postman for building and using APIs. Postman simplifies each step of the API lifecycle and streamlines collaboration so you can create better APIs—faster.
  • SentrySession Replay! Rewind and replay every step of the user’s journey before and after they encountered an issue. Eliminate the guesswork and get to the root cause of an issue, faster. Use the code CHANGELOG and get the team plan free for three months.
  • FastlyOur bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com
  • Typesense – Lightning fast, globally distributed Search-as-a-Service that runs in memory. You literally can’t get any faster!
  • Featuring:

    • Georgi Gerganov – Website, GitHub, Mastodon, X
    • Adam Stacoviak – Website, GitHub, LinkedIn, Mastodon, X
    • Jerod Santo – GitHub, LinkedIn, Mastodon, X

    Show Notes:

    • ggerganov/whisper.cpp
    • examples/main
    • Arm Neon technology
    • Apple’s secret M1 coprocessor
    • ggerganov/llama.cpp
    • Introducing LLaMA: A foundational, 65-billion-parameter large language model
    • facebookresearch/llama
    • Ludacris Llama Llama Red Pajama Freestyle
    • The Changelog #506: Stable Diffusion breaks the internet with Simon Willison
    • Large language models are having their Stable Diffusion moment
    • Something missing or broken? PRs welcome!

      ...more
      View all episodesView all episodes
      Download on the App Store

      The Changelog: Software Development, Open SourceBy Changelog Media

      • 4.7
      • 4.7
      • 4.7
      • 4.7
      • 4.7

      4.7

      286 ratings


      More shows like The Changelog: Software Development, Open Source

      View all
      Software Engineering Radio by se-radio@computer.org

      Software Engineering Radio

      271 Listeners

      Software Engineering Daily by Software Engineering Daily

      Software Engineering Daily

      625 Listeners

      LINUX Unplugged by Jupiter Broadcasting

      LINUX Unplugged

      268 Listeners

      Talk Python To Me by Michael Kennedy

      Talk Python To Me

      585 Listeners

      Soft Skills Engineering by Jamison Dance and Dave Smith

      Soft Skills Engineering

      289 Listeners

      Data Engineering Podcast by Tobias Macey

      Data Engineering Podcast

      146 Listeners

      Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

      Syntax - Tasty Web Development Treats

      987 Listeners

      REWORK by 37signals

      REWORK

      210 Listeners

      Practical AI by Practical AI LLC

      Practical AI

      208 Listeners

      AWS Podcast by Amazon Web Services

      AWS Podcast

      203 Listeners

      The Stack Overflow Podcast by The Stack Overflow Podcast

      The Stack Overflow Podcast

      63 Listeners

      The Real Python Podcast by Real Python

      The Real Python Podcast

      142 Listeners

      Big Technology Podcast by Alex Kantrowitz

      Big Technology Podcast

      494 Listeners

      Training Data by Sequoia Capital

      Training Data

      40 Listeners

      The Pragmatic Engineer by Gergely Orosz

      The Pragmatic Engineer

      64 Listeners