Merge Conflict

514: Running Local LLMs in VS Code


Listen Later

In this episode James and Frank dive into running AI coding models locally versus in the cloud—BYOK/Open Router, VS Code’s chat/agent harness, model runners (Olama, vLLM), and the practicality of 27B models on a 3090 using 4‑bit quantization. They share hands-on takeaways—how recent engineering (MT/MTPLX) boosts inference to usable token rates, when auto model selection makes sense, cost and hardware trade‑offs, and why local models can liberate your workflow while still needing smarter, unified tooling.

Follow Us
  • Frank: Twitter, Blog, GitHub
  • James: Twitter, Blog, GitHub
  • Merge Conflict: Twitter, Facebook, Website, Chat on Discord
  • Music : Amethyst Seer - Citrine by Adventureface
  • ⭐⭐ Review Us ⭐⭐

    Machine transcription available on http://mergeconflict.fm

    Support Merge Conflict

    ...more
    View all episodesView all episodes
    Download on the App Store

    Merge ConflictBy soundbite.fm

    • 4.9
    • 4.9
    • 4.9
    • 4.9
    • 4.9

    4.9

    86 ratings


    More shows like Merge Conflict

    View all
    Hanselminutes with Scott Hanselman by Scott Hanselman

    Hanselminutes with Scott Hanselman

    382 Listeners

    The Changelog: Software Development, Open Source by Changelog Media

    The Changelog: Software Development, Open Source

    288 Listeners

    This Week in Tech (Audio) by TWiT

    This Week in Tech (Audio)

    3,059 Listeners

    The Vergecast by The Verge

    The Vergecast

    3,722 Listeners

    RunAs Radio by Richard Campbell

    RunAs Radio

    83 Listeners

    Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

    Syntax - Tasty Web Development Treats

    985 Listeners

    Darknet Diaries by Jack Rhysider

    Darknet Diaries

    8,077 Listeners

    Practical AI by Practical AI LLC

    Practical AI

    212 Listeners

    The Modern .NET Show by Jamie Taylor

    The Modern .NET Show

    34 Listeners

    AI DevOps Podcast by Jeffrey Palermo

    AI DevOps Podcast

    22 Listeners

    .NET Rocks! by Carl Franklin and Richard Campbell

    .NET Rocks!

    242 Listeners

    People I (Mostly) Admire by Freakonomics Radio + Stitcher

    People I (Mostly) Admire

    2,030 Listeners

    The Unhandled Exception Podcast by Dan Clarke (@dracan)

    The Unhandled Exception Podcast

    3 Listeners

    Security This Week by Carl Franklin

    Security This Week

    18 Listeners

    Latent Space: The AI Engineer Podcast by Latent.Space

    Latent Space: The AI Engineer Podcast

    101 Listeners