May 11, 2026

514: Running Local LLMs in VS Code

Listen Later

55 minutes

In this episode James and Frank dive into running AI coding models locally versus in the cloud—BYOK/Open Router, VS Code’s chat/agent harness, model runners (Olama, vLLM), and the practicality of 27B models on a 3090 using 4‑bit quantization. They share hands-on takeaways—how recent engineering (MT/MTPLX) boosts inference to usable token rates, when auto model selection makes sense, cost and hardware trade‑offs, and why local models can liberate your workflow while still needing smarter, unified tooling.

Follow Us

Frank: Twitter, Blog, GitHub

James: Twitter, Blog, GitHub

Merge Conflict: Twitter, Facebook, Website, Chat on Discord

Music : Amethyst Seer - Citrine by Adventureface

⭐⭐ Review Us ⭐⭐

Machine transcription available on http://mergeconflict.fm

Support Merge Conflict

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Merge Conflict

By soundbite.fm

4.9

8686 ratings

May 11, 2026

514: Running Local LLMs in VS Code

Listen Later

55 minutes

In this episode James and Frank dive into running AI coding models locally versus in the cloud—BYOK/Open Router, VS Code’s chat/agent harness, model runners (Olama, vLLM), and the practicality of 27B models on a 3090 using 4‑bit quantization. They share hands-on takeaways—how recent engineering (MT/MTPLX) boosts inference to usable token rates, when auto model selection makes sense, cost and hardware trade‑offs, and why local models can liberate your workflow while still needing smarter, unified tooling.

Follow Us

Frank: Twitter, Blog, GitHub

James: Twitter, Blog, GitHub

Merge Conflict: Twitter, Facebook, Website, Chat on Discord

Music : Amethyst Seer - Citrine by Adventureface

⭐⭐ Review Us ⭐⭐

Machine transcription available on http://mergeconflict.fm

Support Merge Conflict

...more

More shows like Merge Conflict

Hanselminutes with Scott Hanselman by Scott Hanselman

Hanselminutes with Scott Hanselman

379 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

289 Listeners

This Week in Tech (Audio) by TWiT

This Week in Tech (Audio)

3,061 Listeners

The Vergecast by The Verge

The Vergecast

3,717 Listeners

RunAs Radio by Richard Campbell

RunAs Radio

83 Listeners

Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

Syntax - Tasty Web Development Treats

982 Listeners

Darknet Diaries by Jack Rhysider

Darknet Diaries

8,051 Listeners

Practical AI by Practical AI LLC

Practical AI

208 Listeners

The Modern .NET Show by Jamie Taylor

The Modern .NET Show

34 Listeners

AI DevOps Podcast by Jeffrey Palermo

AI DevOps Podcast

23 Listeners

.NET Rocks! by Carl Franklin and Richard Campbell

.NET Rocks!

243 Listeners

People I (Mostly) Admire by Freakonomics Radio + Stitcher

People I (Mostly) Admire

2,027 Listeners

The Unhandled Exception Podcast by Dan Clarke (@dracan)

The Unhandled Exception Podcast

3 Listeners

Security This Week by Carl Franklin

Security This Week

18 Listeners

Latent Space: The AI Engineer Podcast by Latent.Space

Latent Space: The AI Engineer Podcast

101 Listeners