
Sign up to save your podcasts
Or


Episode 517 starts with a light chat about AI avatars and new text‑to‑speech deepfakes before diving into LLM “thinking” modes—what baked‑in planning actually does, why it multiplies token costs, and when it helps or hurts. James and Frank give concrete dev advice: try low‑thinking settings, use big models for creative planning then smaller ones to execute, leverage harnesses/system prompts, and beware quantized local models often do better without thinking.
⭐⭐ Review Us ⭐⭐
Machine transcription available on http://mergeconflict.fm
Support Merge Conflict
By soundbite.fm4.9
8686 ratings
Episode 517 starts with a light chat about AI avatars and new text‑to‑speech deepfakes before diving into LLM “thinking” modes—what baked‑in planning actually does, why it multiplies token costs, and when it helps or hurts. James and Frank give concrete dev advice: try low‑thinking settings, use big models for creative planning then smaller ones to execute, leverage harnesses/system prompts, and beware quantized local models often do better without thinking.
⭐⭐ Review Us ⭐⭐
Machine transcription available on http://mergeconflict.fm
Support Merge Conflict

381 Listeners

288 Listeners

3,063 Listeners

3,724 Listeners

83 Listeners

985 Listeners

8,046 Listeners

210 Listeners

34 Listeners

23 Listeners

243 Listeners

2,032 Listeners

3 Listeners

18 Listeners

101 Listeners