The Programming Podcast

Why Even Staff Engineers Feel Behind on AI Right Now! (You are not alone!)


Listen Later

The “prompt-and-pray” era is over — and that’s a good thing.


In this episode, we break down why AI “magic” collapses under real production traffic (edge cases, hallucinations, messy inputs, and even infrastructure-level failures)… and what replaces it: actual AI engineering.


Danny frames the shift with four architectural pillars that make LLM features shippable and reliable:

- State orchestration (stop treating models like employees — they’re stateless CPUs)

- Constraint generation (JSON forcing, schema-driven outputs, type-safe sampling)

- Infrastructure reliability (retries, backoff, fallbacks — because inference can and will fail)

- Regression testing & evals (measure prompts like code, break builds when quality drops)


SITE https://www.programmingpodcast.com/


Stay in Touch:

📧 Have questions for the show? Or are you a business that wants to talk business?

Email us at [email protected]!


Danny Thompson

https://x.com/DThompsonDev

/ dthompsondev

www.DThompsonDev.com


Leon Noel

https://x.com/leonnoel

/ leonnoel

https://100devs.org/


📧 Have questions for the show? Or are you a business that wants to talk business?

Email us at [email protected]!


We also hit the reality of agent “throughput” vs human review bottlenecks (Phoenix Project vibes), why monolithic agents are a trap, and a listener question about networking + credibility after pitching an MVP that isn’t fully shipped yet.


If you’re building AI features for real users — not demos — this is the blueprint.


00:00 — The “prompt-and-pray” era is over

02:49 — AI hype fades: guardrails + reality

06:34 — Deterministic software vs probabilistic models

07:29 — The 4 pillars of AI engineering (overview)

11:37 — Pillar 1: state orchestration (FSM, stateless models)

20:26 — Pillar 2: constraint generation (JSON, schemas, type safety)

28:28 — Pillar 3: infra reliability (retries, fallbacks, failures)

32:21 — Pillar 4: evals + regression testing (LLM-as-judge)

43:40 — Listener question: networking, MVP pressure, and credibility

...more
View all episodesView all episodes
Download on the App Store

The Programming PodcastBy The Programming Podcast

  • 4.9
  • 4.9
  • 4.9
  • 4.9
  • 4.9

4.9

65 ratings


More shows like The Programming Podcast

View all
Freakonomics Radio by Freakonomics Radio + Stitcher

Freakonomics Radio

32,084 Listeners

Hidden Brain by Hidden Brain, Shankar Vedantam

Hidden Brain

43,657 Listeners

Marketplace by Marketplace

Marketplace

8,772 Listeners

Software Engineering Radio - the podcast for professional software developers by team@se-radio.net (SE-Radio Team)

Software Engineering Radio - the podcast for professional software developers

274 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

287 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

624 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

582 Listeners

Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

Syntax - Tasty Web Development Treats

989 Listeners

The freeCodeCamp Podcast by freeCodeCamp.org

The freeCodeCamp Podcast

486 Listeners

CoRecursive: Coding Stories by Adam Gordon Bell - Software Developer

CoRecursive: Coding Stories

189 Listeners

Practical AI by Practical AI LLC

Practical AI

205 Listeners

My First Million by Hubspot Media

My First Million

2,647 Listeners

The Stack Overflow Podcast by The Stack Overflow Podcast

The Stack Overflow Podcast

62 Listeners

The Startup Ideas Podcast by Greg Isenberg

The Startup Ideas Podcast

204 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

636 Listeners