Machine Learning Guide

MLA 024 Agentic Software Engineering


Listen Later

Agentic engineering shifts the developer role from manual coding to orchestrating AI agents that automate the full software lifecycle from ticket to deployment. Using Claude Code with MCP servers and git worktrees allows a single person to manage the output and quality of an entire engineering organization.

Links
  • Notes and resources at ocdevel.com/mlg/mla-24
  • Try a walking desk - stay healthy & sharp while you learn & code
  • Generate a podcast - use my voice to listen to any AI generated content you want
The Shift: Agentic Engineering

Andrej Karpathy transitioned from "vibe coding" in February 2025 to "agentic engineering" in February 2026. This shift represents moving from casual AI use to using agents as the primary production coding interface. The goal is to automate the software engineering lifecycle, allowing a single person to manage system design and outcomes while agents handle implementation.

Tooling and Context Efficiency

Minimize MCP servers to preserve context. 12 active servers consume 66,000 tokens, which is one-third of Claude's 200K window. Lazy-loading MCP definitions reduces usage by up to 95%.

  • GitHub MCP: Accesses GitHub API for PR creation, issue management, and Actions.
  • Context7: Fetches version-specific documentation to prevent hallucinations in libraries like React or Prisma.
  • Sequential Thinking: Forces structured reasoning for complex architecture decisions.
  • Playwright: Performs browser automation for E2E testing and UI debugging.
  • Memory: Local knowledge-graph for persistent project context across sessions.
  • Hooks: PostToolUse auto-formats code via Prettier. PreToolUse blocks dangerous commands like rm -rf or writes to .env. SessionStart with a compact matcher re-injects instructions after context compaction.
High-Impact Workflows
  • Plan-First Mode: Use Shift-Tab for read-only exploration. Create TODOs and milestones before implementation to reduce backtracking.
  • Git Worktrees: Claude Code supports parallel sessions via the --worktree flag. This allows 3 to 5 simultaneous agents to work on different branches in a single repository.
  • Headless Mode: Use the --print flag and JSON formatting to script Claude into external automation or CI/CD pipelines.
The Automated Engineering Pipeline
  1. Trigger: Issues are filed or labels like claude-autofix are applied. Tools like n8n or OpenClaw can also trigger sessions via webhooks or Slack.
  2. Implementation: Claude plans, implements changes, and writes tests in an isolated worktree.
  3. Self-Review: The code-review plugin runs four parallel agents to score changes for correctness and security.
  4. CI and Auto-Fix: Claude monitors CI status, auto-fixes failures, and merges PRs to staging via squash once checks pass.
  5. Human Gate: The engineer reviews the accumulated changes in the staging branch before merging to main for production deployment.
Career Transition

The role of the engineer moves from writing code to acting as an engineering operator. Daily work involves triaging issues, making architectural judgment calls, and optimizing the automation system. Maintaining a CLAUDE.md file under 100 lines ensures maximum token efficiency and performance for the agentic team.

...more
View all episodesView all episodes
Download on the App Store

Machine Learning GuideBy OCDevel

  • 4.9
  • 4.9
  • 4.9
  • 4.9
  • 4.9

4.9

772 ratings


More shows like Machine Learning Guide

View all
The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

288 Listeners

Data Skeptic by Kyle Polich

Data Skeptic

478 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

625 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

583 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

299 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

348 Listeners

Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

Syntax - Tasty Web Development Treats

990 Listeners

AI Today Podcast by AI & Data Today

AI Today Podcast

159 Listeners

DataFramed by DataCamp

DataFramed

268 Listeners

Practical AI by Practical AI LLC

Practical AI

215 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

200 Listeners

The Real Python Podcast by Real Python

The Real Python Podcast

139 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

99 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

229 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

666 Listeners