June 19, 2026

Sandboxing, Agent Harnesses, and Agent Teamwork

Listen Later

1 hour 19 minutes

Shahram Anver is the Co-Founder and CEO of Cleric, the autonomous AI SRE that investigates and root-causes production issues like an experienced teammate — often in under two minutes. Before Cleric, Shahram led MLOps, DevOps, and FinOps platform engineering at Gojek, Southeast Asia's super-app. In this conversation, he breaks down why production operations never kept pace with AI-accelerated development, and why the real unlock for an AI SRE isn't faster triage — it's an agent that *learns* and compounds operational memory across your whole org.

In this episode:

🔧 The on-call problem — Why one broken service still drags ten engineers onto a call, and how AI changes that

🤖 What an AI SRE actually is — How Cleric investigates across your existing observability stack instead of adding another tool

🧠 Learning over MTTR — Why Shahram argues the value isn't alert triage, it's an agent that gets better every investigation

🪜 Ramping like a new engineer — Explore the environment, learn from the work, talk to the team

🔁 The investigate–measure–learn loop — Turning what worked on one incident into context for the next

🕸️ Knowledge graphs & operational memory — Mapping teams, clusters, and dependencies so insight from one team helps another

⚡ Under two minutes to root cause — What "fast" really requires in a live production environment

🚀 The road to autonomy — From assisted investigation toward self-healing infrastructure

If you're an SRE, platform engineer, DevOps lead, or anyone building or buying AI agents for production, this one's for you.

🔗 Links & Resources

Cleric: https://cleric.ai

Shahram on LinkedIn: https://www.linkedin.com/in/shahramanver/

Willem Pienaar (Co-Founder/CTO): https://www.linkedin.com/in/willempienaar/

Cleric launches the first self-learning AI SRE: https://cleric.ai/blog/cleric-launches-the-first-self-learning-ai-sre

MLOps Community: https://mlops.community

Join the community: https://go.mlops.community/slack

⏱️ Timestamps

[00:00] Tech Jargon Confusion

[00:27] Harness vs Model

[08:48] Model Evolution in Cleric

[13:36] Sandboxing and Simulated Environments

[20:40] Shifting AI Perceptions

[24:10] Managing Humans vs Agents

[31:32] Steering Parallel Agents

[34:16] Human Decision Integration in Models

[43:28] 80/20 Data Split

[49:40] Becoming a Skill

[53:35] 2027 Agent Autonomy

[59:14] Agent Learning in Production

[1:04:31] Software as Personal Capabilities

[1:08:31] Vibe Coding vs Durability

[1:18:23] Wrap up

#AISRE #SiteReliabilityEngineering #AIAgents

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

MLOps.community

By Demetrios

4.6

2323 ratings

June 19, 2026

Sandboxing, Agent Harnesses, and Agent Teamwork

Listen Later

1 hour 19 minutes

Shahram Anver is the Co-Founder and CEO of Cleric, the autonomous AI SRE that investigates and root-causes production issues like an experienced teammate — often in under two minutes. Before Cleric, Shahram led MLOps, DevOps, and FinOps platform engineering at Gojek, Southeast Asia's super-app. In this conversation, he breaks down why production operations never kept pace with AI-accelerated development, and why the real unlock for an AI SRE isn't faster triage — it's an agent that *learns* and compounds operational memory across your whole org.

In this episode:

🔧 The on-call problem — Why one broken service still drags ten engineers onto a call, and how AI changes that

🤖 What an AI SRE actually is — How Cleric investigates across your existing observability stack instead of adding another tool

🧠 Learning over MTTR — Why Shahram argues the value isn't alert triage, it's an agent that gets better every investigation

🪜 Ramping like a new engineer — Explore the environment, learn from the work, talk to the team

🔁 The investigate–measure–learn loop — Turning what worked on one incident into context for the next

🕸️ Knowledge graphs & operational memory — Mapping teams, clusters, and dependencies so insight from one team helps another

⚡ Under two minutes to root cause — What "fast" really requires in a live production environment

🚀 The road to autonomy — From assisted investigation toward self-healing infrastructure

If you're an SRE, platform engineer, DevOps lead, or anyone building or buying AI agents for production, this one's for you.

🔗 Links & Resources

Cleric: https://cleric.ai

Shahram on LinkedIn: https://www.linkedin.com/in/shahramanver/

Willem Pienaar (Co-Founder/CTO): https://www.linkedin.com/in/willempienaar/

Cleric launches the first self-learning AI SRE: https://cleric.ai/blog/cleric-launches-the-first-self-learning-ai-sre

MLOps Community: https://mlops.community

Join the community: https://go.mlops.community/slack

⏱️ Timestamps

[00:00] Tech Jargon Confusion

[00:27] Harness vs Model

[08:48] Model Evolution in Cleric

[13:36] Sandboxing and Simulated Environments

[20:40] Shifting AI Perceptions

[24:10] Managing Humans vs Agents

[31:32] Steering Parallel Agents

[34:16] Human Decision Integration in Models

[43:28] 80/20 Data Split

[49:40] Becoming a Skill

[53:35] 2027 Agent Autonomy

[59:14] Agent Learning in Production

[1:04:31] Software as Personal Capabilities

[1:08:31] Vibe Coding vs Durability

[1:18:23] Wrap up

#AISRE #SiteReliabilityEngineering #AIAgents

...more

More shows like MLOps.community

This Week in Startups by Jason Calacanis

This Week in Startups

1,292 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

288 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,095 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

624 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

583 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

301 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

345 Listeners

Practical AI by Practical AI LLC

Practical AI

213 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

563 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

507 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

146 Listeners

Latent Space: The AI Engineer Podcast by Latent.Space

Latent Space: The AI Engineer Podcast

100 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

227 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

689 Listeners

AI + a16z by a16z

AI + a16z

32 Listeners