Deep dive into HolmesGPT, the CNCF Sandbox AI agent that revolutionizes cloud-native troubleshooting. This episode covers what it is, its 40+ integrations, the project roadmap, and how to set it up today.
AirFrance-KLM's secure automation platform with Terraform, Vault, and AnsibleAWS ECS tmpfs mounts on Fargate for secure secrets handlingQwen 30B running on Raspberry Pi - democratizing edge AIAWS European Sovereign Cloud with independent EU governanceCNCF Sandbox project (accepted October 2025) with 1,600+ GitHub starsAgentic architecture: creates investigation task lists, queries systems, synthesizes findings40+ built-in toolsets: Prometheus, Grafana Loki/Tempo, Kubernetes, ArgoCD, DataDog, and morePrivacy-first: bring your own LLM keys, read-only access, respects RBACEnd-to-end automation with AlertManager, PagerDuty, OpsGenie integrationInstallation options: pip, Homebrew, Helm, Web UI, K9s pluginHolmesGPT GitHubHolmesGPT DocumentationFull TranscriptEpisode Type: full Episode Number: 83 Season: 1 Tags: HolmesGPT, CNCF, Kubernetes, root cause analysis, AI ops, troubleshooting, observability, SRE, platform engineering, Robusta, agentic AI