
Sign up to save your podcasts
Or


In this episode, Ole Lensmar joins Richard Li, founder of Polar Sky, to explore how testing strategies must evolve for the age of AI. They discuss the journey of cloud-native tools like Telepresence and Richard's key insight for agentic coding: using automated tests as the "context" to guide AI behavior, rather than relying on complex prompts to describe system rules.
The conversation also digs into the mechanics of building AI-driven software, emphasizing why "evals" are critical for measuring success. Richard shares practical strategies for treating evaluations as data management problems and explains how his team uses simple AI agents to "babysit" CI pipelines and automatically resolve flaky tests.
Topics discussed:
By TestkubeIn this episode, Ole Lensmar joins Richard Li, founder of Polar Sky, to explore how testing strategies must evolve for the age of AI. They discuss the journey of cloud-native tools like Telepresence and Richard's key insight for agentic coding: using automated tests as the "context" to guide AI behavior, rather than relying on complex prompts to describe system rules.
The conversation also digs into the mechanics of building AI-driven software, emphasizing why "evals" are critical for measuring success. Richard shares practical strategies for treating evaluations as data management problems and explains how his team uses simple AI agents to "babysit" CI pipelines and automatically resolve flaky tests.
Topics discussed: