The Agentic Mesh Podcast

Episode 14: Role-Based Evaluation Framework for Agents – Agentic Process Automation Part 4


Listen Later

Agentic Process Automation puts agents inside enterprise workflows at scale — thousands of them, making decisions, routing work, invoking tools. Evaluating whether each one is safe to operate requires matching performance criteria to the governance risk each agent actually carries.

That matching is the core problem. An agent can complete tasks quickly while violating policy. It can remain compliant while consuming too much compute to be viable. It can appear accurate while using tools incorrectly, escalating poorly, or behaving inconsistently across similar cases. A few summary metrics or anecdotal feedback are not enough. And even when the right things are measured, the meaning of a weakness depends on context. A reliability issue that is tolerable in a low-risk drafting workflow is disqualifying in a multi-step process where the agent has broad discretion. Without a way to classify the operating context and then evaluate the agent within that context, organizations drift toward either over-control or under-control.


☑️Subscribe to our Channel for the most up to date information on Agentic Mesh!

You can also listen to our Podcast on 

Spotify: https://open.spotify.com/show/6C6U2fmVdxNMdo1bZpVASy?si=3cyoAireSZiBaKJ-1FtJlA

Apple Podcasts: https://podcasts.apple.com/us/podcast/the-agentic-mesh-podcast/id1874331081

Out Now!  O'Reilly Agentic Mesh written by Eric Broda and Davis Broda

👉https://www.oreilly.com/library/view/agentic-mesh/9798341621633/

👉Stay Connected with Eric and John:

https://www.linkedin.com/in/ericbroda/

https://www.linkedin.com/in/jymiller/

👉For full-length articles by Eric Broda and John Miller:

https://agenticmesh.substack.com/?utm_campaign=profile_chips


...more
View all episodesView all episodes
Download on the App Store

The Agentic Mesh PodcastBy Eric Broda and John Miller