Generation AI

AI Trust, Eval Frameworks, and Why Data Quality Matters


Listen Later

In this episode of Generation AI, hosts JC and Ardis tackle one of the most pressing concerns in higher education today: how to trust AI outputs. They explore the psychology of trust in technology, the evaluation frameworks used to measure AI accuracy, and how Retrieval Augmented Generation (RAG) helps ground AI responses in factual data. The conversation offers practical insights for higher education professionals who want to implement AI solutions but worry about accuracy and reliability. Listeners will learn how to evaluate AI systems, what questions to ask vendors, and why having public-facing content is crucial for effective AI implementation.

Introduction: The Trust Challenge in AI (00:00:06)

  • JC Bonilla and Ardis Kadiu introduce the topic of trusting AI outputs
  • Contrasting traditional predictive modeling metrics with new AI evaluation methods
  • Understanding that trust is both earned and lost through interactions

The Psychology of Trust in AI (00:03:35)

  • How human psychology frameworks for trust transfer to technology
  • Challenge appraisal (seeing AI as enhancement) versus threat appraisal (seeing AI as risky)
  • Example: How autonomous driving shows trust being built or lost through micro-decisions
  • The importance of making AI systems more predictable to humans

Evaluating AI Outputs: The Evals Framework (00:11:41)

  • Moving from traditional machine learning metrics to new evaluation methods
  • How OpenAI Evals works as a standard for measuring AI performance
  • Creating test sets with thousands of variations to check AI outputs
  • The concept of "AI checking on AI" for more thorough evaluation
  • Element451's achievement of 94-95% accuracy rates on their evaluations

Retrieval Augmented Generation (RAG) Explained (00:27:23)

  • RAG as an "open book exam" approach for AI systems
  • How data is processed, categorized, and made searchable
  • The importance of re-ranking information to find the most relevant content
  • How multiple documents can be combined to create accurate answers

Addressing Common AI Trust Concerns (00:33:31)

  • Reducing hallucinations through proper grounding in source material
  • Why "garbage in, garbage out" fears are often overblown
  • Using public-facing content as reliable data sources
  • The value of traceable sources in building confidence in AI responses

Conclusion: Building Earned Trust (00:38:11)

  • Trust in AI comes from reliability and transparency
  • The importance of asking the right questions when selecting AI partners
  • How to distinguish between companies just talking about AI versus implementing best practices


- - - -

Connect With Our Co-Hosts:
Ardis Kadiu
https://www.linkedin.com/in/ardis/
https://twitter.com/ardis

Dr. JC Bonilla
https://www.linkedin.com/in/jcbonilla/
https://twitter.com/jbonillx

About The Enrollify Podcast Network:
Generation AI is a part of the Enrollify Podcast Network. If you like this podcast, chances are you’ll like other Enrollify shows too! 

Enrollify is made possible by Element451 —  the next-generation AI student engagement platform helping institutions create meaningful and personalized interactions with students. Learn more at element451.com

Attend the 2025 Engage Summit! 
The Engage Summit is the premier conference for forward-thinking leaders and practitioners dedicated to exploring the transformative power of AI in education. Explore the strategies and tools to step into the next generation of student engagement, supercharged by AI. You'll leave ready to deliver the most personalized digital engagement experience every step of the way.

Register now to secure your spot in Charlotte, NC, on June 24-25, 2025! Early bird registration ends February 1st -- https://engage.element451.com/register

...more
View all episodesView all episodes
Download on the App Store

Generation AIBy Ardis Kadiu, Dr. JC Bonilla

  • 5
  • 5
  • 5
  • 5
  • 5

5

11 ratings


More shows like Generation AI

View all
HBR IdeaCast by Harvard Business Review

HBR IdeaCast

211 Listeners

a16z Podcast by Andreessen Horowitz

a16z Podcast

998 Listeners

Gartner ThinkCast by Gartner

Gartner ThinkCast

108 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

295 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

324 Listeners

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion by AI & Data Today

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion

144 Listeners

Practical AI by Practical AI LLC

Practical AI

189 Listeners

Higher Ed Pulse by Mallory Willsea

Higher Ed Pulse

23 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

8,773 Listeners

Hard Fork by The New York Times

Hard Fork

5,365 Listeners

In Your Element by Daniella Nordin and Brendan Henkel

In Your Element

3 Listeners

Fixable by TED

Fixable

215 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

420 Listeners

Lightcone Podcast by Y Combinator

Lightcone Podcast

20 Listeners

Training Data by Sequoia Capital

Training Data

37 Listeners