Generation AI

AI Trust, Eval Frameworks, and Why Data Quality Matters


Listen Later

In this episode of Generation AI, hosts JC and Ardis tackle one of the most pressing concerns in higher education today: how to trust AI outputs. They explore the psychology of trust in technology, the evaluation frameworks used to measure AI accuracy, and how Retrieval Augmented Generation (RAG) helps ground AI responses in factual data. The conversation offers practical insights for higher education professionals who want to implement AI solutions but worry about accuracy and reliability. Listeners will learn how to evaluate AI systems, what questions to ask vendors, and why having public-facing content is crucial for effective AI implementation.

Introduction: The Trust Challenge in AI (00:00:06)

  • JC Bonilla and Ardis Kadiu introduce the topic of trusting AI outputs
  • Contrasting traditional predictive modeling metrics with new AI evaluation methods
  • Understanding that trust is both earned and lost through interactions

The Psychology of Trust in AI (00:03:35)

  • How human psychology frameworks for trust transfer to technology
  • Challenge appraisal (seeing AI as enhancement) versus threat appraisal (seeing AI as risky)
  • Example: How autonomous driving shows trust being built or lost through micro-decisions
  • The importance of making AI systems more predictable to humans

Evaluating AI Outputs: The Evals Framework (00:11:41)

  • Moving from traditional machine learning metrics to new evaluation methods
  • How OpenAI Evals works as a standard for measuring AI performance
  • Creating test sets with thousands of variations to check AI outputs
  • The concept of "AI checking on AI" for more thorough evaluation
  • Element451's achievement of 94-95% accuracy rates on their evaluations

Retrieval Augmented Generation (RAG) Explained (00:27:23)

  • RAG as an "open book exam" approach for AI systems
  • How data is processed, categorized, and made searchable
  • The importance of re-ranking information to find the most relevant content
  • How multiple documents can be combined to create accurate answers

Addressing Common AI Trust Concerns (00:33:31)

  • Reducing hallucinations through proper grounding in source material
  • Why "garbage in, garbage out" fears are often overblown
  • Using public-facing content as reliable data sources
  • The value of traceable sources in building confidence in AI responses

Conclusion: Building Earned Trust (00:38:11)

  • Trust in AI comes from reliability and transparency
  • The importance of asking the right questions when selecting AI partners
  • How to distinguish between companies just talking about AI versus implementing best practices


- - - -

Connect With Our Co-Hosts:
Ardis Kadiu
https://www.linkedin.com/in/ardis/
https://twitter.com/ardis

Dr. JC Bonilla
https://www.linkedin.com/in/jcbonilla/
https://twitter.com/jbonillx

About The Enrollify Podcast Network:
Generation AI is a part of the Enrollify Podcast Network. If you like this podcast, chances are you’ll like other Enrollify shows too! 

Enrollify is made possible by Element451 — The AI Workforce Platform for Higher Ed. Learn more at element451.com


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

...more
View all episodesView all episodes
Download on the App Store

Generation AIBy Ardis Kadiu, Dr. JC Bonilla

  • 5
  • 5
  • 5
  • 5
  • 5

5

13 ratings


More shows like Generation AI

View all
The Diary Of A CEO with Steven Bartlett by DOAC

The Diary Of A CEO with Steven Bartlett

8,494 Listeners

The Real Python Podcast by Real Python

The Real Python Podcast

142 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

9,930 Listeners

Bernard Marr's Future of Business & Technology Podcast by Bernard Marr

Bernard Marr's Future of Business & Technology Podcast

7 Listeners

Haunted Cosmos by Ben Garrett & Brian Sauvé

Haunted Cosmos

3,762 Listeners

The Agentic Insider by Iridius.ai Inc

The Agentic Insider

4 Listeners