Everyday AI Podcast – An AI and ChatGPT Podcast

EP 628: What’s the best LLM for your team? 7 Steps to evaluate and create ROI for AI


Listen Later

How can you measure ROI on GenAI for your team? 🤔

Internal evaluations and intentionality. 

We've helped thousands of orgs put LLMs to work and ACTUALLY save time. On today's show, we're dishing the 7 steps you need to follow. 

What’s the best LLM for your team? 7 Steps to evaluate and create ROI for AI -- An Everyday AI chat with Jordan Wilson


Newsletter: Sign up for our free daily newsletter
More on this Episode: Episode Page
Join the discussion on LinkedIn: Thoughts on this? Join the convo on LinkedIn and connect with other AI leaders.

Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineup
Website: YourEverydayAI.com
Email The Show: [email protected]
Connect with Jordan on LinkedIn


Topics Covered in This Episode:

  1. Choosing the Right Large Language Model
  2. Evaluating LLMs for Business ROI
  3. Front-End AI Operating Systems Explained
  4. Common Traps in AI Model Evaluation
  5. Public Benchmarks for LLM Evaluation
  6. Seven-Step LLM Evaluation Framework
  7. Measuring Pre-GenAI Human Baselines
  8. Building Realistic AI Test Datasets
  9. Calculating ROI for GenAI Implementation
  10. Monthly Retesting and AI Model Updates


Timestamps:

00:00 Choosing the Right AI Model

07:02 Adapting Workflows for AI Integration

10:58 "Gemini's Versatile Modes Overview"

14:30 Avoiding AI Shiny Object Syndrome

15:36 AI Evaluation for Reliability and Improvement

20:36 "Data Testing Guide Essentials"

25:15 Realistic and Messy Data Essentials

26:06 "Building Effective AI Workspaces"

31:08 AI Evaluation and ROI Calculation

34:11 Human Oversight in AI Testing

35:52 Evaluating GenAI Use Cases

39:00 "NotebookLM: AI-Powered Idea Organizer"


Keywords:

Large Language Model, LLM, generative AI, AI operating system, front end AI models, AI evaluation, model ROI, model evaluation steps, AI benchmarks, scientific benchmarks, API connection, enterprise AI, ChatGPT, Claude, Gemini, Copilot, team AI adoption, knowledge worker AI, operating system choice, productivity modes, connectors, deep research mode, agent mode, image generation, web search, Canvas mode, advanced voice mode, business process automation, workflow evaluation, change management, AI training, 



Send Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info)

Vibe coding is dead simple. Head to AI.Studio/build to create your first app. 

Vibe coding is dead simple. Head to AI.Studio/build to create your first app. 

...more
View all episodesView all episodes
Download on the App Store

Everyday AI Podcast – An AI and ChatGPT PodcastBy Everyday AI

  • 4.8
  • 4.8
  • 4.8
  • 4.8
  • 4.8

4.8

95 ratings


More shows like Everyday AI Podcast – An AI and ChatGPT Podcast

View all
NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

333 Listeners

AI Today Podcast by AI & Data Today

AI Today Podcast

154 Listeners

Practical AI by Practical AI LLC

Practical AI

209 Listeners

The Artificial Intelligence Show by Paul Roetzer and Mike Kaput

The Artificial Intelligence Show

197 Listeners

AI Chat: ChatGPT, AI News, Artificial Intelligence, OpenAI, Machine Learning by Jaeden Schafer

AI Chat: ChatGPT, AI News, Artificial Intelligence, OpenAI, Machine Learning

151 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

224 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

605 Listeners

AI For Humans: Making Artificial Intelligence Fun & Practical by Kevin Pereira & Gavin Purcell

AI For Humans: Making Artificial Intelligence Fun & Practical

273 Listeners

A Beginner's Guide to AI by Dietmar Fischer

A Beginner's Guide to AI

55 Listeners

AI Hustle: Make Money from AI and ChatGPT, Midjourney, NVIDIA, Anthropic, OpenAI by Jaeden Schafer and Jamie McCauley

AI Hustle: Make Money from AI and ChatGPT, Midjourney, NVIDIA, Anthropic, OpenAI

172 Listeners

The TED AI Show by TED

The TED AI Show

48 Listeners

AI Applied: Covering AI News, Interviews and Tools - ChatGPT, Midjourney, Gemini, OpenAI, Anthropic by Jaeden Schafer and Conor Grennan

AI Applied: Covering AI News, Interviews and Tools - ChatGPT, Midjourney, Gemini, OpenAI, Anthropic

146 Listeners

Leveraging AI by Isar Meitis

Leveraging AI

62 Listeners

Beyond The Prompt - How to use AI in your company by Jeremy Utley & Henrik Werdelin

Beyond The Prompt - How to use AI in your company

63 Listeners

Using AI at Work: AI in the Workplace & Generative AI for Business Leaders by Chris Daigle

Using AI at Work: AI in the Workplace & Generative AI for Business Leaders

24 Listeners