Generation AI

How Do Models Get Smarter? Pre-training, Fine-tuning, Long Context, Real-time Reasoning


Listen Later

In this illuminating discussion, hosts JC Bonilla and Ardis Kadiu break down the four fundamental ways AI models become smarter: pre-training (long-term memory), context/prompting (short-term memory), real-time reasoning (inference-time processing), and fine-tuning (specialized learning). Using real-world examples from Bloomberg GPT and Apple's strategy, they explain why bigger models aren't always better and how companies can achieve remarkable results by intelligently combining these different approaches to model intelligence. Kadiu provides a masterclass in understanding AI model development, challenging common assumptions about specialized models while explaining why current AI capabilities are sufficient for most applications over the next 4-5 years.

Post-Thanksgiving Welcome and Updates (00:00:07)

  • Warm opening with hosts sharing Thanksgiving experiences
  • Discussion of family gatherings and cooking adventures
  • Setting the stage for a technical but accessible conversation

Understanding Model Intelligence: The Four Paths (00:29:06)

  • Pre-training explained as "long-term memory" for models
  • Context/prompting described as "short-term memory"
  • Real-time reasoning capabilities during inference
  • Fine-tuning as a specialized learning approach
  • How these methods combine in practical applications

Pre-training Deep Dive (00:31:07)

  • Explanation of the "P" in GPT (Generative Pre-trained Transformer)
  • How pre-training works as foundational knowledge
  • Cost implications of extensive pre-training
  • Trade-offs between model size and performance

Context and Prompting Insights (00:32:44)

  • Role of context in model performance
  • How prompting provides short-term guidance
  • Examples of effective context usage
  • Impact on model accuracy and results

Real-time Reasoning Capabilities (00:34:06)

  • How models perform inference-time reasoning
  • Internal processing and decision-making
  • Benefits of self-guided problem-solving
  • Examples of reasoning in action

Fine-tuning and Specialization (00:36:16)

  • When and why to use fine-tuning
  • Cost benefits of specialized training
  • Real-world examples of successful fine-tuning
  • Limitations and considerations

Practical Applications and Cost Considerations (00:42:26)

  • Analysis of decreasing model costs
  • Speed vs accuracy trade-offs
  • When to use which approach
  • Future trends in model development

Industry Examples and Case Studies (00:47:20)

  • Bloomberg GPT's lessons learned
  • Apple's strategic approach to AI
  • OpenAI's revenue model
  • Success factors in model deployment

Looking Forward: The Next 4-5 Years (00:49:13)

  • Current capabilities vs future needs
  • Role of evaluation and testing
  • Importance of proper tooling
  • Balance between innovation and practical application


- - - -

Connect With Our Co-Hosts:
Ardis Kadiu
https://www.linkedin.com/in/ardis/
https://twitter.com/ardis

Dr. JC Bonilla
https://www.linkedin.com/in/jcbonilla/
https://twitter.com/jbonillx

About The Enrollify Podcast Network:
Generation AI is a part of the Enrollify Podcast Network. If you like this podcast, chances are you’ll like other Enrollify shows too! 

Enrollify is made possible by Element451 — The AI Workforce Platform for Higher Ed. Learn more at element451.com


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

...more
View all episodesView all episodes
Download on the App Store

Generation AIBy Ardis Kadiu, Dr. JC Bonilla

  • 5
  • 5
  • 5
  • 5
  • 5

5

12 ratings


More shows like Generation AI

View all
The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

530 Listeners

The Knowledge Project by Shane Parrish

The Knowledge Project

2,664 Listeners

a16z Podcast by Andreessen Horowitz

a16z Podcast

1,081 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

302 Listeners

Founders by David Senra

Founders

2,099 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

339 Listeners

Practical AI by Practical AI LLC

Practical AI

212 Listeners

Last Week in AI by Skynet Today

Last Week in AI

303 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

133 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

96 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

209 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

558 Listeners

AI For Humans: Making Artificial Intelligence Fun & Practical by Kevin Pereira & Gavin Purcell

AI For Humans: Making Artificial Intelligence Fun & Practical

267 Listeners

Everyday AI Podcast – An AI and ChatGPT Podcast by Everyday AI

Everyday AI Podcast – An AI and ChatGPT Podcast

104 Listeners

AI Applied: Covering AI News, Interviews and Tools - ChatGPT, Midjourney, Gemini, OpenAI, Anthropic by Jaeden Schafer and Conor Grennan

AI Applied: Covering AI News, Interviews and Tools - ChatGPT, Midjourney, Gemini, OpenAI, Anthropic

134 Listeners