Data Science at Home

Leveling Up AI: Reinforcement Learning with Human Feedback (Ep. 222)


Listen Later

In this episode, we dive into the not-so-secret sauce of ChatGPT, and what makes it a different model than its predecessors in the field of NLP and Large Language Models.

We explore how human feedback can be used to speed up the learning process in reinforcement learning, making it more efficient and effective.

Whether you're a machine learning practitioner, researcher, or simply curious about how machines learn, this episode will give you a fascinating glimpse into the world of reinforcement learning with human feedback.

 

Sponsors

This episode is supported by How to Fix the Internet, a cool podcast from the Electronic Frontier Foundation and Bloomberg, global provider of financial news and information, including real-time and historical price data, financial data, trading news, and analyst coverage.

 

References

Learning through human feedback

https://www.deepmind.com/blog/learning-through-human-feedback

 

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

https://arxiv.org/abs/2204.05862

...more
View all episodesView all episodes
Download on the App Store

Data Science at HomeBy Francesco Gadaleta

  • 4.2
  • 4.2
  • 4.2
  • 4.2
  • 4.2

4.2

72 ratings


More shows like Data Science at Home

View all
On Point with Meghna Chakrabarti by WBUR

On Point with Meghna Chakrabarti

4,022 Listeners

Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,380 Listeners

Nature Podcast by Springer Nature Limited

Nature Podcast

756 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

626 Listeners

Science Vs by Spotify Studios

Science Vs

12,130 Listeners

Science Friday by Science Friday and WNYC Studios

Science Friday

6,467 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

306 Listeners

The Daily by The New York Times

The Daily

113,121 Listeners

Up First from NPR by NPR

Up First from NPR

56,944 Listeners

The Atlantic Interview by The Atlantic

The Atlantic Interview

14 Listeners

Modern Wisdom by Chris Williamson

Modern Wisdom

4,025 Listeners

The Peter Attia Drive by Peter Attia, MD

The Peter Attia Drive

8,043 Listeners

Practical AI by Practical AI LLC

Practical AI

212 Listeners

Consider This from NPR by NPR

Consider This from NPR

6,462 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,525 Listeners