Data Science at Home

Leveling Up AI: Reinforcement Learning with Human Feedback (Ep. 222)


Listen Later

In this episode, we dive into the not-so-secret sauce of ChatGPT, and what makes it a different model than its predecessors in the field of NLP and Large Language Models.

We explore how human feedback can be used to speed up the learning process in reinforcement learning, making it more efficient and effective.

Whether you're a machine learning practitioner, researcher, or simply curious about how machines learn, this episode will give you a fascinating glimpse into the world of reinforcement learning with human feedback.

 

Sponsors

This episode is supported by How to Fix the Internet, a cool podcast from the Electronic Frontier Foundation and Bloomberg, global provider of financial news and information, including real-time and historical price data, financial data, trading news, and analyst coverage.

 

References

Learning through human feedback

https://www.deepmind.com/blog/learning-through-human-feedback

 

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

https://arxiv.org/abs/2204.05862

...more
View all episodesView all episodes
Download on the App Store

Data Science at HomeBy Francesco Gadaleta

  • 4.2
  • 4.2
  • 4.2
  • 4.2
  • 4.2

4.2

72 ratings


More shows like Data Science at Home

View all
On Point with Meghna Chakrabarti by WBUR

On Point with Meghna Chakrabarti

4,027 Listeners

Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,384 Listeners

Nature Podcast by Springer Nature Limited

Nature Podcast

753 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

628 Listeners

Science Vs by Spotify Studios

Science Vs

12,133 Listeners

Science Friday by Science Friday and WNYC Studios

Science Friday

6,463 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

305 Listeners

The Daily by The New York Times

The Daily

113,307 Listeners

Up First from NPR by NPR

Up First from NPR

56,974 Listeners

The Atlantic Interview by The Atlantic

The Atlantic Interview

15 Listeners

Modern Wisdom by Chris Williamson

Modern Wisdom

4,027 Listeners

The Peter Attia Drive by Peter Attia, MD

The Peter Attia Drive

8,037 Listeners

Practical AI by Practical AI LLC

Practical AI

209 Listeners

Consider This from NPR by NPR

Consider This from NPR

6,466 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,508 Listeners