Data Science at Home

Leveling Up AI: Reinforcement Learning with Human Feedback (Ep. 222)


Listen Later

In this episode, we dive into the not-so-secret sauce of ChatGPT, and what makes it a different model than its predecessors in the field of NLP and Large Language Models.

We explore how human feedback can be used to speed up the learning process in reinforcement learning, making it more efficient and effective.

Whether you're a machine learning practitioner, researcher, or simply curious about how machines learn, this episode will give you a fascinating glimpse into the world of reinforcement learning with human feedback.

 

Sponsors

This episode is supported by How to Fix the Internet, a cool podcast from the Electronic Frontier Foundation and Bloomberg, global provider of financial news and information, including real-time and historical price data, financial data, trading news, and analyst coverage.

 

References

Learning through human feedback

https://www.deepmind.com/blog/learning-through-human-feedback

 

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

https://arxiv.org/abs/2204.05862

...more
View all episodesView all episodes
Download on the App Store

Data Science at HomeBy Francesco Gadaleta

  • 4.2
  • 4.2
  • 4.2
  • 4.2
  • 4.2

4.2

72 ratings


More shows like Data Science at Home

View all
Freakonomics Radio by Freakonomics Radio + Stitcher

Freakonomics Radio

31,989 Listeners

Global News Podcast by BBC World Service

Global News Podcast

7,583 Listeners

WSJ Your Money Briefing by The Wall Street Journal

WSJ Your Money Briefing

1,705 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,091 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

623 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

585 Listeners

Science Magazine Podcast by Science Magazine

Science Magazine Podcast

826 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

301 Listeners

FT Tech Tonic by Financial Times

FT Tech Tonic

99 Listeners

Worklife with Adam Grant by TED

Worklife with Adam Grant

9,162 Listeners

Practical AI by Practical AI LLC

Practical AI

207 Listeners

Last Week in AI by Skynet Today

Last Week in AI

306 Listeners

Hard Fork by The New York Times

Hard Fork

5,512 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

228 Listeners

The Last Invention by Longview

The Last Invention

1,106 Listeners