BlueDot Narrated

Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback


Listen Later

Audio versions of blogs and papers from BlueDot courses. 


This paper explains Anthropic’s constitutional AI approach, which is largely an extension on RLHF but with AIs replacing human demonstrators and human evaluators.

A podcast by BlueDot Impact.

...more
View all episodesView all episodes
Download on the App Store

BlueDot NarratedBy BlueDot Impact