
Sign up to save your podcasts
Or
Today we’re joined by Abhishek Gupta, a PhD Student at UC Berkeley.
Abhishek, a member of the BAIR Lab, joined us to talk about his recent robotics and reinforcement learning research and interests, which focus on applying RL to real-world robotics applications. We explore the concept of reward supervision, and how to get robots to learn these reward functions from videos, and the rationale behind supervised experts in these experiments.
We also discuss the use of simulation for experiments, data collection, and the path to scalable robotic learning. Finally, we discuss gradient surgery vs gradient sledgehammering, and his ecological RL paper, which focuses on the “phenomena that exist in the real world” and how humans and robotics systems interface in those situations.
The complete show notes for this episode can be found at https://twimlai.com/go/466.
4.7
416416 ratings
Today we’re joined by Abhishek Gupta, a PhD Student at UC Berkeley.
Abhishek, a member of the BAIR Lab, joined us to talk about his recent robotics and reinforcement learning research and interests, which focus on applying RL to real-world robotics applications. We explore the concept of reward supervision, and how to get robots to learn these reward functions from videos, and the rationale behind supervised experts in these experiments.
We also discuss the use of simulation for experiments, data collection, and the path to scalable robotic learning. Finally, we discuss gradient surgery vs gradient sledgehammering, and his ecological RL paper, which focuses on the “phenomena that exist in the real world” and how humans and robotics systems interface in those situations.
The complete show notes for this episode can be found at https://twimlai.com/go/466.
160 Listeners
475 Listeners
296 Listeners
339 Listeners
149 Listeners
188 Listeners
298 Listeners
91 Listeners
423 Listeners
124 Listeners
200 Listeners
71 Listeners
508 Listeners
11 Listeners
32 Listeners