The Python Podcast.__init__

Experimenting With Reinforcement Learning Using MushroomRL


Listen Later

Summary

Reinforcement learning is a branch of machine learning and AI that has a lot of promise for applications that need to evolve with changes to their inputs. To support the research happening in the field, including applications for robotics, Carlo D’Eramo and Davide Tateo created MushroomRL. In this episode they share how they have designed the project to be easy to work with, so that students can use it in their study, as well as extensible so that it can be used by businesses and industry professionals. They also discuss the strengths of reinforcement learning, how to design problems that can leverage its capabilities, and how to get started with MushroomRL for your own work.

Announcements
  • Hello and welcome to Podcast.__init__, the podcast about Python’s role in data and science.
  • When you’re ready to launch your next app or want to try a project you hear about on the show, you’ll need somewhere to deploy it, so take a look at our friends over at Linode. With the launch of their managed Kubernetes platform it’s easy to get started with the next generation of deployment and scaling, powered by the battle tested Linode platform, including simple pricing, node balancers, 40Gbit networking, dedicated CPU and GPU instances, and worldwide data centers. Go to pythonpodcast.com/linode and get a $100 credit to try out a Kubernetes cluster of your own. And don’t forget to thank them for their continued support of this show!
  • Your host as usual is Tobias Macey and today I’m interviewing Davide Tateo and Carlo D’Eramo about MushroomRL, a library for building reinforcement learning experiments
  • Interview
    • Introductions
    • How did you get introduced to Python?
    • Can you start by describing what reinforcement learning is and how it differs from other approaches for machine learning?
    • What are some example use cases where reinforcement learning might be necessary?
    • Can you describe what MushroomRL is and the story behind it?
      • Who are the target users of the project?
      • What are its main goals?
      • What are your suggestions to other developers for implementing a
      • succesful library?
      • What are some of the core concepts that researchers and/or engineers need to understand to be able to effectively use reinforcement learning techniques?
      • Can you describe how MushroomRL is architected?
        • How have the goals and design of the project changed or evolved since you began working on it?
        • What is the workflow for building and executing an experiment with MushroomRL?
          • How do you track the states and outcomes of experiments?
          • What are some of the considerations involved in designing an environment and reward functions for an agent to interact with?
          • What are some of the open questions that are being explored in reinforcement learning?
          • How are you using MushroomRL in your own research?
          • What are the most interesting, innovative, or unexpected ways that you have seen MushroomRL used?
          • What are the most interesting, unexpected, or challenging lessons that you have learned while working on MushroomRL?
          • When is MushroomRL the wrong choice?
          • What do you have planned for the future of MushroomRL?
          • How can the open-source community contribute to MushroomRL?
          • What kind of support you are willing to provide to users?
          • Keep In Touch
            • Davide
              • boris-il-forte on GitHub
              • Website
              • Carlo
                • carloderamo on GitHub
                • Website
                • Picks
                  • Tobias
                    • Britannia TV Series
                    • Davide
                      • 1984 by George Orwell
                      • Carlo
                        • Twin Peaks TV Series
                        • Closing Announcements
                          • Thank you for listening! Don’t forget to check out our other show, the Data Engineering Podcast for the latest on modern data management.
                          • Visit the site to subscribe to the show, sign up for the mailing list, and read the show notes.
                          • If you’ve learned something or tried out a project from the show then tell us about it! Email [email protected]) with your story.
                          • To help other people find the show please leave a review on iTunes and tell your friends and co-workers
                          • Join the community in the new Zulip chat workspace at pythonpodcast.com/chat
                          • Links
                            • MushroomRL
                            • TU Darmstadt
                            • MuJoCo
                            • PyBullet
                            • iGibson
                            • Habitat
                            • OpenAI Gym
                            • PyTorch
                              • Podcast Episode
                              • RLLib
                              • Ray
                                • Podcast Episode
                                • OpenAI Baselines
                                • Stable Baselines
                                • ROS
                                • The intro and outro music is from Requiem for a Fish The Freak Fandango Orchestra / CC BY-SA

                                  ...more
                                  View all episodesView all episodes
                                  Download on the App Store

                                  The Python Podcast.__init__By Tobias Macey

                                  • 4.4
                                  • 4.4
                                  • 4.4
                                  • 4.4
                                  • 4.4

                                  4.4

                                  100 ratings


                                  More shows like The Python Podcast.__init__

                                  View all
                                  TED Talks Daily by TED

                                  TED Talks Daily

                                  11,284 Listeners

                                  6 Minute English by BBC Radio

                                  6 Minute English

                                  1,779 Listeners

                                  The Changelog: Software Development, Open Source by Changelog Media

                                  The Changelog: Software Development, Open Source

                                  285 Listeners

                                  Data Skeptic by Kyle Polich

                                  Data Skeptic

                                  474 Listeners

                                  Talk Python To Me by Michael Kennedy

                                  Talk Python To Me

                                  585 Listeners

                                  Software Engineering Daily by Software Engineering Daily

                                  Software Engineering Daily

                                  629 Listeners

                                  The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

                                  The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

                                  425 Listeners

                                  Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

                                  Super Data Science: ML & AI Podcast with Jon Krohn

                                  296 Listeners

                                  Python Bytes by Michael Kennedy and Brian Okken

                                  Python Bytes

                                  213 Listeners

                                  Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

                                  Syntax - Tasty Web Development Treats

                                  987 Listeners

                                  DataFramed by DataCamp

                                  DataFramed

                                  267 Listeners

                                  Practical AI by Practical AI LLC

                                  Practical AI

                                  196 Listeners

                                  The Real Python Podcast by Real Python

                                  The Real Python Podcast

                                  137 Listeners

                                  Last Week in AI by Skynet Today

                                  Last Week in AI

                                  275 Listeners

                                  Latent Space: The AI Engineer Podcast by swyx + Alessio

                                  Latent Space: The AI Engineer Podcast

                                  66 Listeners