The Python Podcast.__init__

Understanding Machine Learning Through Visualizations with Benjamin Bengfort and Rebecca Bilbro


Listen Later

Summary

Machine learning models are often inscrutable and it can be difficult to know whether you are making progress. To improve feedback and speed up iteration cycles Benjamin Bengfort and Rebecca Bilbro built Yellowbrick to easily generate visualizations of model performance. In this episode they explain how to use Yellowbrick in the process of building a machine learning project, how it aids in understanding how different parameters impact the outcome, and the improved understanding among teammates that it creates. They also explain how it integrates with the scikit-learn API, the difficulty of producing effective visualizations, and future plans for improvement and new features.

Preface
  • Hello and welcome to Podcast.__init__, the podcast about Python and the people who make it great.
  • When you’re ready to launch your next app you’ll need somewhere to deploy it, so check out Linode. With private networking, shared block storage, node balancers, and a 40Gbit network, all controlled by a brand new API you’ve got everything you need to scale up. Go to podcastinit.com/linode to get a $20 credit and launch a new server in under a minute.
  • To get worry-free releases download GoCD, the open source continous delivery server built by Thoughworks. You can use their pipeline modeling and value stream map to build, control and monitor every step from commit to deployment in one place. And with their new Kubernetes integration it’s even easier to deploy and scale your build agents. Go to podcastinit.com/gocd to learn more about their professional support services and enterprise add-ons.
  • Visit the site to subscribe to the show, sign up for the newsletter, and read the show notes. And if you have any questions, comments, or suggestions I would love to hear them. You can reach me on Twitter at @Podcast__init__ or email [email protected])
  • To help other people find the show please leave a review on iTunes, or Google Play Music, tell your friends and co-workers, and share it on social media.
  • Your host as usual is Tobias Macey and today I’m interviewing Rebecca Bilbro and Benjamin Bengfort about Yellowbrick, a scikit extension to use visualizations for assisting with model selection in your data science projects.
  • Interview
    • Introductions
    • How did you get introduced to Python?
    • Can you describe the use case for Yellowbrick and how the project got started?
    • What is involved in visualizing scikit-learn models?
      • What kinds of information do the visualizations convey?
      • How do they aid in understanding what is happening in the models?

      • How much direction does yellowbrick provide in terms of knowing which visualizations will be helpful in various circumstances?

      • What does the workflow look like for someone using Yellowbrick while iterating on a data science project?

      • What are some of the common points of confusion that your students encounter when learning data science and how has yellowbrick assisted in achieving understanding?

      • How is Yellowbrick iplemented and how has the design changed over the lifetime of the project?

      • What would be required to integrate with other visualization libraries and what benefits (if any) might that provide?

        • What about other ML frameworks?

        • What are some of the most challenging or unexpected aspects of building and maintaining Yellowbrick?

        • What are the limitations or edge cases for yellowbrick?

        • What do you have planned for the future of yellowbrick?

        • Beyond visualization, what are some of the other areas that you would like to see innovation in how data science is taught and/or conducted to make it more accessible?

        • Keep In Touch
          • Rebecca Bilbro
            • Github
            • Twitter

            • Benjamin Bengfort

              • Github
              • Twitter

              • Picks
                • Tobias
                  • Poutine

                  • Rebecca

                    • The color yellow

                    • Benjamin

                      • ALL CAPS

                      • Links
                        • Hadoop
                        • Natural Language Processing
                        • Machine Learning
                        • scikit-learn
                        • Model Selection Triple
                        • the machine learning workflow
                        • scikit-yb
                        • Yellowbrick
                        • Visualizer API
                        • Visual Tests
                        • Jupyter
                        • Matplotlib
                        • Tensorflow
                        • Hyperparameter
                        • Parallel Coordinates
                        • Radviz
                        • Rank2D
                        • Prediction Error Plot
                        • Residuals Plot
                        • Validation Curves
                        • Alpha Selection
                        • Frequency Distribution Plot
                        • Bayes Theorem
                        • Seaborn
                        • Stop Words
                        • N-gram
                        • Craig – Bias and Fairness of Algorithms
                        • Shiny
                        • Bokeh
                        • Keras
                        • StatsModels
                        • Tensorboard
                        • PyTorch
                        • NumPy
                        • Voxel
                        • Wizard of Oz
                        • The intro and outro music is from Requiem for a Fish The Freak Fandango Orchestra / CC BY-SA

                          ...more
                          View all episodesView all episodes
                          Download on the App Store

                          The Python Podcast.__init__By Tobias Macey

                          • 4.4
                          • 4.4
                          • 4.4
                          • 4.4
                          • 4.4

                          4.4

                          100 ratings


                          More shows like The Python Podcast.__init__

                          View all
                          The Changelog: Software Development, Open Source by Changelog Media

                          The Changelog: Software Development, Open Source

                          283 Listeners

                          Data Skeptic by Kyle Polich

                          Data Skeptic

                          481 Listeners

                          Chat With Traders by Tessa Dao

                          Chat With Traders

                          1,979 Listeners

                          Talk Python To Me by Michael Kennedy

                          Talk Python To Me

                          593 Listeners

                          Software Engineering Daily by Software Engineering Daily

                          Software Engineering Daily

                          623 Listeners

                          The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

                          The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

                          445 Listeners

                          Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

                          Super Data Science: ML & AI Podcast with Jon Krohn

                          297 Listeners

                          Python Bytes by Michael Kennedy and Brian Okken

                          Python Bytes

                          215 Listeners

                          Data Engineering Podcast by Tobias Macey

                          Data Engineering Podcast

                          142 Listeners

                          Machine Learning Guide by OCDevel

                          Machine Learning Guide

                          764 Listeners

                          Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

                          Syntax - Tasty Web Development Treats

                          981 Listeners

                          DataFramed by DataCamp

                          DataFramed

                          267 Listeners

                          Practical AI by Practical AI LLC

                          Practical AI

                          190 Listeners

                          The Real Python Podcast by Real Python

                          The Real Python Podcast

                          140 Listeners

                          Hard Fork by The New York Times

                          Hard Fork

                          5,426 Listeners