The Python Podcast.__init__

AI Driven Automated Code Review With DeepCode


Listen Later

Summary

Software engineers are frequently faced with problems that have been fixed by other developers in different projects. The challenge is how and when to surface that information in a way that increases their efficiency and avoids wasted effort. DeepCode is an automated code review platform that was built to solve this problem by training a model on a massive array of open sourced code and the history of their bug and security fixes. In this episode their CEO Boris Paskalev explains how the company got started, how they build and maintain the models that provide suggestions for improving your code changes, and how it integrates into your workflow.

Announcements
  • Hello and welcome to Podcast.__init__, the podcast about Python and the people who make it great.
  • When you’re ready to launch your next app or want to try a project you hear about on the show, you’ll need somewhere to deploy it, so take a look at our friends over at Linode. With 200 Gbit/s private networking, scalable shared block storage, node balancers, and a 40 Gbit/s public network, all controlled by a brand new API you’ve got everything you need to scale up. And for your tasks that need fast computation, such as training machine learning models, they just launched dedicated CPU instances. Go to pythonpodcast.com/linode to get a $20 credit and launch a new server in under a minute. And don’t forget to thank them for their continued support of this show!
  • You listen to this show to learn and stay up to date with the ways that Python is being used, including the latest in machine learning and data analysis. For even more opportunities to meet, listen, and learn from your peers you don’t want to miss out on this year’s conference season. We have partnered with organizations such as O’Reilly Media, Dataversity, Corinium Global Intelligence, and Data Council. Upcoming events include the O’Reilly AI conference, the Strata Data conference, the combined events of the Data Architecture Summit and Graphorum, and Data Council in Barcelona. Go to pythonpodcast.com/conferences to learn more about these and other events, and take advantage of our partner discounts to save money when you register today.
  • Your host as usual is Tobias Macey and today I’m interviewing Boris Paskalev about DeepCode, an automated code review platform for detecting security vulnerabilities in your projects
  • Interview
    • Introductions
    • Can you start by explaining what DeepCode is and the story of how it got started?
    • How is the DeepCode platform implemented?
    • What are the current languages that you support and what was your guiding principle in selecting them?
      • What languages are you targeting next?
      • What is involved in maintaining support for languages as they release new versions with new features?
        • How do you ensure that the recommendations that you are making are not using languages features that are not available in the runtimes that a given project is using?
        • For someone who is using DeepCode, how does it fit into their workflow?
        • Can you explain the process that you use for training your models?
          • How do you curate and prepare the project sources that you use to power your models?
            • How much domain expertise is necessary to identify the faults that you are trying to detect?
            • What types of labelling do you perform to ensure that the resulting models are focusing on the proper aspects of the source repositories?
            • How do you guard against false positives and false negatives in your analysis and recommendations?
            • Does the code that you are analyzing and the resulting fixes act as a feedback mechanism for a reinforcement learning system to update your models?
              • How do you guard against leaking intellectual property of your scanned code when surfacing recommendations?
              • What have been some of the most interesting/unexpected/challenging aspects of building the DeepCode product?
              • What do you have planned for the future of the platform and business?
              • Keep In Touch
                • LinkedIn
                • Picks
                  • Tobias
                    • Redwall Series by Brian Jacques
                    • Boris
                      • Artifical Intelligence
                      • Get outside
                      • Closing Announcements
                        • Thank you for listening! Don’t forget to check out our other show, the Data Engineering Podcast for the latest on modern data management.
                        • Visit the site to subscribe to the show, sign up for the mailing list, and read the show notes.
                        • If you’ve learned something or tried out a project from the show then tell us about it! Email [email protected]) with your story.
                        • To help other people find the show please leave a review on iTunes and tell your friends and co-workers
                        • Join the community in the new Zulip chat workspace at pythonpodcast.com/chat
                        • Links
                          • DeepCode
                          • Zurich, Switzerland
                          • BigCode
                          • ETH Zurich
                          • Datalog
                          • F Strings
                          • Data Classes
                          • DeepCode Research
                          • The intro and outro music is from Requiem for a Fish The Freak Fandango Orchestra / CC BY-SA

                            ...more
                            View all episodesView all episodes
                            Download on the App Store

                            The Python Podcast.__init__By Tobias Macey

                            • 4.4
                            • 4.4
                            • 4.4
                            • 4.4
                            • 4.4

                            4.4

                            100 ratings


                            More shows like The Python Podcast.__init__

                            View all
                            Freakonomics Radio by Freakonomics Radio + Stitcher

                            Freakonomics Radio

                            32,020 Listeners

                            Odd Lots by Bloomberg

                            Odd Lots

                            1,940 Listeners

                            The Changelog: Software Development, Open Source by Changelog Media

                            The Changelog: Software Development, Open Source

                            289 Listeners

                            Data Skeptic by Kyle Polich

                            Data Skeptic

                            479 Listeners

                            Software Engineering Daily by Software Engineering Daily

                            Software Engineering Daily

                            624 Listeners

                            Talk Python To Me by Michael Kennedy

                            Talk Python To Me

                            585 Listeners

                            Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

                            Super Data Science: ML & AI Podcast with Jon Krohn

                            302 Listeners

                            Python Bytes by Michael Kennedy and Brian Okken

                            Python Bytes

                            215 Listeners

                            Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

                            Syntax - Tasty Web Development Treats

                            987 Listeners

                            DataFramed by DataCamp

                            DataFramed

                            269 Listeners

                            Practical AI by Practical AI LLC

                            Practical AI

                            210 Listeners

                            The Intelligence from The Economist by The Economist

                            The Intelligence from The Economist

                            2,554 Listeners

                            The Real Python Podcast by Real Python

                            The Real Python Podcast

                            142 Listeners

                            声动早咖啡 by 声动活泼

                            声动早咖啡

                            292 Listeners

                            The Foreign Affairs Interview by Foreign Affairs Magazine

                            The Foreign Affairs Interview

                            444 Listeners