Data Engineering Podcast

Shining A Light on Shadow IT In Data And Analytics


Listen Later

Summary

Misaligned priorities across business units can lead to tensions that drive members of the organization to build data and analytics projects without the guidance or support of engineering or IT staff. The availability of cloud platforms and managed services makes this a viable option, but can lead to downstream challenges. In this episode Sean Knapp and Charlie Crocker share their experiences of working in and with companies that have dealt with shadow IT projects and the importance of enabling and empowering the use and exploration of data and analytics. If you have ever been frustrated by seemingly draconian policies or struggled to align everyone on your supported platform, then this episode will help you gain some perspective and set you on a path to productive collaboration.

Announcements
  • Hello and welcome to the Data Engineering Podcast, the show about modern data management
  • When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode. With 200Gbit private networking, scalable shared block storage, a 40Gbit public network, fast object storage, and a brand new managed Kubernetes platform, you’ve got everything you need to run a fast, reliable, and bullet-proof data platform. And for your machine learning workloads, they’ve got dedicated CPU and GPU instances. Go to dataengineeringpodcast.com/linode today to get a $20 credit and launch a new server in under a minute. And don’t forget to thank them for their continued support of this show!
  • Are you spending too much time maintaining your data pipeline? Snowplow empowers your business with a real-time event data pipeline running in your own cloud account without the hassle of maintenance. Snowplow takes care of everything from installing your pipeline in a couple of hours to upgrading and autoscaling so you can focus on your exciting data projects. Your team will get the most complete, accurate and ready-to-use behavioral web and mobile data, delivered into your data warehouse, data lake and real-time streams. Go to dataengineeringpodcast.com/snowplow today to find out why more than 600,000 websites run Snowplow. Set up a demo and mention you’re a listener for a special offer!
  • You listen to this show to learn and stay up to date with what’s happening in databases, streaming platforms, big data, and everything else you need to know about modern data management. For even more opportunities to meet, listen, and learn from your peers you don’t want to miss out on this year’s conference season. We have partnered with organizations such as O’Reilly Media, Corinium Global Intelligence, ODSC, and Data Council. Upcoming events include the Software Architecture Conference in NYC, Strata Data in San Jose, and PyCon US in Pittsburgh. Go to dataengineeringpodcast.com/conferences to learn more about these and other events, and take advantage of our partner discounts to save money when you register today.
  • Your host is Tobias Macey and today I’m interviewing Sean Knapp, Charlie Crocker about shadow IT in data and analytics
  • Interview
    • Introduction
    • How did you get involved in the area of data management?
    • Can you start by sharing your definition of shadow IT?
    • What are some of the reasons that members of an organization might start building their own solutions outside of what is supported by the engineering teams?
      • What are some of the roles in an organization that you have seen involved in these shadow IT projects?
      • What kinds of tools or platforms are well suited for being provisioned and managed without involvement from the platform team?
        • What are some of the pitfalls that these solutions present as a result of their initial ease of use?
        • What are the benefits to the organization of individuals or teams building and managing their own solutions?
        • What are some of the risks associated with these implementations of data collection, storage, management, or analysis that have no oversight from the teams typically tasked with managing those systems?
          • What are some of the ways that compliance or data quality issues can arise from these projects?
          • Once a project has been started outside of the approved channels it can quickly take on a life of its own. What are some of the ways you have identified the presence of "unauthorized" data projects?
            • Once you have identified the existence of such a project how can you revise their implementation to integrate them with the "approved" platform that the organization supports?
            • What are some strategies for removing the friction in the collection, access, or availability of data in an organization that can eliminate the need for shadow IT implementations?
            • What are some of the inherent complexities in data management which you would like to see resolved in order to reduce the tensions that lead to these bespoke solutions?
            • Contact Info
              • Sean
                • LinkedIn
                • @seanknapp on Twitter
                • Charlie
                  • LinkedIn
                  • Parting Question
                    • From your perspective, what is the biggest gap in the tooling or technology for data management today?
                    • Links
                      • Shadow IT
                      • Ascend
                        • Podcast Episode
                        • ZoneHaven
                        • Google Sawzall
                        • M&A == Mergers and Acquisitions
                        • DevOps
                        • Waterfall Development
                        • Data Governance
                        • Data Lineage
                        • Pioneers, Settlers, and Town Planners
                        • PowerBI
                        • Tableau
                        • Excel
                        • Amundsen
                          • Podcast Episode
                          • The intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA

                            Support Data Engineering Podcast

                            ...more
                            View all episodesView all episodes
                            Download on the App Store

                            Data Engineering PodcastBy Tobias Macey

                            • 4.5
                            • 4.5
                            • 4.5
                            • 4.5
                            • 4.5

                            4.5

                            142 ratings


                            More shows like Data Engineering Podcast

                            View all
                            This Week in Startups by Jason Calacanis

                            This Week in Startups

                            1,300 Listeners

                            The Changelog: Software Development, Open Source by Changelog Media

                            The Changelog: Software Development, Open Source

                            288 Listeners

                            The a16z Show by Andreessen Horowitz

                            The a16z Show

                            1,104 Listeners

                            Software Engineering Daily by Software Engineering Daily

                            Software Engineering Daily

                            630 Listeners

                            Risky Business by Risky Business Media

                            Risky Business

                            372 Listeners

                            Talk Python To Me by Michael Kennedy

                            Talk Python To Me

                            583 Listeners

                            Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

                            Super Data Science: ML & AI Podcast with Jon Krohn

                            309 Listeners

                            NVIDIA AI Podcast by NVIDIA

                            NVIDIA AI Podcast

                            346 Listeners

                            Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

                            Syntax - Tasty Web Development Treats

                            987 Listeners

                            Practical AI by Practical AI LLC

                            Practical AI

                            211 Listeners

                            Dwarkesh Podcast by Dwarkesh Patel

                            Dwarkesh Podcast

                            551 Listeners

                            The Data Engineering Show by The Firebolt Data Bros

                            The Data Engineering Show

                            10 Listeners

                            Latent Space: The AI Engineer Podcast by Latent.Space

                            Latent Space: The AI Engineer Podcast

                            104 Listeners

                            This Day in AI Podcast by Michael Sharkey, Chris Sharkey

                            This Day in AI Podcast

                            227 Listeners

                            The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

                            The AI Daily Brief: Artificial Intelligence News and Analysis

                            683 Listeners