Data Engineering Podcast

Building A Data Mesh Platform At PayPal


Listen Later

Summary

There has been a lot of discussion about the practical application of data mesh and how to implement it in an organization. Jean-Georges Perrin was tasked with designing a new data platform implementation at PayPal and wound up building a data mesh. In this episode he shares that journey and the combination of technical and organizational challenges that he encountered in the process.

Announcements
  • Hello and welcome to the Data Engineering Podcast, the show about modern data management
  • Are you tired of dealing with the headache that is the 'Modern Data Stack'? We feel your pain. It's supposed to make building smarter, faster, and more flexible data infrastructures a breeze. It ends up being anything but that. Setting it up, integrating it, maintaining it—it’s all kind of a nightmare. And let's not even get started on all the extra tools you have to buy to get it to do its thing. But don't worry, there is a better way. TimeXtender takes a holistic approach to data integration that focuses on agility rather than fragmentation. By bringing all the layers of the data stack together, TimeXtender helps you build data solutions up to 10 times faster and saves you 70-80% on costs. If you're fed up with the 'Modern Data Stack', give TimeXtender a try. Head over to dataengineeringpodcast.com/timextender where you can do two things: watch us build a data estate in 15 minutes and start for free today.
  • Your host is Tobias Macey and today I'm interviewing Jean-Georges Perrin about his work at PayPal to implement a data mesh and the role of data contracts in making it work
  • Interview
    • Introduction
    • How did you get involved in the area of data management?
    • Can you start by describing the goals and scope of your work at PayPal to implement a data mesh?
      • What are the core problems that you were addressing with this project?
      • Is a data mesh ever "done"?
      • What was your experience engaging at the organizational level to identify the granularity and ownership of the data products that were needed in the initial iteration?
      • What was the impact of leading multiple teams on the design of how to implement communication/contracts throughout the mesh?
      • What are the technical systems that you are relying on to power the different data domains?
        • What is your philosophy on enforcing uniformity in technical systems vs. relying on interface definitions as the unit of consistency?
        • What are the biggest challenges (technical and procedural) that you have encountered during your implementation?
        • How are you managing visibility/auditability across the different data domains? (e.g. observability, data quality, etc.)
        • What are the most interesting, innovative, or unexpected ways that you have seen PayPal's data mesh used?
        • What are the most interesting, unexpected, or challenging lessons that you have learned while working on data mesh?
        • When is a data mesh the wrong choice?
        • What do you have planned for the future of your data mesh at PayPal?
        • Contact Info
          • LinkedIn
          • Blog
          • Parting Question
            • From your perspective, what is the biggest gap in the tooling or technology for data management today?
            • Closing Announcements
              • Thank you for listening! Don't forget to check out our other shows. Podcast.__init__ covers the Python language, its community, and the innovative ways it is being used. The Machine Learning Podcast helps you go from idea to production with machine learning.
              • Visit the site to subscribe to the show, sign up for the mailing list, and read the show notes.
              • If you've learned something or tried out a project from the show then tell us about it! Email [email protected]) with your story.
              • To help other people find the show please leave a review on Apple Podcasts and tell your friends and co-workers
              • Links
                • Data Mesh
                  • O'Reilly Book (affiliate link)
                  • The next generation of Data Platforms is the Data Mesh
                  • PayPal
                  • Conway's Law
                  • Data Mesh For All Ages - US, Data Mesh For All Ages - UK
                  • Data Mesh Radio
                  • Data Mesh Community
                  • Data Mesh In Action
                  • Great Expectations
                    • Podcast Episode
                    • The intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA

                      Sponsored By:

                      • TimeXtender: ![TimeXtender Logo](https://files.fireside.fm/file/fireside-uploads/images/c/c6161a3f-a67b-48ef-b087-52f1f1573292/35MYWp0I.png)
                      TimeXtender is a holistic, metadata-driven solution for data integration, optimized for agility. TimeXtender provides all the features you need to build a future-proof infrastructure for ingesting, transforming, modelling, and delivering clean, reliable data in the fastest, most efficient way possible.
                      You can't optimize for everything all at once. That's why we take a holistic approach to data integration that optimises for agility instead of fragmentation. By unifying each layer of the data stack, TimeXtender empowers you to build data solutions 10x faster while reducing costs by 70%-80%. We do this for one simple reason: because time matters.
                      Go to [dataengineeringpodcast.com/timextender](https://www.dataengineeringpodcast.com/timextender) today to get started for free!

                      Support Data Engineering Podcast

                      ...more
                      View all episodesView all episodes
                      Download on the App Store

                      Data Engineering PodcastBy Tobias Macey

                      • 4.6
                      • 4.6
                      • 4.6
                      • 4.6
                      • 4.6

                      4.6

                      134 ratings


                      More shows like Data Engineering Podcast

                      View all
                      Software Engineering Radio - the podcast for professional software developers by se-radio@computer.org

                      Software Engineering Radio - the podcast for professional software developers

                      262 Listeners

                      The Changelog: Software Development, Open Source by Changelog Media

                      The Changelog: Software Development, Open Source

                      285 Listeners

                      The Cloudcast by Massive Studios

                      The Cloudcast

                      154 Listeners

                      Thoughtworks Technology Podcast by Thoughtworks

                      Thoughtworks Technology Podcast

                      43 Listeners

                      Data Skeptic by Kyle Polich

                      Data Skeptic

                      474 Listeners

                      Talk Python To Me by Michael Kennedy

                      Talk Python To Me

                      584 Listeners

                      Software Engineering Daily by Software Engineering Daily

                      Software Engineering Daily

                      630 Listeners

                      AWS Podcast by Amazon Web Services

                      AWS Podcast

                      200 Listeners

                      Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

                      Super Data Science: ML & AI Podcast with Jon Krohn

                      295 Listeners

                      Python Bytes by Michael Kennedy and Brian Okken

                      Python Bytes

                      212 Listeners

                      DataFramed by DataCamp

                      DataFramed

                      267 Listeners

                      Practical AI by Practical AI LLC

                      Practical AI

                      196 Listeners

                      The Stack Overflow Podcast by The Stack Overflow Podcast

                      The Stack Overflow Podcast

                      63 Listeners

                      The Real Python Podcast by Real Python

                      The Real Python Podcast

                      137 Listeners

                      Latent Space: The AI Engineer Podcast by swyx + Alessio

                      Latent Space: The AI Engineer Podcast

                      64 Listeners