Data Engineering Podcast

Taking A Tour Of PostgreSQL with Jonathan Katz - Episode 42


Listen Later

Summary

One of the longest running and most popular open source database projects is PostgreSQL. Because of its extensibility and a community focus on stability it has stayed relevant as the ecosystem of development environments and data requirements have changed and evolved over its lifetime. It is difficult to capture any single facet of this database in a single conversation, let alone the entire surface area, but in this episode Jonathan Katz does an admirable job of it. He explains how Postgres started and how it has grown over the years, highlights the fundamental features that make it such a popular choice for application developers, and the ongoing efforts to add the complex features needed by the demanding workloads of today’s data layer. To cap it off he reviews some of the exciting features that the community is working on building into future releases.

Preamble
  • Hello and welcome to the Data Engineering Podcast, the show about modern data management
  • When you’re ready to build your next pipeline you’ll need somewhere to deploy it, so check out Linode. With private networking, shared block storage, node balancers, and a 40Gbit network, all controlled by a brand new API you’ve got everything you need to run a bullet-proof data platform. Go to dataengineeringpodcast.com/linode to get a $20 credit and launch a new server in under a minute.
  • Are you struggling to keep up with customer request and letting errors slip into production? Want to try some of the innovative ideas in this podcast but don’t have time? DataKitchen’s DataOps software allows your team to quickly iterate and deploy pipelines of code, models, and data sets while improving quality. Unlike a patchwork of manual operations, DataKitchen makes your team shine by providing an end to end DataOps solution with minimal programming that uses the tools you love. Join the DataOps movement and sign up for the newsletter at datakitchen.io/de today. After that learn more about why you should be doing DataOps by listening to the Head Chef in the Data Kitchen at dataengineeringpodcast.com/datakitchen
  • Go to dataengineeringpodcast.com to subscribe to the show, sign up for the mailing list, read the show notes, and get in touch.
  • Join the community in the new Zulip chat workspace at dataengineeringpodcast.com/chat
  • Your host is Tobias Macey and today I’m interviewing Jonathan Katz about a high level view of PostgreSQL and the unique capabilities that it offers
  • Interview
    • Introduction
    • How did you get involved in the area of data management?
    • How did you get involved in the Postgres project?
    • For anyone who hasn’t used it, can you describe what PostgreSQL is?
      • Where did Postgres get started and how has it evolved over the intervening years?

      • What are some of the primary characteristics of Postgres that would lead someone to choose it for a given project?

        • What are some cases where Postgres is the wrong choice?

        • What are some of the common points of confusion for new users of PostGreSQL? (particularly if they have prior database experience)

        • The recent releases of Postgres have had some fairly substantial improvements and new features. How does the community manage to balance stability and reliability against the need to add new capabilities?

        • What are the aspects of Postgres that allow it to remain relevant in the current landscape of rapid evolution at the data layer?

        • Are there any plans to incorporate a distributed transaction layer into the core of the project along the lines of what has been done with Citus or CockroachDB?

        • What is in store for the future of Postgres?

        • Contact Info
          • @jkatz05 on Twitter
          • jkatz on GitHub
          • Parting Question
            • From your perspective, what is the biggest gap in the tooling or technology for data management today?
            • Links
              • PostgreSQL
              • Crunchy Data
              • Venuebook
              • Paperless Post
              • LAMP Stack
              • MySQL
              • PHP
              • SQL
              • ORDBMS
              • Edgar Codd
              • A Relational Model of Data for Large Shared Data Banks
              • Relational Algebra
              • Oracle DB
              • UC Berkeley
              • Dr. Michael Stonebraker
              • Ingres
              • Informix
              • QUEL
              • ANSI C
              • CVS
              • BSD License
              • UUID
              • JSON
              • XML
              • HStore
              • PostGIS
              • BTree Index
              • GIN Index
              • GIST Index
              • KNN GIST
              • SPGIST
              • Full Text Search
              • BRIN Index
              • WAL (Write-Ahead Log)
              • SQLite
              • PGAdmin
              • Vim
              • Emacs
              • Linux
              • OLAP (Online Analytical Processing)
              • Postgres IRC
              • Postgres Slack
              • Postgres Conferences
              • UPSERT
              • Postgres Roadmap
              • CockroachDB
                • Podcast Interview

                • Citus Data

                  • Podcast Interview

                  • The intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA

                    Support Data Engineering Podcast

                    ...more
                    View all episodesView all episodes
                    Download on the App Store

                    Data Engineering PodcastBy Tobias Macey

                    • 4.5
                    • 4.5
                    • 4.5
                    • 4.5
                    • 4.5

                    4.5

                    142 ratings


                    More shows like Data Engineering Podcast

                    View all
                    The Changelog: Software Development, Open Source by Changelog Media

                    The Changelog: Software Development, Open Source

                    289 Listeners

                    Software Engineering Daily by Software Engineering Daily

                    Software Engineering Daily

                    624 Listeners

                    Talk Python To Me by Michael Kennedy

                    Talk Python To Me

                    583 Listeners

                    Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

                    Super Data Science: ML & AI Podcast with Jon Krohn

                    302 Listeners

                    NVIDIA AI Podcast by NVIDIA

                    NVIDIA AI Podcast

                    343 Listeners

                    Practical AI by Practical AI LLC

                    Practical AI

                    204 Listeners

                    AWS Podcast by Amazon Web Services

                    AWS Podcast

                    205 Listeners

                    Last Week in AI by Skynet Today

                    Last Week in AI

                    305 Listeners

                    Dwarkesh Podcast by Dwarkesh Patel

                    Dwarkesh Podcast

                    523 Listeners

                    The Data Engineering Show by The Firebolt Data Bros

                    The Data Engineering Show

                    8 Listeners

                    No Priors: Artificial Intelligence | Technology | Startups by Conviction

                    No Priors: Artificial Intelligence | Technology | Startups

                    129 Listeners

                    Latent Space: The AI Engineer Podcast by swyx + Alessio

                    Latent Space: The AI Engineer Podcast

                    92 Listeners

                    This Day in AI Podcast by Michael Sharkey, Chris Sharkey

                    This Day in AI Podcast

                    227 Listeners

                    The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

                    The AI Daily Brief: Artificial Intelligence News and Analysis

                    633 Listeners

                    AI + a16z by a16z

                    AI + a16z

                    36 Listeners