The Python Podcast.__init__

PyTables with Francesc Alted


Listen Later

Summary

HDF5 is a file format that supports fast and space efficient analysis of large datasets. PyTables is a project that wraps and expands on the capabilities of HDF5 to make it easy to integrate with the larger Python data ecosystem. Francesc Alted explains how the project got started, how it works, and how it can be used for creating sharable and archivable data sets.

Preface
  • Hello and welcome to Podcast.__init__, the podcast about Python and the people who make it great.
  • I would like to thank everyone who has donated to the show. Your contributions help us make the show sustainable.
  • When you’re ready to launch your next project you’ll need somewhere to deploy it. Check out Linode at linode.com/podcastinit and get a $20 credit to try out their fast and reliable Linux virtual servers for running your awesome app. Linode will has announced new plans, including 1GB for $5 plan, high memory plans starting at 16GB for $60/mo and an upgrade in storage from 24GB to 30GB on our 2GB for $10 plan.
  • Visit our site to subscribe to our show, sign up for our newsletter, read the show notes, and get in touch.
  • To help other people find the show you can leave a review on iTunes, or Google Play Music, and tell your friends and co-workers
  • Your host as usual is Tobias Macey and today I’m interviewing Francesc Alted about PyTables
  • Interview
    • Introductions
    • How did you get introduced to Python?
    • To start with, what is HDF5 and what was the problem that motivated you to wrap Python around it to create PyTables?
    • Which are the most relevant contributors for PyTables? How you interacted?
    • How is the project architected and what are some of the design decisions that you are most proud of?
    • What are some of the typical use cases for PyTables and how does it tie into the broader Python data ecosystem?
    • How common is it to use an HDF5 file as a data interchange format to be shared between researchers or between languages?
    • Given the ability to create custom node types, does that inhibit the ability to interact with the stored data using other libraries?
    • What are some of the capabilities of HDF5 and PyTables that can’t be reasonably replicated in other data storage systems?
    • One of the more intriguing capabilities that I noticed while reading the documentation is the ability to perform undo and redo operations on the data. How might that be leveraged in a real-world use case?
    • What are some of the most interesting or unexpected uses of PyTables that you are aware of?
    • Keep In Touch
      • @FrancescAlted on Twitter
      • FrancescAlted on GitHub
      • Picks
        • Tobias
          • The Accountant

          • Francesc

            • Blosc a high speed compressor, specially meant for binary data
            • The Lego Batman Movie

            • Links
              • PyTables
              • PyTables – Optimization
              • Presentations and Videos about PyTables
              • Part of the story behind PyTables
              • HDF5
              • Pandas
              • SIMD
              • NumFOCUS
              • The intro and outro music is from Requiem for a Fish The Freak Fandango Orchestra / CC BY-SA

                ...more
                View all episodesView all episodes
                Download on the App Store

                The Python Podcast.__init__By Tobias Macey

                • 4.4
                • 4.4
                • 4.4
                • 4.4
                • 4.4

                4.4

                100 ratings


                More shows like The Python Podcast.__init__

                View all
                The Changelog: Software Development, Open Source by Changelog Media

                The Changelog: Software Development, Open Source

                283 Listeners

                Data Skeptic by Kyle Polich

                Data Skeptic

                481 Listeners

                Chat With Traders by Tessa Dao

                Chat With Traders

                1,979 Listeners

                Talk Python To Me by Michael Kennedy

                Talk Python To Me

                590 Listeners

                Software Engineering Daily by Software Engineering Daily

                Software Engineering Daily

                622 Listeners

                The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

                The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

                444 Listeners

                Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

                Super Data Science: ML & AI Podcast with Jon Krohn

                297 Listeners

                Python Bytes by Michael Kennedy and Brian Okken

                Python Bytes

                215 Listeners

                Data Engineering Podcast by Tobias Macey

                Data Engineering Podcast

                141 Listeners

                Machine Learning Guide by OCDevel

                Machine Learning Guide

                764 Listeners

                Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

                Syntax - Tasty Web Development Treats

                986 Listeners

                DataFramed by DataCamp

                DataFramed

                267 Listeners

                Practical AI by Practical AI LLC

                Practical AI

                192 Listeners

                The Real Python Podcast by Real Python

                The Real Python Podcast

                139 Listeners

                Hard Fork by The New York Times

                Hard Fork

                5,431 Listeners