Data Engineering Podcast

Mobile Data Collection And Analysis Using Ona And Canopy With Peter Lubell-Doughtie - Episode 41


Listen Later

Summary

With the attention being paid to the systems that power large volumes of high velocity data it is easy to forget about the value of data collection at human scales. Ona is a company that is building technologies to support mobile data collection, analysis of the aggregated information, and user-friendly presentations. In this episode CTO Peter Lubell-Doughtie describes the architecture of the platform, the types of environments and use cases where it is being employed, and the value of small data.

Preamble
  • Hello and welcome to the Data Engineering Podcast, the show about modern data management
  • When you’re ready to build your next pipeline you’ll need somewhere to deploy it, so check out Linode. With private networking, shared block storage, node balancers, and a 40Gbit network, all controlled by a brand new API you’ve got everything you need to run a bullet-proof data platform. Go to dataengineeringpodcast.com/linode to get a $20 credit and launch a new server in under a minute.
  • Are you struggling to keep up with customer request and letting errors slip into production? Want to try some of the innovative ideas in this podcast but don’t have time? DataKitchen’s DataOps software allows your team to quickly iterate and deploy pipelines of code, models, and data sets while improving quality. Unlike a patchwork of manual operations, DataKitchen makes your team shine by providing an end to end DataOps solution with minimal programming that uses the tools you love. Join the DataOps movement and sign up for the newsletter at datakitchen.io/de today. After that learn more about why you should be doing DataOps by listening to the Head Chef in the Data Kitchen at dataengineeringpodcast.com/datakitchen
  • Go to dataengineeringpodcast.com to subscribe to the show, sign up for the mailing list, read the show notes, and get in touch.
  • Join the community in the new Zulip chat workspace at dataengineeringpodcast.com/chat
  • Your host is Tobias Macey and today I’m interviewing Peter Lubell-Doughtie about using Ona for collecting data and processing it with Canopy
  • Interview
    • Introduction
    • How did you get involved in the area of data management?
    • What is Ona and how did the company get started?
      • What are some examples of the types of customers that you work with?

      • What types of data do you support in your collection platform?

      • What are some of the mechanisms that you use to ensure the accuracy of the data that is being collected by users?

      • Does your mobile collection platform allow for anyone to submit data without having to be associated with a given account or organization?

      • What are some of the integration challenges that are unique to the types of data that get collected by mobile field workers?

      • Can you describe the flow of the data from collection through to analysis?

      • To help improve the utility of the data being collected you have started building Canopy. What was the tipping point where it became worth the time and effort to start that project?

        • What are the architectural considerations that you factored in when designing it?
        • What have you found to be the most challenging or unexpected aspects of building an enterprise data warehouse for general users?

        • What are your plans for the future of Ona and Canopy?

        • Contact Info
          • Email
          • pld on Github
          • Website
          • Parting Question
            • From your perspective, what is the biggest gap in the tooling or technology for data management today?
            • Links
              • OpenSRP
              • Ona
              • Canopy
              • Open Data Kit
              • Earth Institute at Columbia University
              • Sustainable Engineering Lab
              • WHO
              • Bill and Melinda Gates Foundation
              • XLSForms
              • PostGIS
              • Kafka
              • Druid
              • Superset
              • Postgres
              • Ansible
              • Docker
              • Terraform
              • The intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA

                Support Data Engineering Podcast

                ...more
                View all episodesView all episodes
                Download on the App Store

                Data Engineering PodcastBy Tobias Macey

                • 4.6
                • 4.6
                • 4.6
                • 4.6
                • 4.6

                4.6

                135 ratings


                More shows like Data Engineering Podcast

                View all
                Software Engineering Radio - the podcast for professional software developers by se-radio@computer.org

                Software Engineering Radio - the podcast for professional software developers

                272 Listeners

                The Changelog: Software Development, Open Source by Changelog Media

                The Changelog: Software Development, Open Source

                283 Listeners

                The Cloudcast by Massive Studios

                The Cloudcast

                153 Listeners

                Thoughtworks Technology Podcast by Thoughtworks

                Thoughtworks Technology Podcast

                41 Listeners

                Data Skeptic by Kyle Polich

                Data Skeptic

                483 Listeners

                Talk Python To Me by Michael Kennedy

                Talk Python To Me

                592 Listeners

                Software Engineering Daily by Software Engineering Daily

                Software Engineering Daily

                624 Listeners

                The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

                The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

                444 Listeners

                Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

                Super Data Science: ML & AI Podcast with Jon Krohn

                298 Listeners

                Python Bytes by Michael Kennedy and Brian Okken

                Python Bytes

                213 Listeners

                DataFramed by DataCamp

                DataFramed

                266 Listeners

                Practical AI by Practical AI LLC

                Practical AI

                190 Listeners

                The Stack Overflow Podcast by The Stack Overflow Podcast

                The Stack Overflow Podcast

                64 Listeners

                The Real Python Podcast by Real Python

                The Real Python Podcast

                140 Listeners

                Latent Space: The AI Engineer Podcast by swyx + Alessio

                Latent Space: The AI Engineer Podcast

                77 Listeners