O'Reilly Data Show Podcast

Bringing scalable real-time analytics to the enterprise


Listen Later

In this episode of the Data Show, I spoke with Dhruba Borthakur (co-founder and CTO) and Shruti Bhat (SVP of Product) of Rockset, a startup focused on building solutions for interactive data science and live applications. Borthakur was the founding engineer of HDFS and creator of RocksDB, while Bhat is an experienced product and marketing executive focused on enterprise software and data products. Their new startup is focused on a few trends I’ve recently been thinking about, including the re-emergence of real-time analytics, and the hunger for simpler data architectures and tools.  Borthakur exemplifies the need for companies to continually evaluate new technologies: while he was the founding engineer for HDFS, these days he mostly works with object stores like S3.

We had a great conversation spanning many topics, including:

  • RocksDB, an open source, embeddable key-value store originated by Facebook, and which is used in several other open source projects.
  • Time-series databases.
  • The importance of having solutions for real-time analytics, particularly now with the renewed interest in IoT applications and rollout of 5G technologies.
  • Use cases for Rockset’s technologies—and more generally, applications of real-time analytics.
  • The Aggregator Leaf Tailer architecture as an alternative to the Lambda architecture.
  • Building data infrastructure in the cloud.
  • The Aggregator Leaf Tailer (“CQRS for the data world”): A data architecture favored by web-scale companies. Source: Dhruba Borthakur, used with permission.

    Related resources:

    • Serverless Streaming Architectures & Algorithms for the Enterprise – a new tutorial on September 24th at Strata Data NYC.
    • “Becoming a machine learning company means investing in foundational technologies”
    • Haoyuan Li: “In the age of AI, fundamental value resides in data”
    • Harish Doddi: “Simplifying machine learning lifecycle management”
    • Eric Jonas: “A Berkeley view on serverless computing”
    • “Specialized tools for machine learning development and model governance are becoming essential”
    • Avner Braaverman: “What data scientists and data engineers can do with current generation serverless technologies”
    • ...more
      View all episodesView all episodes
      Download on the App Store

      O'Reilly Data Show PodcastBy O'Reilly Media

      • 4
      • 4
      • 4
      • 4
      • 4

      4

      63 ratings


      More shows like O'Reilly Data Show Podcast

      View all
      The Changelog: Software Development, Open Source by Changelog Media

      The Changelog: Software Development, Open Source

      283 Listeners

      O'Reilly Radar Podcast - O'Reilly Media Podcast by O'Reilly Media

      O'Reilly Radar Podcast - O'Reilly Media Podcast

      36 Listeners

      Data Skeptic by Kyle Polich

      Data Skeptic

      482 Listeners

      Talk Python To Me by Michael Kennedy

      Talk Python To Me

      592 Listeners

      Software Engineering Daily by Software Engineering Daily

      Software Engineering Daily

      623 Listeners

      O'Reilly Design Podcast - O'Reilly Media Podcast by O'Reilly Media

      O'Reilly Design Podcast - O'Reilly Media Podcast

      8 Listeners

      The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

      The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

      446 Listeners

      AWS Podcast by Amazon Web Services

      AWS Podcast

      202 Listeners

      Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

      Super Data Science: ML & AI Podcast with Jon Krohn

      297 Listeners

      NVIDIA AI Podcast by NVIDIA

      NVIDIA AI Podcast

      323 Listeners

      Machine Learning Guide by OCDevel

      Machine Learning Guide

      764 Listeners

      AI Today Podcast by AI & Data Today

      AI Today Podcast

      146 Listeners

      DataFramed by DataCamp

      DataFramed

      267 Listeners

      Practical AI by Practical AI LLC

      Practical AI

      192 Listeners

      Google DeepMind: The Podcast by Hannah Fry

      Google DeepMind: The Podcast

      197 Listeners

      Last Week in AI by Skynet Today

      Last Week in AI

      287 Listeners

      This Day in AI Podcast by Michael Sharkey, Chris Sharkey

      This Day in AI Podcast

      199 Listeners