O'Reilly Data Show Podcast

Bringing scalable real-time analytics to the enterprise


Listen Later

In this episode of the Data Show, I spoke with Dhruba Borthakur (co-founder and CTO) and Shruti Bhat (SVP of Product) of Rockset, a startup focused on building solutions for interactive data science and live applications. Borthakur was the founding engineer of HDFS and creator of RocksDB, while Bhat is an experienced product and marketing executive focused on enterprise software and data products. Their new startup is focused on a few trends I’ve recently been thinking about, including the re-emergence of real-time analytics, and the hunger for simpler data architectures and tools.  Borthakur exemplifies the need for companies to continually evaluate new technologies: while he was the founding engineer for HDFS, these days he mostly works with object stores like S3.

We had a great conversation spanning many topics, including:

  • RocksDB, an open source, embeddable key-value store originated by Facebook, and which is used in several other open source projects.
  • Time-series databases.
  • The importance of having solutions for real-time analytics, particularly now with the renewed interest in IoT applications and rollout of 5G technologies.
  • Use cases for Rockset’s technologies—and more generally, applications of real-time analytics.
  • The Aggregator Leaf Tailer architecture as an alternative to the Lambda architecture.
  • Building data infrastructure in the cloud.
  • The Aggregator Leaf Tailer (“CQRS for the data world”): A data architecture favored by web-scale companies. Source: Dhruba Borthakur, used with permission.

    Related resources:

    • Serverless Streaming Architectures & Algorithms for the Enterprise – a new tutorial on September 24th at Strata Data NYC.
    • “Becoming a machine learning company means investing in foundational technologies”
    • Haoyuan Li: “In the age of AI, fundamental value resides in data”
    • Harish Doddi: “Simplifying machine learning lifecycle management”
    • Eric Jonas: “A Berkeley view on serverless computing”
    • “Specialized tools for machine learning development and model governance are becoming essential”
    • Avner Braaverman: “What data scientists and data engineers can do with current generation serverless technologies”
    • ...more
      View all episodesView all episodes
      Download on the App Store

      O'Reilly Data Show PodcastBy O'Reilly Media

      • 4
      • 4
      • 4
      • 4
      • 4

      4

      63 ratings


      More shows like O'Reilly Data Show Podcast

      View all
      The Changelog: Software Development, Open Source by Changelog Media

      The Changelog: Software Development, Open Source

      285 Listeners

      O'Reilly Radar Podcast - O'Reilly Media Podcast by O'Reilly Media

      O'Reilly Radar Podcast - O'Reilly Media Podcast

      35 Listeners

      Data Skeptic by Kyle Polich

      Data Skeptic

      475 Listeners

      Talk Python To Me by Michael Kennedy

      Talk Python To Me

      580 Listeners

      Software Engineering Daily by Software Engineering Daily

      Software Engineering Daily

      624 Listeners

      O'Reilly Design Podcast - O'Reilly Media Podcast by O'Reilly Media

      O'Reilly Design Podcast - O'Reilly Media Podcast

      8 Listeners

      AWS Podcast by Amazon Web Services

      AWS Podcast

      203 Listeners

      Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

      Super Data Science: ML & AI Podcast with Jon Krohn

      295 Listeners

      Python Bytes by Michael Kennedy and Brian Okken

      Python Bytes

      214 Listeners

      Data Engineering Podcast by Tobias Macey

      Data Engineering Podcast

      139 Listeners

      DataFramed by DataCamp

      DataFramed

      266 Listeners

      Practical AI by Practical AI LLC

      Practical AI

      196 Listeners

      Google DeepMind: The Podcast by Hannah Fry

      Google DeepMind: The Podcast

      188 Listeners

      Me, Myself, and AI by MIT Sloan Management Review and Boston Consulting Group (BCG)

      Me, Myself, and AI

      99 Listeners

      AI Chat: ChatGPT & AI News, Artificial Intelligence, OpenAI, Machine Learning by Jaeden Schafer

      AI Chat: ChatGPT & AI News, Artificial Intelligence, OpenAI, Machine Learning

      139 Listeners

      This Day in AI Podcast by Michael Sharkey, Chris Sharkey

      This Day in AI Podcast

      178 Listeners

      The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

      The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

      397 Listeners