The GeekNarrator

Tiered Storage implementation by StarTree (Apache Pinot) with Neha Pawar


Listen Later

In this podcast I have invited Neha Pawar, who is one of the Founding Engineers are StarTree (the company powering Apache Pinot). We talked about how StarTree has implemented Tiered storage and how it differs from other available implementations.  Note: Currently tiered storage is available only in StarTree’s Pinot and not available in the open source version. But its only about time.


Chapters: 00:00 Introduction 03:28 What does Tiered Storage mean? 05:51 How many tiers are typically supported? 07:30 Is it mainly about Cost Optimisation? How do I compare the cost savings vs performance hit? 15:41 What is mmap and how does it help? 16:45 How do I implement/approach Tiered Storage? What are the challenges? 23:00 What is Apache Pinot? When we say low latency, how low it is? 25:00 How is it implemented in StarTree (Apache Pinot)? 36:45 What happens when I query for more number of (or all) columns? How is that optimised? 47:10 What are the failure modes? 50:15 How can we test and validate Tiered Storage as a feature? 54:30 How would bloom filter false positives affect performance and correctness? 56:15 Can I move back my data from Cold storage to Hot Storage? 57:45 What other cloud storage services are supported other than S3? 58:35 What is the future of Tiered Storage?



...more
View all episodesView all episodes
Download on the App Store

The GeekNarratorBy Kaivalya Apte

  • 5
  • 5
  • 5
  • 5
  • 5

5

3 ratings


More shows like The GeekNarrator

View all
Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

623 Listeners

TED Tech by TED Tech

TED Tech

397 Listeners

Up First from NPR by NPR

Up First from NPR

56,525 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

522 Listeners

The Daily Brief by Zerodha

The Daily Brief

14 Listeners

The Pragmatic Engineer by Gergely Orosz

The Pragmatic Engineer

63 Listeners