In the 43rd episode I speak with Tim Berglund on Realtime Analytics with Apache Pinot.
Chapters:
00:00 Introduction
01:22 What do we mean by analytics and realtime analytics?
05:35 Can we define realtime in millis, seconds or minutes?
08:54 What is the fundamental difference between traditional analytics systems and Apache Pinot?
12:19 Was Kafka one of the reasons Apache Pinot could reach its full potential?
16:50 E-commerce Application example - How do I get my data in?
20:07 How is data stored (structured) on the disk?
23:31 Are joins available in Apache Pinot?
26:07 Joins vs pre-computing at ingestion
27:15 How is historical data ingested into Apache Pinot?
28:14 Types of indexes available in Apache Pinot
35:42 Do indexes cause write amplification? Is that a problem in Apache Pinot?
40:02 Point lookups in Apache Pinot
42:54 Anamoly Detection
45:51 Coming up in Apache Pinot
Links:
StarTree https://startree.ai/
Apache Pinot: https://pinot.apache.org/
Joins in Pinot: https://startree.ai/blog/apache-pinot...
Apache Pinot Indexes: https://docs.pinot.apache.org/basics/...
Other playlists:
Distributed systems: • Distributed Syste...
Modern Databases: • Modern Databases
Serverless Architecture: • Serverless Archit...
Software Engineering: • Software Engineering
I hope you like the episode. Like, share and subscribe to the channel.
Cheers,