Tech Unplugged

Kafka: Distributed Messaging System for High-Volume Data


Listen Later

This podcast introduces Kafka, a distributed messaging system developed for high-volume log data collection and delivery with low latency. Kafka combines features of log aggregators and messaging systems, offering scalability and high throughput while allowing real-time log event consumption. It addresses the challenges of processing the large volumes of log data generated by modern internet applications. The paper details Kafka's architecture, design choices, deployment at LinkedIn, and performance compared to other messaging systems like ActiveMQ and RabbitMQ, highlighting its superior throughput and efficiency in log processing. Kafka's design prioritizes efficiency and scalability, using a pull-based model and relying on the file system page cache to achieve high performance. The authors conclude that Kafka is a successful system for both offline and online applications.

...more
View all episodesView all episodes
Download on the App Store

Tech UnpluggedBy Sublimetechie