Data Science Decoded

Data Science #28 - The Bloom filter algorithm


Listen Later

In the 28th episode, we go over Burton Bloom's Bloom filter from 1970, a groundbreaking data structure that enables fast, space-efficient set membership checks by allowing a small, controllable rate of false positives.Unlike traditional methods that store full data, Bloom filters use a compact bit array and multiple hash functions, trading exactness for speed and memory savings.


This idea transformed modern data science and big data systems, powering tools like Apache Spark, Cassandra, and Kafka, where fast filtering and memory efficiency are critical for performance at scale.

...more
View all episodesView all episodes
Download on the App Store

Data Science DecodedBy Mike E

  • 3.8
  • 3.8
  • 3.8
  • 3.8
  • 3.8

3.8

5 ratings


More shows like Data Science Decoded

View all
Radiolab by WNYC Studios

Radiolab

43,991 Listeners

My Favorite Theorem by Kevin Knudson & Evelyn Lamb

My Favorite Theorem

100 Listeners

WW2 Pod: We Have Ways of Making You Talk by Goalhanger

WW2 Pod: We Have Ways of Making You Talk

1,429 Listeners

The Rest Is History by Goalhanger

The Rest Is History

15,632 Listeners