Learning Bayesian Statistics

#142 Bayesian Trees & Deep Learning for Optimization & Big Data, with Gabriel Stechschulte


Listen Later

Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!

  • Get early access to Alex's next live-cohort courses!
  • Intro to Bayes Course (first 2 lessons free)
  • Advanced Regression Course (first 2 lessons free)

Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

Visit our Patreon page to unlock exclusive Bayesian swag ;)

Takeaways:

  • BART as a core tool: Gabriel explains how Bayesian Additive Regression Trees provide robust uncertainty quantification and serve as a reliable baseline model in many domains.
  • Rust for performance: His Rust re-implementation of BART dramatically improves speed and scalability, making it feasible for larger datasets and real-world IoT applications.
  • Strengths and trade-offs: BART avoids overfitting and handles missing data gracefully, though it is slower than other tree-based approaches.
  • Big data meets Bayes: Gabriel shares strategies for applying Bayesian methods with big data, including when variational inference helps balance scale with rigor.
  • Optimization and decision-making: He highlights how BART models can be embedded into optimization frameworks, opening doors for sequential decision-making.
  • Open source matters: Gabriel emphasizes the importance of communities like PyMC and Bambi, encouraging newcomers to start with small contributions.

Chapters:

05:10 – From economics to IoT and Bayesian statistics

18:55 – Introduction to BART (Bayesian Additive Regression Trees)

24:40 – Re-implementing BART in Rust for speed and scalability

32:05 – Comparing BART with Gaussian Processes and other tree methods

39:50 – Strengths and limitations of BART

47:15 – Handling missing data and different likelihoods

54:30 – Variational inference and big data challenges

01:01:10 – Embedding BART into optimization and decision-making frameworks

01:08:45 – Open source, PyMC, and community support

01:15:20 – Advice for newcomers

01:20:55 – Future of BART, Rust, and probabilistic programming

Thank you to my Patrons for making this episode possible!

Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Joshua Meehl, Javier Sabio, Kristian Higgins, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan, Francesco Madrisotti, Ivy Huang, Gary Clarke, Robert Flannery, Rasmus Hindström, Stefan, Corey Abshire, Mike Loncaric, David McCormick, Ronald Legere, Sergio Dolia, Michael Cao, Yiğit Aşık, Suyog Chandramouli and Adam Tilmar Jakobsen.

Links from the show:

  • Gabriel’s website: https://gstechschulte.github.io/
  • Gabriel on LinkedIn: https://www.linkedin.com/in/gabrielstechschulte/
  • Gabriel on GitHub: https://github.com/GStechschulte 
  • Gabriel on Blue Sky: https://bsky.app/profile/gstechschulte.bsky.social
  • Gabriel on Google Scholar: https://scholar.google.com/citations?user=ood-6GIAAAAJ&hl=en
  • Rust implementation of PyMC-BART: https://github.com/GStechschulte/bart-rs
  • PyMC BART: https://www.pymc.io/projects/bart/en/latest/
  • Reproducing Uber's Marketplace Optimization: https://gstechschulte.github.io/posts/2025-09-15-marketplace-optimization/
  • Alternating Direction Method of Multipliers (ADMM) for distributed budget allocation: https://github.com/GStechschulte/uber-admm
  • A Beginner's Guide to Variational Inference | PyData Virginia 2025: https://youtu.be/XECLmgnS6Ng?feature=shared
  • Associated GitHub repo: https://github.com/fonnesbeck/vi_pydata_virginia_2025
  • Bambi: https://bambinos.github.io/bambi/

Transcript

This is an automatic transcript and may therefore contain errors. Please get in touch if you're willing to correct them.

...more
View all episodesView all episodes
Download on the App Store

Learning Bayesian StatisticsBy Alexandre Andorra

  • 4.7
  • 4.7
  • 4.7
  • 4.7
  • 4.7

4.7

66 ratings


More shows like Learning Bayesian Statistics

View all
Data Skeptic by Kyle Polich

Data Skeptic

477 Listeners

The Quanta Podcast by Quanta Magazine

The Quanta Podcast

525 Listeners

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

434 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

301 Listeners

Data Engineering Podcast by Tobias Macey

Data Engineering Podcast

146 Listeners

Machine Learning Guide by OCDevel

Machine Learning Guide

768 Listeners

DataFramed by DataCamp

DataFramed

268 Listeners

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas by Sean Carroll | Wondery

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas

4,151 Listeners

Practical AI by Practical AI LLC

Practical AI

211 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

194 Listeners

Last Week in AI by Skynet Today

Last Week in AI

302 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

89 Listeners

MIT Technology Review Narrated by MIT Technology Review

MIT Technology Review Narrated

258 Listeners

The Joy of Why by Steven Strogatz, Janna Levin and Quanta Magazine

The Joy of Why

492 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

97 Listeners