Learning Bayesian Statistics

#80 Bayesian Additive Regression Trees (BARTs), with Sameer Deshpande


Listen Later

Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!

I’m sure you know at least one Bart. Maybe you’ve even used one — but you’re not proud of it, because you didn’t know what you were doing. Thankfully, in this episode, we’ll go to the roots of regression trees — oh yeah, that’s what BART stands for. What were you thinking about?

Our tree expert will be no one else than Sameer Deshpande. Sameer is an assistant professor of Statistics at the University of Wisconsin-Madison. Prior to that, he completed a postdoc at MIT and earned his Ph.D. in Statistics from UPenn.

On the methodological front, he is interested in Bayesian hierarchical modeling, regression trees, model selection, and causal inference. Much of his applied work is motivated by an interest in understanding the long-term health consequences of playing American-style tackle football. He also enjoys modeling sports data and was a finalist in the 2019 NFL Big Data Bowl.

Outside of Statistics, he enjoys cooking, making cocktails, and photography — sometimes doing all of those at the same time…

Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work at https://bababrinkman.com/ !

Thank you to my Patrons for making this episode possible!

Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor, Thomas Wiecki, Chad Scherrer, Nathaniel Neitzke, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Joshua Duncan, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Raul Maldonado, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, David Haas, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Trey Causey, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, and Arkady.

Visit https://www.patreon.com/learnbayesstats to unlock exclusive Bayesian swag ;)

Links from the show:

  • Sameer’s website: https://skdeshpande91.github.io/
  • Sameer on GitHub: https://github.com/skdeshpande91
  • Sameer on Twitter: https://twitter.com/skdeshpande91 
  • Sameer on Google Scholar: https://scholar.google.com/citations?user=coVrnWIAAAAJ&hl=en
  • LBS #50 Ta(l)king Risks & Embracing Uncertainty, with David Spiegelhalter: https://learnbayesstats.com/episode/50-talking-risks-embracing-uncertainty-david-spiegelhalter/
  • LBS #51 Bernoulli’s Fallacy & the Crisis of Modern Science, with Aubrey Clayton: https://learnbayesstats.com/episode/51-bernoullis-fallacy-crisis-modern-science-aubrey-clayton/
  • LBS #58 Bayesian Modeling and Computation, with Osvaldo Martin, Ravin Kumar and Junpeng Lao: https://learnbayesstats.com/episode/58-bayesian-modeling-computation-osvaldo-martin-ravin-kumar-junpeng-lao/
  • Book Bayesian Modeling and Computation in Python: https://bayesiancomputationbook.com/welcome.html
  • LBS #39 Survival Models & Biostatistics for Cancer Research, with Jacki Buros: https://learnbayesstats.com/episode/39-survival-models-biostatistics-cancer-research-jacki-buros/
  • Original BART paper (Chipman, George, and McCulloch 2010): https://doi.org/10.1214/09-AOAS285
  • Hill (2011) on BART in causal inference: https://doi.org/10.1198/jcgs.2010.08162
  • Hahn, Murray, and Carvalho on Bayesian causal forests: https://doi.org/10.1214/19-BA1195
  • Main BART package in R: https://cran.r-project.org/web/packages/BART/index.html
  • dbart R package: https://cran.r-project.org/web/packages/dbarts/index.html
  • Sameer’s own re-implementation of BART: https://github.com/skdeshpande91/flexBART

Abstract

by Christoph Bamberg

In episode 80, Sameer Deshpande, assistant professor of Statistics at the University of Wisconsin-Madison is our guest. 

He had a passion for math from a young age. And got into Bayesian statistics at university, teaching statistics now himself. We talk about the intricacies of teaching bayesian statistics, such as helping students accept that there are no objective answers. 

Sameer’s current work focuses on Bayesian Additive Regression Trees (BARTs). He also works on prior specification, and numerous cool applied projects, for example on the effects of playing American football as an adolescent and its effects for later health

We primarily talk about BARTs as a way of approximating complex functions by using a collection of step functions. They work off the shelf pretty well and can be applied to various models such as survival models, linear models, and smooth models. BARTs are somewhat analogous to splines and can capture trajectories well over time. However, they are also a bit like a black box making them hard to interpret.

We further touch upon some of his work on practical problems, such as how cognitive processes change over time or models of baseball empires’ decision making.

Transcript

Please note that the following transcript was generated automatically and may therefore contain errors. Feel free to reach out if you're willing to correct them.

...more
View all episodesView all episodes
Download on the App Store

Learning Bayesian StatisticsBy Alexandre Andorra

  • 4.7
  • 4.7
  • 4.7
  • 4.7
  • 4.7

4.7

62 ratings


More shows like Learning Bayesian Statistics

View all
Data Skeptic by Kyle Polich

Data Skeptic

474 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

585 Listeners

Quanta Science Podcast by Quanta Magazine

Quanta Science Podcast

450 Listeners

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

429 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

295 Listeners

Python Bytes by Michael Kennedy and Brian Okken

Python Bytes

212 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

322 Listeners

Data Engineering Podcast by Tobias Macey

Data Engineering Podcast

142 Listeners

DataFramed by DataCamp

DataFramed

267 Listeners

The Numberphile Podcast by Brady Haran

The Numberphile Podcast

445 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

190 Listeners

COMPLEXITY by Santa Fe Institute

COMPLEXITY

279 Listeners

The Real Python Podcast by Real Python

The Real Python Podcast

136 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

64 Listeners

Complex Systems with Patrick McKenzie (patio11) by Patrick McKenzie

Complex Systems with Patrick McKenzie (patio11)

101 Listeners