Learning Bayesian Statistics

#159 Bayesian Occupancy Models, with Matthijs Hollanders


Listen Later

Support & Resources
→ Support the show on Patreon
Bayesian Modeling Course (first 2 lessons free)

Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work

Takeaways:

Q: What is a Bayesian occupancy model and what problem does it solve?
A: An occupancy model accounts for the fact that you don't always detect a species when surveying for it, especially when the species is rare. A naive count of where you found it underestimates true occupancy. The model adds a repeated-measures component: you visit each site multiple times, and from the pattern of detections vs. non-detections it estimates a detection probability. Matthijs framed it as a zero-inflation structure where the zero-inflation happens at the site level rather than the observation level -- which keeps the model conceptually simple, just a standard GLM with a Bernoulli “is the species here at all?” stacked on top of a detection-rate process.

Q: What are Automated Recording Units and why don't traditional occupancy models handle them well?
A: ARUs are camera traps and acoustic monitors that record continuously over deployment periods of days, weeks, or months. The data they produce isn't a sequence of discrete human-led surveys; it's a continuous-time observation stream. Traditional occupancy models were designed for the discrete case -- a human visits a site, records yes or no, goes home. With ARUs, the question becomes how to bin or threshold the continuous data without losing the richer signal it actually contains.

Q: When should you not reach for occARU?

A: When your dataset is large and your survey interval is fine-grained. The bottleneck is Stan's fitting speed -- years of daily count data across many sites will fit slowly. The workaround is to bin coarser (weekly or monthly), which doesn't hurt occupancy estimation at all and only loses some detection-rate resolution. If you're only interested in occupancy, big grouping windows are fine.

Full takeaways here

Chapters:

00:12:14 What is an occupancy model and what problem does it solve?
00:16:16 What are Automated Recording Units and why do they need different models?
00:18:45 What is the occARU R package and why does it exist?
00:23:55 Why does occARU model counts directly rather than binary detection?
00:26:38 What does multi-species hierarchical modeling with Gaussian processes look like?
00:32:22 How does occARU implement Gaussian processes efficiently?
00:41:01 Why are Gaussian processes such a powerful but tricky modeling tool?
00:44:11 What is variance decomposition with global-local shrinkage priors?
00:49:02 How does occARU leverage recent Stan features for zero-sum constraints?
00:57:37 When does within-chain parallelization actually help?
01:01:30 How does Monte Carlo integration reduce high Pareto-k values?
01:15:27 When does occARU underperform and what's on the roadmap?

Thank you to my Patrons for making this episode possible!

Links from the show here.

...more
View all episodesView all episodes
Download on the App Store

Learning Bayesian StatisticsBy Alexandre Andorra

  • 4.7
  • 4.7
  • 4.7
  • 4.7
  • 4.7

4.7

66 ratings


More shows like Learning Bayesian Statistics

View all
Odd Lots by Bloomberg

Odd Lots

1,978 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,457 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

583 Listeners

The Quanta Podcast by Quanta Magazine

The Quanta Podcast

548 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

301 Listeners

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas by Sean Carroll

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas

4,176 Listeners

Practical AI by Practical AI LLC

Practical AI

213 Listeners

Last Week in AI by Skynet Today

Last Week in AI

318 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

97 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

561 Listeners

Hard Fork by The New York Times

Hard Fork

5,544 Listeners

Latent Space: The AI Engineer Podcast by Latent.Space

Latent Space: The AI Engineer Podcast

100 Listeners

Risky Business with Nate Silver and Maria Konnikova by Pushkin Industries

Risky Business with Nate Silver and Maria Konnikova

261 Listeners

Prof G Markets by Vox Media Podcast Network

Prof G Markets

1,486 Listeners

The Opinions by The New York Times Opinion

The Opinions

633 Listeners