
Sign up to save your podcasts
Or


As you know, histograms are decent visualizations for PDFs with lots of samples...
10k predictions, 20 bins
...but if there are only a few samples, the histogram-binning choices can matter a lot:
10 predictions, 4 binssame 10 predictions, 7 binsThe binning (a) discards information, and worse, (b) is mathematically un-aesthetic.
But a CDF doesn't have this problem!
same 10 predictions, every data point precisely representedIf you make a bunch of predictions, and you want to know how well they're calibrated, classically you make a graph like this:
source: SSC's 2019 prediction gradingBut, as with a histogram, this depends on how you bin your predictions.
100 predictions, 10 binssame 100 predictions, 30 binsIs there some CDF-like equivalent here? Some visualization with no free parameters?
I asked that question to several people at Arbor Summer Camp. I got three answers:
---
First published:
Source:
Linkpost URL:
https://optimizationprocess.com/calibration-cdf/
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
By LessWrongAs you know, histograms are decent visualizations for PDFs with lots of samples...
10k predictions, 20 bins
...but if there are only a few samples, the histogram-binning choices can matter a lot:
10 predictions, 4 binssame 10 predictions, 7 binsThe binning (a) discards information, and worse, (b) is mathematically un-aesthetic.
But a CDF doesn't have this problem!
same 10 predictions, every data point precisely representedIf you make a bunch of predictions, and you want to know how well they're calibrated, classically you make a graph like this:
source: SSC's 2019 prediction gradingBut, as with a histogram, this depends on how you bin your predictions.
100 predictions, 10 binssame 100 predictions, 30 binsIs there some CDF-like equivalent here? Some visualization with no free parameters?
I asked that question to several people at Arbor Summer Camp. I got three answers:
---
First published:
Source:
Linkpost URL:
https://optimizationprocess.com/calibration-cdf/
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

26,366 Listeners

2,438 Listeners

8,995 Listeners

4,148 Listeners

92 Listeners

1,595 Listeners

9,913 Listeners

90 Listeners

71 Listeners

5,471 Listeners

16,082 Listeners

536 Listeners

131 Listeners

95 Listeners

519 Listeners