unSILOed with Greg LaBlanc

Dark Data: Why What You Don’t Know Matters feat. David Hand


Listen Later

We like to think we have everything we need to make decisions based on the numbers we are presented in a data set. But any large data set is bound to have problems. And it's often the data that we are missing that can lead us off course unexpectedly. 

David Hand has written many books, including The Improbability Principle: Why Coincidences, Miracles, and Rare Events Happen Every Day and the more recent, Dark Data: Why What You Don’t Know Matters. He is also emeritus professor of math at Imperial College.

David and Greg talk today about bias in statistics, interpreting data sets, and whether or not we are just more aware of global events happening than we were in the past, and how that affects stats?

Episode Quotes:

Interpreting data sets:

You need an element of caution, skepticism about the data because let's face it. Any large data set is likely to have some problems, measurement, error problems, duplications and missing values. In time, missing records, it's likely to have some problems. So, a skeptical attitude I think is a healthy attitude.

Observational data:

I think observational data is particularly risky and it has to be said that the data science revolution we are currently living through is in large part driven by big observational administrative data sets. Data sets which arise in the normal practice of everyday life. Running a credit card or a retail operation, for example or a transport company, a hospital or whatever. You're just observing what happens. You're not manipulating or intervening. And in that case, I think the opportunities for distortions are very severe. Now, whether those distortions will impact your conclusions depends on what question you're asking, but there is a great risk.

Misconceptions of big data sets:

People have this belief that big data, massive data sets, billions of data points - no need to worry, the size of the data or wash all the problems away. What I say is that big data has all the problems of small data and extra problems of their own because I think they have more opportunities for glitches to occur and problems to arise.


Show Links:


Guest Profile:

  • Faculty Profile at Imperial College London
  • Professional Profile at The British Academy


His work:

  • David Hand on Google Scholar
  • Dark Data: Why What You Don’t Know Matters
  • The Improbability Principle: Why Coincidences, Miracles, and Rare Events Happen Every Day
  • Measurement: A Very Short Introduction
...more
View all episodesView all episodes
Download on the App Store

unSILOed with Greg LaBlancBy Greg La Blanc

  • 4.6
  • 4.6
  • 4.6
  • 4.6
  • 4.6

4.6

59 ratings


More shows like unSILOed with Greg LaBlanc

View all
EconTalk by Russ Roberts

EconTalk

4,223 Listeners

a16z Podcast by Andreessen Horowitz

a16z Podcast

1,030 Listeners

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

517 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,389 Listeners

Decoder with Nilay Patel by The Verge

Decoder with Nilay Patel

3,143 Listeners

Odd Lots by Bloomberg

Odd Lots

1,775 Listeners

Invest Like the Best with Patrick O'Shaughnessy by Colossus | Investing & Business Podcasts

Invest Like the Best with Patrick O'Shaughnessy

2,315 Listeners

Azeem Azhar's Exponential View by Azeem Azhar

Azeem Azhar's Exponential View

613 Listeners

Hidden Forces by Demetri Kofinas

Hidden Forces

1,436 Listeners

Capitalisn't by University of Chicago Podcast Network

Capitalisn't

526 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

198 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

389 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

423 Listeners

Clearer Thinking with Spencer Greenberg by Spencer Greenberg

Clearer Thinking with Spencer Greenberg

128 Listeners

"Econ 102" with Noah Smith and Erik Torenberg by Turpentine

"Econ 102" with Noah Smith and Erik Torenberg

145 Listeners