
Sign up to save your podcasts
Or
YouTube Link: https://www.youtube.com/watch?v=41JBrC5e5tA
David Hand, professor of statistics, reveals how ChatGPT lies with "dark data"; more generally, large language models and even peer review.
Listen now early and ad-free on Patreon https://patreon.com/curtjaimungal.
- Patreon: https://patreon.com/curtjaimungal (early access to ad-free audio episodes!)
- Crypto: https://tinyurl.com/cryptoTOE
- PayPal: https://tinyurl.com/paypalTOE
- Twitter: https://twitter.com/TOEwithCurt
- Discord Invite: https://discord.com/invite/kBcnfNVwqs
- iTunes: https://podcasts.apple.com/ca/podcast...
- Pandora: https://pdora.co/33b9lfP
- Spotify: https://open.spotify.com/show/4gL14b9...
- Subreddit r/TheoriesOfEverything: https://reddit.com/r/theoriesofeveryt...
- TOE Merch: https://tinyurl.com/TOEmerch
DAVID HAND'S BOOKS:
- Dark Data: https://amzn.to/446Fou1
- The Improbability Principle: https://amzn.to/3DOn1iX
TIMESTAMPS:
00:00:00 Introduction
00:01:34 What is Dark Data? (missing data matters more than what you have)
00:07:03 The perils of "changing definitions"
00:09:15 David on writing and his selective process
00:20:15 Theory-driven vs. data-driven models (& the constitution of LLMs)
00:32:08 The dilemma of partial truths
00:34:40 The "File Drawer Problem" & its adverse effects on clinical trials
00:39:09 Regression to the mean (how random variations lead to misleading conclusions)
00:44:12 Publication bias
00:48:03 Open-access models and their pitfalls
00:54:06 Why LLMs are simultaneously brilliant & stupid
01:03:40 David’s daily routine
01:06:24 The mean vs. median
01:11:07 Every type of "Dark Data" listed (watch this first!)
Learn more about your ad choices. Visit megaphone.fm/adchoices
4.7
435435 ratings
YouTube Link: https://www.youtube.com/watch?v=41JBrC5e5tA
David Hand, professor of statistics, reveals how ChatGPT lies with "dark data"; more generally, large language models and even peer review.
Listen now early and ad-free on Patreon https://patreon.com/curtjaimungal.
- Patreon: https://patreon.com/curtjaimungal (early access to ad-free audio episodes!)
- Crypto: https://tinyurl.com/cryptoTOE
- PayPal: https://tinyurl.com/paypalTOE
- Twitter: https://twitter.com/TOEwithCurt
- Discord Invite: https://discord.com/invite/kBcnfNVwqs
- iTunes: https://podcasts.apple.com/ca/podcast...
- Pandora: https://pdora.co/33b9lfP
- Spotify: https://open.spotify.com/show/4gL14b9...
- Subreddit r/TheoriesOfEverything: https://reddit.com/r/theoriesofeveryt...
- TOE Merch: https://tinyurl.com/TOEmerch
DAVID HAND'S BOOKS:
- Dark Data: https://amzn.to/446Fou1
- The Improbability Principle: https://amzn.to/3DOn1iX
TIMESTAMPS:
00:00:00 Introduction
00:01:34 What is Dark Data? (missing data matters more than what you have)
00:07:03 The perils of "changing definitions"
00:09:15 David on writing and his selective process
00:20:15 Theory-driven vs. data-driven models (& the constitution of LLMs)
00:32:08 The dilemma of partial truths
00:34:40 The "File Drawer Problem" & its adverse effects on clinical trials
00:39:09 Regression to the mean (how random variations lead to misleading conclusions)
00:44:12 Publication bias
00:48:03 Open-access models and their pitfalls
00:54:06 Why LLMs are simultaneously brilliant & stupid
01:03:40 David’s daily routine
01:06:24 The mean vs. median
01:11:07 Every type of "Dark Data" listed (watch this first!)
Learn more about your ad choices. Visit megaphone.fm/adchoices
242 Listeners
483 Listeners
1,044 Listeners
922 Listeners
4,127 Listeners
488 Listeners
807 Listeners
368 Listeners
992 Listeners
453 Listeners
100 Listeners
243 Listeners
1,331 Listeners
285 Listeners
200 Listeners