Daliana's Game

Weather forecasting with AI, Kaggle tips and tricks, dealing with missing data, deep learning with Jesper Dramsch, The Data Scientist Show #040


Listen Later

Jesper Dramsch is a scientist for machine learning at the European Centre for Medium-Range Weather forecasts. They have a phd in applied Machine Learning to Geoscience from Technical University of Denmark. They are a Kaggle Kernals Expert and TPU star, ranking at top 81/100k worldwide. We talked about weather forecasting, things they learned from Kaggle, how to deal with missing data and ourliers, deep learning, Keras vs Pytorch, XGBoost, their struggles as a phd student, working in the EU vs US. Follow @DalianaLiu for more updates on data science and this show.

(00:01:27) how he got into in ML 

(00:09:10) how he handled missing data 

(00:28:34) Transformers are eating the world 

(00:49:36) Hoover Loss is a fantastic metric to deal with extreme values 

(00:54:48) his experience with Kaggle competition 

(01:02:59) Kaggle tricks that helped his models perform better 

(01:08:18) PyTorch vs Keras 

(01:30:30) working in different countries and cultures 

Resources shared by Jesper:

The newsletter with missing data:

https://buttondown.email/jesper/archive/towels-have-quite-a-dry-sense-of-humor/

The paper by Gael about missing data:

https://academic.oup.com/gigascience/article/doi/10.1093/gigascience/giac013/6568998

The Huber Loss:

https://en.wikipedia.org/wiki/Huber_loss

Skill Scores:

https://en.wikipedia.org/wiki/Forecast_skill

Brier Skill in Weather:

https://www.dwd.de/EN/ourservices/seasonals_forecasts/forecast_reliability.html

CRPS Continuous Ranked Probability Score

https://datascience.stackexchange.com/questions/63919/what-is-continuous-ranked-probability-score-crps

ConvNext, Convnets for the 2020s:

https://arxiv.org/abs/2201.03545

Transformers for ensemble forecasts:

https://arxiv.org/abs/2106.13924

Books I recommend:

https://www.amazon.com/shop/jesperdramsch/list/2DYS5KVR5TX0E

Blog posts I wrote about these books:

https://dramsch.net/tags/books/

Short I made about Test-Time Augmentation

https://www.youtube.com/shorts/w4sAh9lKyls

Their links: https://dramsch.net/links

Their open PhD thesis: https://dramsch.net/phd

Newsletter: https://dramsch.net/newsletter

Twitter: https://dramsch.net/twitter

Youtube: https://dramsch.net/youtube

Linkedin: https://dramsch.net/linkedin

Kaggle: https://dramsch.net/

...more
View all episodesView all episodes
Download on the App Store

Daliana's GameBy Daliana Liu

  • 4.7
  • 4.7
  • 4.7
  • 4.7
  • 4.7

4.7

75 ratings


More shows like Daliana's Game

View all
Bloomberg Intelligence by Bloomberg

Bloomberg Intelligence

402 Listeners

a16z Podcast by Andreessen Horowitz

a16z Podcast

1,034 Listeners

Data Skeptic by Kyle Polich

Data Skeptic

480 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

298 Listeners

DataFramed by DataCamp

DataFramed

267 Listeners

What's Next|科技早知道 by 声动活泼

What's Next|科技早知道

176 Listeners

硅谷101 by 硅谷101

硅谷101

184 Listeners

Last Week in AI by Skynet Today

Last Week in AI

287 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

9,189 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

443 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

121 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

201 Listeners

VK科技閱讀時間 by VK

VK科技閱讀時間

10 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

461 Listeners

Training Data by Sequoia Capital

Training Data

43 Listeners