Machine Learning Street Talk (MLST)

#73 - YASAMAN RAZEGHI & Prof. SAMEER SINGH - NLP benchmarks


Listen Later

Patreon: https://www.patreon.com/mlst

Discord: https://discord.gg/ESrGqhf5CB

YT version: https://youtu.be/RzGaI7vXrkk

This week we speak with Yasaman Razeghi and Prof. Sameer Singh from UC Urvine. Yasaman recently published a paper called Impact of Pretraining Term Frequencies on Few-Shot Reasoning where she demonstrated comprehensively that large language models only perform well on reasoning tasks because they memorise the dataset. For the first time she showed the accuracy was linearly correlated to the occurance rate in the training corpus, something which OpenAI should have done in the first place! 

We also speak with Sameer who has been a pioneering force in the area of machine learning interpretability for many years now, he created LIME with Marco Riberio and also had his hands all over the famous Checklist paper and many others. 

We also get into the metric obsession in the NLP world and whether metrics are one of the principle reasons why we are failing to make any progress in NLU. 

[00:00:00] Impact of Pretraining Term Frequencies on Few-Shot Reasoning

[00:14:59] Metrics

[00:18:55] Definition of reasoning

[00:25:12] Metrics (again)

[00:28:52] On true believers 

[00:33:04] Sameers work on model explainability / LIME 

[00:36:58] Computational irreducability 

[00:41:07] ML DevOps and Checklist

[00:45:58] Future of ML devops

[00:49:34] Thinking about future


Prof. Sameer Singh

https://sameersingh.org/


Yasaman Razeghi

https://yasamanrazeghi.com/


References;


Impact of Pretraining Term Frequencies on Few-Shot Reasoning [Razeghi et al with Singh]

https://arxiv.org/pdf/2202.07206.pdf


Beyond Accuracy: Behavioral Testing of NLP Models with CheckList [Riberio et al with Singh]

https://arxiv.org/pdf/2005.04118.pdf


“Why Should I Trust You?” Explaining the Predictions of Any Classifier (LIME) [Riberio et al with Singh]

https://arxiv.org/abs/1602.04938


Tim interviewing LIME Creator Marco Ribeiro in 2019

https://www.youtube.com/watch?v=6aUU-Ob4a8I


Tim video on LIME/SHAP on his other channel

https://www.youtube.com/watch?v=jhopjN08lTM


Our interview with Christoph Molar

https://www.youtube.com/watch?v=0LIACHcxpHU


Interpretable Machine Learning book @ChristophMolnar

https://christophm.github.io/interpretable-ml-book/


Machine Teaching: A New Paradigm for Building Machine Learning Systems [Simard]

https://arxiv.org/abs/1707.06742


Whimsical notes on machine teaching

https://whimsical.com/machine-teaching-Ntke9EHHSR25yHnsypHnth


Gopher paper (Deepmind)

https://www.deepmind.com/blog/language-modelling-at-scale-gopher-ethical-considerations-and-retrieval

https://arxiv.org/pdf/2112.11446.pdf


EleutherAI

https://www.eleuther.ai/

https://github.com/kingoflolz/mesh-transformer-jax/

https://pile.eleuther.ai/


A Theory of Universal Artificial Intelligence based on Algorithmic Complexity [Hutter]

https://arxiv.org/pdf/cs/0004001.pdf







...more
View all episodesView all episodes
Download on the App Store

Machine Learning Street Talk (MLST)By Machine Learning Street Talk (MLST)

  • 4.7
  • 4.7
  • 4.7
  • 4.7
  • 4.7

4.7

83 ratings


More shows like Machine Learning Street Talk (MLST)

View all
Data Skeptic by Kyle Polich

Data Skeptic

470 Listeners

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

436 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

295 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

324 Listeners

Practical AI by Practical AI LLC

Practical AI

189 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

203 Listeners

Last Week in AI by Skynet Today

Last Week in AI

282 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

352 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

125 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

196 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

63 Listeners

"Upstream" with Erik Torenberg by Erik Torenberg

"Upstream" with Erik Torenberg

64 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

422 Listeners

AI + a16z by a16z

AI + a16z

33 Listeners

Training Data by Sequoia Capital

Training Data

36 Listeners