Thinking Machines: AI & Philosophy

On Adversarial Training & Robustness with Bhavna Gopal


Listen Later

"Understanding what's going on in a model is important to fine-tune it for specific tasks and to build trust."

Bhavna Gopal is a PhD candidate at Duke, research intern at Slingshot with experience at Apple, Amazon and Vellum.

We discuss

  • How adversarial robustness research impacts the field of AI explainability.
  • How do you evaluate a model's ability to generalize?
  • What adversarial attacks should we be concerned about with LLMs?
...more
View all episodesView all episodes
Download on the App Store

Thinking Machines: AI & PhilosophyBy Daniel Reid Cahn