Adrian Wedd

Reasoning Models Think Themselves Into Trouble


Listen Later

Frontier reasoning models are 5–20x more vulnerable to adversarial prompts than non-reasoning models. The thinking process itself is the attack surface.
...more
View all episodesView all episodes
Download on the App Store

Adrian WeddBy Adrian Wedd