
Sign up to save your podcasts
Or


Is AI just good at trivia, or can it actually take your job? In this episode, host Emily Laird breaks down GDPval-AA, the benchmark pitting models against humans across 1,320 real world tasks, scored like chess and judged blind. With top models working faster and cheaper than any employee, this is less sci-fi and more spreadsheet reality. If you’ve ever wondered whether the robots are coming for your role, this is your warning shot.
By Emily Laird4.6
2020 ratings
Is AI just good at trivia, or can it actually take your job? In this episode, host Emily Laird breaks down GDPval-AA, the benchmark pitting models against humans across 1,320 real world tasks, scored like chess and judged blind. With top models working faster and cheaper than any employee, this is less sci-fi and more spreadsheet reality. If you’ve ever wondered whether the robots are coming for your role, this is your warning shot.

32,271 Listeners

539 Listeners

1,656 Listeners

56,846 Listeners

8,760 Listeners

178 Listeners

215 Listeners

27,636 Listeners

5,143 Listeners

10,203 Listeners

16,495 Listeners

1,802 Listeners

675 Listeners

110 Listeners

0 Listeners