
Sign up to save your podcasts
Or


AI benchmarks saturate quickly, struggle to capture what we care about, and cost more than ever to build. But are they doomed? Greg Burnham, who leads Epoch's benchmarking team, and Tom Adamczewski, who developed MirrorCode, push back on the pessimism and dig into what the next generation of AI benchmarks could look like.
By Epoch AI5
55 ratings
AI benchmarks saturate quickly, struggle to capture what we care about, and cost more than ever to build. But are they doomed? Greg Burnham, who leads Epoch's benchmarking team, and Tom Adamczewski, who developed MirrorCode, push back on the pessimism and dig into what the next generation of AI benchmarks could look like.

1,978 Listeners

4,277 Listeners

2,457 Listeners

1,096 Listeners

124 Listeners

291 Listeners

198 Listeners

97 Listeners

561 Listeners

507 Listeners

145 Listeners

100 Listeners

265 Listeners

488 Listeners

95 Listeners