LessWrong (30+ Karma)

[Linkpost] “A Chess-GPT Linear Emergent World Representation” by karvonenadam


Listen Later

This is a linkpost for https://adamkarvonen.github.io/machine_learning/2024/01/03/chess-world-models.html

A Chess-GPT Linear Emergent World Representation

Introduction.

Among the many recent developments in ML, there were two I found interesting and wanted to dig into further. The first was gpt-3.5-turbo-instruct's ability to play chess at 1800 Elo. The fact that an LLM could learn to play chess well from random text scraped off the internet seemed almost magical. The second was Kenneth Li's Emergent World Representations paper. There is an excellent summary on The Gradient and a follow-up from Neel Nanda. In it, they trained a 25 million parameter GPT to predict the next character in an Othello game. It learns to accurately make moves in games unseen in its training dataset, and using both non-linear and linear probes it was found that the model accurately tracks the state of the board.

However, this only worked for a model trained on a synthetic [...]



---

Outline:

(00:07) A Chess-GPT Linear Emergent World Representation

(02:35) Training Chess GPT

(05:12) Chess-GPTs Internal World Model

(08:46) Probing for latent variables

(12:41) \- I fine-tuned GPT-2 on a 50 / 50 mix of OpenWebText and chess games, and it learned to play chess and continued to output plausible looking text. Maybe theres something interesting to look at there?

---

First published:

February 8th, 2024

Source:

https://www.lesswrong.com/posts/yzGDwpRBx6TEcdeA5/a-chess-gpt-linear-emergent-world-representation

Linkpost URL:
https://adamkarvonen.github.io/machine_learning/2024/01/03/chess-world-models.html

---

Narrated by TYPE III AUDIO.

...more
View all episodesView all episodes
Download on the App Store

LessWrong (30+ Karma)By LessWrong


More shows like LessWrong (30+ Karma)

View all
The Daily by The New York Times

The Daily

113,207 Listeners

Astral Codex Ten Podcast by Jeremiah

Astral Codex Ten Podcast

130 Listeners

Interesting Times with Ross Douthat by New York Times Opinion

Interesting Times with Ross Douthat

7,258 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

534 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,291 Listeners

AI Article Readings by Readings of great articles in AI voices

AI Article Readings

4 Listeners

Doom Debates by Liron Shapira

Doom Debates

14 Listeners

LessWrong posts by zvi by zvi

LessWrong posts by zvi

2 Listeners