
Sign up to save your podcasts
Or
Formalizing the Informal
One way to view MIRI's Agent Foundations research is that it saw the biggest problem in AI safety as "human preferences are informal, but we need to somehow get formal guarantees about them" -- and so, in response, it set out to make a formal-informal bridge.
Recently, I’ve been thinking about how we might formally represent the difference between formal and informal. My prompt is something like: if we assume that classical probability theory applies to “fully formal” propositions, how can we generalize it to handle “informal” stuff?
I’m going to lead a discussion on this tomorrow, Wednesday Sept. 11, at 11am EDT (7am Pacific, 4pm UK).
Discord Event link (might not work for most people):
https://discord.com/events/1237103274591649933/1282859362125352960
Zoom link (should work for everyone):
https://us06web.zoom.us/j/6274543940?pwd=TGZpY3NSTUVYNHZySUdCQUQ5ZmxQQT09
You can support my work on Patreon.
---
First published:
Source:
Narrated by TYPE III AUDIO.
Formalizing the Informal
One way to view MIRI's Agent Foundations research is that it saw the biggest problem in AI safety as "human preferences are informal, but we need to somehow get formal guarantees about them" -- and so, in response, it set out to make a formal-informal bridge.
Recently, I’ve been thinking about how we might formally represent the difference between formal and informal. My prompt is something like: if we assume that classical probability theory applies to “fully formal” propositions, how can we generalize it to handle “informal” stuff?
I’m going to lead a discussion on this tomorrow, Wednesday Sept. 11, at 11am EDT (7am Pacific, 4pm UK).
Discord Event link (might not work for most people):
https://discord.com/events/1237103274591649933/1282859362125352960
Zoom link (should work for everyone):
https://us06web.zoom.us/j/6274543940?pwd=TGZpY3NSTUVYNHZySUdCQUQ5ZmxQQT09
You can support my work on Patreon.
---
First published:
Source:
Narrated by TYPE III AUDIO.
26,401 Listeners
2,388 Listeners
7,925 Listeners
4,132 Listeners
87 Listeners
1,456 Listeners
9,045 Listeners
86 Listeners
388 Listeners
5,427 Listeners
15,207 Listeners
474 Listeners
123 Listeners
75 Listeners
455 Listeners