
Sign up to save your podcasts
Or
In this post, I'll list all the areas of control research (and implementation) that seem promising to me.
This references framings and abstractions discussed in Prioritizing threats for AI control.
First, here is a division of different areas, though note that these areas have some overlaps:
This division is imperfect and somewhat [...]
---
Outline:
(01:43) Developing and using settings for control evaluations
(13:51) Better understanding control-relevant capabilities and model properties
(28:21) Developing specific countermeasures/techniques in isolation
(36:01) Doing control experiments on actual AI usage
(40:43) Prototyping or implementing software infrastructure for control and security measures that are particularly relevant to control
(41:58) Prototyping or implementing human processes for control
(46:50) Conceptual research and other ways of researching control other than empirical experiments
(49:37) Nearer-term applications of control to practice and accelerate later usage of control
---
First published:
Source:
Narrated by TYPE III AUDIO.
In this post, I'll list all the areas of control research (and implementation) that seem promising to me.
This references framings and abstractions discussed in Prioritizing threats for AI control.
First, here is a division of different areas, though note that these areas have some overlaps:
This division is imperfect and somewhat [...]
---
Outline:
(01:43) Developing and using settings for control evaluations
(13:51) Better understanding control-relevant capabilities and model properties
(28:21) Developing specific countermeasures/techniques in isolation
(36:01) Doing control experiments on actual AI usage
(40:43) Prototyping or implementing software infrastructure for control and security measures that are particularly relevant to control
(41:58) Prototyping or implementing human processes for control
(46:50) Conceptual research and other ways of researching control other than empirical experiments
(49:37) Nearer-term applications of control to practice and accelerate later usage of control
---
First published:
Source:
Narrated by TYPE III AUDIO.
26,368 Listeners
2,397 Listeners
7,808 Listeners
4,111 Listeners
87 Listeners
1,454 Listeners
8,774 Listeners
89 Listeners
354 Listeners
5,363 Listeners
15,028 Listeners
461 Listeners
127 Listeners
65 Listeners
432 Listeners