
Sign up to save your podcasts
Or


Testing an LM system for dangerous capabilities is crucial for assessing its risks.
Summary of best practices
Best practices for labs evaluating LM systems for dangerous capabilities:
---
Outline:
(00:11) Summary of best practices
(04:26) How labs are doing
(06:40) Appendix: misc notes on best practices
(06:49) Appendix: misc notes on particular evals
(12:08) Appendix: reading list
The original text contained 2 footnotes which were omitted from this narration.
---
First published:
Source:
Narrated by TYPE III AUDIO.
By LessWrongTesting an LM system for dangerous capabilities is crucial for assessing its risks.
Summary of best practices
Best practices for labs evaluating LM systems for dangerous capabilities:
---
Outline:
(00:11) Summary of best practices
(04:26) How labs are doing
(06:40) Appendix: misc notes on best practices
(06:49) Appendix: misc notes on particular evals
(12:08) Appendix: reading list
The original text contained 2 footnotes which were omitted from this narration.
---
First published:
Source:
Narrated by TYPE III AUDIO.

112,220 Listeners

131 Listeners

7,228 Listeners

568 Listeners

16,196 Listeners

4 Listeners

14 Listeners

2 Listeners