
Sign up to save your podcasts
Or


Testing an LM system for dangerous capabilities is crucial for assessing its risks.
Summary of best practices
Best practices for labs evaluating LM systems for dangerous capabilities:
---
Outline:
(00:11) Summary of best practices
(04:26) How labs are doing
(06:40) Appendix: misc notes on best practices
(06:49) Appendix: misc notes on particular evals
(12:08) Appendix: reading list
The original text contained 2 footnotes which were omitted from this narration.
---
First published:
Source:
Narrated by TYPE III AUDIO.
By LessWrongTesting an LM system for dangerous capabilities is crucial for assessing its risks.
Summary of best practices
Best practices for labs evaluating LM systems for dangerous capabilities:
---
Outline:
(00:11) Summary of best practices
(04:26) How labs are doing
(06:40) Appendix: misc notes on best practices
(06:49) Appendix: misc notes on particular evals
(12:08) Appendix: reading list
The original text contained 2 footnotes which were omitted from this narration.
---
First published:
Source:
Narrated by TYPE III AUDIO.

112,856 Listeners

130 Listeners

7,217 Listeners

532 Listeners

16,202 Listeners

4 Listeners

14 Listeners

2 Listeners