
Sign up to save your podcasts
Or
Testing an LM system for dangerous capabilities is crucial for assessing its risks.
Summary of best practices
Best practices for labs evaluating LM systems for dangerous capabilities:
---
Outline:
(00:11) Summary of best practices
(04:26) How labs are doing
(06:40) Appendix: misc notes on best practices
(06:49) Appendix: misc notes on particular evals
(12:08) Appendix: reading list
The original text contained 2 footnotes which were omitted from this narration.
---
First published:
Source:
Narrated by TYPE III AUDIO.
Testing an LM system for dangerous capabilities is crucial for assessing its risks.
Summary of best practices
Best practices for labs evaluating LM systems for dangerous capabilities:
---
Outline:
(00:11) Summary of best practices
(04:26) How labs are doing
(06:40) Appendix: misc notes on best practices
(06:49) Appendix: misc notes on particular evals
(12:08) Appendix: reading list
The original text contained 2 footnotes which were omitted from this narration.
---
First published:
Source:
Narrated by TYPE III AUDIO.
26,382 Listeners
2,386 Listeners
7,925 Listeners
4,134 Listeners
87 Listeners
1,456 Listeners
9,048 Listeners
87 Listeners
387 Listeners
5,426 Listeners
15,207 Listeners
472 Listeners
120 Listeners
75 Listeners
456 Listeners