Into AI Safety

FEEDBACK: AISC Proposal w/ Remmelt Ellen


Listen Later

In this episode I discuss my initial research proposal for the 2024 Winter AI Safety Camp with one of the individuals who helps facilitate the program, Remmelt Ellen.

The proposal is titled The Effect of Machine Learning on Bioengineered Pandemic Risk. A doc-capsule of the proposal at the time of this recording can be found at this link.

Links to all articles/papers which are mentioned throughout the episode can be found below, in order of their appearance. 

  • MegaSyn: Integrating Generative Molecule Design, Automated Analog Designer and Synthetic Viability Prediction
  • Dual use of artificial-intelligence-powered drug discovery
  • Artificial intelligence and biological misuse: Differentiating risks of language models and biological design tools
  • Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research
  • Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models
  • Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!
  • unRLHF - Efficiently undoing LLM safeguards
...more
View all episodesView all episodes
Download on the App Store

Into AI SafetyBy Jacob Haimes