LessWrong (30+ Karma)

“Practical Learnings from Synthetic Document Finetuning” by Axel Højmark, Jérémy Scheurer


Listen Later

We've been using Synthetic Document Finetuning (SDF) quite a bit at Apollo Research lately. This post covers a few tweaks to the standard SDF recipe specific to our use cases, plus some general tips and tricks for getting good results. We’re sharing these notes in case they’re useful to others doing research with SDF.

1. What Is SDF?

Synthetic Document Finetuning (SDF) is a knowledge editing technique where models are finetuned on LLM-generated documents consistent with a target fact or belief. As described in Slocum et al. (2025), SDF "often succeeds at implanting beliefs that behave similarly to genuine knowledge." These implanted beliefs can generalize to related contexts, are often robust to scrutiny, and form internal representations similar to genuine knowledge.

We mostly followed the pipeline described in Slocum et al. (2025) and the safety-research/false-facts repository.

The pipeline has several stages:

  1. Universe Context: Define a "universe" description where the target belief is true.
  2. Fact Extraction: Extract discrete claims from that universe that the synthetic documents will revolve around.
  3. Generation: Use an LLM to generate a large, diverse corpus of synthetic documents. This is done by having the LLM first brainstorm document types (blogs, papers, memos), then come up with specific ideas [...]

---

Outline:

(00:32) 1. What Is SDF?

(02:03) Iterating on Universes and Generation Prompts

(03:42) 2. Getting Models to Surface the Information

(04:14) Dropping the DOCTAG

(04:56) Dropping Webtext to Increase Saliency

(06:13) Matching the Test Distribution

(07:11) Prepending Eval Prompts

(08:20) 3. Training Details

(08:24) Document Length and Token Counts

(09:05) Training for Multiple Epochs

(09:37) Running Experiments with LoRA & Tinker

(10:33) 4. Dealing with Gibberish

(12:14) 5. Evaluating Effects

(13:58) 6. Other Explorations

(17:06) Acknowledgements

---

First published:

May 26th, 2026

Source:

https://www.lesswrong.com/posts/7zGgFPLaTXJwCJccB/practical-learnings-from-synthetic-document-finetuning

---

Narrated by TYPE III AUDIO.

...more
View all episodesView all episodes
Download on the App Store

LessWrong (30+ Karma)By LessWrong


More shows like LessWrong (30+ Karma)

View all
The Daily by The New York Times

The Daily

112,330 Listeners

Astral Codex Ten Podcast by Jeremiah

Astral Codex Ten Podcast

130 Listeners

Interesting Times with Ross Douthat by New York Times Opinion

Interesting Times with Ross Douthat

7,247 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

563 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,328 Listeners

AI Article Readings by Readings of great articles in AI voices

AI Article Readings

4 Listeners

Doom Debates! by Liron Shapira

Doom Debates!

14 Listeners

LessWrong posts by zvi by zvi

LessWrong posts by zvi

2 Listeners