May 27, 2026

“Practical Learnings from Synthetic Document Finetuning” by Axel Højmark, Jérémy Scheurer

17 minutes

We've been using Synthetic Document Finetuning (SDF) quite a bit at Apollo Research lately. This post covers a few tweaks to the standard SDF recipe specific to our use cases, plus some general tips and tricks for getting good results. We’re sharing these notes in case they’re useful to others doing research with SDF.

1. What Is SDF?

Synthetic Document Finetuning (SDF) is a knowledge editing technique where models are finetuned on LLM-generated documents consistent with a target fact or belief. As described in Slocum et al. (2025), SDF "often succeeds at implanting beliefs that behave similarly to genuine knowledge." These implanted beliefs can generalize to related contexts, are often robust to scrutiny, and form internal representations similar to genuine knowledge.

We mostly followed the pipeline described in Slocum et al. (2025) and the safety-research/false-facts repository.

The pipeline has several stages:

Universe Context: Define a "universe" description where the target belief is true.
Fact Extraction: Extract discrete claims from that universe that the synthetic documents will revolve around.
Generation: Use an LLM to generate a large, diverse corpus of synthetic documents. This is done by having the LLM first brainstorm document types (blogs, papers, memos), then come up with specific ideas [...]

---

Outline:

(00:32) 1. What Is SDF?

(02:03) Iterating on Universes and Generation Prompts

(03:42) 2. Getting Models to Surface the Information

(04:14) Dropping the DOCTAG

(04:56) Dropping Webtext to Increase Saliency

(06:13) Matching the Test Distribution

(07:11) Prepending Eval Prompts

(08:20) 3. Training Details

(08:24) Document Length and Token Counts

(09:05) Training for Multiple Epochs

(09:37) Running Experiments with LoRA & Tinker

(10:33) 4. Dealing with Gibberish

(12:14) 5. Evaluating Effects

(13:58) 6. Other Explorations

(17:06) Acknowledgements

---

First published:

May 26th, 2026

Source:

https://www.lesswrong.com/posts/7zGgFPLaTXJwCJccB/practical-learnings-from-synthetic-document-finetuning

---

Narrated by TYPE III AUDIO.

...more

View all episodes

By LessWrong

May 27, 2026

“Practical Learnings from Synthetic Document Finetuning” by Axel Højmark, Jérémy Scheurer

17 minutes

1. What Is SDF?

We mostly followed the pipeline described in Slocum et al. (2025) and the safety-research/false-facts repository.

The pipeline has several stages:

Universe Context: Define a "universe" description where the target belief is true.
Fact Extraction: Extract discrete claims from that universe that the synthetic documents will revolve around.
Generation: Use an LLM to generate a large, diverse corpus of synthetic documents. This is done by having the LLM first brainstorm document types (blogs, papers, memos), then come up with specific ideas [...]

---

Outline:

(00:32) 1. What Is SDF?

(02:03) Iterating on Universes and Generation Prompts

(03:42) 2. Getting Models to Surface the Information

(04:14) Dropping the DOCTAG

(04:56) Dropping Webtext to Increase Saliency

(06:13) Matching the Test Distribution

(07:11) Prepending Eval Prompts

(08:20) 3. Training Details

(08:24) Document Length and Token Counts

(09:05) Training for Multiple Epochs

(09:37) Running Experiments with LoRA & Tinker

(10:33) 4. Dealing with Gibberish

(12:14) 5. Evaluating Effects

(13:58) 6. Other Explorations

(17:06) Acknowledgements

---

First published:

May 26th, 2026

Source:

https://www.lesswrong.com/posts/7zGgFPLaTXJwCJccB/practical-learnings-from-synthetic-document-finetuning

---

Narrated by TYPE III AUDIO.

...more

More shows like LessWrong (30+ Karma)

View all

The Daily

112,330 Listeners

Astral Codex Ten Podcast

130 Listeners

Interesting Times with Ross Douthat

7,247 Listeners

Dwarkesh Podcast

563 Listeners

The Ezra Klein Show

16,328 Listeners

AI Article Readings

4 Listeners

Doom Debates!

14 Listeners

LessWrong posts by zvi

2 Listeners

Share “Practical Learnings from Synthetic Document Finetuning” by Axel Højmark, Jérémy Scheurer

Sign up to save your podcasts

“Practical Learnings from Synthetic Document Finetuning” by Axel Højmark, Jérémy Scheurer

“Practical Learnings from Synthetic Document Finetuning” by Axel Højmark, Jérémy Scheurer

More shows like LessWrong (30+ Karma)

The Daily

Astral Codex Ten Podcast

Interesting Times with Ross Douthat

Dwarkesh Podcast

The Ezra Klein Show

AI Article Readings

Doom Debates!

LessWrong posts by zvi