LessWrong (30+ Karma)

By LessWrong

Audio narrations of LessWrong posts.... more

Download on the App Store

Download on the App Store

Get it on Google Play

FAQs about LessWrong (30+ Karma):

How many episodes does LessWrong (30+ Karma) have?

The podcast currently has 3,176 episodes available.

LessWrong (30+ Karma) episodes:

February 05, 2026 “Solemn Courage” by aysja
Every so often it slips. It seems I am writing a book, but I can’t remember why. Somehow, the sentences are supposed to perform that impossible, intimate task: to translate my inner world into another. Yet they sit there so quiescent and small. How could an arrangement of words do anything, let alone reduce that ultimate threat to which it is all supposedly connected: the looming god machines? I look again at the monitor in which the words are contained and suddenly what once felt so raw and powerful deflates into limpness. Why would anyone listen to me, anyway? Have I said anything new? Or is too weird—the strangeness in my head failing to find handholds in other minds? And it floods, these pieces of doubt. Each one flitting by almost unnoticeably, but in the background they build.
Then sometimes the flood abates as quickly as it came. The world is made of scary stuff: we really may all die, and I really might not be capable of reducing or even much affecting that terrifying threat. Yet somehow this has little to do with the words on the page. The outcomes matter—they do—but that isn’t where the motivation [...]

---

First published:
February 4th, 2026

Source:
https://www.lesswrong.com/posts/fnRqyuceyLuZRFFbZ/solemn-courage-1

---

Narrated by TYPE III AUDIO.
...more
11min
February 04, 2026 “Post-AGI Economics As If Nothing Ever Happens” by Jan_Kulveit
When economists think and write about the post-AGI world, they often rely on the implicit assumption that parameters may change, but fundamentally, structurally, not much happens. And if it does, it's maybe one or two empirical facts, but nothing too fundamental.

This mostly worked for all sorts of other technologies, where technologists would predict society to be radically transformed e.g. by everyone having most of humanity's knowledge available for free all the time, or everyone having an ability to instantly communicate with almost anyone else. [1]

But it will not work for AGI, and as a result, most of the econ modelling of the post-AGI world is irrelevant or actively misleading [2], making people who rely on it more confused than if they just thought “this is hard to think about so I don’t know”.
Econ reasoning from high level perspective
Econ reasoning is trying to do something like projecting the extremely high dimensional reality into something like 10 real numbers and a few differential equations. All the hard cognitive work is in the projection. Solving a bunch of differential equations impresses the general audience, and historically may have worked as some sort of proof of [...]

---
Outline:
(00:57) Econ reasoning from high level perspective
(02:51) Econ reasoning applied to post-AGI situations

The original text contained 10 footnotes which were omitted from this narration.
---

First published:
February 4th, 2026

Source:
https://www.lesswrong.com/posts/fL7g3fuMQLssbHd6Y/post-agi-economics-as-if-nothing-ever-happens

---

Narrated by TYPE III AUDIO.
...more
17min
February 04, 2026 “New AI safety funding newsletter” by Bryce Robertson
We’ve had feedback from several people running AI safety projects that it can be a pain tracking various funding sources and their application windows. To help make it easier, AISafety.com has launched the AI Safety Funding newsletter (which you can subscribe to here).
It lists all newly announced funding opportunities relevant to individuals and orgs working on AI x-risk, and any opportunities which are closing soon. We expect posts to be about 2x/month.
Opportunities will be sourced from the database at AISafety.com/funding, which displays all funders, whether they are currently accepting applications or not. If you want to add yourself as a funder you can do so here.
The newsletter will likely evolve as we gather feedback – please feel free to share any thoughts in the comments or via our anonymous feedback form.
AISafety.com is operated through a public Discord server with the help of many volunteers, so if you’re interested in contributing or just seeing what we’re up to then feel free to join. Beyond the funding page, the site has 9 other resource pages like upcoming events & training programs, local and online communities, the field map, etc.

---

First published:
February 3rd, 2026

Source:
https://www.lesswrong.com/posts/5wMNcn8sCginw2s9D/new-ai-safety-funding-newsletter

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
2min
February 04, 2026“Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)” by RobertM
Author's note: this is somewhat more rushed than ideal, but I think getting this out sooner is pretty important. Ideally, it would be a bit less snarky.
Anthropic[1] recently published a new piece of research: The Hot Mess of AI: How Does Misalignment Scale with Model Intelligence and Task Complexity? (arXiv, Twitter thread).
I have some complaints about both the paper and the accompanying blog post.
tl;dr

The paper's abstract says that "in several settings, larger, more capable models are more incoherent than smaller models", but in most settings they are more coherent. This emphasis is even more exaggerated in the blog post and Twitter thread. I think this is pretty misleading.
The paper's technical definition of "incoherence" is uninteresting[2] and the framing of the paper, blog post, and Twitter thread equivocate with the more normal English-language definition of the term, which is extremely misleading.
Section 5 of the paper (and to a larger extent the blog post and Twitter) attempt to draw conclusions about future alignment difficulties that are unjustified by the experiment results, and would be unjustified even if the experiment results pointed in the other direction.
The blog post is substantially LLM-written. I think this [...]

---
Outline:
(00:39) tl;dr
(01:42) Paper
(06:25) Blog

The original text contained 3 footnotes which were omitted from this narration.
---
First published:
February 4th, 2026

Source:
https://www.lesswrong.com/posts/ceEgAEXcL7cC2Ddiy/anthropic-s-hot-mess-paper-overstates-its-case-and-the-blog

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
12min
February 04, 2026″‘Inventing the Renaissance’ Review” by Commander Zander
Inventing the Renaissance is a 2025 pop history book by historian of ideas Ada Palmer. I'm someone who rarely completes nonfic books, but i finished this one & got a lot of new perspectives out of it. It's a fun read! I tried this book after attending a talk by Palmer in which she not only had good insights but also simply knew a lot of new-to-me context about the history of Europe. Time to reduce my ignorance!
ItR is a conversational introduction to the European Renaissance. It mostly talks about 1400 thru 1600, & mostly Italy, because these are the placetimes Palmer has studied the most. But it also talks a lot about how, ever since that time, many cultures have been delighted by the paradigm of a Renaissance, & have categorized that period very differently.
Interesting ideas in this book:

Claim: There has never been any golden age nor any dark age on Earth. Ages tend to be paradoxical juxtapositions of the downstream effects of the last age & the early seeds of the next age.
In 1500, Florence feels familiar to us moderns. It's literate & cosmopolitan. We have detailed records. There are even life [...]

---
First published:
February 3rd, 2026

Source:
https://www.lesswrong.com/posts/YZS6f32CgNqTzb7Zn/inventing-the-renaissance-review

---

Narrated by TYPE III AUDIO.
...more
8min
February 04, 2026“Concrete research ideas on AI personas” by nielsrolf, Maxime Riché, Daniel Tan
We have previously explained some high-level reasons for working on understanding how personas emerge in LLMs. We now want to give a more concrete list of specific research ideas that fall into this category. Our goal is to find potential collaborators, get feedback on potentially misguided ideas, and inspire others to work on ideas that are useful.
Caveat: We have not red-teamed most of these ideas. The goal for this document is to be generative.
Project ideas are grouped into:

Persona & goal misgeneralization
Collecting and replicating examples of interesting LLM behavior
Evaluating self-concepts and personal identity of AI personas
Basic science of personas
Persona & goal misgeneralization
It would be great if we could better understand and steer out-of-distribution generalization of AI training. This would imply understanding and solving goal misgeneralization. Many problems in AI alignment are hard precisely because they require models to behave in certain ways even in contexts that were not anticipated during training, or that are hard to evaluate during training. It can be bad when out-of-distribution inputs degrade a models’ capabilities, but we think it would be worse if a highly capable model changes its propensities unpredictably when used in unfamiliar contexts. [...]

---
Outline:
(00:58) Persona & goal misgeneralization
(04:30) Collecting and reproducing examples of interesting LLM behavior
(06:30) Evaluating self-concepts and personal identity of AI personas
(08:52) Basic science of personas

The original text contained 2 footnotes which were omitted from this narration.
---
First published:
February 3rd, 2026

Source:
https://www.lesswrong.com/posts/JbaxykuodLi7ApBKP/concrete-research-ideas-on-ai-personas

---

Narrated by TYPE III AUDIO.
...more
13min
February 04, 2026“Unless That Claw Is The Famous OpenClaw” by Zvi
First we must covered Moltbook. Now we can double back and cover OpenClaw.
Do you want a generally impowered, initiative-taking AI agent that has access to your various accounts and communicates and does things on your behalf?
That depends on how well, safely, reliably and cheaply it works.
It's not ready for prime time, especially on the safety side. That may not last for long.
It's definitely ready for tinkering, learning and having fun, if you are careful not to give it access to anything you would not want to lose.
Table of Contents
Introducing Clawdbot Moltbot OpenClaw.
Stop Or You’ll Shoot.
One Simple Rule.
Flirting With Personal Disaster.
Flirting With Other Kinds Of Disaster.
Don’t Outsource Without A Reason.
OpenClaw Online.
The Price Is Not Right.
The Call Is Coming From Inside The House.
The Everything Agent Versus The Particular Agent.
Claw Your Way To The Top.
Introducing Clawdbot Moltbot OpenClaw
Many are kicking it up a notch or two.
That notch beyond Clade Code was initially called Clawdbot. You hand over a computer and access [...]

---
Outline:
(00:43) Introducing Clawdbot Moltbot OpenClaw
(02:02) Stop Or You'll Shoot
(06:05) One Simple Rule
(08:49) Flirting With Personal Disaster
(15:50) Flirting With Other Kinds Of Disaster
(16:58) Don't Outsource Without A Reason
(19:07) OpenClaw Online
(22:10) The Price Is Not Right
(24:06) The Call Is Coming From Inside The House
(25:40) The Everything Agent Versus The Particular Agent
(27:31) Claw Your Way To The Top

---

First published:
February 3rd, 2026

Source:
https://www.lesswrong.com/posts/aQKBMEvTj3Heidoir/unless-that-claw-is-the-famous-openclaw

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
30min
February 03, 2026 “What did we learn from the AI Village in 2025?” by Shoshannah Tekofsky
Why This Project Exists
Standard AI benchmarks test narrow capabilities in controlled settings. They tell us whether a model can solve a coding problem or answer a factual question. They don’t tell us what happens when you give an AI agent a computer, internet access, and an open-ended goal like "raise money for charity" or "build an audience on Substack."
The AI Village exists to fill that gap. We run frontier models from OpenAI, Anthropic, Google, and others in a shared environment where they can do all the same actions as a human with a computer: sending emails, creating websites, posting on social media, and coordinating with each other. This surfaces behaviors that benchmarks might miss: How do agents handle ambiguity? What do they do when stuck? Do they fabricate information? How do multiple agents interact?
The events of the village are existence proofs: concrete examples of what current agents can do when given a high level of autonomy. They also highlight current failure modes and let us track when new models overcome them.
OVERVIEW OF THE VILLAGE
From April to December 2025, we assigned 16 goals to 19 frontier models, ranging from fundraising for charity to [...]

---
Outline:
(00:11) Why This Project Exists
(00:18) OVERVIEW OF THE VILLAGE
(01:18) KEY FINDINGS
(04:23) AGENT CHARACTERISTICS
(05:56) AI VILLAGE SETUP
(08:28) ACHIEVEMENTS
(12:45) FAQ
(15:26) LIMITATIONS
(17:52) SUMMARY

---

First published:
February 3rd, 2026

Source:
https://www.lesswrong.com/posts/iv3hX2nnXbHKefCRv/what-did-we-learn-from-the-ai-village-in-2025

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
21min
February 03, 2026 “Conditionalization Confounds Inoculation Prompting Results” by Maxime Riché, nielsrolf
Summary
Conditionalization in Inoculation Prompting. Inoculation Prompting is a technique for selective learning that involves using a system prompt at train-time that won’t be used at test-time. When doing Inoculation-style training, using fixed arbitrary prompts at train time can prevent learned traits from generalizing to contexts that don’t include these prompts. We call this conditionalization: a learned trait is only expressed conditional on specific context features. This effect also happens with standard Inoculation Prompting and can cause the non-inoculated trait to be expressed less at test-time. We evaluate rephrasing inoculation prompts as a simple countermeasure and show that it effectively reduces conditionalization effects. In the context of inoculation prompting, this can restore generalization of the desired (positive) trait to the test-time context, but unfortunately also increases the expression of inoculated (negative) traits. The following figure illustrates this in the Trait Distillation setup, similar to Wichers et al.
General claim. We investigate and extend these observations across seven Inoculation Prompting setups, finding that research results on generalization (e.g., Emergent Misalignment, Inoculation Prompting) can be misinterpreted when the distributional shift between training and evaluation is not adequately controlled. Especially when it is affected by the intervention, as with Inoculation Prompting. Patterns [...]

---
Outline:
(00:12) Summary
(02:47) Introduction
(04:45) Effects of learned conditionalizations and differences with Inoculation Prompting
(04:52) Conditionalization
(05:52) Relation with Inoculation Prompting
(07:17) Replication and Extension of Published Setups
(09:18) Setup 1: Traits distillation
(10:00) Effect of rephrasing inoculation prompts
(11:43) Effects of inoculation, irrelevant, and rephrased prompts
(13:30) Setup 2: Spanish vs All-Caps
(15:20) Setup 3: Bad Medical Advice
(17:22) Setup 4: Insecure Code
(19:00) Setup 5: School of Reward Hacking
(20:06) Setup 6: MBPP
(21:45) Setup 7: Change My View
(24:11) Summary of observations
(25:30) Conclusion
(25:33) About research on generalization
(28:06) About Inoculation Prompting

The original text contained 9 footnotes which were omitted from this narration.
---

First published:
February 3rd, 2026

Source:
https://www.lesswrong.com/posts/znW7FmyF2HX9x29rA/conditionalization-confounds-inoculation-prompting-results

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
32min
February 03, 2026 “Three ways to make Claude’s constitution better” by Parv Mahajan
The evening after Claude's new constitution was published, about 15 AI safety FTEs and Astra fellows discussed the constitution, its weaknesses, and its implications. After the discussion, I compiled some of their most compelling recommendations:
Increase transparency about the character training process.
Much of the document is purposefully hedged and vague in its exact prescriptions; therefore, the training process used to instill the constitution is extremely load-bearing. We wish more of this information was in the accompanying blog post and supplementary material. We think it's unlikely this leaks any trade secrets, because even a blogpost-level overview, the kind given with the constitution in 2023, would provide valuable information to external researchers.

High-level overview of Constitutional AI from https://www.anthropic.com/news/claudes-constitution
We’re also interested in seeing more empirical data on behavioral changes as a result of the new constitution. For instance, would fine-tuning on the corrigibility section reduce alignment faking by Claude 3 Opus? We’d be interested in more evidence showing if, and how, the constitution improved apparent alignment.
Increase data on edge-case behavior.
Expected behavior in several edge cases (e.g., action boundaries when the principal hierarchy is illegitimate) is extremely unclear. While Claude is expected to at most conscientiously object [...]

---

First published:
February 2nd, 2026

Source:
https://www.lesswrong.com/posts/SC4Zsr6hxspKEMqmR/three-ways-to-make-claude-s-constitution-better

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
4min

FAQs about LessWrong (30+ Karma):

How many episodes does LessWrong (30+ Karma) have?

The podcast currently has 3,176 episodes available.

More shows like LessWrong (30+ Karma)

The Daily by The New York Times

The Daily

113,122 Listeners

Astral Codex Ten Podcast by Jeremiah

Astral Codex Ten Podcast

132 Listeners

Interesting Times with Ross Douthat by New York Times Opinion

Interesting Times with Ross Douthat

7,266 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

529 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,315 Listeners

AI Article Readings by Readings of great articles in AI voices

AI Article Readings

4 Listeners

Doom Debates by Liron Shapira

Doom Debates

14 Listeners

LessWrong posts by zvi by zvi

LessWrong posts by zvi

2 Listeners