LessWrong (30+ Karma)

By LessWrong

Audio narrations of LessWrong posts.... more

Download on the App Store

Download on the App Store

Get it on Google Play

FAQs about LessWrong (30+ Karma):

How many episodes does LessWrong (30+ Karma) have?

The podcast currently has 2,635 episodes available.

LessWrong (30+ Karma) episodes:

June 11, 2025 [Linkpost] “the void” by nostalgebraist
This is a link post.
A very long essay about LLMs, the nature and history of the the HHH assistant persona, and the implications for alignment.
Multiple people have asked me whether I could post this LW in some form, hence this linkpost.
(Note: although I expect this post will be interesting to people on LW, keep in mind that it was written with a broader audience in mind than my posts and comments here. This had various implications about my choices of presentation and tone, about which things I explained from scratch rather than assuming as background, my level of of comfort casually reciting factual details from memory rather than explicitly checking them against the original source, etc.
Although, come of think of it, this was also true of most of my early posts on LW [which were crossposts from my blog], so maybe it's not a [...]

---

First published:
June 11th, 2025

Source:
https://www.lesswrong.com/posts/3EzbtNLdcnZe8og8b/the-void-1

Linkpost URL:
https://nostalgebraist.tumblr.com/post/785766737747574784/the-void

---

Narrated by TYPE III AUDIO.
...more
2min
June 11, 2025 “Expectation = intention = setpoint” by jimmy
When I was first learning about hypnosis, one of the things that was very confusing to me is how "expectations" relate to "intent". Some hypnotists would say "All suggestion is about expectation; if they expect to have an experience they will", and frame their inductions in terms of expectation (e.g. "Your eyelids will become heavy"). The problem with this is that "I don't think it's gonna work". Other hypnotists would avoid this issue entirely by saying "I don't care if you think it will work. Follow my instructions, and you will get the results regardless of what you believe" and then say things like "Make your eyelids heavy". The problem with this is that "I don't know to do that!", which would be avoided by saying "You don't have to 'do' anything; I'm telling you what is going to happen, and your job is simply to notice when it [...]

---
Outline:
(10:34) Sleep talking, on purpose
(11:44) You can just... decide that?
(15:46) You \[dont\] have to believe!
(20:58) Pay attention
The original text contained 11 footnotes which were omitted from this narration.
---

First published:
June 9th, 2025

Source:
https://www.lesswrong.com/posts/p2HQKoW39Ew6updtq/expectation-intention-setpoint

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
22min
June 10, 2025 “Give Me a Reason(ing Model)” by Zvi
Are we doing this again? It looks like we are doing this again.
This time it involves giving LLMs several ‘new’ tasks including effectively a Tower of Hanoi problem, asking them to specify the answer via individual steps rather than an algorithm then calling a failure to properly execute all the steps this way (whether or not they even had enough tokens to do it!) an inability to reason.
The actual work in the paper seems by all accounts to be fine as far as it goes if presented accurately, but the way it is being presented and discussed is not fine.

Not Thinking Clearly
Ruben Hassid (12 million views, not how any of this works): BREAKING: Apple just proved AI “reasoning” models like Claude, DeepSeek-R1, and o3-mini don’t actually reason at all.
They just memorize patterns really well.
Here's what Apple discovered:
(hint: we’re not as close to [...]
---
Outline:
(00:53) Not Thinking Clearly
(01:59) Thinking Again
(07:24) Inability to Think
(08:56) In Brief
(10:01) What's In a Name
---

First published:
June 10th, 2025

Source:
https://www.lesswrong.com/posts/tnc7YZdfGXbhoxkwj/give-me-a-reason-ing-model

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
12min
June 10, 2025“Mech interp is not pre-paradigmatic” by Lee Sharkey
This is a blogpost version of a talk I gave earlier this year at GDM.

Epistemic status: Vague and handwavy. Nuance is often missing. Some of the claims depend on implicit definitions that may be reasonable to disagree with. But overall I think it's directionally true.

It's often said that mech interp is pre-paradigmatic.
I think it's worth being skeptical of this claim.
In this post I argue that:

Mech interp is not pre-paradigmatic.
Within that paradigm, there have been "waves" (mini paradigms). Two waves so far.
Second-Wave Mech Interp has recently entered a 'crisis' phase.
We may be on the edge of a third wave.

Preamble: Kuhn, paradigms, and paradigm shifts
First, we need to be familiar with the basic definition of a paradigm:

A paradigm is a distinct set of concepts or thought patterns, including theories, research [...]

---
Outline:
(00:58) Preamble: Kuhn, paradigms, and paradigm shifts
(03:56) Claim: Mech Interp is Not Pre-paradigmatic
(07:56) First-Wave Mech Interp (ca. 2012 - 2021)
(10:21) The Crisis in First-Wave Mech Interp
(11:21) Second-Wave Mech Interp (ca. 2022 - ??)
(14:23) Anomalies in Second-Wave Mech Interp
(17:10) The Crisis of Second-Wave Mech Interp (ca. 2025 - ??)
(18:25) Toward Third-Wave Mechanistic Interpretability
(20:28) The Basics of Parameter Decomposition
(22:40) Parameter Decomposition Questions Foundational Assumptions of Second-Wave Mech Interp
(24:13) Parameter Decomposition In Theory Resolves Anomalies of Second-Wave Mech Interp
(27:27) Conclusion
The original text contained 6 footnotes which were omitted from this narration.
---
First published:
June 10th, 2025

Source:
https://www.lesswrong.com/posts/beREnXhBnzxbJtr8k/mech-interp-is-not-pre-paradigmatic

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
30min
June 10, 2025 “The True Goal Fallacy” by adamShimi
As I ease out into a short sabbatical, I find myself turning back to dig the seeds of my repeated cycle of exhaustion and burnout in the last few years.
Many factors were at play, some more personal that I’m comfortable discussing here. But I have unearthed at least one failure mode that I see reflected and diffracted in others lives, especially people who like me love to think, to make sense, to understand. So that seems worth a blog post, if only to plant a pointer to the problem, and my own way to solve it.
I’ve christened this issue the “true goal fallacy”: the unchecked yet embodied assumption that there is a correct goal in the world, a true essence in need of discovery and revealing.
Case Study: Team Lead Crash
A concrete example: the inciting incident of my first burnout was my promotion to team lead.
[...]

---
Outline:
(00:48) Case Study: Team Lead Crash
(03:48) Axiology of True Goal Fallacy
(06:20) Absorbing Ambiguity And Allowing Feedback
(11:55) On Never Being Fully Cured
The original text contained 3 footnotes which were omitted from this narration.
---

First published:
June 9th, 2025

Source:
https://www.lesswrong.com/posts/B4zKRZh5oxyGnAdos/the-true-goal-fallacy

---

Narrated by TYPE III AUDIO.
...more
14min
June 10, 2025“Ghiblification for Privacy” by jefftk
I often want to include an image in my posts to give a sense of a
situation. A photo communicates the most, but sometimes that's too
much: some participants would rather remain anonymous. A friend
suggested running pictures through an AI model to convert them into a
Studio
Ghibli-style cartoon, as was briefly a fad a few months ago:

House Party Dances

Letting
Kids Be Outside

The model is making quite large changes, aside from just converting to
a cartoon, including:
Moving people around
Changing posture
Substituting clothing
Combining multiple people into one
Changing races
Giving people extra hands

For my purposes, however, this is helpful, since I'm trying to
illustrate the general feeling of the situation and an overly faithful
cartoon could communicate [...]

---

First published:
June 10th, 2025

Source:
https://www.lesswrong.com/posts/oc2EZhsYLWLKdyMia/ghiblification-for-privacy

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
3min
June 10, 2025
Error rendering URL
---

Source:
https://www.lesswrong.com/posts/HKCKinBgsKKvjQyWK/read-the-pricing-first

---

Narrated by TYPE III AUDIO.
...more
1min
June 09, 2025 “When is it important that open-weight models aren’t released? My thoughts on the benefits and dangers of open-weight models in response to developments in CBRN capabilities.” by ryan_greenblatt
Recently, Anthropic released Opus 4 and said they couldn't rule out the model triggering ASL-3 safeguards due to the model's CBRN capabilities. That is, they say they couldn't rule out that this model had "the ability to significantly help individuals or groups with basic technical backgrounds (e.g., undergraduate STEM degrees) create/obtain and deploy CBRN weapons" (quoting from Anthropic's RSP). More specifically, Anthropic is worried about the model's capabilities in assisting with bioweapons. (See footnote 3 here.)
Given this and results on Virology Capabilities Test, it seems pretty likely that various other AI companies have or will soon have models which can significantly help amateurs make bioweapons.[1] One relevant question is whether it would be bad if there were open-weight models above this capability threshold. Further, should people advocate for not releasing open-weight models above this capability level?
In this post, I'll discuss how I think about releasing [...]

---
Outline:
(02:45) Costs and benefits of open-weight models with these CBRN capabilities
(08:12) Implications of this cost-benefit situation
(11:39) When would my views on open weights change?
(14:32) Mitigations
The original text contained 10 footnotes which were omitted from this narration.
---

First published:
June 9th, 2025

Source:
https://www.lesswrong.com/posts/TeF8Az2EiWenR9APF/when-is-it-important-that-open-weight-models-aren-t-released

---

Narrated by TYPE III AUDIO.
...more
18min
June 09, 2025 “Administering immunotherapy in the morning seems to really, really matter. Why?” by Abhishaike Mahajan
Edit on 08/06/2024: At least one person has pointed out that, at one point, giving hypertensives at night were also thought to matter, a now disproven idea. Someone also mentioned how many times the clinical trial information was altered during the study. I added in a section at the end to discuss this.

There's a really interesting phenomenon in the immunotherapy field that has been going on for what seems to be several years now, but was raised to me — a non-oncologist — via a viral Twitter thread of some work at ASCO25:

Translating the jargon: amongst the patients who received their immunotherapy infusion before 3pm (as opposed to after 3pm), their cancer stayed under control for longer (11.3 months vs. 5.7 months) and on median lived longer (at least[1] 23.2 months versus 16.4 months). A near 2x~ improvement in the most important metrics doing something [...]

The original text contained 2 footnotes which were omitted from this narration.
---

First published:
June 8th, 2025

Source:
https://www.lesswrong.com/posts/pPtqh4fwNcbkgFSpG/administering-immunotherapy-in-the-morning-seems-to-really

---

Narrated by TYPE III AUDIO.

---
Images from the article:
observational. Later: Before 3pm vs after 3pm. observational->interventional. Only PFS as the primary. Change inclusion/exclusion criteria." The tweet shows a progression of changes to what appears to be a clinical trial design, with various modifications to timing, study type, and outcome measures." style="max-width: 100%;" />
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
23min
June 09, 2025 [Linkpost] “METR: Recent frontier models are reward hacking” by Daniel Kokotajlo
This is a link post.
METR just made a lovely post detailing many examples they've found of reward hacks by frontier models. Unlike the reward hacks of yesteryear, these models are smart enough to know that what they are doing is deceptive and not what the company wanted them to do.

---

First published:
June 9th, 2025

Source:
https://www.lesswrong.com/posts/Zu4ai9GFpwezyfB2K/metr-recent-frontier-models-are-reward-hacking

Linkpost URL:
https://metr.org/blog/2025-06-05-recent-reward-hacking/

---

Narrated by TYPE III AUDIO.
...more
1min

FAQs about LessWrong (30+ Karma):

How many episodes does LessWrong (30+ Karma) have?

The podcast currently has 2,635 episodes available.

More shows like LessWrong (30+ Karma)

Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,361 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,428 Listeners

The Peter Attia Drive by Peter Attia, MD

The Peter Attia Drive

8,957 Listeners

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas by Sean Carroll | Wondery

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas

4,145 Listeners

ManifoldOne by Steve Hsu

ManifoldOne

92 Listeners

Your Undivided Attention by The Center for Humane Technology, Tristan Harris, Daniel Barcay and Aza Raskin

Your Undivided Attention

1,591 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

9,911 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

90 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

72 Listeners

Hard Fork by The New York Times

Hard Fork

5,471 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,083 Listeners

Moonshots with Peter Diamandis by PHD Ventures

Moonshots with Peter Diamandis

537 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

131 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

94 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

511 Listeners