LessWrong (30+ Karma)

By LessWrong

Audio narrations of LessWrong posts.... more

Download on the App Store

Download on the App Store

Get it on Google Play

FAQs about LessWrong (30+ Karma):

How many episodes does LessWrong (30+ Karma) have?

The podcast currently has 2,641 episodes available.

LessWrong (30+ Karma) episodes:

May 31, 2025 “‘GiveWell for AI Safety’: Lessons learned in a week” by Lydia Nottingham
On prioritizing orgs by theory of change, identifying effective giving opportunities, and how Manifund can help.
Epistemic status: I spent ~20h thinking about this. If I were to spend 100+ h thinking about this, I expect I’d write quite different things. I was surprised to find early GiveWell ‘learned in public’: perhaps this is worth trying.
The premise: EA was founded on cost-effectiveness analysis—why not try this for AI safety, aside from all the obvious reasons¹? A good thing about early GiveWell was its transparency. Some wish OpenPhil were more transparent today. That seems sometimes hard, due to strategic or personnel constraints. Can Manifund play GiveWell's role for AI safety—publishing rigorous, evidence-backed evaluations?
With that in mind, I set out to evaluate the cost-effectiveness of marginal donations to AI safety orgs². Since I was evaluating effective giving opportunities, I only looked at nonprofits³.
I couldn’t evaluate all 50+ orgs [...]

---

First published:
May 30th, 2025

Source:
https://www.lesswrong.com/posts/Z8KLLHvsEkukxpTCD/givewell-for-ai-safety-lessons-learned-in-a-week

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
13min
May 30, 2025“Letting Kids Be Kids” by Zvi
Letting kids be kids seems more and more important to me over time. Our safetyism and paranoia about children is catastrophic on way more levels than most people realize. I believe all these effects are very large:
It raises the time, money and experiential costs of having children so much that many choose not to have children, or to have less children than they would want.
It hurts the lived experience of children.
It hurts children's ability to grow and develop.
It de facto forces children to use screens quite a lot.
It instills a very harmful style of paranoia in all concerned.
This should be thought of as part of the Cost of Thriving Index discussion, and the fertility discussions as well. Before I return to the more general debate, I wanted to take care of this aspect first. It's [...]

---
Outline:
(02:31) Current Insanity Levels
(09:50) The Result of This
(12:15) Some Progress on Letting Kids Be Kids
(13:10) CPS Cares
(19:52) Panopticon
(22:37) They Even Close Playgrounds
(24:08) How Did Things Get This Bad?
(28:11) Let Kids be Adults
(30:12) Treat Kids Like People
(32:00) Lowering the Burden
(35:05) Screens
---

First published:
May 30th, 2025

Source:
https://www.lesswrong.com/posts/mZ48pp2Y4YLvrPXHv/letting-kids-be-kids

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
39min
May 30, 2025 “Orphaned Policies (Post 5 of 6 on AI Governance)” by Mass_Driver
In previous posts in this sequence, I laid out a case for why most AI governance research is too academic and too abstract to have much influence over the future. Politics is noisy and contested, so we can’t expect that good AI governance ideas will spread on their own – we need a large team of people who are actively promoting those ideas. Unfortunately, we currently have at least 3 researchers for every advocate, so many policy ideas have been “orphaned,” i.e., nobody is taking those ideas and showing them to decision-makers who have the power to implement them.
The best way of addressing this imbalance would be to shift funding and jobs from research to advocacy. However, as a practical matter, I don’t expect many funders to heed my arguments, nor do I expect many researchers to spontaneously quit and look for new jobs in other fields. So [...]

---
Outline:
(01:49) DRAFT ACTUAL POLICY DOCUMENTS
(04:43) MAKE YOUR WHITE PAPERS SPECIFIC
(08:31) CATALOG OF ORPHANED POLICIES
(08:57) Windfall Profits Clause
(09:48) Antitrust Waiver
(11:18) Strict Liability
(12:28) Visa Reform
(14:01) Insurance Requirements
(15:24) Public Grant Funding
(16:18) Global Crisis Hotline
(17:51) Compute Monitoring
(19:41) LAWS Boycott
(22:10) Industry Standards
(23:18) Structured Access to Research
(24:39) CONCLUSION
---

First published:
May 29th, 2025

Source:
https://www.lesswrong.com/posts/wFKZmvfRfNn24HNHp/orphaned-policies-post-5-of-6-on-ai-governance

---

Narrated by TYPE III AUDIO.
...more
28min
May 30, 2025 “Do you even have a system prompt? (PSA)” by Croissanthology
Everyone around me has a notable lack of system prompt. And when they do have a system prompt, it's either the eigenprompt or some half-assed 3-paragraph attempt at telling the AI to “include less bullshit”.
I see no systematic attempts at making a good one anywhere.[1]
(For clarity, a system prompt is a bit of text—that's a subcategory of "preset" or "context"—that's included in every single message you send the AI.)
No one says “I have a conversation with Claude, then edit the system prompt based on what annoyed me about its responses, then I rinse and repeat”.

No one says “I figured out what phrasing most affects Claude's behavior, then used those to shape my system prompt".

I don't even see a “yeah I described what I liked and don't like about Claude TO Claude and then had it make a system prompt for [...]

The original text contained 1 footnote which was omitted from this narration.
---

First published:
May 29th, 2025

Source:
https://www.lesswrong.com/posts/HjHqxzn3rnH7T45hp/do-you-even-have-a-system-prompt-psa

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
4min
May 30, 2025 [Linkpost] “Incorrect Baseline Evaluations Call into Question Recent LLM-RL Claims” by shash42
This is a link post.
There has been a flurry of recent papers proposing new RL methods that claim to improve the “reasoning abilities” in language models. The most recent ones, which show improvements with random or no external rewards have led to surprise, excitement and confusion.
We analyzed 7 popular LLM RL papers (100+ to 3000+ likes, 50k+ to 500k+ views on X) including “Spurious Rewards”, “RL from 1 example”, and 3 papers exploring “Intrinsic Confidence Rewards”. We found that in most of these papers the improvements could be a mirage due to various accidental issues in the evaluation setups (discussed below). As such, the baseline numbers of the pre-RL models are massively underreported compared to official numbers in the Qwen releases, or other standardized evaluations (for example in the Sober Reasoning paper). In several cases, the post-RL model performance was actually worse than the (correctly evaluated) pre-RL baseline [...]

---

First published:
May 29th, 2025

Source:
https://www.lesswrong.com/posts/p8rcMDRwEGeFAzCQS/incorrect-baseline-evaluations-call-into-question-recent-llm

Linkpost URL:
https://safe-lip-9a8.notion.site/Incorrect-Baseline-Evaluations-Call-into-Question-Recent-LLM-RL-Claims-2012f1fbf0ee8094ab8ded1953c15a37?pvs=4

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
3min
May 30, 2025“CFAR is running an experimental mini-workshop (June 2-6, Berkeley CA)!” by Davis_Kingsley
Hello from the Center for Applied Rationality!
Some of you may have attended our classic applied rationality workshops in the past; others of you may have wanted to attend a workshop but not yet had a chance to. It's been a while since we've last run public-facing workshops, but we wanted to say:

We're not dead! (More on that perhaps in a later post)

We have a new experimental mini-workshop coming up soon and hopefully more workshop content to follow after!

Our new workshop will be held after LessOnline at the upcoming Arbor Summer Camp program; classes will begin after lunch on Monday 6/2 and end just before lunch on Thursday 6/5. Some things to know about the upcoming workshop:

This workshop is "mini"
This workshop will last for about three days (two half-days on the edges, two full days in the middle), instead [...]

---
First published:
May 29th, 2025

Source:
https://www.lesswrong.com/posts/5AGK8b3rm8YD7jhxi/cfar-is-running-an-experimental-mini-workshop-june-2-6

---

Narrated by TYPE III AUDIO.
...more
5min
May 29, 2025 “AI #118: Claude Ascendant” by Zvi
The big news of this week was of course the release of Claude 4 Opus. I offered two review posts: One on safety and alignment, and one on mundane utility, and a bonus fun post on Google's Veo 3.
I am once again defaulting to Claude for most of my LLM needs, although I often will also check o3 and perhaps Gemini 2.5 Pro.
On the safety and alignment front, Anthropic did extensive testing, and reported that testing in an exhaustive model card. A lot of people got very upset to learn that Opus could, if pushed too hard in the wrong situations engineered for these results, do things like report your highly unethical actions to authorities or try to blackmail developers into not being shut down or replaced. It is good that we now know about these things, and it was quickly observed that similar behaviors [...]

---
Outline:
(01:23) Language Models Offer Mundane Utility
(08:54) Now With Extra Glaze
(15:54) Get My Agent On The Line
(17:03) Language Models Don't Offer Mundane Utility
(22:49) Huh, Upgrades
(26:42) On Your Marks
(27:35) Choose Your Fighter
(33:40) Deepfaketown and Botpocalypse Soon
(37:51) Fun With Media Generation
(38:21) Playing The Training Data Game
(38:38) They Took Our Jobs
(46:51) The Art of Learning
(49:10) The Art of the Jailbreak
(49:49) Unprompted Attention
(50:44) Get Involved
(51:33) Introducing
(51:52) In Other AI News
(52:45) Show Me the Money
(57:08) Nvidia Sells Out
(01:03:14) Quiet Speculations
(01:06:16) The Quest for Sane Regulations
(01:18:18) The Week in Audio
(01:20:13) Rhetorical Innovation
(01:34:29) Board of Anthropic
(01:37:08) Misaligned!
(01:39:22) Aligning a Smarter Than Human Intelligence is Difficult
(01:40:21) Americans Do Not Like AI
(01:42:37) People Are Worried About AI Killing Everyone
(01:44:09) Other People Are Not As Worried About AI Killing Everyone
(01:46:01) The Lighter Side
---

First published:
May 29th, 2025

Source:
https://www.lesswrong.com/posts/9THq9RvpbmecWa6Ni/ai-118-claude-ascendant

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
1h 52min
May 29, 2025 “Gradual Disempowerment: Concrete Research Projects” by Raymond Douglas
This post benefitted greatly from comments, suggestions, and ongoing discussions with David Duvenaud, David Krueger, and Jan Kulveit. All errors are my own.
A few months ago, I and my coauthors published Gradual Disempowerment (GD hereafter). It was mostly about how things might go wrong, but naturally a lot of the resulting interest has been about solutions.
We have some more formal followup work coming: in the meantime, this is my 80/20 for ‘what would I do if I had way more time’ / ‘what would I find it helpful if someone else had done well’. This document is very much breadth over depth, and still missing a lot of details; I hope it is nonetheless helpful. For many of these, I expect even a pretty motivated and smart undergraduate could make useful progress in 10-20 hours.
I would be excited about people doing good work on [...]

---
Outline:
(01:15) Conceptual / High-Level
(01:19) Interaction with other x-risk concerns
(02:37) Responding to counterarguments
(04:10) Beyond competitive pressures
(06:12) Clarifying the goal
(07:42) Social Science-y
(07:45) Robustness of societal fundamentals
(08:59) Studying Historical Parallels
(10:12) Indicators and Policy
(11:32) Technical / Mathematical
(11:36) Simulating entire civilizations
(12:46) AI cognition and agency
(13:26) Civilisational Alignment / Hierarchical Agency
(16:41) Differential Progress / Differential Empowerment
(17:38) AI Complementarity
(18:48) Concluding thoughts
The original text contained 1 footnote which was omitted from this narration.
---

First published:
May 29th, 2025

Source:
https://www.lesswrong.com/posts/GAv4DRGyDHe2orvwB/gradual-disempowerment-concrete-research-projects

---

Narrated by TYPE III AUDIO.
...more
20min
May 29, 2025 “Truth or Dare” by Duncan Sabien (Inactive)

Author's note: This is my apparently-annual "I'll put a post on LessWrong in honor of LessOnline" post. These days, my writing goes on my Substack. There have in fact been some pretty cool essays since last year's LO post.
Structural note:
Some essays are like a five-minute morning news spot. Other essays are more like a 90-minute lecture.
This is one of the latter. It's not necessarily complex or difficult; it could be a 90-minute lecture to seventh graders (especially ones with the right cultural background).
But this is, inescapably, a long-form piece, à la In Defense of Punch Bug or The MTG Color Wheel. It takes its time. It doesn’t apologize for its meandering (outside of this disclaimer). It asks you to sink deeply into a gestalt, to drift back and forth between seemingly unrelated concepts until you start to feel the way those concepts weave together [...]

---
Outline:
(02:30) 0. Introduction
(10:08) A list of truths and dares
(14:34) Act I
(14:37) Scene I: How The Water Tastes To The Fishes
(22:38) Scene II: The Chip on Mitchell's Shoulder
(28:17) Act II
(28:20) Scene I: Bent Out Of Shape
(41:26) Scene II: Going Stag, But Like ... Together?
(48:31) Scene III: Patterns, Projections, and Preconceptions
(01:02:04) Interlude: The Sound of One Hand Clapping
(01:05:45) Act III
(01:05:56) Scene I: Memetic Traps (Or, The Battle for the Soul of Morty Smith)
(01:27:16) Scene II: The problem with Rhonda Byrne's 2006 bestseller The Secret
(01:32:39) Scene III: Escape velocity
(01:42:26) Act IV
(01:42:29) Scene I: Boy, putting Zack Davis's name in a header will probably have Effects, huh
(01:44:08) Scene II: Whence Wholesomeness?
---

First published:
May 29th, 2025

Source:
https://www.lesswrong.com/posts/TQ4AXj3bCMfrNPTLf/truth-or-dare

---

Narrated by TYPE III AUDIO.

---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
...more
2h 4min
May 29, 2025 “The Best Way to Align an LLM: Inner Alignment is Now a Solved Problem?” by RogerDearnaley
This is a link-post for an exciting paper I recently read: Safety Pretraining: Toward the Next Generation of Safe AI by Pratyush Maini, Sachin Goyal, et al.

For a couple of years I (and others) have been proposing an approach to alignment: what the authors of this recent paper name "safety pretraining". In a nutshell: that it's best to apply your alignment training as part of the standard pretraining process to produce a base model that is already aligned — simply pretrain it on a lot of examples of aligned behavior.
I've regarded this approach as a major advance ever since I read the seminal 2023 paper on the topic: Pretraining Language Models with Human Preferences by Tomasz Korbak et al., and I'm absolutely delighted to finally see someone else publish another paper on this approach — I'm only sad it has taken so long.
I highly encourage [...]

The original text contained 5 footnotes which were omitted from this narration.
---

First published:
May 28th, 2025

Source:
https://www.lesswrong.com/posts/xAsviBJGSBBtgBiCw/the-best-way-to-align-an-llm-inner-alignment-is-now-a-solved

---

Narrated by TYPE III AUDIO.
...more
15min

FAQs about LessWrong (30+ Karma):

How many episodes does LessWrong (30+ Karma) have?

The podcast currently has 2,641 episodes available.

More shows like LessWrong (30+ Karma)

Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,365 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,432 Listeners

The Peter Attia Drive by Peter Attia, MD

The Peter Attia Drive

8,971 Listeners

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas by Sean Carroll | Wondery

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas

4,148 Listeners

ManifoldOne by Steve Hsu

ManifoldOne

92 Listeners

Your Undivided Attention by The Center for Humane Technology, Tristan Harris, Daniel Barcay and Aza Raskin

Your Undivided Attention

1,595 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

9,913 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

90 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

72 Listeners

Hard Fork by The New York Times

Hard Fork

5,475 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,076 Listeners

Moonshots with Peter Diamandis by PHD Ventures

Moonshots with Peter Diamandis

536 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

131 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

95 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

515 Listeners