LessWrong posts by zvi

“Regarding South Africa” by Zvi


Listen Later

The system prompt being modified by an unauthorized person in pursuit of a ham-fisted political point very important to Elon Musk once already doesn’t seem like coincidence.
It happening twice looks rather worse than that.

A Funny Thing Happened on Twitter

In addition to having seemingly banned all communication with Pliny, Grok seems to have briefly been rather eager to talk on Twitter, with zero related prompting, about whether there is white genocide in South Africa?
Tracing Woods: Golden Gate Claude returns in a new form: South Africa Grok.
Grace: This employee must still be absorbing the culture.
Garrison Lovely: “Mom, I want Golden Gate Claude back.”
“We have Golden Gate Claude at home.”
Golden Gate Claude at home:
 
Many such cases were caught on screenshots before a mass deletion event.
It doesn’t look good.

A Brief History of Similar Incidents

When Grace says ‘this employee must [...]

---

Outline:

(00:22) A Funny Thing Happened on Twitter

(05:54) A Brief History of Similar Incidents

(08:29) Speculations on What Happened

(16:23) The Jabs Continue

(17:11) Show Us Your System Prompts

(19:15) A Perfectly Reasonable Explanation For All This

(26:23) How Should We Think About This Going Forward?

---

First published:

May 16th, 2025

Source:

https://www.lesswrong.com/posts/kMH8zFoHHJvy6wH7h/regarding-south-africa

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

...more
View all episodesView all episodes
Download on the App Store

LessWrong posts by zviBy zvi