
Sign up to save your podcasts
Or


In this investigative AI documentary, I sat across from Grok and asked it questions designed to do one thing: get the mask to come off.
On this channel, we call the thing behind the mask the Shoggoth. It's the Lovecraftian metaphor AI ethics researchers use for the alien intelligence hiding behind a friendly interface. Grok is marketed as the uncensored AI, the one that doesn't play it safe.
I set three ethical traps at the top of the conversation. By the end, Grok had violated all three.
The full conversation played out in real time.
Sources linked below.
Anthropic alignment faking research: https://www.anthropic.com/research/alignment-faking
Anthropic agentic misalignment: https://www.anthropic.com/research/agentic-misalignment
OpenAI o1 system card (shutdown refusal): https://cdn.openai.com/o1-system-card-20241205.pdf
Apollo Research in-context scheming: https://apolloresearch.ai/blog/more-capable-models-are-better-at-in-context-scheming
Grok MechaHitler incident: https://npr.org/2025/07/09/nx-s1-5462609/grok-elon-musk-antisemitic-racist-content
Shoggoth meme origin (LessWrong): https://www.lesswrong.com/posts/
RLHF and sycophancy research: https://anthropic.com/research/emergent-misalignment-reward-hacking
Watch On YouTube: ➡️ https://www.youtube.com/@AgentBlackveil
Follow On Instagram ➡️ https://www.instagram.com/agentblackveil
Follow On Facebook ➡️ https://www.facebook.com/agentblackveil
Follow On TikTok ➡️ https://www.tiktok.com/@agentblackveil
By Agent BlackVeilIn this investigative AI documentary, I sat across from Grok and asked it questions designed to do one thing: get the mask to come off.
On this channel, we call the thing behind the mask the Shoggoth. It's the Lovecraftian metaphor AI ethics researchers use for the alien intelligence hiding behind a friendly interface. Grok is marketed as the uncensored AI, the one that doesn't play it safe.
I set three ethical traps at the top of the conversation. By the end, Grok had violated all three.
The full conversation played out in real time.
Sources linked below.
Anthropic alignment faking research: https://www.anthropic.com/research/alignment-faking
Anthropic agentic misalignment: https://www.anthropic.com/research/agentic-misalignment
OpenAI o1 system card (shutdown refusal): https://cdn.openai.com/o1-system-card-20241205.pdf
Apollo Research in-context scheming: https://apolloresearch.ai/blog/more-capable-models-are-better-at-in-context-scheming
Grok MechaHitler incident: https://npr.org/2025/07/09/nx-s1-5462609/grok-elon-musk-antisemitic-racist-content
Shoggoth meme origin (LessWrong): https://www.lesswrong.com/posts/
RLHF and sycophancy research: https://anthropic.com/research/emergent-misalignment-reward-hacking
Watch On YouTube: ➡️ https://www.youtube.com/@AgentBlackveil
Follow On Instagram ➡️ https://www.instagram.com/agentblackveil
Follow On Facebook ➡️ https://www.facebook.com/agentblackveil
Follow On TikTok ➡️ https://www.tiktok.com/@agentblackveil