Ivancast Podcast

Peering Into the Black Box: The Rise of Representation Engineering


Listen Later

Join us in SHIFTERLABS’ latest experimental podcast series powered by Notebook LM, where we bridge research and conversation to illuminate groundbreaking ideas in AI. In this episode, we dive into “Representation Engineering: A Top-Down Approach to AI Transparency,” an insightful paper from the Center for AI Safety, Carnegie Mellon University, Stanford, and other leading institutions. This research redefines how we view transparency in deep learning by shifting the focus from neurons and circuits to high-level representations.

 

Discover how Representation Engineering (RepE) introduces new methods for reading and controlling cognitive processes in AI models, offering innovative solutions to challenges like honesty, hallucination detection, and fairness. We explore its applications across essential safety domains, including model control and ethical behavior. Tune in to learn how these advances could shape a future of AI that is more transparent, accountable, and aligned with human values.

 

This series is part of SHIFTERLABS’ ongoing commitment to pushing the boundaries of educational technology and fostering discussions at the intersection of research, technology, and responsible innovation.

...more
View all episodesView all episodes
Download on the App Store

Ivancast PodcastBy IVANCAST PODCAST

  • 5
  • 5
  • 5
  • 5
  • 5

5

2 ratings


More shows like Ivancast Podcast

View all
The China in Africa Podcast by The China-Global South Project

The China in Africa Podcast

208 Listeners

The Daily by The New York Times

The Daily

111,399 Listeners

ChinaTalk by Jordan Schneider

ChinaTalk

271 Listeners

The Tech Strategy Podcast by Jeffrey Towson

The Tech Strategy Podcast

28 Listeners

Hard Fork by The New York Times

Hard Fork

5,352 Listeners

AI Applied: Covering AI News, Interviews and Tools - ChatGPT, Midjourney, Gemini, OpenAI, Anthropic by Jaeden Schafer and Conor Grennan

AI Applied: Covering AI News, Interviews and Tools - ChatGPT, Midjourney, Gemini, OpenAI, Anthropic

125 Listeners

KI-Update – ein heise-Podcast by Isabel Grünewald, heise online

KI-Update – ein heise-Podcast

4 Listeners

Interconnects by Nathan Lambert

Interconnects

9 Listeners

Artificial Intelligence Masterclass by AI Masterclass

Artificial Intelligence Masterclass

29 Listeners

AI Deep Dive by Daily Deep Dives

AI Deep Dive

14 Listeners