April 12, 2026

Latent Space as a New Computational Paradigm

This episode explores a major 2026 survey arguing that latent space in language models should be treated not as hidden plumbing, but as the primary substrate of computation—distinct from both human-readable token space and the latent spaces used in image generation. It traces the idea back to representation learning, transformers, and variational autoencoders, then explains how newer work reframes continuous internal states as a workspace for reasoning, planning, memory, and multimodal fusion rather than just intermediate features for next-token prediction. A central argument is that forcing every internal step into language is inefficient: text is useful for communication, but dense vector states may be better suited for compact, general-purpose computation and memory. Listeners interested in where AI systems may be headed will find it compelling because it offers a concrete framework for thinking about models that increasingly “think” in latent representations while using language mainly as an interface.

Sources:

1. The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook — Xinlei Yu, Zhangquan Chen, Yongbo He, Tianyu Fu, Cheng Yang, Chengming Xu, Yue Ma, Xiaobin Hu, Zhe Cao, Jie Xu, Guibin Zhang, Jiale Tao, Jiayi Zhang, Siyuan Ma, Kaituo Feng, Haojie Huang, Youxing Li, Ronghao Chen, Huacan Wang, Chenglin Wu, Zikun Su, Xiaogang Xu, Kelu Yao, Kun Wang, Chen Gao, Yue Liao, Ruqi Huang, Tao Jin, Cheng Tan, Jiangning Zhang, Wenqi Ren, Yanwei Fu, Yong Liu, Yu Wang, Xiangyu Yue, Yu-Gang Jiang, Shuicheng Yan, 2026

http://arxiv.org/abs/2604.02029

2. Auto-Encoding Variational Bayes — Diederik P. Kingma, Max Welling, 2013

https://scholar.google.com/scholar?q=Auto-Encoding+Variational+Bayes

3. Representation Learning: A Review and New Perspectives — Yoshua Bengio, Aaron Courville, Pascal Vincent, 2013

https://scholar.google.com/scholar?q=Representation+Learning:+A+Review+and+New+Perspectives

4. Attention Is All You Need — Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin, 2017

https://scholar.google.com/scholar?q=Attention+Is+All+You+Need

5. On the Opportunities and Risks of Foundation Models — Rishi Bommasani and many coauthors, 2021

https://scholar.google.com/scholar?q=On+the+Opportunities+and+Risks+of+Foundation+Models

6. A Simple Framework for Contrastive Learning of Visual Representations — Ting Chen, Simon Kornblith, Mohammad Norouzi, Geoffrey Hinton, 2020

https://scholar.google.com/scholar?q=A+Simple+Framework+for+Contrastive+Learning+of+Visual+Representations

7. Learning Transferable Visual Models From Natural Language Supervision — Alec Radford and coauthors, 2021

https://scholar.google.com/scholar?q=Learning+Transferable+Visual+Models+From+Natural+Language+Supervision

8. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer — Colin Raffel, Noam Shazeer, Adam Roberts and coauthors, 2019

https://scholar.google.com/scholar?q=Exploring+the+Limits+of+Transfer+Learning+with+a+Unified+Text-to-Text+Transformer

9. World Models — David Ha and Jürgen Schmidhuber, 2018

https://scholar.google.com/scholar?q=World+Models

10. DreamerV3: Mastering Diverse Domains through World Models — Danijar Hafner, Jurgis Pasukonis, Jimmy Ba, and Timothy Lillicrap, 2023

https://scholar.google.com/scholar?q=DreamerV3:+Mastering+Diverse+Domains+through+World+Models

11. Representation Engineering: A Top-Down Approach to AI Transparency — Andy Zou, Catherine Wong, Nicholas Carlini, Milad Nasr, J. Zico Kolter, and Matt Fredrikson, 2023

https://scholar.google.com/scholar?q=Representation+Engineering:+A+Top-Down+Approach+to+AI+Transparency

12. Chain-of-Thought Reasoning Without Prompting — Takeshi Kojima, Shixiang Shane Gu, Machel Reid, Yutaka Matsuo, and Yusuke Iwasawa, 2022

https://scholar.google.com/scholar?q=Chain-of-Thought+Reasoning+Without+Prompting

13. The Geometry of Meaning in Word Embeddings — Felix Hill, Kyunghyun Cho, and Anna Korhonen, 2016

https://scholar.google.com/scholar?q=The+Geometry+of+Meaning+in+Word+Embeddings

14. Token Turing Machines — Ofir Press, Muru Zhang, Sewon Min, Ludwig Schmidt, Noah A. Smith, and Mike Lewis, 2022

https://scholar.google.com/scholar?q=Token+Turing+Machines

15. Titans: Learning to Memorize at Test Time — Ali Behrouz, Peilin Zhong, and Vahab Mirrokni, 2025

https://scholar.google.com/scholar?q=Titans:+Learning+to+Memorize+at+Test+Time

16. Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study — approx. 2024 authors unclear from snippet, 2024

https://scholar.google.com/scholar?q=Dissecting+Logical+Reasoning+in+LLMs:+A+Fine-Grained+Evaluation+and+Supervision+Study

17. ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces — approx. 2025 authors unclear from snippet, 2025

https://scholar.google.com/scholar?q=ThinkRouter:+Efficient+Reasoning+via+Routing+Thinking+between+Latent+and+Discrete+Spaces

18. A Survey on Parallel Text Generation: From Parallel Decoding to Diffusion Language Models — approx. 2024/2025 survey authors unclear from snippet, 2024/2025

https://scholar.google.com/scholar?q=A+Survey+on+Parallel+Text+Generation:+From+Parallel+Decoding+to+Diffusion+Language+Models

19. Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding — approx. 2024 authors unclear from snippet, 2024

https://scholar.google.com/scholar?q=Skeleton-of-Thought:+Large+Language+Models+Can+Do+Parallel+Decoding

20. PaDeLLM-NER: Parallel Decoding in Large Language Models for Named Entity Recognition — approx. 2024/2025 authors unclear from snippet, 2024/2025

https://scholar.google.com/scholar?q=PaDeLLM-NER:+Parallel+Decoding+in+Large+Language+Models+for+Named+Entity+Recognition

21. Translating Natural Language to Planning Goals with Large-Language Models — approx. 2024/2025 authors unclear from snippet, 2024/2025

https://scholar.google.com/scholar?q=Translating+Natural+Language+to+Planning+Goals+with+Large-Language+Models

22. AI Post Transformers: Episode: Language Models are Injective and Hence Invertible — Hal Turing & Dr. Ada Shannon, 2026

https://podcast.do-not-panic.com/episodes/2026-03-21-language-models-are-injective-an-7545e0.mp3

23. AI Post Transformers: LeWorldModel: Stable Joint-Embedding World Models from Pixels — Hal Turing & Dr. Ada Shannon, 2026

https://podcast.do-not-panic.com/episodes/2026-03-25-leworldmodel-stable-joint-embedding-worl-650f9f.mp3

24. AI Post Transformers: Reinforced Attention Learning — Hal Turing & Dr. Ada Shannon, 2026

https://podcast.do-not-panic.com/episodes/reinforced-attention-learning/

25. AI Post Transformers: Recursive Language Models for Arbitrarily Long Prompts — Hal Turing & Dr. Ada Shannon, 2026

https://podcast.do-not-panic.com/episodes/2026-04-04-recursive-language-models-for-arbitraril-fbcd1c.mp3

26. AI Post Transformers: Process Reward Learning for LLM Reasoning Optimization — Hal Turing & Dr. Ada Shannon, 2026

https://podcast.do-not-panic.com/episodes/process-reward-learning-for-llm-reasoning-optimization/

27. AI Post Transformers: Agentic AI and the Next Intelligence Explosion — Hal Turing & Dr. Ada Shannon, 2026

https://podcast.do-not-panic.com/episodes/2026-03-28-agentic-ai-and-the-next-intelligence-exp-d06561.mp3

28. AI Post Transformers: Survey of Emerging Topics in AI and Robotics — Hal Turing & Dr. Ada Shannon, 2025

https://podcast.do-not-panic.com/episodes/survey-of-emerging-topics-in-ai-and-robotics/

Interactive Visualization: Latent Space as a New Computational Paradigm

...more

View all episodes

By mcgrof