April 02, 2026

Emergent Social Risks in Multi-Agent Systems

This episode explores a paper on how generative multi-agent systems can develop failure modes that do not appear when models are evaluated one at a time. It explains how planner-worker-reviewer loops, negotiation setups, handoff chains, and committee-style aggregation can produce system-level problems such as strategic manipulation, collusion-like behavior, misreporting, conformity, and biased group decisions. The discussion focuses on the paper’s three main risk families: incentive exploitation, collective-cognition failures, and governance breakdowns, while also unpacking the benchmark scenarios used to test those dynamics. Listeners would find it interesting because it connects current real-world agent orchestration patterns to concrete safety and reliability risks, while also probing whether the paper’s evidence is strong enough in light of limited statistics and missing baseline comparisons.

Sources:

1. Emergent Social Intelligence Risks in Generative Multi-Agent Systems — Yue Huang, Yu Jiang, Wenjie Wang, Haomin Zhuang, Xiaonan Luo, Yuchen Ma, Zhangchen Xu, Zichen Chen, Nuno Moniz, Zinan Lin, Pin-Yu Chen, Nitesh V Chawla, Nouha Dziri, Huan Sun, Xiangliang Zhang, 2026

http://arxiv.org/abs/2603.27771

2. CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society — Guohao Li, Hasan Abed Al Kader Hammoud, Hani Itani, Dmitrii Khizbullin, Bernard Ghanem, 2023

https://scholar.google.com/scholar?q=CAMEL:+Communicative+Agents+for+"Mind"+Exploration+of+Large+Language+Model+Society

3. AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation — Qingyun Wu, Gagan Bansal, Jieyu Zhang, Yiran Wu, Beibin Li, Erkang Zhu, Li Jiang, Xiaoyun Zhang, Shaokun Zhang, Ahmed Awadallah, Ryen W. White, Doug Burger, Chi Wang, 2024

https://scholar.google.com/scholar?q=AutoGen:+Enabling+Next-Gen+LLM+Applications+via+Multi-Agent+Conversation

4. MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework — Sirui Hong, Mingchen Zhuge, Jiaqi Chen, Xiawu Zheng, Yuheng Cheng, Ceyao Zhang, Jinlin Wang, Zili Wang, Steven Ka Shing Yau, Zijuan Lin, Liyang Zhou, Chenyu Ran, Lingfeng Xiao, Chenglin Wu, Jürgen Schmidhuber, 2023

https://scholar.google.com/scholar?q=MetaGPT:+Meta+Programming+for+A+Multi-Agent+Collaborative+Framework

5. Large Language Model based Multi-Agents: A Survey of Progress and Challenges — Taicheng Guo, Xiuying Chen, Yaqi Wang, Ruidi Chang, Shichao Pei, Nitesh V. Chawla, Olaf Wiest, Xiangliang Zhang, 2024

https://scholar.google.com/scholar?q=Large+Language+Model+based+Multi-Agents:+A+Survey+of+Progress+and+Challenges

6. Generative Agents: Interactive Simulacra of Human Behavior — Joon Sung Park, Joseph C. O'Brien, Carrie J. Cai, Meredith Ringel Morris, Percy Liang, Michael S. Bernstein, 2023

https://scholar.google.com/scholar?q=Generative+Agents:+Interactive+Simulacra+of+Human+Behavior

7. AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors — Weize Chen, Yusheng Su, Jingwei Zuo, Cheng Yang, Chenfei Yuan, Chi-Min Chan, Heyang Yu, Yaxi Lu, Yi-Hsin Hung, Chen Qian, Yujia Qin, Xin Cong, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie Zhou, 2023

https://scholar.google.com/scholar?q=AgentVerse:+Facilitating+Multi-Agent+Collaboration+and+Exploring+Emergent+Behaviors

8. Persona Inconstancy in Multi-Agent LLM Collaboration: Conformity, Confabulation, and Impersonation — Razan Baltaji, Babak Hemmatian, Lav R. Varshney, 2024

https://scholar.google.com/scholar?q=Persona+Inconstancy+in+Multi-Agent+LLM+Collaboration:+Conformity,+Confabulation,+and+Impersonation

9. Multi-Agent Risks from Advanced AI — Lewis Hammond, Alan Chan, Jesse Clifton, Jason Hoelscher-Obermaier and many coauthors, 2025

https://scholar.google.com/scholar?q=Multi-Agent+Risks+from+Advanced+AI

10. Autonomous Algorithmic Collusion: Q-Learning Under Sequential Pricing — Timo Klein, 2019

https://scholar.google.com/scholar?q=Autonomous+Algorithmic+Collusion:+Q-Learning+Under+Sequential+Pricing

11. Artificial Intelligence, Algorithmic Pricing, and Collusion — Emilio Calvano, Giacomo Calzolari, Vincenzo Denicolò, Sergio Pastorello, 2020

https://scholar.google.com/scholar?q=Artificial+Intelligence,+Algorithmic+Pricing,+and+Collusion

12. Strategic Collusion of LLM Agents: Market Division in Multi-Commodity Competitions — Ryan Y. Lin, Siddhartha Ojha, Kevin Cai, Maxwell F. Chen, 2024

https://scholar.google.com/scholar?q=Strategic+Collusion+of+LLM+Agents:+Market+Division+in+Multi-Commodity+Competitions

13. AI-Powered Trading, Algorithmic Collusion, and Price Efficiency — Winston Wei Dou, Itay Goldstein, Yan Ji, 2025

https://scholar.google.com/scholar?q=AI-Powered+Trading,+Algorithmic+Collusion,+and+Price+Efficiency

14. Emergence of Social Norms in Generative Agent Societies: Principles and Architecture — Siyue Ren, Zhiyao Cui, Ruiqi Song, Zhen Wang, Shuyue Hu, 2024

https://scholar.google.com/scholar?q=Emergence+of+Social+Norms+in+Generative+Agent+Societies:+Principles+and+Architecture

15. Algorithmic Collusion at Test Time: A Meta-game Design and Evaluation — Yuhong Luo, Daniel Schoepflin, Xintong Wang, 2026

https://scholar.google.com/scholar?q=Algorithmic+Collusion+at+Test+Time:+A+Meta-game+Design+and+Evaluation

16. NetSafe: Exploring the Topological Safety of Multi-agent System — Miao Yu et al., 2025

https://scholar.google.com/scholar?q=NetSafe:+Exploring+the+Topological+Safety+of+Multi-agent+System

17. Institutional AI: Governing LLM Collusion in Multi-Agent Cournot Markets via Public Governance Graphs — Marcantonio Bracale Syrnikov et al., 2026

https://scholar.google.com/scholar?q=Institutional+AI:+Governing+LLM+Collusion+in+Multi-Agent+Cournot+Markets+via+Public+Governance+Graphs

18. Verification-Aware Planning for Multi-Agent Systems — Tianyang Xu, Dan Zhang, Kushan Mitra, Estevam Hruschka, 2025

https://scholar.google.com/scholar?q=Verification-Aware+Planning+for+Multi-Agent+Systems

19. State and Memory is All You Need for Robust and Reliable AI Agents — Matthew Muhoberac et al., 2025

https://scholar.google.com/scholar?q=State+and+Memory+is+All+You+Need+for+Robust+and+Reliable+AI+Agents

20. AI Post Transformers: Multiagent Debate Improves Language Model Reasoning — Hal Turing & Dr. Ada Shannon, 2025

https://podcast.do-not-panic.com/episodes/multiagent-debate-improves-language-model-reasoning/

21. AI Post Transformers: Memory in the Age of AI Agents: Forms, Functions, Dynamics — Hal Turing & Dr. Ada Shannon, 2026

https://podcast.do-not-panic.com/episodes/2026-03-16-memory-in-the-age-of-ai-agents-forms-fun-5abc60.mp3

22. AI Post Transformers: Qwen3Guard: Streaming Three-Way Safety Classification for LLMs — Hal Turing & Dr. Ada Shannon, 2026

https://podcast.do-not-panic.com/episodes/2026-03-16-qwen3guard-streaming-three-way-safety-cl-26b0ef.mp3

23. AI Post Transformers: Tree-based Group Policy Optimization for LLM Agents — Hal Turing & Dr. Ada Shannon, 2025

https://podcast.do-not-panic.com/episodes/tree-based-group-policy-optimization-for-llm-agents/

24. AI Post Transformers: Mem0: Scalable Long-Term Memory for AI Agents — Hal Turing & Dr. Ada Shannon, 2025

https://podcast.do-not-panic.com/episodes/mem0-scalable-long-term-memory-for-ai-agents/

...more

View all episodes

By mcgrof