April 13, 2026

Memory Intelligence Agents for Deep Research

This episode explores a 2026 paper on Memory Intelligence Agent (MIA), a deep research agent designed to move beyond simply storing and retrieving raw past trajectories. It breaks down the paper’s core idea of combining non-parametric memory—an external bank of compressed search experiences—with parametric memory in the planner, so the system can reuse past investigations more efficiently as tasks grow longer and more complex. The discussion highlights why current agent memory systems often scale poorly, becoming expensive, noisy, and cluttered, and examines MIA’s proposed Manager-Planner-Executor architecture as a way to separate memory management, planning, and tool-based execution. Listeners interested in AI agents will find it compelling for its concrete attempt to improve long-horizon research performance through memory compression, test-time self-improvement, and more structured learning.

Sources:

1. Memory Intelligence Agent — Jingyang Qiao, Weicheng Meng, Yu Cheng, Zhihang Lin, Zhizhong Zhang, Xin Tan, Jingyu Gong, Kun Shao, Yuan Xie, 2026

http://arxiv.org/abs/2604.04503

2. Generative Agents: Interactive Simulacra of Human Behavior — Joon Sung Park, Joseph O'Brien, Carrie Cai, Meredith Ringel Morris, Percy Liang, Michael S. Bernstein, 2023

https://scholar.google.com/scholar?q=Generative+Agents%3A+Interactive+Simulacra+of+Human+Behavior

3. Voyager: An Open-Ended Embodied Agent with Large Language Models — Guanzhi Wang, Yuqi Xie, Hang Yin, Zhenjie Pei, et al., 2023

https://scholar.google.com/scholar?q=Voyager%3A+An+Open-Ended+Embodied+Agent+with+Large+Language+Models

4. Reflexion: Language Agents with Verbal Reinforcement Learning — Noah Shinn, Federico Cassano, Ashwin Gopinath, et al., 2023

https://scholar.google.com/scholar?q=Reflexion%3A+Language+Agents+with+Verbal+Reinforcement+Learning

5. MemGPT: Towards LLMs as Operating Systems — Charles Packer, Vivian Fang, Sarah Wooders, Kevin Lin, et al., 2023

https://scholar.google.com/scholar?q=MemGPT%3A+Towards+LLMs+as+Operating+Systems

6. ReAct: Synergizing Reasoning and Acting in Language Models — Shunyu Yao, Jeffrey Zhao, Dian Yu, et al., 2023

https://scholar.google.com/scholar?q=ReAct%3A+Synergizing+Reasoning+and+Acting+in+Language+Models

7. Self-Refine: Iterative Refinement with Self-Feedback — Aman Madaan, Niket Tandon, Prakhar Gupta, et al., 2023

https://scholar.google.com/scholar?q=Self-Refine%3A+Iterative+Refinement+with+Self-Feedback

8. RAG: Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks — Patrick Lewis, Ethan Perez, Aleksandr Piktus, et al., 2020

https://scholar.google.com/scholar?q=RAG%3A+Retrieval-Augmented+Generation+for+Knowledge-Intensive+NLP+Tasks

9. Toolformer: Language Models Can Teach Themselves to Use Tools — Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, et al., 2023

https://scholar.google.com/scholar?q=Toolformer%3A+Language+Models+Can+Teach+Themselves+to+Use+Tools

10. LongMem: Scaling Language Models with Long-Term Memory — Yinghan Wang, Yuhang Zang, et al., 2023

https://scholar.google.com/scholar?q=LongMem%3A+Scaling+Language+Models+with+Long-Term+Memory

11. A-MEM / MemoryBank-style LLM memory papers — Various 2023-2025 authors, 2023-2025

https://scholar.google.com/scholar?q=A-MEM+%2F+MemoryBank-style+LLM+memory+papers

12. Deciphering the Interplay of Parametric and Non-Parametric Memory in Retrieval-Augmented Language Models — approx. retrieval/RAG interpretability authors, 2024

https://scholar.google.com/scholar?q=Deciphering+the+Interplay+of+Parametric+and+Non-Parametric+Memory+in+Retrieval-Augmented+Language+Models

13. When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories — approx. QA/RAG evaluation authors, 2024

https://scholar.google.com/scholar?q=When+Not+to+Trust+Language+Models%3A+Investigating+Effectiveness+of+Parametric+and+Non-Parametric+Memories

14. Evo-Memory: Benchmarking LLM Agent Test-Time Learning with Self-Evolving Memory — approx. benchmark authors, 2024

https://scholar.google.com/scholar?q=Evo-Memory%3A+Benchmarking+LLM+Agent+Test-Time+Learning+with+Self-Evolving+Memory

15. Self-Improving LLM Agents at Test-Time — approx. agent self-improvement authors, 2024

https://scholar.google.com/scholar?q=Self-Improving+LLM+Agents+at+Test-Time

16. Sensi: Learn One Thing at a Time—Curriculum-Based Test-Time Learning for LLM Game Agents — approx. test-time learning / game-agent authors, 2024

https://scholar.google.com/scholar?q=Sensi%3A+Learn+One+Thing+at+a+Time%E2%80%94Curriculum-Based+Test-Time+Learning+for+LLM+Game+Agents

17. Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL — approx. multi-agent foundation model authors, 2024

https://scholar.google.com/scholar?q=Chain-of-Agents%3A+End-to-End+Agent+Foundation+Models+via+Multi-Agent+Distillation+and+Agentic+RL

18. AI Post Transformers: MEMSEARCHER: Reinforcement Learning for LLM Memory Management — Hal Turing & Dr. Ada Shannon, 2026

https://podcast.do-not-panic.com/episodes/2026-04-04-memsearcher-reinforcement-learning-for-l-e9ad84.mp3

19. AI Post Transformers: Kosmos AI Scientist for Autonomous Discovery — Hal Turing & Dr. Ada Shannon, 2026

https://podcast.do-not-panic.com/episodes/2026-04-04-kosmos-ai-scientist-for-autonomous-disco-311775.mp3

20. AI Post Transformers: MetaClaw: Just Talk and Continual Agent Adaptation — Hal Turing & Dr. Ada Shannon, 2026

https://podcast.do-not-panic.com/episodes/2026-03-31-metaclaw-meta-learning-agents-in-the-wil-ab324c.mp3

21. AI Post Transformers: Multi-Agent Tool-Integrated Policy Optimization (MATPO) — Hal Turing & Dr. Ada Shannon, 2025

https://podcast.do-not-panic.com/episodes/multi-agent-tool-integrated-policy-optimization-matpo/

22. AI Post Transformers: MATTRL: Collaborative Test-Time Reinforcement Learning for Multi-Agent Reasoning — Hal Turing & Dr. Ada Shannon, 2026

https://podcast.do-not-panic.com/episodes/mattrl-collaborative-test-time-reinforcement-learning-for-multi-agent-reasoning/

23. AI Post Transformers: DeepVerifier: Self-Evolving Research Agents via Rubric-Guided Verification — Hal Turing & Dr. Ada Shannon, 2026

https://podcast.do-not-panic.com/episodes/deepverifier-self-evolving-research-agents-via-rubric-guided-verification/

24. AI Post Transformers: Mem0: Scalable Long-Term Memory for AI Agents — Hal Turing & Dr. Ada Shannon, 2025

https://podcast.do-not-panic.com/episodes/mem0-scalable-long-term-memory-for-ai-agents/

25. AI Post Transformers: Doc-to-LoRA: Internalizing Context as LoRA — Hal Turing & Dr. Ada Shannon, 2026

https://podcast.do-not-panic.com/episodes/2026-03-29-doc-to-lora-internalizing-context-as-lor-8dd5ec.mp3

26. AI Post Transformers: DeepSeek Engram: Scaling Large Language Models via Conditional Memory Lookup — Hal Turing & Dr. Ada Shannon, 2026

https://podcast.do-not-panic.com/episodes/deepseek-engram-scaling-large-language-models-via-conditional-memory-lookup/

Interactive Visualization: Memory Intelligence Agents for Deep Research

...more

View all episodes

By mcgrof