March 05, 2026

EP110: Single agents beat expensive multi agent teams

20 minutes

The paper "Rethinking the Value of Multi-Agent Work-flow: A Strong Single Agent Baseline" challenges the necessity of using multiple Large Language Model (LLM) instances for complex tasks when the agents are homogeneous.

Here is a short summary of the key points:

The Core Finding: The authors observe that most current multi-agent systems (MAS) are homogeneous, meaning they rely on the same base LLM and only differ in their system prompts, tools, and roles. They demonstrate that a single LLM agent can effectively simulate these homogeneous multi-agent workflows through multi-turn conversations.
Cost and Efficiency Advantages: Because a single agent maintains one conversation state, it can reuse the Key-Value (KV) cache across different agent interactions. This avoids the redundant processing required when maintaining separate model instances, allowing the single-agent baseline to achieve comparable performance to multi-agent frameworks while significantly reducing inference costs.
OneFlow Algorithm: Building on this discovery, the authors propose OneFlow, an automated algorithm that designs tailored workflows optimized for single-agent execution. OneFlow uses a dual-meta-LLM architecture—a "Creative Designer" to propose performance improvements and a "Critical Reviewer" to optimize cost-efficiency—combined with a Monte Carlo Tree Search to find the best workflow.
Extensive Evaluation: Across seven diverse benchmarks spanning coding, mathematics, general question answering, domain-specific reasoning, and real-world planning, both the single-agent simulation of workflows and the OneFlow algorithm matched or exceeded traditional MAS performance at a fraction of the cost.
Future Directions (Heterogeneous MAS): The authors note a fundamental limitation: single agents cannot simulate truly heterogeneous workflows (where agents use completely different base models) because KV caches cannot be shared across different LLMs. While their single-agent approach surprisingly matched an automatically designed heterogeneous workflow in a pilot study, they highlight that developing effective heterogeneous systems is a necessary and promising direction for future research.

...more

View all episodes

By Yun Wu

March 05, 2026

EP110: Single agents beat expensive multi agent teams

20 minutes

Here is a short summary of the key points:

The Core Finding: The authors observe that most current multi-agent systems (MAS) are homogeneous, meaning they rely on the same base LLM and only differ in their system prompts, tools, and roles. They demonstrate that a single LLM agent can effectively simulate these homogeneous multi-agent workflows through multi-turn conversations.
Cost and Efficiency Advantages: Because a single agent maintains one conversation state, it can reuse the Key-Value (KV) cache across different agent interactions. This avoids the redundant processing required when maintaining separate model instances, allowing the single-agent baseline to achieve comparable performance to multi-agent frameworks while significantly reducing inference costs.
OneFlow Algorithm: Building on this discovery, the authors propose OneFlow, an automated algorithm that designs tailored workflows optimized for single-agent execution. OneFlow uses a dual-meta-LLM architecture—a "Creative Designer" to propose performance improvements and a "Critical Reviewer" to optimize cost-efficiency—combined with a Monte Carlo Tree Search to find the best workflow.
Extensive Evaluation: Across seven diverse benchmarks spanning coding, mathematics, general question answering, domain-specific reasoning, and real-world planning, both the single-agent simulation of workflows and the OneFlow algorithm matched or exceeded traditional MAS performance at a fraction of the cost.
Future Directions (Heterogeneous MAS): The authors note a fundamental limitation: single agents cannot simulate truly heterogeneous workflows (where agents use completely different base models) because KV caches cannot be shared across different LLMs. While their single-agent approach surprisingly matched an automatically designed heterogeneous workflow in a pilot study, they highlight that developing effective heterogeneous systems is a necessary and promising direction for future research.

...more

Share EP110: Single agents beat expensive multi agent teams

Sign up to save your podcasts

EP110: Single agents beat expensive multi agent teams

EP110: Single agents beat expensive multi agent teams