Learning GenAI via SOTA Papers

EP110: Single agents beat expensive multi agent teams


Listen Later

The paper "Rethinking the Value of Multi-Agent Work-flow: A Strong Single Agent Baseline" challenges the necessity of using multiple Large Language Model (LLM) instances for complex tasks when the agents are homogeneous.

Here is a short summary of the key points:

  • The Core Finding: The authors observe that most current multi-agent systems (MAS) are homogeneous, meaning they rely on the same base LLM and only differ in their system prompts, tools, and roles. They demonstrate that a single LLM agent can effectively simulate these homogeneous multi-agent workflows through multi-turn conversations.
  • Cost and Efficiency Advantages: Because a single agent maintains one conversation state, it can reuse the Key-Value (KV) cache across different agent interactions. This avoids the redundant processing required when maintaining separate model instances, allowing the single-agent baseline to achieve comparable performance to multi-agent frameworks while significantly reducing inference costs.
  • OneFlow Algorithm: Building on this discovery, the authors propose OneFlow, an automated algorithm that designs tailored workflows optimized for single-agent execution. OneFlow uses a dual-meta-LLM architecture—a "Creative Designer" to propose performance improvements and a "Critical Reviewer" to optimize cost-efficiency—combined with a Monte Carlo Tree Search to find the best workflow.
  • Extensive Evaluation: Across seven diverse benchmarks spanning coding, mathematics, general question answering, domain-specific reasoning, and real-world planning, both the single-agent simulation of workflows and the OneFlow algorithm matched or exceeded traditional MAS performance at a fraction of the cost.
  • Future Directions (Heterogeneous MAS): The authors note a fundamental limitation: single agents cannot simulate truly heterogeneous workflows (where agents use completely different base models) because KV caches cannot be shared across different LLMs. While their single-agent approach surprisingly matched an automatically designed heterogeneous workflow in a pilot study, they highlight that developing effective heterogeneous systems is a necessary and promising direction for future research.
...more
View all episodesView all episodes
Download on the App Store

Learning GenAI via SOTA PapersBy Yun Wu