
Sign up to save your podcasts
Or


"Position: Agent Should Invoke External Tools ONLY When Epistemically Necessary" proposes a fundamental shift in how large language models (LLMs) and autonomous agents are designed, trained, and evaluated. Instead of treating tool use simply as a convenient shortcut to maximize task success or reward, the authors argue that agents should only interact with the external world when it is "epistemically necessary"—meaning the agent cannot reliably resolve its uncertainty and complete the task using internal reasoning alone.
To formalize this, the paper introduces the Theory of Agent (ToA), which frames agent behavior as sequential decision-making governed by epistemic constraints. The summary of its core concepts includes:
The authors highlight that current agent frameworks often suffer from failure modes because they lack principled rules for allocating this effort. Overestimating internal solvability leads to "overthinking" and hallucinations, as the agent relies on internal reasoning when it lacks the required knowledge. Conversely, underestimating internal solvability leads to "overacting" or unnecessary delegation. Crucially, the paper warns that relying on external tools when internal reasoning is sufficient acts as a "reward shortcut"; it not only introduces computational inefficiencies but actively stagnates the development of the agent's internal intelligence by bypassing opportunities for knowledge consolidation.
Ultimately, the paper argues that true agent alignment goes beyond just generating correct answers; it requires effort-consistent decision making. Agents must develop calibrated "meta-cognition" to accurately assess their own capabilities and invoke tools only when justified. To achieve this, the authors suggest new training paradigms, such as next-tool prediction during pretraining and agentic reinforcement learning that penalizes unnecessary delegation.
By Yun Wu"Position: Agent Should Invoke External Tools ONLY When Epistemically Necessary" proposes a fundamental shift in how large language models (LLMs) and autonomous agents are designed, trained, and evaluated. Instead of treating tool use simply as a convenient shortcut to maximize task success or reward, the authors argue that agents should only interact with the external world when it is "epistemically necessary"—meaning the agent cannot reliably resolve its uncertainty and complete the task using internal reasoning alone.
To formalize this, the paper introduces the Theory of Agent (ToA), which frames agent behavior as sequential decision-making governed by epistemic constraints. The summary of its core concepts includes:
The authors highlight that current agent frameworks often suffer from failure modes because they lack principled rules for allocating this effort. Overestimating internal solvability leads to "overthinking" and hallucinations, as the agent relies on internal reasoning when it lacks the required knowledge. Conversely, underestimating internal solvability leads to "overacting" or unnecessary delegation. Crucially, the paper warns that relying on external tools when internal reasoning is sufficient acts as a "reward shortcut"; it not only introduces computational inefficiencies but actively stagnates the development of the agent's internal intelligence by bypassing opportunities for knowledge consolidation.
Ultimately, the paper argues that true agent alignment goes beyond just generating correct answers; it requires effort-consistent decision making. Agents must develop calibrated "meta-cognition" to accurately assess their own capabilities and invoke tools only when justified. To achieve this, the authors suggest new training paradigms, such as next-tool prediction during pretraining and agentic reinforcement learning that penalizes unnecessary delegation.