Skip to content

PATTERN Cited by 5 sources

Tool-surface minimization

Tool-surface minimization is the discipline of keeping the number of tools an agent sees small, because (a) tool-calling accuracy degrades as the tool inventory grows (arXiv 2411.15399 cited by Datadog) and (b) every tool's description consumes context window budget. "Just turn every API endpoint into a tool" does not scale (Source: sources/2026-03-04-datadog-mcp-server-agent-tools).

Datadog uses three tactics together, each with distinct trade-offs:

1. Flexible tools

Rather than one tool per API endpoint, design tools whose schema covers multiple use cases. Requires careful schema design; the payoff is one well-designed tool replacing several narrow ones.

2. Toolsets

A core (default) toolset covers common workflows and loads automatically on agent connection. Opt-in toolsets expose specialized workflows when the user enables them.

  • Cost: users must anticipate what the agent will need ahead of time. Wrong forecast → agent lacks a capability at the moment it needs it.
  • Datadog's documented toolsets API is the product surface.

3. Layering (tool chaining)

A discovery tool ("how do I accomplish X?") answers capability questions and returns the name/identifier of a second-tier tool that actually does the work. The second-tier tool is not front- loaded into the description budget until the discovery tool points at it.

How the tactics compose

Tactic Savings target Cost
Flexible tools Both accuracy (fewer names) + context (fewer descriptions) Schema-design complexity
Toolsets Context (only loaded tools cost budget) Requires user forecast
Layering Context (specialized tools hidden behind discovery) +1 call latency

Decaying constraint?

Client-side features soften this pressure without removing it: Claude Code's tool search avoids loading every tool up front; Claude skills and Kiro Powers let agents load specialized knowledge on demand. How skills and MCP compose is "still an open question" per the Datadog post. Structural minimization remains the default; client features widen the budget, they don't eliminate the discipline.

Seen in

Last updated · 542 distilled / 1,571 read