DittoBench: Clocking Tool-Calling, Memory, and Speed in Any Agent Harness
A benchmark for any agentic harness that wields tools and memory — not just SWE. DittoBench clocks 10 models on whether they call the right tool, find the right memory, and how fast they do both, with a composite speed score that puts latency on the same footing as quality.
Read