How good is the memory, really?

We measure it. These are the latest scores from the Coppermind retrieval benchmark, measured 2026-06-25.

100%recall@10 — the right memory was in the top 10 results

68%nDCG@10 — how close to the top it ranked

58%MRR — how high the first right answer appeared

What recall@10 means for a CMO

When you ask Coppermind about something you stored — a budget decision, a stakeholder preference, what you promised on the last call — recall@10 measures how often the right memory shows up in the top ten results. A score of 100% means that on this benchmark, the memory you needed was there 100% of the time.

What the benchmark covers

The Coppermind retrieval benchmark is our internal, 9-query retrieval benchmark run on sanitized fixture data (no customer data is ever used). It covers exact-keyword lookups, semantic paraphrases (asking in different words than you stored), and knowledge updates — the hard case where a fact changed and the stale version must not win.

Honest fine print

These are measured scores on a named internal benchmark at a point in time — not a universal guarantee. We re-run the benchmark as the retrieval pipeline evolves, and this page is regenerated from the committed baseline on every build, so the number you see is the number we measured.

Benchmark: Coppermind retrieval benchmark · 9 queries · measured 2026-06-25 · build 15d9ea33. Questions? support@coppermind.app