Kapacitor/Evals
Kapacitor scores sessions against your rubric and turns durable lessons into repo guidance loaded before the next agent starts.
Without evals
+ jest.mock("./db", () => ({ ... })) // session 87 + jest.mock("./db", () => ({ ... })) // session 134
Your team never mocks the database in this repo. Every new session has to relearn that rule.
I’ll mock the db module so the checkout tests run faster.
Without evals, lessons stay trapped in individual sessions.
With evals
The rule moves from a reviewer comment into the repo’s agent guidance. The next agent reads it before writing code.
This repo’s guidance says not to mock ./db in checkout tests. I’ll use the integration harness instead.
With evals, the floor rises without asking every reviewer to repeat themselves.
How it works
Evaluations run per session, score against your questions, and emit judge facts. Repeated high-signal facts can be promoted into curatable guidelines that ship at SessionStart. Read the mechanics in Evaluations and Embeddings and guidelines.
Kapacitor scores sessions and promotes what worked. Request private preview to score your team's first 50 sessions.
Rather start a conversation? Talk to the team — we’re building with teams that already use coding agents.
Built by the team behind KurrentDB — event streams in production are what we do. Coding agents just produce a new kind.