Cut your Claude Code costs by 74% on monorepo exploration
The Distillery sits between your AI coding tool and the Anthropic API, where it deduplicates repeated context and distils large tool outputs without changing your workflow and without your context leaving your machine. Works with Claude Code, Cline, Goose, Aider, Zed, and any OpenAI-compatible tool.
Zero-config. No prompt rewriting. No workflow changes.
Early access — we’ll email you when The Distillery opens up. No spam.

Reductions measured by scripts/benchmark.ts on claude-sonnet-4-5. Run it yourself: npx tsx scripts/benchmark.ts. See methodology.
Three steps. One terminal.
Install, point one environment variable at the proxy, keep working. Click through each step to see exactly what you run.
Everything you need to stop overpaying.
Dedupe before the request, measure after the session, enforce across the team, pay only when it works.
Stop paying for context Claude already saw.
The proxy fingerprints every context block going out over the wire. Repeats get referenced, not re-sent, removing bytes from the request body before Anthropic bills them.
Know what each session cost.
thedistillery stats in your terminal: tokens, savings, cost per session.
Enforce distillation across the team.
Commit one file. Teammates inherit your savings automatically.
Only charged when it delivers.
15% of verified savings above €20/mo. Zero fee if we save you nothing.
A hook reports your bill. A proxy changes it.
Shell hooks fire after the session ends and only see final token totals. The Distillery intercepts every request in-flight, before Anthropic bills them. Other proxies take a different route — see this comparison for how The Distillery differs from routing-based approaches.
You’re going to run Claude Code tomorrow. One choice, Two bills.
Keep paying for duplicate context.
Your sessions re-send the same files, the same tool outputs, the same context, turn after turn. Anthropic bills every byte. You never see it itemized.
Let the proxy delete the waste.
Distillery sits between your CLI and Anthropic, dedupes repeated context, and distils tool outputs before the bytes ever leave. Same agent. Smaller invoice.
How The Distillery compares: vs Claude Code Router · vs Compresr · vs Headroom · vs Edgee · vs ClawProxy