For Acme Corp (DEMO) · Period 2026-04-01 to 2026-04-30
acme-pilot saved $63,000 in April 2026.
Direct cost reduction retained by acme-pilot, net of the Performance Fee.
Performance Fee, debited from prepaid balance against measured savings per §4-§5 of the Tessera Terms of Service.
Spend trend
Joint baseline (gray) vs actual paid cost (olive) over the last six readings
The gap between bars is the measured Ongoing Savings.
Per-workload breakdown
5 workloads this period · indicator reflects reduction versus its anchored baseline
| Status | Workload | Provider · model | Requests | Baseline | Actual | Saved | Reduction |
|---|---|---|---|---|---|---|---|
Optimized | product-search-summarize | OpenAI · gpt-4o-2024-08-06 | 280,000 | 42.0k | 24.0k | 18.0k | 42.9% |
Optimized | support-classifier | Anthropic · claude-opus-4-7-2026 | 45,000 | 58.0k | 32.0k | 26.0k | 44.8% |
Partial | doc-extraction | Google · gemini-2.5-pro | 95,000 | 50.0k | 38.0k | 12.0k | 24.0% |
Partial | realtime-coding-assist | Anthropic · claude-sonnet-4-x | 120,000 | 32.0k | 25.0k | $7,000 | 21.9% |
Excluded | code-review-bot | · deepseek-chatno ratified anchor covers this period | 28,000 | $0 | 14.0k | $0 | 0.0% |
What to fix next
Open optimization opportunities ranked by projected monthly savings · plus recent implementations for accountability
Support classifier evaluation set (200 golden examples) shows Sonnet-4.x preserves 96% accuracy on the 90% of cases where the intent is unambiguous. RouteLLM router handles the 10% edge cases on Opus. Implemented Mar 2026.
Promptfoo eval shows gpt-4o-mini matches summarization quality on 92% of test set. Route by complexity heuristic.
System prompt 2.4k tokens stable across 87% of calls. Enable Anthropic prompt cache (5-min TTL) → ~50% input cost reduction.
Daily 450 reviews are non-realtime (overnight digest). Batch API = 50% cost reduction with same model.
Implemented 2026-04-16 · counted toward this period's measured savings
Document extraction workload (Gemini-2.5-Pro · 95k requests/month) currently runs synchronously. Workload is overnight-tolerant per existing SLA. Google batch API runs at 50% list price. One Python service to wrap the existing pipeline; switching back is reversible within hours.
Support classifier system prompt + intent taxonomy is 1,840 tokens, identical across all 45k monthly requests. Prompt caching not configured — current hit rate from random page-cache only. Adding cache_control: { type: ephemeral } on the static prefix lifts hit rate to ~85%, dropping prefix cost by 9x.
Document extraction input averages 1,800 tokens with substantial boilerplate and structural padding. LLMLingua-2 (Microsoft) compresses 30-40% with under 2% measured quality loss on extraction tasks per their published benchmarks. Inference layer added before existing extraction call.
Realtime coding assist currently routes 78% of requests to Sonnet, 22% to Opus based on rough heuristic. Re-running RouteLLM scoring against your golden set suggests 92% safely route to Sonnet with no measurable quality drop on diff acceptance rate.
Savings trajectory
Measured savings delivered each closing period
Methodology in four points
- iOngoing Savings = Joint Baseline cost (anchor blended cost × actual request volume) − actual paid cost, summed across in-scope workloads.
- iiWorkloads without a ratified anchor covering the period are excluded from savings — their actual cost is shown for transparency only.
- iiiProvider price moves, unrelated workload shrinkage, and seasonal volume effects are excluded per §6 of the Tessera Terms of Service.
- ivIf period savings are zero or negative, Performance Fee is zero (drift floor, §3.7); the Monitoring Fee remains due.
Audit trail · countersignature
Acknowledgement of this Reading constitutes acceptance of the Ongoing Savings figure and the Performance Fee derived from it. Disputes are governed by §18 of the Tessera Terms of Service — fifteen calendar days from issuance, in writing, with disputed portion withheld from balance debit and undisputed portion debited normally.
Fintechagency OÜ d.b.a. Tessera
Name · title · date
Acme Corp (DEMO)
Name · title · date