Agentic SpendGuard¶

Audit-chain spend control for LLM agents — sub-5ms per decision, KMS-signed audit chain, deterministic policy enforcement. Built for platform-engineering teams that need compliance evidence, not just a runtime guardrail.

What it does¶

SpendGuard sits between your agent framework and the external actions the agent takes (LLM calls, tool calls, sub-agent dispatches). At each boundary, the sidecar consults a Contract DSL and the per-tenant budget ledger to decide:

✅ CONTINUE — proceed with the action
⏸ REQUIRE_APPROVAL — pause until an operator approves
🔄 DEGRADE — apply a mutation patch (e.g. force smaller model)
⏭ SKIP — non-fatal skip
⛔ STOP — terminate the run

Every decision — CONTINUE or STOP — produces exactly one immutable audit record in canonical_events. There is no path that produces an effect without a corresponding audit row (Contract §6.1 invariant).

Where it fits¶

agent step ─►  sidecar (<5ms) ─► ledger (atomic reserve / commit)
                  │                  │
                  │                  └─► audit_outbox ─► canonical_events
                  └─► contract evaluator (Stage 2 hot path)

Why it matters¶

Agent runtimes burn money in unbounded ways:

One bad prompt can drain $10K of GPT-4 quota
A retry loop on a rate-limited tool call doesn't know it's draining
Compliance teams can't tell which decision led to which spend

Existing options are vendor-specific (per-OpenAI, per-Anthropic) or prompt-only (please-don't-spend-too-much, with a 26.67% violation rate in Microsoft AGT's red-team data).

SpendGuard is the cross-runtime budget governance layer.

Status (POC, 2026-05)¶

Layer	Status
T (Trace)	✅ end-to-end
L (Ledger)	✅ Postgres SERIALIZABLE + audit_outbox
C (Contract DSL)	🟡 evaluator hot path live; CEL deferred to v2
D (Decision)	✅ 8-stage transaction, end-to-end
E (Evidence)	✅ audit_outbox → canonical_events
P (Proof)	🟡 chain durable; per-event signing strict mode deferred

8 demo modes all green: decision, invoice, agent, release, ttl_sweep, deny, agent_real_*, multi_provider_usd.

Get started¶

→ Quickstart — zero to a green DENY demo in 5 minutes → Concepts — 6-layer architecture → POC vs GA gates — what's production-ready vs not → GA hardening slices — production design, implementation, test, and review gates