Feature Matrix
34 of 34 features live in production · grouped by capability domain
Feature Matrix section. Each entry maps to a real production capability with an env flag. Where a kill switch exists, it is named explicitly — operations can pause a feature surface without a code change.Streaming RAG (/v1/run, SSE)
Stateful LangGraph agent with progressive token streaming, request_id tracing, and reconnect semantics.
Quality guardrails + SSE quality events
Runtime quality gates emit SSE events; release gates enforce override workflow.
TTS audio output (OpenAI)
Optional /audio.ready SSE events with deterministic fake-TTS path for tests.
PostgreSQL + pgvector (local)
Cosine retrieval with similarity scoring, deterministic ordering, and embedding-dimension invariants.
AWS Bedrock Knowledge Bases
Retrieval routed to Bedrock KB per corpus config — region + KB id selectable per tenant.
GCP Vertex AI Search
Discovery Engine datastores wired through the same router; mock-tested without live creds.
API key auth + RBAC
Hashed key storage; reader/editor/admin role gates on every protected endpoint.
ABAC policy engine
Priority-aware deny-first / allow policies with simulation API and DSL conditions.
Document ACLs
Per-document grants (read / write / admin) with creator-owner default and expiring grants.
Enterprise SSO (OIDC)
PKCE flow, state/nonce replay protection, JIT user provisioning, multi-tenant IdP registry.
SCIM 2.0 provisioning
Token-authenticated SCIM endpoints with audit events and tenant-scoped user lifecycle.
Redis token-bucket rate limiting
Per-key + per-tenant dual enforcement with stable 429 schema and retry hints.
Idempotency keys (write endpoints)
Conflict detection on write paths; write-ahead Idempotency-Key contract.
Async document ingestion (ARQ)
Redis-backed ARQ worker; 202 Accepted with status tracking and failure reasons.
Circuit breakers (external calls)
Threshold-based breakers around LLM/retrieval providers with bulkheads and retries.
Kill switches (per feature)
Per-feature kill switches; rollout/canary controls; maintenance gates above RBAC.
Cost governance + chargeback
Pricing catalog, tenant budgets with warn/block/degrade, spend analytics + chargeback reports.
SLA engine + load shedding
Tenant-scoped SLA policies with warn/degrade/shed enforcement on /v1/run and ingestion.
Autoscaling recommendations
Adaptive autoscaling profiles + recommendation actions feeding the operability evaluator.
Envelope encryption (AES-256-GCM)
Tenant key registry + encrypted blob store; pluggable KMS error contracts.
Key rotation + KMS
Resumable re-encryption jobs with telemetry; admin keyring lifecycle endpoints.
Encrypted + signed backups
Signed manifests, retention pruning, drill reporting; DR readiness ops endpoints.
Multi-region failover
Region status, readiness arbitration, token-gated promotion + write-freeze.
SOC 2 compliance automation
SOC 2 control catalog + continuous evaluation engine + scheduling tasks.
Evidence bundle generation
Persisted evidence under var/evidence with signing, verification, and retention pruning.
DSAR / data governance
Export/delete/anonymize APIs, retention pipeline with legal hold supersession.
Audit log (tamper-evident)
Central audit service across auth, mutation, and ops paths with metadata redaction.
Prometheus metrics (/v1/metrics)
Standard Prom format with workload counters, latency histograms, and queue depths.
Operability alerts + incidents
Alert rules registry, deterministic evaluation APIs, incident automation lifecycle.
Operability evaluator worker
Background evaluator with distributed locking + heartbeat reporting and ops summaries.
Notification delivery (ARQ + DLQ)
ARQ-backed worker with HMAC signatures, dedupe, DLQ persistence, and admin replay.
Python + TypeScript SDKs
Generated from OpenAPI; `make sdk-generate` regenerates with examples.
BFF endpoints (/v1/ui/*)
Bootstrap, dashboard, documents, activity, actions — built for the UI without exposing internals.
Versioned API + envelope contracts
/v1 routes with success/error envelopes, deprecation headers on legacy aliases (sunset 2026-05-10).