The assistant inside your workspace. XTTS + Whisper for voice; Anthropic, OpenAI, OpenRouter, or local MLX for chat. Same governance contract across modalities.
The assistant accepts speech or text and replies in kind. Voice is XTTS for output and Whisper for input, GPU-accelerated. Chat routes through whichever provider you've configured for the tenant.
A 7-tier cost ladder picks the right model for each task — T0 local Ollama, T2 Haiku, T3 Sonnet, T6 frontier. Every routing decision lands in your audit trail with the rationale, estimated cost, and fallback chain.
Tier ladder
The audit trail doesn't sample. Every prompt the assistant receives + every response it returns lands in BrainDB with token usage, model, provider, and per-call cost. Forensic by construction.
“How do I add OAuth2 to the customer portal?”
“Routed to T3 Sonnet — multi-file context required”
“1,247 input tokens · 482 output · $0.0083”
“XTTS v2 · 612ms · GPU”
“Use authlib not passport”
“Decision recorded: chose authlib over passport (rationale 18 chars)”
Inside the workspace. Inside your audit trail.
Related products