Skip to main content

    Public Roadmap

    Full transparency into what we've shipped and what's coming next. We build in the open.

    Shipped

    15 features

    CORS-Ready API

    Live

    Call Behest directly from your browser. No backend proxy needed.

    Auth & Tenant Isolation

    Live

    Multi-tenant authentication with per-project API keys and JWT support.

    Three-Tier Rate Limiting

    Live

    Per-IP, per-project, and per-user rate limiting with zero code.

    PII Shield

    Live

    Automatic PII detection and protection before data reaches the LLM.

    Sentinel — Prompt Injection Defense

    Live

    Block jailbreak attempts and prompt injection attacks automatically.

    Conversation Memory

    Live

    Session-based conversation memory. Users pick up where they left off.

    System Prompt Management

    Live

    Configure your AI's personality and behavior per project.

    Token Budgets

    Live

    Automatic cost control with per-user and per-project daily limits.

    Kill Switches

    Live

    Instant emergency shutdown at global, tenant, or project level.

    Full Observability Stack

    Live

    Traces, metrics, and logs — correlated automatically.

    Usage Analytics & Spend Tracking

    Live

    Know your AI spend. Per-model, per-user analytics in real time.

    Self-Hosted Deployment

    Live

    Deploy in your cloud. Your data never leaves your infrastructure.

    Smart LLM Routing

    Live

    Route requests across OpenAI, Anthropic, Gemini, Bedrock, Mistral, Cohere, and more.

    BYO LLM API Keys

    Live

    Bring your own OpenAI, Anthropic, Gemini, Bedrock, Mistral, Cohere, or OpenRouter keys.

    Usage Tiers & Token Economics

    Live

    Configure end-user tiers with per-tier rate limits and token budgets — chargeback-ready out of the box.

    Planned

    2 features

    Semantic Cache

    Coming Soon

    Faster responses and lower costs for semantically similar queries.

    Built-in RAG

    Coming Soon

    Ground your AI in your documents with built-in retrieval.

    Have a feature request?

    Let us know what you need →

    Enterprise Token FinOps: Enforce hard budgets and attribute costs per session.

    Learn more