VoiceDock Docs

Changelog

Notable changes to the VoiceDock platform, API, and documentation.

Notable, user-facing changes to the VoiceDock platform and API, newest first. Dates reflect public availability.

June 2026

  • Usage-based, at-cost pricing. You pay for actual model usage at cost, plus a flat €0.07 per minute orchestration fee. Vertex AI Live is a flat €0.25 per minute, all-in. See Billing.
  • Bring-your-own-key (BYOK) is now optional. Assistants work out of the box on platform-provided keys; add your own provider keys only if you want to. See BYOK setup.

May 2026

  • Documentation and full API reference launched — complete guides plus an interactive REST API reference. Start at the Quickstart.
  • xAI Grok for realtime voice — low-latency realtime speech, plus a Grok text-to-speech option. See xAI Grok integration.
  • Branded end-of-call report emails — per-organization branding, an "Open in dashboard" link, and a recording hint.
  • Configurable call duration up to 30 minutes per assistant (max_duration_seconds). See Assistants.

April 2026

  • Google Gemini Live added as a realtime speech-to-speech provider, including a Vertex AI option. See Provider pricing.
  • Google Gemini text-to-speech added as a provider.
  • Email notifications for web calls.

March 2026

  • Official Node.js / TypeScript SDK released — install hmsovereign from npm. See Node SDK.
  • MCP server for the platform, hosted at mcp.hmsovereign.com. See MCP server.
  • Recording consent flow (DTMF) — callers can be asked to press 1 to consent before any processing begins. See Privacy & compliance.
  • More providers — Mistral (Voxtral) speech-to-text and Inworld text-to-speech.
  • Improved multilingual turn detection and interruption handling.

February 2026

  • More provider options — Mistral and xAI Grok as text models, Gladia and ElevenLabs Scribe as speech-to-text.
  • GDPR mode for per-assistant data-retention control. See Privacy & compliance.
  • Configurable silence timeout with a faster default.

January 2026

  • Web calls — browser-based WebRTC calls, with a public embeddable web-calls API and whitelabel support. See Web calls.
  • Call recording with signed URLs for secure access.
  • Voicemail detection and a configurable voicemail message. See Voicemail detection.
  • Autonomous silence handling — recurring prompts when a caller goes quiet. See Autonomous silence handling.
  • Outbound campaigns — campaign tracking for outbound calls. See Campaigns.
  • Assistants can speak while running a tool, with async tool results fed back into the conversation. See Custom tools.
  • Richer webhook events — deterministic end reasons, call timestamps, and phone-number and direction fields.
  • Real-time sync webhook API, replacing polling.
  • Free local voices for text-to-speech.
  • Prompt template variables such as {{ now }}.

December 2025

  • Public API foundations — assistants, calls, and phone numbers as first-class resources, with agent configuration separated from phone numbers.
  • Webhooks — assistant-request (pre-call config override), status-update, tool-calls (function calling), and end-of-call-report with full transcript. See Webhooks.
  • Live Call Control API — inject context, speak, transfer, or end a call mid-conversation.
  • Built-in call control — LLM-controlled end_call and call transfer. See Call transfers.
  • Outbound call API.
  • Post-call structured analysis. See Call analysis.
  • Bring-your-own-key (BYOK) providers and SIP trunk support. See SIP trunks.
  • xAI Grok realtime speech-to-speech provider.
  • Whitelabel support — child organizations, per-organization email domains, and branded summaries. See Whitelabel.
  • Usage-based billing in credits, at a flat €0.07 per minute.
  • Multilingual emails and call summaries.

On this page