Changelog
Notable changes to the VoiceDock platform, API, and documentation.
Notable, user-facing changes to the VoiceDock platform and API, newest first. Dates reflect public availability.
June 2026
- Usage-based, at-cost pricing. You pay for actual model usage at cost, plus a flat €0.07 per minute orchestration fee. Vertex AI Live is a flat €0.25 per minute, all-in. See Billing.
- Bring-your-own-key (BYOK) is now optional. Assistants work out of the box on platform-provided keys; add your own provider keys only if you want to. See BYOK setup.
May 2026
- Documentation and full API reference launched — complete guides plus an interactive REST API reference. Start at the Quickstart.
- xAI Grok for realtime voice — low-latency realtime speech, plus a Grok text-to-speech option. See xAI Grok integration.
- Branded end-of-call report emails — per-organization branding, an "Open in dashboard" link, and a recording hint.
- Configurable call duration up to 30 minutes per assistant (
max_duration_seconds). See Assistants.
April 2026
- Google Gemini Live added as a realtime speech-to-speech provider, including a Vertex AI option. See Provider pricing.
- Google Gemini text-to-speech added as a provider.
- Email notifications for web calls.
March 2026
- Official Node.js / TypeScript SDK released — install
hmsovereignfrom npm. See Node SDK. - MCP server for the platform, hosted at
mcp.hmsovereign.com. See MCP server. - Recording consent flow (DTMF) — callers can be asked to press 1 to consent before any processing begins. See Privacy & compliance.
- More providers — Mistral (Voxtral) speech-to-text and Inworld text-to-speech.
- Improved multilingual turn detection and interruption handling.
February 2026
- More provider options — Mistral and xAI Grok as text models, Gladia and ElevenLabs Scribe as speech-to-text.
- GDPR mode for per-assistant data-retention control. See Privacy & compliance.
- Configurable silence timeout with a faster default.
January 2026
- Web calls — browser-based WebRTC calls, with a public embeddable web-calls API and whitelabel support. See Web calls.
- Call recording with signed URLs for secure access.
- Voicemail detection and a configurable voicemail message. See Voicemail detection.
- Autonomous silence handling — recurring prompts when a caller goes quiet. See Autonomous silence handling.
- Outbound campaigns — campaign tracking for outbound calls. See Campaigns.
- Assistants can speak while running a tool, with async tool results fed back into the conversation. See Custom tools.
- Richer webhook events — deterministic end reasons, call timestamps, and phone-number and direction fields.
- Real-time sync webhook API, replacing polling.
- Free local voices for text-to-speech.
- Prompt template variables such as
{{ now }}.
December 2025
- Public API foundations — assistants, calls, and phone numbers as first-class resources, with agent configuration separated from phone numbers.
- Webhooks — assistant-request (pre-call config override), status-update, tool-calls (function calling), and end-of-call-report with full transcript. See Webhooks.
- Live Call Control API — inject context, speak, transfer, or end a call mid-conversation.
- Built-in call control — LLM-controlled
end_calland call transfer. See Call transfers. - Outbound call API.
- Post-call structured analysis. See Call analysis.
- Bring-your-own-key (BYOK) providers and SIP trunk support. See SIP trunks.
- xAI Grok realtime speech-to-speech provider.
- Whitelabel support — child organizations, per-organization email domains, and branded summaries. See Whitelabel.
- Usage-based billing in credits, at a flat €0.07 per minute.
- Multilingual emails and call summaries.