
Enhanced speech recognition model is now available
62% Word Error Rate (WER) improvement for US English

62% Word Error Rate (WER) improvement for US English

Following Google’s release of new Speech API, we are happy to announce improved quality of call records transcription.

OpenAI has recently announced GA version of their Realtime API that Voximplant now fully supports

Check out the latest useful Voximplant Kit updates — we developed chat analytics, improved call history, added new tools for supervisors, expanded scenario capabilities, and updated the softphone. Below is a brief overview of the essential enhancements.

Connect any Voximplant call to ElevenLabs Conversational AI agents

Learn how a Voice AI Orchestration Platform connects LLMs, STT/TTS, turn‑taking, and telephony (PSTN, SIP, WebRTC) to build reliable real‑time voice agents. See benefits, architecture, and how Voximplant helps.

New integrations for Voice AI have arrived: Google's Gemini 2.0 Flash model, featuring seamless voice-to-voice conversation capabilities and ElevenLabs low-latency streaming speech synthesis are now available for Voximplant developers

Voximplant now includes a native Cartesia Line / Agents connector that connects any Voximplant call to a Cartesia Line voice agent for real-time, speech-to-speech conversations—over PSTN, SIP, WebRTC, or WhatsApp Business Calling—without building custom media gateways or WebSocket streaming infrastructure.

Today Ultravox announced they are directly integrating Voximplant into their platform to provide SIP capabilities. The integration builds on Voximplant’s deep telephony and Voice AI tooling

Voximplant has new realtime speech generation for voice AI from Inworld, our latest Voice AI text-to-speech (TTS) partner. Together, we combine state-of-the-art TTS with carrier-grade connectivity so you can build voice agents that sound like your brand, not a generic robot.