New!
Cartesia is available now
Voximplant
New!
Cartesia is available now
NewsEventsVoximplant KitGlossary

Tag: ASR

Introducing OpenAI Realtime API Client

Introducing OpenAI Realtime API Client

OpenAI has launched its beta Realtime API, revolutionizing voice assistants with speech-to-speech interactions, ultra-low latency, and realistic voices. Voximplant’s integration makes it easy to connect calls to OpenAI's models, enabling seamless, human-like conversations with minimal setup.

What is Automatic Speech Recognition?

What is Automatic Speech Recognition?

How many times a day do you talk to a computer? We’re not referring to the exasperated exclamation you direct at your laptop when it overheats and crashes. We want you to think about the moments you speak to a device and it actually listens.

What Is a Voice AI Orchestration Platform?

What Is a Voice AI Orchestration Platform?

Learn how a Voice AI Orchestration Platform connects LLMs, STT/TTS, turn‑taking, and telephony (PSTN, SIP, WebRTC) to build reliable real‑time voice agents. See benefits, architecture, and how Voximplant helps.

Voximplant Kit updates. April 2025

Voximplant Kit updates. April 2025

Check out the latest useful Voximplant Kit updates — we developed chat analytics, improved call history, added new tools for supervisors, expanded scenario capabilities, and updated the softphone. Below is a brief overview of the essential enhancements.

Deepgram Voice Agent now available in Voximplant

Deepgram Voice Agent now available in Voximplant

Voximplant now includes a native Deepgram module that connects any Voximplant call to Deepgram’s Voice Agent API for real-time, speech‑to‑speech conversations. You can stream audio from phone numbers, SIP trunks, WhatsApp, or WebRTC into Deepgram’s unified agent environment—combining STT, LLM reasoning, and TTS—and play responses via Voximplant’s serverless runtime with minimal latency.

Extend Cartesia Line Agents to SIP, WhatsApp, and Global Phone Networks

Extend Cartesia Line Agents to SIP, WhatsApp, and Global Phone Networks

Voximplant now includes a native Cartesia Line / Agents connector that connects any Voximplant call to a Cartesia Line voice agent for real-time, speech-to-speech conversations—over PSTN, SIP, WebRTC, or WhatsApp Business Calling—without building custom media gateways or WebSocket streaming infrastructure.

Cartesia Realtime TTS now available in Voximplant

Cartesia Realtime TTS now available in Voximplant

Voximplant now includes a native Cartesia module for streaming, low-latency text-to-speech (TTS). You can use a single VoxEngine API to synthesize speech in real time, connect it to any call (PSTN, SIP, WebRTC, WhatsApp) and control playback from a Large Language Model (LLM) or other source, all inside VoxEngine.