Realtime client
Ultravox is a multimodal LLM that understands text and human speech without separate ASR. It converts audio to LLM’s high-dimensional space, responding faster than systems with separate ASR and LLM components.
You can integrate Ultravox WebSocket API client in your Voximplant application via the Ultravox VoxEngine module. It provides you a client interface to communicate with Ultravox models in real time and produce both audio and text transcriptions.
Usage
- Require the Modules.Ultravoxmodule in your VoxEngine scenario.
- Create a Ultravox.WebSocketAPIClientinstance (via theUltravox.createWebSocketAPIClientmethod) and provide your parameters.
- Send media between CallandUltravox.WebSocketAPIClient(via the*.sendMediaorVoxEngine.sendMediaBetweenmethods).
- Listen to the Ultravox.WebSocketAPIClientevents (via theUltravox.WebSocketAPIEventsevent list) and implement you application business logic.
Here is the complete scenario example for your reference:
Ultravox WebSocketAPI client
Frequently asked questions
Q: The AI voice on the phone is choppy and high-pitched. What do I do?
A: Set the inputSampleRate property on the Ultravox side to 16000, this should fix the issue.