For the complete documentation index, see llms.txt.
Voximplant Platform is a Voice AI Orchestration Platform and an established Cloud Communications Platform for building programmable voice, video, and messaging applications using serverless call control, SDKs, and APIs.
Click on any card below to see more information.
Connect real-time AI agents, speech systems, and telephony channels with code-driven orchestration.
Run inbound/outbound PSTN, SIP, WebRTC, and WhatsApp voice flows with fine-grained call control.
Use cloud IDE/debugging, multi-platform SDKs, and Management API automation.
Stream live call audio over WebSockets for real-time AI, transcription, and analysis pipelines.
Run globally on serverless infrastructure with multi-region coverage and uptime monitoring.
Realtime and agent-style voice integrations.
Live speech interactions with Gemini APIs.
WebSocket-based speech-native connector.
Native voice-agent connector and examples.
Conversational AI agent integrations.

Line Agents runtime with VoxEngine orchestration.
Grok voice-agent flow and feature support.

Realtime TTS pattern for half-cascade voice pipelines.

Realtime TTS option for half-cascade voice flows.
Streaming/realtime TTS option for voice AI pipelines.
Voximplant AI is a serverless runtime for Voice AI pipelines that connects real-time agent/LLM systems and speech engines to PSTN / SIP / WebRTC / mobile / WhatsApp calling, with code-driven orchestration and provider flexibility. See Voximplant AI and the docs Voice AI connectors section.
Supported vendors (direct agent / real-time LLM connectors)
Native/direct connectivity is positioned for:
Voximplant AI also explicitly supports connecting to another WebSocket interface (for other real-time AI systems) in addition to the vendors above.
Supported vendors (speech engines: STT / TTS)
Voximplant’s platform speech layer (STT/TTS) includes built-in providers such as:
For realtime / streaming TTS used in Voice AI scenarios, Voximplant also provides native VoxEngine modules and guides for:
Pipeline options (architectures you can run)
Orchestration primitives (what you control)
Real-time media integration (streaming)
Connectivity and endpoints
Serverless call control (VoxEngine)
Conferencing and bridging
Recording, transcription, and speech processing
call.record() in scenarios (supports stereo and additional options)record(transcribe=true) and retrieval via GetCallHistory (transcription
delivered asynchronously)Speech-to-Text (ASR) modes and features
Answering machine / voicemail / beep detection
Automated outbound calling (call lists + dialing logic)
Cloud IDE and debugging
Local IDE continuous integration
SDKs and client libraries
Management API (HTTP)
VoxEngine.createWebSocket(...)WebSocket.sendMediaTo(...)