For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Platform docsVideosCommunitySign up
CapabilitiesGetting startedVoice AI OrchestrationVoxEngine PlatformAPI ReferenceFAQ
CapabilitiesGetting startedVoice AI OrchestrationVoxEngine PlatformAPI ReferenceFAQ
  • VoxEngine Development
    • VoxEngine concepts
    • Applications
    • Users
    • Scenarios
    • Routing rules
    • Phone numbers
    • Calls and sessions
    • Video calls
    • Management API
    • Account subusers
    • Integrations
    • Firewall
    • Cloud IDE
    • Type declarations
    • VoxEngine CI
    • Working with API requests
    • Working with the Voximplant's API
    • Remote session management
    • Key-value storage
    • Secret storage
    • Custom data
    • Limits and restrictions
    • Scenarios troubleshooting
    • How billing works
  • Management API
    • Overview
    • Developer Basics
    • Authorization
    • Callbacks
    • Child accounts
    • Accessing secure objects
  • Web and Mobile SDKs
    • iOS: CallKit
    • Android: ConnectionService
    • Screen sharing
    • Custom video sources
    • Mobile SDK statistics
LogoLogo
Platform docsVideosCommunitySign up
VoxEngine Development

Integrations

Use built-in integrations for speech, NLU, and other external services
||View as Markdown|
Was this page helpful?
Edit this page
Previous

Account subusers

Next

Firewall

Built with

On this page
  • Amazon Polly
  • Google WaveNet
  • Dialogflow
  • Yandex SpeechKit
  • T-bank VoiceKit
  • Microsoft Azure

Voximplant seamlessly integrates with third-party software and services, enhancing your communication capabilities.

Here are the currently integrated services:

Amazon Polly

Amazon Polly is a service that converts text into lifelike speech, enabling you to create applications that speak and develop entirely new categories of speech-enabled products. Polly’s Text-to-Speech (TTS) service utilizes advanced deep learning technologies to synthesize natural-sounding human speech.

Google WaveNet

WaveNet technology represents a revolutionary approach to creating synthetic speech. It synthesizes speech with greater human-like emphasis and inflection on syllables, phonemes, and words. This technology is employed to produce speech for Google Assistant, Google Search, and Google Translate.

Dialogflow

Dialogflow is a natural language understanding platform for designing and integrating conversational user interfaces into mobile apps, web applications, devices, bots, interactive voice response systems, and more. Dialogflow operates on Google Cloud Platform, allowing you to scale to hundreds of millions of users.

Yandex SpeechKit

This service enables developers to recognize voice in text across multiple languages. SpeechKit is the driving force behind Alice, the Yandex voice assistant.

T-bank VoiceKit

VoiceKit from T-bank features deep neural network models for speech recognition and synthesis and is used to create a financial voice assistant called Oleg.

Microsoft Azure

Microsoft Azure Text-to-Speech (TTS) provides 116 total new voice options covering 35 languages and 49 unique dialects. These options include lifelike 36 neural options based on the latest deep learning technology, with several that offer even more advanced functionality with Microsoft’s proprietary speaking styles.

Use built-in integrations for speech, NLU, and other external services