ReadSpeaker TTS Engine SDK

Give your content, applications, and devices a natural voice.

Whether you’re developing new corporate e-learning software or taking an online banking system to the next level to boost end-user satisfaction, choose ReadSpeaker to delight your customers. The ReadSpeaker TTS engine SDK allows you to easily build and seamlessly integrate our synthetic voices into your applications. Whether it’s an e-learning application, announcement and notification system, or a set of audio books – in fact, whatever application or device you’re working on our text to speech voices are ready to satisfy the most demanding requirements.


> Exceptional performance from natural-sounding voices
ReadSpeaker provides voices that are extremely accurate, clear and natural, and ready to express your content intelligently. Optimized for your specific platform, they’re designed to deliver the highest quality sound and exceptional performance every time. Communication has never been easier or more effective.

> A world-class family of voices
Engage multilingual customers from across the globe. Today, ReadSpeaker gives your content a voice in 40+ languages and 120+ voices —and more languages are on their way.

> Create engaging content communicated in your voice
With our intuitive content editor, you can effortlessly enter text or SSML to produce, preview, edit and tweak your voice content. Adjust flow and speed to fit your application perfectly. Speed up or slow down voices and incorporate pauses for effect in audio books or training courses. You have complete control over the speed, pitch, volume, and pause. So whether you prefer a Darth Vader voice or a trustworthy business voice, the choice is yours.

> Customizable dictionary for greater accuracy
ReadSpeaker voices come with a pronunciation dictionary that covers most of a language’s pronunciations. However, there are always exceptions and sometimes you need to add regional dialect variations and expand the standard lexicon. Fine-tune pronunciation, manage acronyms, get the nuances of names of people and locations just right. Add technical words that are specific to your business or industry. Whether you’re adding a list of medical terms or a newly coined slang word, it’s easy using the phonetic alphabets we support. These include IPA, X-Sampa, TeleAtlas Sampa, Navteq Sampa, X-Sapi, X-CMU, X-PENTAX, X-PINYIN, X-WORLDBET. Phonetic transcription standards may differ between different voices.

> Flexible footprint
~150MB, ~250MB, ~500MB per voice – the choice is yours, depending on whether you need the highest definition voice for your contact centre or something lighter, ReadSpeaker delivers according to your needs.

> A variety of audio formats and sampling rates
We support the following audio formats and sampling rates: PCM, Wav, Ogg and MP3* (*using third party library) – 8KHz, 11 KHz, 16KHz, 22KHz, 44,1KHz. Sampling depth 16 bit.

Linux. Pick your operating system of choice and create your application using C-based APIs.


Operating System Windows XP SP3, Vista, Server 2003, Server 2007, Windows 7/8/10, Linux CentOS 5, Fedora 5, RSHL5, Android 2.3 (Gingerbread) or later, Mac OS X 10.7 (Lion) or later, iOS 8.0 or later.
CPU 1 GHz or faster
RAM 4 GB or more
Voice footprints (depending on quality) ~150MB, ~250MB, ~500 MB per voice


  • Accessibility
    Make communication easier for people with speech disorders, vision impairments and dyslexia. Provide assistive text to voice applications and improve daily life for millions of users. Online services should also be accessible to user groups who tend to be more computer illiterate, like older citizens.
  • Announcement Systems
    Whether it’s to update passengers about a delayed flight at the airport or information about a famous monument, interactive audio kiosks are a great way to provide information.
  • Audio Publishing
    Make your content accessible to everybody. Enable drivers to listen to it on a reading app on their way home. Let your readers stay up to date on the news as they train for the marathon. And above all, let everyone who has difficulty reading enjoy the power of text to voice.
  • Education
    Improve learning outcomes – speech enable your content for e-learning, professional simulation, induction training, etc.
  • Electronic Gaming
    Immerse gamers in audio-driven storytelling. Assist players stuck in an area with voice prompts that activate over a hotspot. Delight your kids with a talking robot.
  • Transportation
    Deploy voice announcements in trains and buses to keep passengers updated on ETAs, delays and departures, next stops, etc.