GSpeech has AI voice synthesis algorithm, which is some of the most advanced and realistic in the business. Most voice synthesizers (including Apple’s Siri) use what’s called concatenative synthesis, in which a program stores individual syllables — sounds such as “ba,” “sht,” and “oo” — and pieces them together on the fly to form words and sentences. This method has gotten pretty good over the years, but it still sounds stilted.
WaveNet, by comparison, uses machine learning to generate audio from scratch. It actually analyzes the waveforms from a huge database of human speech and re-creates them at a rate of 24,000 samples per second. The end result includes voices with subtleties like lip smacks and accents.
Test it in the live working demo below! Try using GS-Player, to change the voice, make voice tuning(adjust speed and pitch of audio). If you allow your site visitors to choose their desired voice, control speed/pitch, they will most likely return to your website more often! Allow them to download audio, so they can listen to it later. Try GSpeech now! It has completely free version, which uses Parametric technology! Paid versions are using WaveNet technology!
The following figure shows the quality of WaveNets on a scale from 1 to 5, compared with Google’s current best TTS systems (parametric and concatenative), and with human speech using Mean Opinion Scores (MOS).
MOS are a standard measure for subjective sound quality tests, and were obtained in blind tests with human subjects (from over 500 ratings on 100 test sentences). As we can see, WaveNets reduce the gap between the state of the art and human-level performance by over 50% for both US English and Mandarin Chinese.
With GSpeech you can increase user engagement and time spent on your site by allowing your visitors to listen in to the content of your website in the background while they’re working, commuting, eating, or having their hands busy. You can improve your website’s accessibility which is often times forgotten and to empower visitors who have visual impairment and reading disabilities to still completely consume your content without the complications of reading.You can adjust speaker color and shape to fit your website design and color scheme. See some examples of speaker types and styles below.
GSpeech neural technology(paid versions) supports the following languages:
GSpeech basic technology(free version) supports the following additional languages: