Voice font

A voice font is a computer-generated voice that can be controlled by specifying parameters such as speed and pitch and made to pronounce text input. The concept is akin to that of a text font or a MIDI instrument in the sense that the same input may easily be represented in several different ways based on the design of each font. In spite of current shortcomings in the underlying technology for voice fonts, screen readers and other devices used to enhance accessibility of text to persons with disabilities, can benefit from having more than one default voice font. This happens in the same way that users of a traditional computer word processor benefit from having more than one text font.

Shortcomings

The synthesized voice created by using a voice font tends to have a slightly unnatural tone. Human voices are very prone to change with the speaker's mood and several other factors that aren't programmed into computerized voices. Voice font software on the Macintosh system tries to get around this by providing tags to change some components of the voice, such as pitch. The Natural Voices software in the sources section allows defining acronym pronunciation and speech rate, as well as other things. Even though speech synthesis has existed since around 1930, according to that source, and the Speech synthesis article, it is difficult to fool experienced listeners into believing that the voice is indeed human.

This may be similar to the difficulty in achieving true Artificial Intelligence that can actually pass a Turing Test by presenting spectators with something indistinguishable from what it is trying to simulate.

Common uses

Like its text counterpart, each voice font can supply a different experience and provide a selection for different purposes. The simplest one is to select a voice font from a group in order to get the clearest one, or to choose the one with a speed that is appropriate for different settings.

For people who are hard of hearing in the upper range of the hearing spectrum, for example, selecting a voice that uses a lower pitch will deliver deeper sounds.

Another use for voice fonts is in electronic music. A commonly available set of synthetic voices from Macintosh computers can be used to enhance the mood of certain music pieces that need a voice but where the composer feels that providing a human voice is not in their interests. Here, male voices can be combined in a choir to provide the tenor and bass for a particular piece, and female voices can be added to fill in other parts of the ensemble—resulting in a choir that consists of speech synthesis rather than human singers, or to utilize a female voice when none are available.

Certain Macintosh clients of instant messaging services such as AOL Instant Messenger have had the option of reading incoming messages using the system's voice fonts. When message receiver has stepped away from the computer, or temporarily put away the part of the screen showing the incoming text, the computer reads the message out loud. This allows the user to continue with their other tasks without needing to view the incoming text.

Sources

External links

Web-based example of different voice fonts

Speech synthesis

Free software	eSpeak Gnopernicus Gnuspeech Orca Festival Speech Synthesis System FreeTTS Sinsy Automatik Text Reader

Proprietary software	DECtalk Software Automatic Mouth Talk It! Microsoft Agent Microsoft Speech API Microsoft text-to-speech voices Readspeaker Voice browser CoolSpeech BrowseAloud LaLaVoice Vocaloid Cantor Symphonic Choirs IVONA CereProc Utau Voiceroid NIAONiao Virtual Singer Vocalina Realivox CeVIO Creative Studio Chipspeech Alter/Ego PPG Phonem

Machine	Echo 2 Pattern playback Phasor RIAS Texas Instruments LPC Speech Chips TuVox

Applications	AOLbyPhone DialogOS Dr. Sbaitso MBROLA Microsoft Narrator Microsoft Speech Server PlainTalk Voice font

Protocols	Speech Synthesis Markup Language SABLE VoiceXML

Developers/ Researchers	Alan W. Black Catherine Browman Franklin Seaney Cooper Gunnar Fant Haskins Laboratories Wolfgang von Kempelen Ignatius Mattingly Philip Rubin Yamaha

Process	Articulatory synthesis Concatenative synthesis Currah Inverse filter PSOLA Phase vocoder Self-voicing

This article is issued from Wikipedia - version of the 5/6/2014. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.

Voice font

Shortcomings

Common uses

See also

Sources

External links