Text to Speech (TTS) is a speech processing system that converts text into speech. These are text-controlled speech synthesis systems that generate a synthetic language from a text that is intended to sound understandable and natural.
TTS systems analyse the text and compose the smallest meaning-giving speech units, the phonemes, into spoken words and sentences. For this purpose, the text is analyzed linguistically, broken down into the smallest text units, so-called graphemes, which are then converted into phonemes. After the conversion, the synthetic speech is provided with a rhythm and stresses.
Among other applications, the TTS technique is used in the Unified Messaging Service (UMS), which allows end users to manage their messages through a Telephone User Interface (TUI).
The speech processing of text-to-speech is supported by a voice user interface (VUI). With text-to-speech, different types of speakers with different voice tones can be distinguished. The spectrum ranges from the child's voice to the woman's voice to the man's voice, and can take into account different voice pitches and moods.