Speech synthesis is a method for artificially generating speech. A text-to-speech system (TTS) is used, i.e. a device or computer program that converts written text into acoustic signals. Speech synthesis is used, among other things, to help visually impaired people communicate.
Need a translation?
FAQ: More questions about speech synthesis
Text to speech (abbreviation: TTS) refers to a method for converting written text into speech. This is a form of speech synthesis.
Speech synthesis is the artificial generation of human speech. Different devices and programs can be used such as TTS software.
To generate speech signals, a distinction is made between two approaches: the rule-based and the lexicon-based approach, which are used in combination in most text-to-speech systems.
Neural speech synthesis refers to a form of speech generation that is constantly being improved through machine learning. To do this, an artificial neural network is created that learns to predict the phonetics of human speech. The result is a more fluid and natural sounding voice.
While text to speech was initially used mainly to help people with disabilities communicate or to provide them with barrier-free access to content, it can now be used wherever text needs to be converted into speech, e.g. in customer service portals or when using smart devices.