Speech processing

Speech processing is the study of speech signals and the processing methods of these signals.

The signals are usually processed in a digital representation whereby speech processing can be seen as the intersection of digital signal processing and natural language processing.

Speech processing can be divided in the following categories:

Speech recognition, which deals with analysis of the linguistic content of a speech signal.
Speaker recognition, where the aim is to recognise the identity of the speaker.
Enhancement of speech signals, e.g. noise reduction,
Speech coding for compression and transmission of speech. See also telecommunication.
Voice analysis for medical purposes, such as analysis of vocal loading and dysfunction of the vocal cords.
Speech synthesis: the artificial synthesis of speech, which usually means computer generated speech.
Speech compression is important in the telecommunications area for increasing the amount of info which can be transferred, stored, or heard, for a given set of time and space constraints.

[edit] Books

Multilingual Speech Processing, Edited by Tanja Schultz and Katrin Kirchhoff, April 2006--Researchers and developers in industry and academia with different backgrounds but a common interest in multilingual speech processing will find an excellent overview of research problems and solutions detailed from theoretical and practical perspectives.---CH 1: Introduction / CH 2: Language Characteristics / CH 3: Linguistic Data Resources / CH 4: Multilingual Acoustic Modeling / CH 5: Multilingual Dictionaries / CH 6: Multilingual Language Modeling / CH 7: Multilingual Speech Synthesis / CH 8: Automatic Language Identification / CH 9: Other Challenges / CH 10: Speech-to-Speech Translation / CH 11: Multilingual Spoken Dialog Systems / Bibliography

Digital Signal Processing
Theory — Nyquist–Shannon sampling theorem, estimation theory, detection theory
Sub-fields — audio signal processing \| control engineering \| digital image processing \| speech processing \| statistical signal processing
Techniques — Discrete Fourier transform (DFT) \| Discrete-time Fourier transform (DTFT) \| bilinear transform \| Z-transform, advanced Z-transform
Sampling — oversampling \| undersampling \| downsampling \| upsampling \| aliasing \| anti-aliasing filter \| sampling rate \| Nyquist rate/frequency
This box: view • talk • edit