Speech processing
From Wikipedia, the free encyclopedia
Speech processing is the study of speech signals and the processing methods of these signals.
The signals are usually processed in a digital representation whereby speech processing can be seen as the intersection of digital signal processing and natural language processing.
Speech processing can be divided in the following categories:
- Speech recognition, which deals with analysis of the linguistic content of a speech signal.
- Speaker recognition, where the aim is to recognise the identity of the speaker.
- Enhancement of speech signals, e.g. noise reduction,
- Speech coding for compression and transmission of speech. See also telecommunication.
- Voice analysis for medical purposes, such as analysis of vocal loading and dysfunction of the vocal cords.
- Speech synthesis: the artificial synthesis of speech, which usually means computer generated speech.
- Speech compression is important in the telecommunications area for increasing the amount of info which can be transferred, stored, or heard, for a given set of time and space constraints.
[edit] Books
- Multilingual Speech Processing, Edited by Tanja Schultz and Katrin Kirchhoff, April 2006--Researchers and developers in industry and academia with different backgrounds but a common interest in multilingual speech processing will find an excellent overview of research problems and solutions detailed from theoretical and practical perspectives.---CH 1: Introduction / CH 2: Language Characteristics / CH 3: Linguistic Data Resources / CH 4: Multilingual Acoustic Modeling / CH 5: Multilingual Dictionaries / CH 6: Multilingual Language Modeling / CH 7: Multilingual Speech Synthesis / CH 8: Automatic Language Identification / CH 9: Other Challenges / CH 10: Speech-to-Speech Translation / CH 11: Multilingual Spoken Dialog Systems / Bibliography
[edit] See also
[edit] External links
- Compure Audio Technologies
- Center for Language and Speech Processing at JHU
- Speech Processing Group
- Voyce Security Systems
- Speech Processing Group at the Laboratory of Applied Physics
- Speech and Language Processing
- Speech Processing
- Speech Processing Discussion Group
- Philips Speech Recognition Systems
Digital Signal Processing |
---|
Theory — Nyquist–Shannon sampling theorem, estimation theory, detection theory |
Sub-fields — audio signal processing | control engineering | digital image processing | speech processing | statistical signal processing |
Techniques — Discrete Fourier transform (DFT) | Discrete-time Fourier transform (DTFT) | bilinear transform | Z-transform, advanced Z-transform |
Sampling — oversampling | undersampling | downsampling | upsampling | aliasing | anti-aliasing filter | sampling rate | Nyquist rate/frequency |