CMU Sphinx

From Wikipedia, the free encyclopedia

CMU Sphinx sometimes simply known as Sphinx is the general term to describe a group of speech recognition projects. These includes a series of speech recognizers (Sphinx I - IV), acoustic model trainer (SphinxTrain).

In 2000, the Sphinx group at CMU committed to open source several speech recognizer components, including Sphinx II and later Sphinx III (in 2001). Therefore, CMU Sphinx could also mean a group of open source projects related to speech recognition. For most, the speech decoders are perhaps the most well-known.

CMU Sphinx is perhaps the only open source, large vocabulary, continuous speech recognition project which consistently releases its work under the liberal BSD-license.

As for the components of CMU Sphinx, currently it is available in four forms.

1 Sphinx 2
2 Sphinx 3
3 Sphinx 4
4 PocketSphinx
5 External links

[edit] Sphinx 2

The fastest recognizer among all in the project, its focus is on real-time analysis of speech signals. It is widely used in dialogue systems and language learning systems. It can be used in computer based PBX systems, such as Asterisk.