VoxForge

From Wikipedia, the free encyclopedia

VoxForge is a free speech corpus and acoustic model repository for open source speech recognition engines.

VoxForge was set up to collect transcribed speech to create a free GPL speech corpus for use with open source speech recognition engines. The speech audio files will be 'compiled' into acoustic models for use with open source speech recognition engines such as HTK, Julius, ISIP, and Sphinx.

The current focus of VoxForge is on collecting transcribed audio for command and control applications on a PC and for IP PBX-based telephony speech recognition (i.e. IVR - Interactive Voice Response) applications. When enough speech audio has been collected, VoxForge will work to create acoustic and language models for dictation applications.

[edit] External links

  • VoxForge - project home page
  • Julius - large vocabulary CSR engine
  • HTK - CUED's hidden Markov model toolkit
  • ISIP - MSState's Institute for Signal and Information Processing
  • Sphinx - The CMU Sphinx Group speech recognition engines