MBROLA

MBROLA is an algorithm for speech synthesis, and software which is distributed at no financial cost but in binary form only, and a worldwide collaborative project. The MBROLA project web page provides diphone databases for a large number of spoken languages.

The MBROLA software is not a complete text-to-speech system for all those languages; the text must first be transformed into phoneme and prosodic information in MBROLA's format, and separate software to do this is available for some but not all of MBROLA's languages and can require extra setup.

Although diphone-based, the quality of MBROLA's synthesis is considered to be higher than that of most diphone synthesisers; this is due in part to the fact that it is based on a preprocessing of diphones (imposing constant pitch and harmonic phases), which enhances their concatenation while only slightly degrading their segmental quality.

MBROLA is a time-domain algorithm, as PSOLA, which implies very low computational load at synthesis time. Unlike PSOLA, however, MBROLA does not require a preliminary marking of pitch periods. This feature has made it possible to develop the MBROLA project around the MBROLA algorithm, through which many speech research labs, companies, or individuals around the world have provided diphone databases for many languages and voices (the number of which is by far a world record for speech synthesis, but there are some notable omissions such as Chinese).

References