Stable release | 3.1.1 / 21 may 2009 |
---|---|
Written in | C++ |
Operating system | POSIX compatible |
Available in | Multi-lingual |
Type | Machine translation |
License | GNU General Public License |
Website | www.apertium.org |
Apertium is a rule-based machine translation platform. It is free software and released under the terms of the GNU General Public License.
Contents |
Apertium originated as one of the machine translation engines in the project OpenTrad, which was funded by the Spanish government. It was originally designed to translate between closely related languages, although it has recently been expanded to treat more divergent language pairs. To create a new machine translation system, one just has to develop linguistic data (dictionaries, rules) in well-specified XML formats.
Language data developed for it (in collaboration with the Universidade de Vigo, the Universitat Politècnica de Catalunya and the Universitat Pompeu Fabra) currently support (in stable version) the Aragonese, Asturian, Basque, Breton, Bulgarian, Catalan, Danish, English, Esperanto, French, Galician, Icelandic, Italian, Macedonian, Norwegian (Bokmål and Nynorsk), Occitan, Portuguese, Romanian, Spanish, Swedish and Welsh languages. A full list is available below. Several companies are also involved in the development of Apertium, including Prompsit Language Engineering, Imaxin Software and Eleka Ingeniaritza Linguistikoa.
Apertium is a shallow-transfer machine translation system, which uses finite state transducers for all of its lexical transformations, and hidden Markov models for part-of-speech tagging or word category disambiguation. Constraint Grammar taggers are also used for some language pairs (eg. Breton-French).
The project has taken part in the 2009, 2010 and 2011 editions of the Google Summer of Code and the 2010 and 2011 editions of the Google Code-in.[1]
List of currently stable language pairs, hover over the language codes to see the languages that they represent.
an |
ast |
eu |
br |
bg |
ca |
da |
en |
eo |
fr |
gl |
is |
it |
mk |
nb |
nn |
oc |
pt |
ro |
es |
sv |
cy |
|
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Aragonese | — | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Yes (⇄) | No | No |
Asturian | No | — | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Yes (⇄) | No | No |
Basque | No | No | — | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Yes (→) | No | No |
Bretonă | No | No | No | — | No | No | No | No | No | Yes (→) | No | No | No | No | No | No | No | No | No | No | No | No |
Bulgarian | No | No | No | No | — | No | No | No | No | No | No | No | No | Yes (⇄) | No | No | No | No | No | No | No | No |
Catalan | No | No | No | No | No | — | No | Yes (⇄) | Yes (→) | Yes (⇄) | No | No | Yes (←) | No | No | No | Yes (⇄) | Yes (⇄) | No | Yes (⇄) | No | No |
Danish | No | No | No | No | No | No | — | No | No | No | No | No | No | No | No | No | No | No | No | No | Yes (←) | No |
English | No | No | No | No | No | Yes (⇄) | No | — | Yes (⇄) | No | Yes (⇄) | Yes (←) | No | Yes (←) | No | No | No | No | No | Yes (⇄) | No | Yes (←) |
Esperanto | No | No | No | No | No | Yes (←) | No | Yes (⇄) | — | Yes (←) | No | No | No | No | No | No | No | No | No | Yes (←) | No | No |
French | No | No | No | Yes (←) | No | Yes (⇄) | No | No | Yes (→) | — | No | No | No | No | No | No | No | No | No | Yes (⇄) | No | No |
Galician | No | No | No | No | No | No | No | Yes (⇄) | No | No | — | No | No | No | No | No | No | Yes (⇄) | No | Yes (⇄) | No | No |
Icelandic | No | No | No | No | No | No | No | Yes (→) | No | No | No | — | No | No | No | No | No | No | No | No | No | No |
Italian | No | No | No | No | No | Yes (→) | No | No | No | No | No | No | — | No | No | No | No | No | No | No | No | No |
Macedonian | No | No | No | No | Yes (⇄) | No | No | Yes (→) | No | No | No | No | No | — | No | No | No | No | No | No | No | No |
Norwegian (Bokmål) | No | No | No | No | No | No | No | No | No | No | No | No | No | No | — | Yes (⇄) | No | No | No | No | No | No |
Norwegian (Nynorsk) | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Yes (⇄) | — | No | No | No | No | No | No |
Occitan | No | No | No | No | No | Yes (⇄) | No | No | No | No | No | No | No | No | No | No | — | No | No | Yes (⇄) | No | No |
Portuguese | No | No | No | No | No | Yes (⇄) | No | No | No | No | Yes (⇄) | No | No | No | No | No | No | — | No | Yes (⇄) | No | No |
Romanian | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | — | Yes (←) | No | No |
Spanish | Yes (⇄) | Yes (⇄) | Yes (←) | No | No | Yes (⇄) | No | Yes (⇄) | Yes (→) | Yes (⇄) | No | No | No | No | No | No | Yes (⇄) | Yes (⇄) | Yes (←) | — | No | No |
Swedish | No | No | No | No | No | No | Yes (→) | No | No | No | No | No | No | No | No | No | No | No | No | No | — | No |
Welsh | No | No | No | No | No | No | No | Yes (→) | No | No | No | No | No | No | No | No | No | No | No | No | No | — |