Microsoft Translator

Microsoft Translator
Web address www.microsoft.com/translator
Type of site
Machine translation
Registration Optional
Owner Microsoft
Current status Active

Microsoft Translator is a multilingual statistical machine-translation cloud service provided by Microsoft. The Microsoft Translator API is integrated across multiple consumer, developer, and enterprise products; including Bing, Microsoft Office, SharePoint, Microsoft Lync, Yammer, Skype Translator, Visual Studio, and Internet Explorer.

The cloud API is also available as a service for customers, ranging from a free tier supporting two million characters per month to paid tiers supporting billions of characters per month.[1]

The service supports 50 languages as of May 2015.

Development

History

The first version of Microsoft’s machine translation system was developed between 1999 and 2000 within Microsoft Research. This system was based on semantic predicate-argument structures known as logical forms (LF), and was spun from the grammar correction feature developed for Microsoft Word. This system was eventually used to translate the entire Microsoft Knowledge Base into Spanish, French, German, and Japanese.[2]

Microsoft’s approach to machine translation, like most of current modern machine translation systems, is “data-driven[3] —rather than relying on writing explicit rules to translate natural language, algorithms are trained to understand and interpret translated parallel texts, allowing them to automatically learn how to translate new natural language text. Microsoft’s experience with the LF-system led directly to a treelet translation system which simplified the LF to dependency trees, and eventually to an order template model which led to significant improvements in speed and the incorporation of new target languages.

The consumer-facing translation site known as Bing Translator (previously known as Windows Live translator) was launched in 2007 and provides free text and website translations on the web. Text is translated directly within the Bing Translator webpage while websites are translated through the Bilingual Viewer tools.

In 2011, the service was extended to include numerous Microsoft Translator products through a cloud-based API, which supports products available to both consumer and enterprise users.

Translation Methodology and Research

Microsoft Translator uses statistical machine translation to create instantaneous translations from one natural language to another. This system is based on four distinct areas of computer learning research seen below.[4]

Type of Learning Impact on Translation
Syntax-Based SMT Syntax-based translation is based on the idea of translating syntactic units, rather than a word or string of words. Microsoft has used Syntax-based SMT to translate much of its computer-related texts from English into multiple target languages. Ongoing research in this area has produced improvements in word inflections and word ordering.
Phrase-Based SMT In Phrase-Based SMT, the machine learns correspondence between languages from parallel text without the aid of linguist knowledge. This produces better translations in less time than other systems.
Word Alignment SMT systems rely on existing translated data to learn how to automatically translate from one language to another. To train the systems, identifying word correspondences (or word alignments) is crucial. Microsoft has developed work in both discriminative[5] and generative[6] approaches to word alignment, resulting in faster algorithms and higher quality and translations.
Language Modeling Language Modeling uses n-gram models to construct comprehensible translations in the target language. This ensures that the output translation is fluent and readable.

Accuracy

The quality of Microsoft Translator’s machine translation outputs are evaluated using a method called the BLEU score.[7]

BLEU (Bilingual Evaluation Understudy) is an algorithm for evaluating the quality of text which has been machine-translated from one natural language to another. Quality is considered to be the correspondence between a machine's output and that of a human. BLEU was one of the first metrics to achieve a high correlation with human judgments of quality, and remains one of the most popular automated and inexpensive metrics.

Because machine translation is based on statistical algorithms rather than human translators, the automatic translations it produces are not always entirely accurate. Microsoft Translator has introduced various feedback features, such as the Collaborative Translations Framework, into its products to allow users to suggest alternative translations. These alternative translations are then integrated into the Microsoft Translator algorithms to improve future translations.

Core Products

Microsoft Translator is a cloud-based API that is integrated into numerous Microsoft products and services.[8] The Translator API can be used on its own and can be customized for use in a pre-publishing or post-publishing environment. The API, which is available through subscription, is free for lower translation volumes, and is charged according to a tiered payment system for volumes exceeding two million characters per month.[9] The remaining core products are available for free.

Microsoft Translator API

The Microsoft Translator API is a cloud-based automatic translation service that can be used to build applications, websites, and tools requiring multi-language support. The API is available using five interfaces: the Web widget, AJAX, HTTP, SOAP, or OData.[10]

Collaborative Translations Framework (CTF)

The Collaborative Translations Framework (CTF) is an extension of the Microsoft Translator API that allows post-publishing improvement of translated text.[11] By using the CTF, readers have the ability to suggest alternative translations to those provided by the API, or vote on previously offered alternatives. This information is then delivered to the API to improve future translations.

Microsoft Translator Hub

The Microsoft Translator Hub allows enterprises and language service providers to build their own translation systems that understand business- and industry-specific terminology.[12] The Hub can also be used in conjunction with the CTF, allowing administrators to approve CTF results and add them directly to the Hub.

The Hub has also been used for language preservation, allowing communities to create their own language translation systems for language and cultural preservation.[13] The Hub has been used to create translation systems for languages such as Hmong, Mayan, Nepali, and Welsh.

Translator Web Widget

The Translator Web Widget is a translation tool that can be added to web pages by pasting a predefined snippet of JavaScript code into the page.[14] The web widget is offered for free by Microsoft, and supports both pre-publishing customized translations using the Translator Hub, and post-publishing improvements using the Collaborative Translations Framework.

Multilingual App Toolkit (MAT)

The Multilingual App Toolkit (MAT) is an integrated Visual Studio tool, which allows developers to streamline localization workflows of their Windows, Windows Phone and desktop apps.[15] MAT improves localization of file management, translation support, and editing tools.

Supported products

Through its core product offerings, Microsoft Translator supports the translation features of many Microsoft products at the consumer and enterprise levels. These products fall broadly into three categories[16]— communication products, Microsoft Office, and apps.

Communication

Microsoft Office

Apps

Languages

Microsoft Translator supports 50 different languages as of May, 2015.[17] The list of supported languages is available at the Microsoft Translator website and can also be retrieved programmatically through the API.[18]

Arabic
Bosnian
Bulgarian
Catalan
Chinese (Simplified)
Chinese (Traditional)
Croatian
Czech
Danish
Dutch

English
Estonian
Finnish
French
German
Greek
Haitian Creole
Hebrew
Hindi
Hmong Daw

Hungarian
Indonesian
Italian
Japanese
Klingon
Klingon (plqaD)
Korean
Latvian
Lithuanian
Malay

Maltese
Norwegian
Persian
Polish
Portuguese
Querétaro Otomi
Romanian
Russian
Serbian
Slovak

Slovenian
Spanish
Swedish
Thai
Turkish
Ukrainian
Urdu
Vietnamese
Welsh
Yucatec Maya

Community Partners

Microsoft Translator has engaged with community partners to increase the number of languages and to improve overall language translation quality. Below is a list of community partners that Microsoft Translator has teamed with.[19]

Additionally, Microsoft has teamed with the Klingon Language Institute, which promotes the constructed language, Klingon, which is used within the fictional Star Trek universe produced by Paramount and CBS Studios. Klingon has been supported by Microsoft Translator since May, 2013.[20]

See Also

References

External Links