Transliteration
Part of a series on |
Translation |
---|
Types |
Theory |
Technologies |
Localization |
Institutional |
|
Related topics |
|
Transliteration is the conversion of a text from one script to another.[1]
For instance, a Latin transliteration of the Greek phrase "Ελληνική Δημοκρατία", usually translated as 'Hellenic Republic', is "Ellēnikḗ Dēmokratía".
Transliteration is not concerned with representing the sounds of the original, only the characters, ideally accurately and unambiguously. Thus, in the above example, λλ is transliterated as 'll', but pronounced /l/; Δ is transliterated as 'D', but pronounced 'ð'; and η is transliterated as 'ē', though it is pronounced /i/ (exactly like ι) and is not long.
Conversely, transcription notes the sounds but not necessarily the spelling. So "Ελληνική Δημοκρατία" could be transcribed as "elinikí ðimokratía", which does not specify which of the /i/ sounds are written as η and which as ι.
Definitions
Systematic transliteration is a mapping from one system of writing into another, typically grapheme to grapheme. Most transliteration systems are one-to-one, so a reader who knows the system can reconstruct the original spelling.
Transliteration is opposed to transcription, which maps the sounds of one language into a writing system. Still, most systems of transliteration map the letters of the source script to letters pronounced similarly in the target script, for some specific pair of source and target language. If the relations between letters and sounds are similar in both languages, a transliteration may be very close to a transcription. In practice, there are some mixed transliteration/transcription systems that transliterate a part of the original script and transcribe the rest.
For many script pairs, there is one or more standard transliteration systems. However, unsystematic transliteration is common.
Difference from transcription
In Modern Greek (and since the Roman Imperial period), the letters <η> <ι> <υ> and the letter combinations <ει> <oι> <υι> are pronounced [i] (except when pronounced as semivowels), and a modern transcription renders them all as <i>; but a transliteration distinguishes them, for example by transliterating to <ē> <i> <y> and <ei> <oi> <yi>. (As the ancient pronunciation of <η> was [ɛː], it is often transliterated as an <e> with a macron, even for modern texts.) On the other hand, <ευ> is sometimes pronounced [ev] and sometimes [ef], depending on the following sound. A transcription distinguishes them, but this is no requirement for a transliteration. The initial letter 'h' reflecting the historical rough breathing in words such as Hellēnikē should logically be omitted in transcription from Koine Greek on,[2] and from transliteration from 1982 on, but it is nonetheless frequently encountered.
Greek word | Transliteration | Transcription | English translation |
---|---|---|---|
Ελληνική Δημοκρατία | Hellēnikē Dēmokratia | Eliniki Dhimokratia | Hellenic Republic |
Ελευθερία | Eleutheria | Eleftheria | Freedom |
Ευαγγέλιο | Euaggelio | Evangelio | Gospel |
των υιών | tōn uiōn | ton ion | of the sons |
Partial transliteration
There is also another type of transliteration that is not full, but partial or quasi. A source word can be transliterated by first identifying all the applicable prefix and suffix segments based on the letters in the source word. All of these segments, in combination constitute a list of potential partial transliterations. So a partial transliteration can include only prefix or only suffix segments. A partial transliteration will also include some unmapped letters of the source word, namely those letters between the end of the prefix and the beginning of the suffix. The partial transliteration can be “filled in” by applying additional segment maps. Applying the segment maps can produce additional transliterations if more than one segment mapping applies to a particular combination of characters in the source word.[3]
Some examples or "partial transliterations" are words like "bishop" via Anglo-Saxon biscep from the Greek word "episkopos" and the word "deacon", which is partially transliterated from the Greek word "diakonos".
Challenges
A simple example of difficulties in transliteration is the voiceless uvular plosive used in Arabic and other languages. It is pronounced approximately like English [k], except that the tongue makes contact not on the soft palate but on the uvula. Pronunciation varies between different languages, and different dialects of the same language. The consonant is sometimes transliterated into "g", sometimes "k", and sometimes "q" in English.[4] Another example is the Russian letter "Х" (kha). It is pronounced as the voiceless velar fricative /x/, like the Scottish pronunciation of ⟨ch⟩ in "loch". This sound is not present in most forms of English, and is often transliterated as "kh", as in Nikita Khrushchev. Many languages have phonemic sounds, such as click consonants, which are quite unlike any phoneme in the language into which they are being transliterated.
Some languages and scripts present particular difficulties to transcribers. These are discussed on separate pages.
- Ancient Near East
- Transliterating cuneiform languages
- Transliteration of Ancient Egyptian (see also Egyptian hieroglyphs)
- hieroglyphic Luwian
- Armenian language
- Avestan
- Brahmic family
- Chinese language
- Click languages of Africa
- English language
- Greek language
- Japanese language
- Korean language
- Persian language
- Semitic languages
- Slavic languages written in the Cyrillic or Glagolitic alphabets
- Thai language
Adopted
- Buckwalter transliteration
- Devanagari transliteration
- Hans Wehr transliteration
- International Alphabet of Sanskrit Transliteration
- Scientific transliteration of Cyrillic
- Transliteration of Ancient Egyptian
- Transliterations of Manchu
- Wylie transliteration
See also
- Latin script
- List of ISO transliterations
- Orthographic transcription
- Phonemic orthography
- Phonetic transcription
- Romanization
- Substitution cipher
- Transcription (linguistics)
References
- ↑ Kharusi, N. S. & Salman, A. (2011) The English Transliteration of Place Names in Oman. Journal of Academic and Applied Studies Vol. 1(3) September 2011, pp. 1–27 Available online at www.academians.org
- ↑ see Koine Greek phonology
- ↑ Machine Learning For Transliteration - Transliteration
- ↑ Language log
"Translation" citation 15: ^ Kasparek, "The Translator's Endless Toil", pp. 85–86. "Roger Bacon wrote that if a translation is to be true, the translator must know both languages, as well as the science that he is to translate"
External links
Look up transliteration in Wiktionary, the free dictionary. |
Online transliteration
- Cyrillic transliteration www.cesty.in
- Greek and Hebrew transliteration Transliterate.com
- Hindi to Urdu (and vice versa) transliteration Malerkotla.co.in
- Jayapal Chandran Basic Indian language transliteration (Asian scripts -> Latin)
- Perl module and online service covering a variety of writing systems Lingua-Systems, Lingua::Translit
- Sinhala and Tamil keyboard and transliteration
- Tool for transliteration of Asian scripts (Asian scripts -> Asian scripts -> Latin ISO)
- ICU Transform Demonstration
- Translit Transliteration service
- TransLiteration Online transliteration service
Documentation
- Unicode Transliteration Guidelines
- ICU User Guide: Transforms International Components for Unicode transliteration services
- Transliteration of Non-Latin scripts – Collection of transliteration tables for many non-Latin scripts maintained by Thomas T. Pedersen.
- United Nations Group of Experts on Geographical Names (UNGEGN) – working group on Romanization Systems.
- Library of Congress: Romanization Tables
Software
- Softario Typus Free in-place transliteration tool for Russian, Arabic, Greek, Hebrew and other languages for Windows.
- OpenOffice.org, for Indic transliteration in OpenOffice
- SourceForge, AzConvert, open source program for transliterating Latin and Arabic scripts of Azerbaijani language developed using Qt
- Subasa Tamil to Sinhalese language transliteration for Mozilla Firefox users
- Indian Language Transliterator for Mozilla Thunderbird This add-on for Mozilla Thunderbird enables Thunderbird users to compose and send messages in 10 regional Indian languages, using their regular QWERTY keyboard. The languages supported by this add-on are Assamese, Bengali, Hindi, Kannada, Oriya, Malayalam, Marathi, Sanskrit, Tamil and Telugu.
- Unidecode, Python module for ASCII transliteration of Unicode text.
- Translit - Lugons Project - Serbian Cyrilic and Latin language transliteration for Apache OpenOffice and LibreOffice
Others
- Transliteration history – history of the transliteration of Slavic languages into Latin alphabets.
|