User:Sundar/Tamil language
From Wikipedia, the free encyclopedia
This is a scratchpad that I was using to rewrite the Tamil language article conforming to the Wikipedia:WikiProject Language Template.This article is no longer relevant (except for the edit history) as this has been moved to Tamil language and many editors have made significant changes subsequently.
Tamil is one of the classical languages of the world with a rich and long literary tradition. In 1996, it was the 18th most spoken language in the world with over 74 million speakers spread across the world. Tamil belongs to the Dravidian language family which is native to parts of India and Pakistan.
The l (pictured right) in the name Tamil is pronounced like a retroflex r and is often transliterated as zh. This phoneme is believed to be unique to Tamil and Malayalam. See also retroflex approximant.
Tamil தமிழ் |
||
---|---|---|
Spoken in: | India, Sri Lanka, Singapore, Malaysia, Mauritius, with a significant minority in many countries like South Africa, Germany, Netherlands, Fiji, Canada, and Australia | |
Region: | Tamil Nadu state in India | |
Total speakers: | 74 million (as of 1999) | |
Ranking: | 18 (as of 1996)[1] | |
Language family: | Dravidian
Southern |
|
Official status | ||
Official language of: | India, Sri Lanka and Singapore | |
Regulated by: | Language Academy | |
Language codes | ||
ISO 639-1: | ta | |
ISO 639-2: | tam | |
ISO 639-3: | — | |
Note: This page may contain IPA phonetic symbols in Unicode. See IPA chart for English for an English-based pronunciation key. |
Contents |
[edit] History
The origins of Tamil are independent of Sanskrit unlike many other Indian languages. One of the oldest members of the Dravidian language family, it has a very long literary tradition which is believed to be several millennia long. The earliest available records date back to inscriptions from 200 BC. Other earlier works are preserved either in palm leaf manuscripts or through oral transmission. The earliest available text which describes the language of the classical period is the Tolkaappiyam (dating from around 200 BC).
Historical linguists categorise Tamil literature into three periods: ancient (200 BC to 700 AD), medieval (700 AD to 1500 AD) and modern (1500 AD to the present). During the medieval period, a number of Sanskrit loan words were absorbed by Tamil, which were sought to be removed by many purists during the 1900s, notably by Parithimaar kalaignar and Maraimalai adigal. This movement was called thanith thamizh iyakkam (meaning pure tamil movement). As a result of this, Tamil in formal documents, public speeches and scientific discourses is free of Sanskrit loan words largely. Around 800-1000 AD, Malayalam is believed to have evolved into a distinct language.
[edit] Classification
Tamil is a member of the Tamil languages group of languages that includes Irula, Kaikadi, Betta Kurumba, Sholaga, and Yerukula languages. This group is a subgroup of the Tamil-Malayalam languages, which falls under a subgroup of Tamil-Kodagu languages, which in turn is a subgroup of Tamil-Kannada-Telugu languages. The Tamil-Kannada-Telugu languages belong to the Southern branch of the Dravidian language family.
[edit] Geographic distribution
Tamil is spoken in southern India, northern and north-eastern Sri Lanka, Singapore, Malaysia, Mauritius, and also by a significant minority in many countries like South Africa, Germany, Netherlands, Fiji, Canada, and Australia.
[edit] Official status
Tamil is one of the 22 official languages of India. It is the administrative language of the Indian state of Tamil Nadu. Tamil is also an official language of Sri Lanka and Singapore.
[edit] Classical language status
Tamil is the first language to be declared as a classical language by the Government of India. This was announced by the President of India Abdul Kalam in a joint sitting of both houses of the Indian Parliament on June 6, 2004.
See item 41 of his address and the BBC news item on the formal approval by the Indian Cabinet.
Prior to that, several members of the academia had voiced a case for classical status to Tamil, notably Professor George L. Hart, who occupies the Chair in Tamil Studies at the University of California, Berkeley. See his statement.
[edit] Spoken and literary variants
Tamil is a diglossic language. The classic form (chentamil) of the language is different from the colloquial (koduntamil) form. This difference in the language has existed since ancient times.
The classic form is preferred for writing, and is also used for public speaking. While written Tamil is mostly standard across various Tamil speaking regions, spoken form of the language differs widely from the written form.
[edit] Dialects
Dialects of Tamil identified by the Ethnologue are: Adi Dravida, Aiyar, Aiyangar, Arava, Burgandi, Kasuva, Kongar, Korava, Korchi, Madrasi, Parikala, Pattapu Bhasha, Sri Lanka Tamil, Malaya Tamil, Burma Tamil, South Africa Tamil, Tigalu, Harijan, Sanketi, Hebbar, Tirunelveli, Madurai. Other known dialects are Kongu and Kumari.
[edit] Derived languages
Malayalam, spoken by the people around the hilly ranges bordering Kerala and Tamil Nadu states, closely resembles Tamil in vocabulary, syntax and script. Hence it is hypothesised to have been evolved from a dialect of Tamil called Malaithamil (meaning Tamil of the mountains).
[edit] Sounds
Main article: Tamil alphabet
The Tamil alphabet has 12 vowels and 18 consonants. These combine to form 216 compound characters. There is one special character (aaytha ezutthu), giving a total of 247 characters.
[edit] Vowels
The vowels are called uyir ezhuthu (uyir - life, ezhuthu - letter). The vowels are classified into short and long (five of each type) and two diphthongs.
The long ('nedil') vowels are about twice as long as the short ('kuRil') vowels. The diphthongs are usually pronounced about 1.5 times as long as the short vowels, though most grammatical texts place them with the long ('nedil') vowels.
Vowel | Sound | X-SAMPA |
---|---|---|
அ | Short a | /@/ |
ஆ | Long A | /a:/ |
இ | Short i | /I/ |
ஈ | Long I | /i:/ |
உ | Short u | /U/ |
ஊ | Long U | /u:/ |
எ | Short e | /E/ |
ஏ | Long E | /e:/ |
ஐ | Diphthong AI | /Ai:/ |
ஒ | Short o | /O/ |
ஓ | Long O | /o:/ |
ஔ | Diphthong AU | /@u:/ |
[edit] Consonants
The consonants are classified into three categories with 6 in each category: vallinam - hard, mellinam - soft or nasal, and idayinam - medium. Tamil has neither conjunct consonants nor aspirated and voiced stops. Some scholars have suggested that in Chenthamil (which refers to Tamil as it existed before Sanskrit words were borrowed), stops were voiceless when at the start of a word and unvoiced otherwise. However, no such distinction is observed by most modern Tamil speakers.
Following is the listing of consonants in isolated form (sans vowel sounds), indicated by the overdot diacritic.
Consonant | Sound | Category | X-SAMPA |
---|---|---|---|
க் | k | vallinam | /k/ or /g/ |
ங் | ng (N-yuh: one sound) | mellinam | /N/ |
ச் | c | vallinam | /c/ or /s/ |
ஞ் | nj | mellinam | /J/ |
ட் | tt | vallinam | /t/ or /d/ |
ண் | nn | mellinam | /n`/ |
த் | th | vallinam | /T/ or /D/ |
ந் | n | mellinam | /n/ |
ப் | p | vallinam | /p/ or /b/ |
ம் | m | mellinam | /m/ |
ய் | y | idaiyinam | /j/ |
ர் | r | idaiyinam | /4/ |
ல் | l | idaiyinam | /l/ |
வ் | v | idaiyinam | /v\/ |
ழ் | zh | idaiyinam | /r\`/ |
ள் | ll | idaiyinam | /l'/ |
ற் | rr | vallinam | /r/ |
ன் | nnn | mellinam | /n/ |
[edit] Compound form
Using the consonant 'k' as an example.
Compound | Transliteration | X-SAMPA |
---|---|---|
க் | k | |
க | ka | /k@/ |
கா | kA | /ka:/ |
கி | ki | /kI/ |
கீ | kI | /ki:/ |
கு | ku | /kU/ |
கூ | kU | /ku:/ |
கெ | ke | /kE/ |
கே | kE | /ke:/ |
கை | kAI | /k@i:/ |
கொ | ko | /kO/ |
கோ | kO | /ko:/ |
கௌ | kAU | /k@u:/ |
[edit] Special character
The special character 'ஃ' (pronounced 'akh') is rarely used by itself; it normally serves a purely grammatical function as an independent vowel form, the equivalent of the Overdot diacritic of plain consonants. This character is called aayutha ezhuthu (aayutham - weapon or device used in wars, ezhuthu - character) as it resembles the three dots found on a shield used by infantry in wars of olden days. The vowel sound u in aayutham gets dropped often.
[edit] Phonology
Phonologists are divided in their opinion over allophones in Tamil. One point of view is that Tamil has never had either conjunct consonants or aspirated and voiced stops. Hence, unlike Indo-European languages, Tamil did not have the need for either separate characters for these consonants or for allophones. But, there is another theory that, though the script does not have symbols for aspirated consonants, the sounds are present as different allophones of the present consonants. Whichever is true, the Sanskrit loan words are pronounced with aspirated consonants even though there are well-defined rules in Tolkaappiyam for pronouncing such words without the aspirated sounds.
[edit] Grammar
This section uses information from UCLA profile of Tamil.
Tamil, like other Dravidian languages, is an agglutinating language. Morphemes are usually transparently separable and analysable affixes that are attached to roots or stems. Such affixes in Tamil are almost always suffixal. Words are made up of lexical roots, or stems (roots that have been expanded by a derivational suffix), followed by one or more inflectional suffixes which mark categories such as person (grammar), number (grammar), mood, tense, etc. There is no absolute limit on the length and extent of agglutination. Because of this, the words get very long and are difficult to translate at times. According to Today Translations[2], a British translation service, the Tamil word "செல்லாதிருப்பவர்" (sellaathiruppavar, meaning certain type of truancy †) is ranked 8th in The Most Untranslatable Word In The World list.
Peyarchol (roughly corresponding to Noun) is a broad classification in Tamil grammatical terminology , which includes common and proper nouns, numerals, pronouns and some adjectives. They inflect for case, person, number (singular and plural), and gender. There are two genders which are based on the natural gender of the referent. There is a notion of type called thinai, which makes a distinction between human and nonhuman. Nouns may be of type "rational" (e.g., nouns referring to men, deities, women in many dialects) or of type "irrational" (e.g., women and children in a few dialects, animals). The various combinations of gender, number and type are referred usually as aiympaal (five categories) to which a noun can belong. There are 8 cases viz nominative, accusative, dative, sociative, genitive, instrumental, locative, and ablative.
Tamil has no articles. Hence, definiteness and indefiniteness are usually signalled by other grammatical devices, such as the number "one", used as an indefinite article. Compound nouns are used as deictic pronouns (demonstratives), which are used to indicate objects close by, at a distance, and a kind of neutral. Sri Lankan Tamil has a fourth kind indicating medial distance.
Verbs are formally inflected mainly for mood and tense by a grammatical particle suffixed to the stem. Most verbs also mark affective and effective "voice" where the former indicates that the subject undergoes the action named by the stem, and the latter signals that the subject directs the action of the stem. The voices used here are not equivalent to the notions of "transitivity" or "causation". Mood is also reflected implicitly by grammatical formatives which also mark tense categories. These signal whether the verbal event is unreal, possible, potential, or a real, and actual. There are three simple tenses (past, present, and future), and a series of perfects.
Word order is usually Subject-Object-Verb (SOV). Even though case and postpositions are used to mark grammatical relations, word order is not completely free. Even where variation is allowed the verb in simple sentences must always come to the far right of the sentence, the exception being in poetry.
In addition, Tamil has a verbal category called "attitude" which is used to indicate the state of mind of the speaker and subjective attitude about the narrated event. Verb auxiliaries are used for this purpose. Examples of affected states projected are: pejorative opinion, antipathy, relief towards an unpleasant event having ended, undesirability about the result of an event etc.
Much of Tamil grammar is extensively described in the oldest available grammar book for Tamil, the Tolkaappiyam.
[edit] Vocabulary
Modern Tamil vocabulary still retains most of the words from classical Tamil. Due to this and because of the emphasis on learning classical works like Tirukkural, classical Tamil is comprehensible in various degrees to most native speakers. However, a number of Sanskrit loan words have been adapted and used commonly in Tamil. But, unlike some other Dravidian languages, these words are restricted mainly to spiritual terminology and abstract nouns. Besides, Sanskrit, there are a few loan words from Persian and Arabic implying trade ties in ancient times. Of late, English words are also being used freely in colloqial Tamil. Some modern technical terminology is borrowed from English, though attempts are being made to have a pure Tamil technical terminology. Many individuals, and some institutions like the Tamil Virtual University have generated technical dictionaries for Tamil.
There are also many instances of Tamil loan words in other languages. Popular examples are cheroot (churuttu meaning "rolled up"), mango and catamaran (from kattu maram meaning "bundled logs"). For more such words, see here.
[edit] Writing system
Main article: Tamil alphabet
The present script used to write Tamil text is believed to have evolved from the Brahmi script of the Ashokan era. Later, a southern variant of the Brahmi script evolved into the Grantha script, which was used to write both Sanskrit and Tamil texts. Around the 15th century, a new script called vettezhuthu (meaning letters that are cut) evolved in order to make it easy for creating inscriptions on stone. Some people also call this as vattezhuthu (meaning curved letters). Around 1935, Periyar suggested some changes to make it amenable for printing. Some of these suggestions were incorporated by the then MG Ramachandran government in 1975.
While the evolution of the script was happening, many Sanskrit words also began to be used in Tamil. To facilitate writing these words, some alphabets from the grantha script are still being retained. However, there are many purists who would argue against the use of such alphabets as there are well-defined rules in Tolkaappiyam for Tamilising loan words.
[edit] Examples
- Tamil: தமிழ்/Tamizh /tamiɮ/
- hello: வணக்கம்/Vanakkam /vanakːam/
- good-bye: சென்று வருகிறேன்/sentru varukireen /sentu varukireen/
- please: தயவு செய்து/dayavu seithu /tajavu sei/
- thank you: நன்றி/nandri /nantʕi/
- sorry: மன்னிக்கவும்/mannikkavum /
- that one: அது/adhu /a tʰu/
- how much?: எவ்வளவு/evvalavu /evːalavu/
- yes: ஆம்/aam /aːm/
- no: இல்லை/illai /ilːaj/
- I don't understand: எனக்குப் புரியவில்லை!/enakku puriyavillai/puri:yavi:lai/
- Where's the bathroom?: குளியலறை எங்கே உள்ளது!kulial arai enge erukkinrathu/kul:ial araI: enge erukki:nrathu/
- generic toast (not used in formal conversations and to elders): (Hey): டேய்!/dei /de:i/
- English: ஆங்கிலம்/aangilam /aːŋilam/
- Do you speak English?: நீங்கள் ஆங்கிலம் பேசுவீர்களா/'neengal aangilam pesuveerkala?' /nee:gal ang:lam pesuv:rkala/?
[edit] External links
[edit] General
- Ethnologue report
- Omniglot resource
- Rosetta Project Archive
- Language Museum report
- UCLA Tamil Profile
- A brief review of its history and characteristics
- International Forum for Information Technology in Tamil
[edit] Resources for learning
- UPENN course page - University of Pennsylvania's web based courses for learning Tamil
- Tamil virtual University has the largest collection of digitised Tamil literary works and web based courses for learning Tamil
[edit] Fonts and Encodings
- Unicode Chart - Unicode Chart for Tamil (PDF)
- Unicode Converter - Online JavaScript tool to convert text in various Tamil encodings into Unicode
- Tamil fonts - Links to download various Tamil fonts
- NLS Information - NLS Information page for Windows XP