Talk:Languages of India

From Wikipedia, the free encyclopedia


WikiProject_India This article is within the scope of WikiProject India, which aims to improve Wikipedia's coverage of India-related topics. If you would like to participate, please visit the project page.
Stub This article has been rated as stub-Class on the quality scale. (see comments)
Archive
Archives

Contents


[edit] Suggested Improvements

  • Flesh out the main body of the article. How about a set of tables giving the number of people speaking each language as their first tongue?
  • Each state should be listed in a table, with the first (and, if possible second) language(s) listed
  • The detailed work on phonetics and alphabet should be moved to a sub-article once the main article has been expanded. It's really far too detailed to be of any use to a casual reader but I suspect it's not rigorous enough right now to avoid offending an expert.
  • Despite the suggestions above, we should avoid turning this into a massive set of lists (which is what has happened to the Native American languages article)

Sadly, I'm nowhere near knowledgeable enough to act on my own suggestions here, so I'll go and try to be more useful somewhere else. However, I'd love to see this article get on the main page someday... Kayman1uk 10:20, 6 April 2006 (UTC)

[edit] Help add input for Wikipedia:Naming conventions (Indic)

Help add input for Wikipedia:Naming conventions (Indic)--Dangerous-Boy 04:49, 4 May 2006 (UTC)

[edit] tamil aspirates

"This classification is observed in all the languages under discussion" - what about tamil, which doesn't even have aspirated consonants except when special characters are used for writing Sanskrit (Granthakshara)?--Grammatical error 06:47, 31 May 2006 (UTC)

Yes. Please reword it. See Tamil language#Phonology. I'm not an expert on such issues; but, one can ask Arvind for any clarifications. -- Sundar \talk \contribs 08:05, 31 May 2006 (UTC)

[edit] Cleanup

I've refactored the lead per WP:LEAD. This article needs a lot of improvement. Currently, beyond the lead, there's nothing except the alphabets. We need to somehow shed our inclination to mention data about individual languages and create a proper encyclopedic article on the subject at hand. We need good maps as in African languages and the layout could be a modified version of Gbe languages. Because all the Gbe languages are linguistically related, they were able to talk about language features, whereas, we need to have smaller summary subsections talking about features of the 4 linguistic families plus Andamanese languages. A good test for not wavering beyond the topic is the extent to which we avoid mention of individual languages in favour of language families. The lead section should ideally be the only place for their mention. -- Sundar \talk \contribs 08:27, 31 May 2006 (UTC)

Yes, you've hit the nail on the head, and good job on the lead, that looks very good. The alphabets section needs to be shortened (and/or split into writing systems and phonology which is what much of it is really about), and then you're right the article needs some expansion on the various families. They should get space relative to the number of speakers of each family, though not exactly proportional. The Gbe languages article is a good model for what to cover, but we should work on a proposed outline of what the article should ideally cover, then we can go do some research to get good sources to cite. General linguistis topics would be history, writing system, phonology, morphology, syntax (grammar), and maybe a bit on corpus linguistics and translation. I fear that if we cover that four times the article may be unwieldy, though maybe not it we don't create that many subsections for each of the language families. The smaller families could just have one or two paragraphs that summarize all of that. Should the major subsections be the topics I listed above or should it be the 4 or so language families and then cover those topics in each section? - Taxman Talk 11:46, 1 June 2006 (UTC)
Taxman, thanks for your outline above and the copyedit done by you. The outline sounds good. We could add a distribution map if there's a definitive source. This book and others from CIIL can be useful. This, being an important main article related to India, merits good attention. Hope more editors join in the effort to improve it. -- Sundar \talk \contribs 12:31, 1 June 2006 (UTC)
Ok, but which outline do you think would work better (my last question)? It would be hard to switch. - Taxman Talk 12:58, 1 June 2006 (UTC)
Thinking of the semantics, I lean in favour of the former. But, we need not have subsections for each language family; we could just have paragraphs. The other outline doesn't sound bad either. -- Sundar \talk \contribs 13:10, 1 June 2006 (UTC)
In principle, I support the former. The article will need to discuss how the language families have influenced each other in grammar (e.g. the Tolkappiyam's rather strained identification of seven cases), phonology (retroflexes in Indo-Aryan), morphology and vocabulary, and it will need to do so in the context of theories such as Murray Emeneau's model of the Indian linguistic area. It seems to me we can best do this with a structure that discusses the families together, rather than separately. -- Arvind 16:07, 5 June 2006 (UTC)

What would be the scope of the article? Languages native to India? If no,t we can also include Portuguese, French, (I don't know if Dutch was ever spoken in Kerala), and Aramaic. Pali seems to be absent, so too NE languages. =Nichalp «Talk»= 15:14, 1 June 2006 (UTC)

Good question. I suppose it depends on the data for the number of speakers. I would propose the coverage should be balanced by that and importance/ other factors. The article would be remiss without mention of English's role, but it seems like it would be better off without a linguistic coverage of it, instead just a survey of the role it has/had. Pali is Indo-European, so it should be covered in that context. What do you mean by NE - Northern European? And please comment on which approach to the layout you prefer based on the above. - Taxman Talk 15:28, 1 June 2006 (UTC)
NE = North East India, one of the most poorly documented regions of India. Sundar, maps shouldn't be a problem anymore since we've got a featured SVG map. Basic drawing using inkscape would solve the problem. Having subsections for each language family might lead to the page becoming too cluttered. =Nichalp «Talk»= 16:25, 1 June 2006 (UTC)
If we get some data on Portugeese, French, Dutch and possibly Hebrew speakers, we could, in the interests of completeness, make a mention in a single line or a short paragraph. Yes, even I've observed the poor coverage on NE here and elsewhere. Let's do proper justice to those language families as well. Glad to know that maps are not a problem. -- Sundar \talk \contribs 10:11, 3 June 2006 (UTC)
The Cochini jews had a special dialect, often called Judaeo-Malayalam which might be interesting enough to mention. We should also at least mention the existence of pockets of native speakers of Goan Portuguese (does it differ from "Standard Portuguese"?) and English since those have deep roots. Chinese and other immigrant languages probably don't merit a mention. -- Arvind 16:07, 5 June 2006 (UTC)

[edit] Removed section

I removed the following text as too detailed on one language and innacurate anyway:

Urdu is unique among Indian languages. Grammatically it is 'genetically' linked to the older language of Prakrit. Much of its vocabulary is derived from neighboring Arabic, Turkish, Farsi and Sanskrit. Indeed, Urdu is the Turkish word for "camp", "tent", or "military encampment". Urdu arose due to contact between the Mughal armies and speakers of the local derivatives of Sanskrit and Prakrit. It has since evolved into a rich independent language. The modern Urdu script evolved from the Arabic script. It was introduced via Persia by invading Mughal armies, and was fitted to the local Indian phonology. Thus, even though Urdu is deeply connected with other Indian languages, and its phonology differes from that of Hindi by only six sounds, its script shows no influence from neighboring Indian alphabets.

It has grains of truth but makes it sound like Urdu is unrelated to Hindi, which no scholars would support. Besides it's too much detail for one language and I'm not sure it should be in even if properly balanced. I made other changes to start working towards what was discussed above. I'll need to go get some more sources to do much more. - Taxman Talk 18:27, 2 June 2006 (UTC)

Agree with you, Taxman. -- Sundar \talk \contribs 10:12, 3 June 2006 (UTC)

[edit] Telugu

I have removed "italian of the east" . Telugu is much more sweeter and italian does not stand anywhere near it. There is no need to add such old colonial phrases in the languages of India article.Bharatveer 14:11, 5 June 2006 (UTC)

[edit] Maps

Can Nichalp or someone else go to this site, select "culture" in the "journey highlights" and grab the information required for creating maps for linguistic distribution, by clicking on "modern language distribution" etc., Since it's in flash, I'm not able to get absolute URLs. -- Sundar \talk \contribs 14:14, 5 June 2006 (UTC)

I've uploaded the screenshot of an enlarged version (showing Asia) here. -- thunderboltza.k.a.Deepu Joseph |TALK 16:05, 5 June 2006 (UTC)

[edit] Going forward

Let's start working with the layout suggested by Taxman above. Each of us shall take up some tasks and take the article forward.

  • For writing systems, which are the ones prevailing here? Brahmic scripts, Konyak orthography[1] then?
  • Language mutual influence (examples have been cited by Arvind above)

A number of ebooks are available at CIIL's site.

By the way, another article languages in India would have a different scope and perspective, wouldn't it? I can imagine that article talking about the language movements, influence on our polity, states reorganisation, political integration, language law [2] etc., Pretty interesting, isn't it? -- Sundar \talk \contribs 07:06, 17 June 2006 (UTC)

Oughtn't we to also discuss some of that here? I'd think the article would be incomplete if it didn't at least summarise the basics of the legal and social status of the various languages in India today.
And shall we try to put together a more detailed outline here first, before going on to actually write it? -- Arvind 10:12, 17 June 2006 (UTC)
Sure. Can you place a tentative outline at /Draft? -- Sundar \talk \contribs 13:43, 17 June 2006 (UTC)

Shouldn't classification be the first section in order to introduce the families? That could use a nice table of languages and their classification and perhaps a map or a chart too. Anyone? -- Sundar \talk \contribs 12:33, 8 July 2006 (UTC)

[edit] Notes

  1. ^ http://www.ciil-ebooks.net/html/konyak/index1.html
  2. ^ http://www.ciil-ebooks.net/html/langLaw/coverpage.html

[edit] Gender vs. measure words

Bengali language has numerical classifiers similar to the East Asian languages, and does not have masculine/feminine grammatical gender like most Indian and European languages do. Is this also true of other languages in eastern India? --JWB 17:40, 14 August 2006 (UTC)

[edit] Less mentions of Marathi language

Surprised to see that there are very few mentions of Marathi language in this article.There's very few information given about Marathi.Im not a expert but perhaps what's relevant to Hindi,Bengali,Punjabi and Gujarati is also obvious to Marathi,but those mentions have not been given.

Marathi is an important language hence please give an comprehensive information about it here(just like Tamil/Kannada and Hindi). (mahawiki 20:02, 27 August 2006 (UTC))

In an article on a country with so many languages there is very little space for each individual langauge. The answer is probably not to add more information on a particluar language, but to remove some of the mentions of others and replace it with general information about language families. The only time a specific language should be mentioned is when some unique feature of them is important enough to justify it. - Taxman Talk 14:01, 28 August 2006 (UTC)