Talk:ISO 11940

From Wikipedia, the free encyclopedia

	This article falls within the scope of WikiProject Writing systems, a WikiProject interested in improving the encyclopaedic coverage and content of articles relating to writing systems on Wikipedia. If you would like to help out, you are welcome to drop by the project page and/or leave a query at the project’s talk page.
???	This article has not yet been assigned a rating on the Project’s quality scale.
???	This article has not yet been assigned a rating on the Project’s importance scale.

[edit] Transliteration

Good thing there is now some info on this standard. Since ISO charges high amounts for their standards, I never had the opportunity of seeing it. It seems to be a transliteration in the narrow sense, from which the original spelling can be reconstructed. Apart from that it seems much stricter than, but quite compatible with RTGS. Perhaps it would be useful to add some comments on the comparison.

I have never seen this standard used in practice anywhere. Do you know of any instances?

Small note: perhaps you missed a diacritic on one of the "y"s.

Do you have any information on the vowels? −Woodstone 14:41, 8 April 2007 (UTC)

I have no idea what I am doing here... As you say, the standard is not in wide use, and I had a hard time googling it. All that turns up are sites trying to sell you the standard for 56 dollars or so. I ask you! My only source for this is the xml file at unicode. It contains more information, and it conscientiously tells us when it is deviating from ISO. Is it is, this is all unverified. From what I found elsewhere on the net, yes, this is definitely intended as a standard allowing the reconstruction of the original spelling from the romanization (unlike RTGS). dab (𒁳) 23:29, 9 April 2007 (UTC)

Thanks for finding this file. Very informative. I see you started on the vowel section. I guess that the vowels are written in the same sequence as in Thai? Relative to the consonants vowels are not spoken in the sequence as written in Thai script and many vowel sounds are written with several symbols, located before, after, below and above the consonant (cluster) that precedes them in pronunciation. So deciphering any text from this transliteration will be a challenge. Finding the syllable boundaries will be a nightmare. −Woodstone 09:56, 10 April 2007 (UTC)

no, I am sure the vowels in transliteration will be in phonological order. The xml file has some special rules for this afaics. Anything else would be insane, and without precedent in ISO 15919. It's p̣hās̄ʹāthịy, not p̣hās̄ʹāịthy. dab (𒁳) 13:29, 10 April 2007 (UTC)

I hope you guys find my additions helpful. The 'xml file' (ICU or CLDR to me) has several divergences from the standard - that's what impelled me to buy a copy of the standard. (I'm now working on a bug report against it.) It's based on someone's documentation of the system, not on the standard itself. I added the example of Chiangmai, because that looks bizarre whichever option you choose for vowel reordering.

I'd've liked to have compared the transliterations of ชุมนุม spelt with nikkhahit, but Windows XP doesn't render the sequence <sara u, nikhahit>. The two implementations give ch̊un̊u and chůnů respectively!

You may wonder why I mentioned dandas. The reason is that the Unicode Technical Committee, or at least, senior members, have agreed that using dandas in Romanised text is perfectly reasonable. The dandas to use are the ones named as Devanagari - the Unicode Character Database categorises them as 'common', i.e. used in multiple scripts.

The table for 'other diacritic marks' looks a mess. Perhaps we need a bearer - น/n probably has fewest typographical problems.RichardW57 20:41, 16 May 2007 (UTC)

Good work. I've always wondered why ISO tries to keep their standards secret by charging high amounts for them. As I guessed above, the vowels are in spelling order (not phonetic), making the transliterations quite difficult to read. In the vowel section the อ is missing: does it re-use the symbol x given for the silent consonant? The other weird choice is v for ฤ, mentioned both as consonant and as vowel. Quaint is also keeping the h as class indicator. The only deviation from RTGS in consonants, apart from the diacritics, seems to be c for จ (RTGS has ch). I need more time to let it sink in. As bearer for some vowels and diacritics, we have used "–"(–) in other Thai spelling articles. −Woodstone 22:54, 16 May 2007 (UTC)

I didn't follow the standard's partitioning of consonants and vowels. It classifies them into the exclusive categories of the 46 consonants (i.e including ฤ and ฦ), the 9 free-standing vowels (including sara am and lakkhangyao), 3 'vowels' below the line (sara u, sara uu and phinthu), 7 vowels above the line (including maitaikhu and nikkhahit, but not yamakkan), the 5 marks that can appear above others (tone marks and thanthakhat/karan), the digits, and the 'special markers'. The latter are all puncutation except yamakkan, which I think should have been included in the vowels above the line. I thought it more useful to put ฤ and ฦ in both the consonant and the vowel lists, and less misleading to only show lakkhangyao in the combinations ฟๅ and ฦๅ. However, it wouldn't surprise me if some minority language had an orthography in which lakkhangyao is a proper vowel. There's one language which uses karan as a sort of tone mark!

Mechanically identifying the function of อ as consonant, vowel or class indicator is not easy. Although it is a vowel in the word ผงอบ /phàʔ ŋɔ̀ːp/ 'extremely weak', it's not obvious why one can't translate 'baking powder' as ผงอบ /phǒŋ ʔòp/. I've always wondered about the old spelling 'Yuthia' of 'Ayutthaya'. Was อ being misinterpreted as a class indicator? The consonant combination อย used to be much commoner, and represented a pre-glottalised palatal glide (source: Fang-Kuei Li's 'A Handbook of Comparative Tai'). Similarly, there is no reason to disbelieve that the class indicator ห was not once a full consonant, as in Old English hnutu 'nut' or hlud 'loud'. Again, the difference cannot always be detected - there are two words แหน, pronounced /nɛ̌ː/ and /hɛ̌ːn/ respectively!
(preceding unsigned edit is marked 2007-05-18T00:07:54 RichardW57)

I suspect the choice of v for ฤ is inspired by the Thai cursive (Roman) 'r'. The latter can look rather like 'seagull' (e.g. U+033C), possibly following the example of cursive kho khuat, which can look like a cross between small gamma and ram's horns (ɤ). I don't know how to verify this idea. RichardW57 18:39, 18 May 2007 (UTC)

For the transliterations of ศ and ษ, see extract for ว ศ and ษ.

How do you feeling about displaying the consonants with the extended vargas lined up? I'm tempted to copy the sibilants to their rightful places, thus:

Thai		ก	ข	ฃ	ค	ฅ	ฆ	ง			จ	ฉ	ศ	ช	ซ	ฌ	ญ
ISO		k	k̄h	ḳ̄h	kh	k̛h	ḳh	ng			c	c̄h	ṣ̄	ch	s	c̣h	ỵ

Thai	ฎ	ฏ	ฐ	ษ	ฑ		ฒ	ณ		ด	ต	ถ	ส	ท		ธ	น
ISO	ḍ	ṭ	ṭ̄h	s̛̄	ṯh		t̛h	ṇ		d	t	t̄h	s̄	th		ṭh	n

Thai	บ	ป	ผ	ฝ	พ	ฟ	ภ	ม
ISO	b	p	p̄h	f̄	ph	f	p̣h	m

Thai	ย	ร	ฤ	ล	ฦ	ว	ศ	ษ	ส	ห	ฬ	อ	ฮ
ISO	y	r	v	l	ł	w	ṣ̄	s̛̄	s̄	h̄	ḷ	x	ḥ

Having one varga (วรรค) per line would make the table too tall. RichardW57 09:35, 20 May 2007 (UTC)

[edit] Tabular form

I have found the following layout of Thai consonants useful to see patterns. I think it would show the principles of the ISO system quite well, but I have no time now to work them in. They could replace (or be added to) the RTGS columns.

		labial		dental		guttural
class	forming	rtgs	thai	rtgs	thai	rtgs	thai
high	aspirated	ph	พ ภ	th	ท ฑ ฒ ธ	kh	ค ฆ
	fricative	f	ฟ	s	ซ	h	ฮ
	nasal	m	ม	n	น ณ	ng	ง
	semivowel	w	ว	y	ย ญ
	liquid			l	ล ฬ	r	ร
	palatalised			ch	ช ฌ
mid	voiceless	p	ป	t	ต ฎ	k	ก
	voiced	b	บ	d	ด ฏ	kh
	palatalised			j	จ	-	อ
low	aspirated	ph	ผ	th	ถ ฐ	kh	ข
	fricative	f	ฝ	s	ส ศ ษ	h	ห
	palatalised			ch	ฉ

−Woodstone 13:28, 20 May 2007 (UTC)

I think you're looking for something like this:

class	manner	Thai	ISO	Thai	ISO	Thai	ISO	Thai	ISO	Thai	ISO	Thai	ISO
		velar		palatal		former retroflex		dental		labial		glottal
mid	voiced					ฎ	ḍ	ด	d	บ	b
mid	voiceless	ก	k	จ	c	ฏ	ṭ	ต	t	ป	p	อ	x
high	aspirated	ข	k̄h	ฉ	c̄h	ฐ	ṭ̄h	ถ	t̄h	ผ	p̄h	ห	h̄
high	once fricative	ข	ḳ̄h	ศ	ṣ̄	ษ	s̛̄	ส	s̄	ฝ	f̄	ห	h̄
low (once all voiced)	formerly unaspirated	ค	kh	ช	ch	ฑ	ṯh	ท	th	พ	ph	ฮ	ḥ
	fricative	ฅ	k̛h	ซ	s					ฟ	f	ฮ	ḥ
	forever aspirated	ฆ	ḳh	ฌ	c̣h	ฒ	t̛h	ธ	ṭh	ภ	p̣h
	nasal	ง	ng	ญ	ỵ	ณ	ṇ	น	n	ม	m
	semivowel			ย	y	ร	r	ล	l	ว	w
	semivowel 2					ฬ	ḷ

Including a glottal order might actually be unhelpful. Note that this table doesn't include ฤ and ฦ. RichardW57 23:37, 20 May 2007 (UTC)

Talk:ISO 11940

From Wikipedia, the free encyclopedia

[edit] Transliteration

[edit] Tabular form

Views

Navigation

Interaction

Search