ISO 639-3

From Wikipedia, the free encyclopedia

ISO 639-3 (ISO 639-3:2007) is an international standard for language codes. The standard describes three‐letter codes for identifying languages. It extends the ISO 639-2 alpha-3 codes with an aim to cover all known natural languages. The standard was published by ISO on 5 February 2007^[1].

It's intended for use in a wide range of applications, in particular computer systems where many languages need to be supported. It provides an enumeration of languages as complete as possible, including living and extinct, ancient and constructed, major and minor, written and unwritten.^[1] However, it does not include reconstructed languages such as Proto-Indo-European.^[2]

It is a superset of ISO 639-1 and of the individual languages in ISO 639-2. ISO 639-1 and ISO 639-2 focused on major languages, most frequently represented in the total body of the world's literature. Since ISO 639-2 also includes language collections, whereas Part 3 does not, ISO 639-3 is not a superset of ISO 639-2. Where B and T codes exist in ISO 639-2, it uses the T-codes.

Examples:

language	639-1	639-2 (B/T)	type	639-3
English	en	eng	individual	eng
German	de	ger/deu	individual	deu
Arabic	ar	ara	macro	arb + several others
Minnan	(zh-min-nan)		individual	nan

The final standard contains 7589 entries^[3]. The inventory of languages is based on a number of sources including: the individual languages contained in 639-2, modern languages from the Ethnologue 15th edition, historic varieties, ancient languages and artificial languages from Anthony Aristar at the Linguist List as well as languages recommended within a public commenting period.

A transition from ISO 639-1 could be done with List of ISO 639-1 codes.

1 Code space
2 Macrolanguages
3 Collective languages
4 History
5 ISO specifications that recommend ISO 639-3
6 See also
7 References
8 External links

[edit] Code space

Since the code is three-letter alphabetic, one upper bound for the number of languages that can be represented is 26 × 26 × 26 = 17576. Since ISO 639-2 defines special codes (2), a reserved range (520) and B-only codes (23), 545 codes cannot be used in part 3. Therefore a lower upper bound is 17576 - 545 = 17032.

The upper bound gets even lower if one subtracts the language collections defined in 639-2 and the ones yet to be defined in ISO 639-5.

[edit] Macrolanguages

Main article: ISO 639 macrolanguage

There are 56 languages in ISO 639-2 which are considered, for the purposes of the standard, to be "macrolanguages" in 639-3 ^[4].

Some of these macrolanguages had no individual language as defined by 639-3 in ISO 639-2, e.g. 'ara' (Generic Arabic). Others like 'nor' (Norwegian) had their two individual parts ('nno' (Nynorsk), 'nob' (Bokmål)) already in 639-2.

That means some languages (e.g. 'arb', Standard Arabic) that were considered by ISO 639-2 to be dialects of one language ('ara') are now in ISO 639-3 in certain contexts considered to be individual languages themselves.

This is an attempt to deal with varieties that may be linguistically distinct from each other, but are treated by their speakers as two forms of the same language, e.g. in cases of diglossia.

For example:

http://www.sil.org/iso639-3/documentation.asp?id=ara (Generic Arabic, 639-2)
http://www.sil.org/iso639-3/documentation.asp?id=arb (Standard Arabic, 639-3)

See ^[5] for the complete list.

[edit] Collective languages

Some ISO 639-2 codes that are commonly used for languages do not precisely represent a particular language or some related languages (as the above macrolanguages). They are regarded as collective languages (or collectives)^[6] and are excluded from ISO 639-3.

See also: ISO 639-2#Collective languages

[edit] History

Stages [1]:

2006-07-14 FDIS 50.00
2007-02-05 60.60

[edit] ISO specifications that recommend ISO 639-3

lexical markup framework

[edit] See also

ISO 639
List of ISO 639-3 codes

[edit] References

[edit] External links

Categories: ISO standards | ISO 639

ISO 639-3

From Wikipedia, the free encyclopedia

Contents

[edit] Code space

[edit] Macrolanguages

[edit] Collective languages

[edit] History

[edit] ISO specifications that recommend ISO 639-3

[edit] See also

[edit] References

[edit] External links

Views

Navigation

Interaction

Search

Languages