List of Latin-script alphabets

The tables below summarize and compare the letter inventory of some of the Latin-script alphabets. In this article, the scope of the word "alphabet" is broadened to include letters with tone marks, and other diacritics used to represent a wide range of orthographic traditions, without regard to whether or how they are sequenced in their alphabet or the table.

Letters contained in the ISO basic Latin alphabet

Alphabets that contain all ISO basic Latin letters

The International Phonetic Alphabet (IPA) includes all 26 letters in their lowercase forms, although g is always single-storey (ɡ) in the IPA and never double-storey ().

Among alphabets for natural languages the Afrikaans,[54] Aromanian, Aymara, Basque,[4] Breton, British, Catalan,[6] Cornish, Czech,[8] Danish,[9] Dutch,[10] Emilian-Romagnol, English,[36] Estonian, Extremaduran, Fala, Filipino,[11] Finnish, French,[12], Fula,[41] Galician,[33] German,[13] Greenlandic, Hungarian,[15] Indonesian, Inari Sami, Iranian Turkic, Italian, Karakalpak,[23] Kazakh,[38] Kurdish, Modern Latin, Lule Sami, Malay, Mirandese, Norwegian,[9], Portuguese, Quechua, Rhaeto-Romance, Romanian, Skolt Sami, Slovak,[24], Southern Sami, Spanish,[25] Swedish, Tswana,[52] Tunisian Arabic,[58] Venda,[51] Võro, Walloon,[27] West Frisian, Xhosa, Zazaki, and Zulu alphabets include all 26 letters, at least in their largest version.

Among alphabets for constructed languages the Ido, Interglossa, Interlingua, Occidental alphabets include all 26 letters.

Alphabets that do not contain all ISO basic Latin letters

This list is based on official definitions of each alphabet. Still, missing letters might occur in non-integrated loan words and place names.

Reduced usage of the letters of the ISO basic Latin alphabet[1] (A–Z) in various alphabets:
Alphabet A B C D E F G H I J K L M N O P Q R S T U V W X Y Z #
Classical Latin[2] ABCDEFGHI KLMNOPQRST V XYZ23
Albanian[3] ABCDEFGHIJKLMNOPQRSTUV XYZ25
Anglo-Saxon ABCDEFGHI KLMNOPQRSTU XYZ23
Arbëresh ABCDEFGHIJKLMNOPQRSTUV X Z24
Asturian ABCDEFGHI LMNOPQRSTUV XYZ23
Azeri[53] ABCDEFGHIJKLMNOPQRSTUV XYZ25
Bambara[39] ABCDEFGHIJKLMNOP RSTU W YZ23
Belarusian[5] ABCDEFGHIJKLMNOP RSTUV YZ23
Berber ABCDEFGHIJKLMN QRSTU WXYZ23
Bislama[45] AB DEFGHIJKLMNOP RSTUVW Y 22
Chamorro[43] ABCDEFGHI KLMNOP RSTU Y 20
Chewa ABCDEFGHIJKLMNOP RSTUVW YZ25
Corsican[31] ABCDEFGHIJ LMNOPQRSTUV Z22
Crimean Tatar ABCDEFGHIJKLMNOPQRSTUV YZ24
Croatian[7] ABCDEFGHIJKLMNOP RSTUV Z22
Dakelh ABCDEFGHIJKLMNO RSTU W YZ22
Cypriot Arabic[59] ABCDEFG IJKLMNOP RSTUVWXYZ23
Dakota ABCDEFGHIJK MNOP STU W YZ20
Dalecarlian AB DEFGHIJKLMNOP RSTUVW Y 22
Dinka[40] ABCDE GHIJKLMNOP R TU W Y 20
Esperanto ABCDEFGHIJKLMNOP RSTUV Z22
Traditional Fala ABCDEFGHIJ LMNOPQRSTUV X Z23
Faroese AB DEFGHIJKLMNOP RSTUV Y 21
Friulian ABCDEFGHIJ LMNOPQRSTUV Z22
Gagauz ABCDEFGHIJKLMNOP RSTUV YZ23
Traditional Galician[33] ABCDEFGHI LMNOPQRSTUV X Z22
Gilbertese AB E I K MNO R TU W 12
Traditional Greenlandic A EFGHIJKLMNOPQRSTUV 19
Glosa ABCDEFGHIJKLMNOPQRSTUVWX Z25
Guaraní[14] ABCDE GHIJKLMNOP RSTUV Y 18
Hän ABCDE GHIJKLMNOP RSTU W YZ22
Haitian ABCDEFGHIJKLMNOP RSTUVW YZ24
Hausa[30] ABCDEFGHIJKLMNO RSTU W YZ22
Hawaiian A E HI KLMNOP U W 12
Icelandic AB DEFGHIJKLMNOP RSTUV XY 22
Igbo[42] ABCDEFGHIJKLMNOP RSTUVW YZ24
Irish[16] ABCDEFGHI LMNOP RSTUV Z20
Traditional Italian[17] ABCDEFGHI LMNOPQRSTUV Z21
Kashubian ABCDEFGHIJKLMNOP RSTU W YZ23
Latvian[18] ABCDEFGHIJKLMNOP RSTUV YZ23
Lithuanian[19] ABCDEFGHIJKLMNOP RSTUV YZ23
Livonian[46] AB DEFGHIJKLMNOP RSTUV YZ23
Malagasy AB DEFGHIJKLMNOP RST V YZ21
Maltese[20] AB DEFGHIJKLMNOPQRSTUVWX Z24
Māori[34] A E GHI K MNOP R TU W 14
Marshallese[47] AB DE IJKLMNOP R TU W Y 17
Traditional Mirandese ABCDEFGHIJ LMNOPQRSTU XYZ23
Mohawk[35] A E HI K NO RST W Y 12
Montenegrin[7] ABCDEFGHIJKLMNOP RSTUV Z22
Na'vi[57][2] A EFGHI KLMNOP RSTUVWXYZ21
Navajo ABCDE GHIJKLMNO ST WXYZ20
Northern Sami ABCDEFGHIJKLMNOP RSTUV Z22
Nuxalk A C HI KLMN PQ STU WXY 16
Occitan ABCDEFGHIJ LMNOPQRSTUV X Z23
Pan-Nigerian ABCDEFGHIJKLMNOP RSTUVW YZ24
Piedmontese ABCDEFGHIJ LMNOPQRSTUV Z22
Pirahã AB GHI K OP ST X 11
Pinyin[32] ABCDEFGHIJKLMNOPQRSTU WXYZ25
Polish[22] ABCDEFGHIJKLMNOP RSTU W YZ23
Traditional Portuguese ABCDEFGHIJ LMNOPQRSTUV X Z23
Romani[29] ABCDEFGHIJKLMNOP RSTUV X Z23
Rotokas A E G I K OP RSTUV 12
Samoan A EFG I LMNOP STUV 14
Sardinian ABCDEFGHIJKLMNOP RSTUV XYZ24
Scots Gaelic ABCDEFGHI LMNOP RSTU 18
Serbian[7] ABCDEFGHIJKLMNOP RSTUV Z22
Shona ABCDEFGHIJKLMNOP RSTUVW YZ24
Sicilian ABCDEFGHIJ LMNOPQRSTUV Z22
Slovenian ABCDEFGHIJKLMNOP RSTUV Z22
Somali ABCDEFGHIJKLMNO QRSTU WXY 23
Sorbian ABCDEFGHIJKLMNOP RSTU W YZ23
Swahili ABCDEFGHIJKLMNOP RSTUVW YZ24
Tagalog[11] AB DE GHI KLMNOP RSTU W Y 19
Tahitian A EF HI MNOP R TUV 13
Tetum AB DEFGHIJKLMNOP RSTUVWX Z23
Tongan A EFGHI KLMNOP STUV 16
Turkish ABCDEFGHIJKLMNOP RSTUV YZ23
Turkmen[55] AB DEFGHIJKLMNOP RSTU W YZ22
Ulithian[49] ABCDEFGHI KLMNOP RSTU W Y 21
Uzbek[25] ABCDEFGHIJKLMNOPQRSTUV XYZ25
Vietnamese[26] ABCDE GHI KLMNOPQRSTUV XY 22
Volapük ABCDEFGHIJKLMNOP RSTUVWXYZ25
Welsh[28] ABCDEFGHIJ LMNOP RSTU W Y 21
Wolof ABCDEFGHIJKLMNOPQRSTU WXY 24
Yapese[50] ABCDEFGHIJKLMNOPQRSTU W Y 23
Yoruba[44] AB DEFGHIJKLMNOP RSTU W Y 21
count 7264546270596769725258656970706523656871 674730244450

Note: The I is used in two distinct versions in Turkic languages, dotless (I ı) and dotted (İ i). They are considered different letters, and case conversion must take care to preserve the distinction. Note that Irish traditionally does not write the dot, or tittle, over the small letter i, but the language makes no distinction here if a dot is displayed, so no specific encoding and special case conversion rule is needed like for Turkic alphabets.

Statistics

The chart above lists a variety of alphabets that do not officially contain all 26 letters of the ISO basic Latin alphabet. In this list, two letters are used by all of them: A and I. For each of the 26 basic ISO Latin alphabet letters, the number of alphabets in the list above using it is as follows:

Letter A I T E N O H M S G U L P R B D F K C J Z V Y W X Q
Alphabets 72 72 71 70 70 70 69 69 68 67 67 65 65 65 64 62 59 58 54 52 50 47 44 30 24 23

Letters not contained in the ISO basic Latin alphabet

Some languages have extended the Latin alphabet with ligatures, modified letters, or digraphs. These symbols are listed below. The characters in the following tables may not all render, depending on which operating system and browser version are used, and the presence or absence of Unicode fonts.

Additional letters by type

Independent letters and ligatures

Additional base letters Æ Ð Ǝ Ə Ɛ Ɣ I IJ Ɩ Ŋ Œ Ɔ Ʊ (K‘) [13] Ʃ[13] Þ Ʋ Ƿ Ȝ Ʒ ʔ
æɑðǝəɛɣıijɩŋœɔʊĸßʃþʋƿȝʒʔ
Anglo-Saxon Æ Ð Œ Þ ǷȜ
Azeri[53] Ə
Bambara[39] Ɛ Ŋ Ɔ
Berber Ɛ
Crimean Tatar ı
Dalecarlian Ð
Danish[9]
Norwegian[9]
Swedish[21]
Southern Sami (Norway)
Æ
Dinka ƐƔ Ŋ Ɔ
Dutch[10] IJ
Faroese Æ Ð
Greenlandic Æ ĸ
German[13]
Luxembourgish
ß
Icelandic
Norn
Æ Ð Þ
Iranian Turkic Ə Ŋ
Celtic British
English[36]
French[12]
Latin[2]
Æ Œ
Inari Sami
Northern Sami
Fula[41]
Ŋ
Skolt Sami Ŋ Ʒ
Pan-Nigerian Ǝ
Turkish ı
Alphabet of Cameroon Æ ƏƐ ŊŒƆ
Alphabet of Benin Ǝ ƐƔ Ŋ ƆƱ Ʋ
Alphabet of Burkina Faso Ǝ Ɛ ƖŊ Ɔ Ʋ
Alphabet of Chad ƏƐ Ŋ Ɔ
Alphabet of Côte d'Ivoire Ɛ ƖŊ ƆƱ ʔ
Scientific Alphabet of Gabon ÐƎ ƐƔ Ŋ Ɔ Ʃ Ʒʔ
Alphabet of Mali Ǝ ƐƔ Ŋ Ɔ ʔ
Alphabet of Niger Ǝ Ɣ Ŋ
Alphabet of Zaïre Ɛ Ɔ

Letter–diacritic combinations: connected or overlaid

Modified letters Ą Ɓ Ç Đ Ɗ Ɖ Ę Ȩ Ə̧ Ɛ̧ Ƒ Ǥ Ɠ Ħ Į Ɨ Ɨ̧ Ƙ Ł Ɲ Ǫ Ø Ơ Ɔ̧ Ɍ Ş Ţ Ŧ Ų Ư Ʉ Ƴ
ąɓçđɗɖęȩə̧ɛ̧ƒǥɠħɦįɨɨ̧ƙłɲǫøơɔ̧ɍşţŧųưʉƴ
Hän
Navajo
Ą Ę Į Ł Ǫ
Lithuanian[19] Ą Ę Į Ų
Dalecarlian Ą Ę Į Ų
Kashubian
Polish[22]
Ą Ę Ł
Pan-Nigerian Ɓ Ɗ Ƙ
Bambara[39]
Dinka[40]
Ɲ
Marshallese[47]
Fula[41] Ɓ Ɗ Ɠ Ɲ Ƴ
Hausa[30] Ɓ Ɗ Ƙ Ƴ
Alphabet of Cameroon Ɓ Ɗ ȨƏ̧Ɛ̧ ƗƗ̧ Ø Ɔ̧ Ƴ
Alphabet of Benin Ɖ Ƒ
Alphabet of Burkina Faso ƁÇ Ɗ Ƴ
Alphabet of Chad Ɓ Ɗ Ɨ Ƴ
Scientific Alphabet of Gabon Ɖ Ɍ
Alphabet of Mali Ɓ Ɗ Ɲ Ƴ
Alphabet of Niger Ɓ Ɗ Ƙ Ɲ Ɍ Ƴ
Albanian[3]
Arbëresh
Basque,[4] Catalan[6]
English[36]
Extremaduran
Fala
French[12]
Friulian
German[13]
Mirandese
Norwegian,[9]
Occitan
Portuguese[23]
Spanish[25]
Walloon[27]
Ç
Azeri[53]
Crimean Tatar
Kazakh[38]
Kurdish
Turkish
Turkmen[55]
Zazaki
Ç Ş
Gagauz Ç ŞŢ
Croatian[7]
Inari Sami
Montenegrin[7]Slovenian
Đ
Northern Sami Đ Ŧ
Skolt Sami Đ Ǥ
Vietnamese[26] Đ Ơ Ư
Maltese[20] Ħ
Tunisian Arabic[58 Đ Ħ Ɍ Ŧ
Cypriot Arabic[59] Ş
Belarusian[5]
Sorbian
Ł
Danish[9]
Faroese
Greenlandic
Norn
Norwegian[9]Southern Sami (Norway)
Ø
Romanian[10] ŞŢ

Other letters in collation order

Note: The tables below are a work in progress. Eventually, table cells with light blue shading will indicate letter forms which do not constitute distinct letters in their associated alphabets. Please help with this task if you have the required linguistic knowledge and technical editing skill.

For the order in which the characters are sorted in each alphabet, see Collating sequence.

Letters derived from A–H

Letter-diacritic combinations (detached) in various Latin alphabets (A–H)
Alphabet Á À Ȧ Â Ä Ǟ Ǎ Ă Ā Ã Å Ǻ Ǽ Ǣ Ć Ċ Ĉ Č Ď É È Ė Ê Ë Ě Ĕ Ē Ġ Ĝ Ǧ Ğ Ģ Ĥ
áàȧâäǟǎăāãåǻǽǣćċĉčďéèėêëěĕēġĝǧğģĥ
Latin[2] ĂĀ ĔĒ
Afrikaans[54] Á ÉÈ ÊË
Albanian[3] Â ÊË
Alemannic ÁÀ ÂÄ Å ÉÈ Ê
Anglo-Saxon Ā Ǣ Ē
Arbëresh Á É Ë
Aromanian Ã
Asturian Á É
Austro-Bavarian ÁÀ ÂÄ Å ÉÈ Ê
Aymara Ä
Azeri[53] Ä Ğ
Belarusian[5] Ć Č
Berber Â Č Ġ Ǧ
Bislama[45] É
Breton Â É Ê
Catalan[6] À ÉÈ
Celtic British ĂĀ ĔĒ
Chamorro[43] Å
Corsican[31] À È
Crimean Tatar  Ğ
Croatian[7] Ć Č
Cypriot Arabic[59] Ċ Ġ
Czech[8] Á ČĎ É Ě
Dalecarlian Ä Å
Danish[9] Á ÅǺǼ É
Dutch[10] ÁÀ ÂÄ ÉÈ ÊË
Emilian-Romagnol À ÂÄ Å ÉÈ ÊË Ē
English[36] À Â Ä Å ÉÈ ÊË
Esperanto Ĉ Ĝ Ĥ
Estonian Ä
Fala Á Ã É
Faroese Á
Filipino[11] ÁÀ Â ÉÈ Ê
Finnish Ä Å
French[12] À Â ÉÈ ÊË
Friulian À Â È Ê
Gagauz Ä Ê
Galician[33] Á É
German[13] ÁÀ ÂÄ ÉÈ Ê
Greenlandic Á Â ÃÅ É Ê
Guaraní[14] Á Ã É
Haitian À È
Hän À ÂÄ Ǎ È ÊËĚ
Hawaiian Ā Ē
Hungarian[15] Á É
Icelandic Á É
Igbo ÁÀ ÉÈ
Inari Sami Á ÂÄ Å Č
Iranian Turkic Ā Č Ē Ǧ
Irish[16] Á É
Italian[17] ÁÀ ÉÈ
Kashubian Ã Ć É Ë
Kazakh[38] Ä É Ğ
Kurdish Ê
Kurdish (IS) É
Latvian[18] Ā Č Ē Ģ
Lithuanian[19] Č Ė
Livonian[46] ÄǞ Ā Ē
Lule Sami Á Ä Å
Luxembourgish ÂÄ ÉÈ ÊË
Malagasy ÁÀ Â È Ê
Maltese[20] À Ċ È Ġ
Māori Ā Ē
Marshallese[47] Ā
Mirandese Á É Ê
Mohawk ÁÀ ÉÈ
Montenegrin[7] Ć Č
Na'vi[57] Ä
Navajo Á É
Norn Á Å É
Northern Sami Á Č
Norwegian[9] ÁÀ ÂÄ Å ÉÈ ÊË
Occitan ÁÀ ÉÈ
Pan-Nigerian
Piedmontese[37] À ÉÈ Ë
Pinyin[32] ÁÀ Ǎ Ā ÉÈ Ě Ē
Polish[22] Ć
Portuguese[23] ÁÀ Â Ã ÉÈ Ê
Rhaeto-Romance À ÉÈ
Romani[29] Č
Romanian  Ă
Samoan Á Ā É Ē
Sardinian ÁÀ ÉÈ
Scots Gaelic ÁÀ ÉÈ
Serbian[7] Ć Č
Sicilian À Â È Ê
Skolt Sami ÂÄ Å Č Ǧ
Slovak[24] Á Ä ČĎ É Ě
Slovenian ÁÀ Ä Ć Č ÉÈ Ê
Sorbian Ć Č Ě
Southern Sami (Norway) Å
Southern Sami (Sweden) Ä Å
Spanish[25] Á É
Swedish[21] ÁÀ Ä Å ÉÈ Ë
Tahitian Ā Ē
Tetum Á É
Tongan Á Ā É Ē
Tswana[52] Ê
Tunisian Arabic[58 À Ċ È Ġ
Turkish  Ğ
Turkmen[55] Ä
Ulithian[49] Ȧ Ė
Venda[51] Á É
Vietnamese[26] ÁÀ Â Ă Ã ÉÈ Ê
Volapük Ä
Võro Ä
Walloon[27] À Â Å ÉÈ ÊË
Welsh[28] ÁÀ ÂÄ ÉÈ ÊË
West Frisian ÂÄ É ÊË
Wolof À É Ë
Xhosa ÁÀ ÂÄ ÉÈ ÊË
Yapese[50] Ä Ë
Yoruba[56] ÁÀ Â Ǎ Ã ÉÈ Ê Ě
Zazaki Ê Ğ

Letters derived from I–O

Letter-diacritic combinations (detached) in various Latin alphabets (I–O)
Alphabet Í Ì İ Î Ï Ǐ Ĭ Ī Ĩ Ĵ Ķ Ǩ Ĺ Ļ Ľ Ŀ ʼN Ń Ň Ñ Ņ Ó Ò Ȯ Ȱ Ô Ö Ȫ Ǒ Ŏ Ō Õ Ȭ Ő Ǿ Ơ
íìiîïǐĭīĩĵķǩĺļľŀʼnńňñņóòôȯȱöȫǒŏōõȭőǿơ
Latin[2] ĬĪ ŎŌ
Afrikaans[54] Í ÎÏ ʼn Ó Ô
Albanian[3] Î Ô
Alemannic ÍÌ Î ÓÒ ÔÖ
Anglo-Saxon Ī Ō
Arbëresh Í ÓÒ Ô
Asturian Í Ñ Ó
Austro-Bavarian ÍÌ Î ÓÒ ÔÖ
Aymara Ï Ñ
Azeri[53] İ Ö
Basque[4] Ñ
Belarusian[5] Ń
Berber
Bislama[45] Ï
Breton Î Ñ Ô
Catalan[6] Í Ï Ŀ Ñ ÓÒ
Celtic British ĬĪ ŎŌ
Chamorro[43] Ñ
Corsican[31] Ì Ï Ò
Crimean Tatar İ Ñ Ö
Czech[8] Í Ň Ó
Dalecarlian Ö
Danish[9] Í Ó Ǿ
Dutch[10] ÍÌ ÎÏ ÓÒ ÔÖ
Emilian-Romagnol Ì Î ÓÒ ÔÖ Ō
English[36] ÎÏ Ó ÔÖ
Esperanto Ĵ
Estonian Ö Õ
Extremaduran Ñ
Fala Í Ĩ Ó Õ
Faroese Í Ó
Filipino[11] ÍÌ Î Ñ ÓÒ
Finnish Ö
French[12] ÎÏ Ô
Friulian Ì Î Ò Ô
Fula[41] Ñ
Gagauz İ Ñ Ö
Galician[33] Í Ñ Ó
German[13] Ñ Ö
Greenlandic Í Î Ĩ Ô
Guaraní[14] Í Ĩ Ñ Ó Õ
Haitian Ò
Hän Ì Î Ǐ Ò Ô Ǒ
Hawaiian Ī Ō
Hungarian[15] Í Ó Ö Ő
Icelandic Í Ó Ö
Igbo ÍÌ ÓÒ
Inari Sami Ö
Iranian Turkic Ī Ö Ō
Irish[16] Í Ó
Italian[17] ÍÌ ÎÏ ÓÒ
Kashubian Ń ÓÒ Ô
Kazakh[38] İ Ï Ñ ÔÖ
Kurdish Î
Kurdish (IS) Í
Latvian[18] Ī Ķ Ļ Ņ Ō
Livonian[46] Ī Ļ Ņ ȮȰ ÖȪ ŌÕȬ
Lule Sami Ñ
Luxembourgish Î ÔÖ
Malagasy Ì Ñ Ò Ô
Maltese[20] Ì Î Ò
Māori Ī Ō
Marshallese[47] Ļ Ņ Ō
Mirandese Í Ó Ô
Mohawk ÍÌ ÓÒ
Na'vi[57] Ì
Navajo Í Ó
Norn Í Ó
Norwegian[9] ÍÌ Î ÓÒ ÔÖ
Occitan Í ÓÒ
Pan-Nigerian
Piedmontese[37] Ì Ò
Pinyin[32] ÍÌ Ǐ Ī ÓÒ Ǒ Ō
Polish[22] Ń Ó
Portuguese[23] Í ÓÒ Ô Õ
Quechua Ñ
Rhaeto-Romance Ì Î Ò
Romanian Î
Samoan Í Ī Ó Ō
Sardinian ÍÌ ÓÒ
Scots Gaelic Ì ÓÒ
Sicilian Ì Î Ò Ô
Skolt Sami Ǩ Ö Õ
Slovak[24] Í Ĺ Ľ Ň Ó ÔÖ
Slovenian ÍÌ ÓÒ ÔÖ
Sorbian Ń Ó
Southern Sami (Norway) Ï
Southern Sami (Sweden) Ï Ö
Spanish[25] Í Ï Ñ Ó
Swedish[21] Ö
Tahitian Ī Ō
Tetum Í Ñ Ó
Tongan Í Ī Ó Ō
Tswana[52] Ô
Tunisian Arabic[58 Ì Ò
Turkish İÎ Ö
Turkmen[55] Ň Ö
Ulithian[49] Ȯ
Venda[51] Í Ó
Vietnamese[26] ÍÌ Ĩ ÓÒ Ô Õ Ơ
Volapük Ö
Võro Ö Õ
Walloon[27] Ì Î ÔÖ
Welsh[28] ÍÌ ÎÏ ÓÒ ÔÖ
West Frisian Ï ÔÖ
Wolof Ñ Ó
Xhosa ÍÌ ÎÏ ÓÒ ÔÖ
Yapese[50] Ö
Yoruba[56] ÍÌ Î Ǐ Ĩ Ń ÓÒ Ô Ǒ Õ
Zazaki İ

Letters derived from R–Z

Letter-diacritic combinations (detached) in various Latin alphabets (P–Z)
Alphabet Ŕ Ř Ŗ Ś Ŝ Š Ș Ť Ț Ú Ù Û Ü Ǔ Ŭ Ū Ũ Ű Ů Ŵ Ý Ŷ Ÿ Ȳ Ź Ż Ž Ǯ
ŕřŗśŝšşťțúùûüǔŭūũűůŵýŷÿȳźżžǯ
Latin[2] ŬŪ
Afrikaans[54] Ú Û Ý
Albanian[3] Û Ŷ
Alemannic ÚÙÛÜ
Anglo-Saxon Ū Ȳ
Arbëresh ÚÙÛ
Asturian Ú Ü
Austro-Bavarian ÚÙÛÜ
Aymara Ü
Azeri[53] Ü
Basque[4] Ü
Belarusian[5] Ś Š Ŭ Ź Ž
Berber Ř
Bislama[45] Ü
Breton ÙÛÜ
Catalan[6] Ú Ü
Celtic British ŬŪ
Chewa Ŵ
Corsican[31] Ù
Crimean Tatar Ü
Croatian[7] Š Ž
Czech[8] Ř Š Ť Ú Ü Ů Ý Ž
Danish[9] Ú Ý
Dutch[10] ÚÙÛÜ
Emilian-Romagnol ÙÛÜ Ż
English[36] ÛÜ
Esperanto Ŝ Ŭ
Estonian Š Ü Ž
Fala Ú Ü Ũ
Faroese Ú Ý
Filipino[11] ÚÙÛ
Finnish Š Ž
French[12] ÙÛÜ Ÿ
Friulian ÙÛ
Gagauz Ü
Galician[33] Ú Ü
German[13] Ü
Greenlandic Ú Û Ũ
Guaraní[14] Ú Ũ Ý
Hän ÙÛ Ǔ
Hawaiian Ū
Hungarian[15] Ú Ü Ű
Icelandic Ú Ý
Igbo ÚÙ
Inari Sami Š Ž
Iranian Turkic Š Ü Ū Ž
Irish[16] Ú
Italian[17] ÚÙ
Kashubian Ś Ù ŹŻ
Kazakh[38] Ü
Kurdish (IS) ÚÙ
Kurdish Û
Latvian[18] Ŗ Š Ū Ž
Lithuanian[19] Š Ū Ž
Livonian[46] Ŗ Š Ț Ū Ȳ Ž
Luxembourgish ÛÜ
Malagasy
Maltese[20] Ù Ż
Māori Ū
Marshallese[47] Ū
Mirandese Ú Ũ
Montenegrin[7] Ś Š Ź Ž
Norn Ú Ý
Northern Sami Š Ž
Norwegian[9] ÚÙÛÜ ÝŶ
Occitan Ú
Pan-Nigerian
Piedmontese[37] Ù
Pinyin[32] ÚÙ ÜǓ Ū
Polish[22] Ś ŹŻ
Portuguese[23] Ú Ü
Rhaeto-Romance Ù
Romani[29] Š Ž
Romanian Ș Ț
Samoan Ú Ū
Sardinian ÚÙ
Scots Gaelic Ù
Serbian[7] Š Ž
Sicilian ÙÛ
Skolt Sami Š Ž Ǯ
Slovak[24] ŔŘ Š Ť Ú Ü Ý Ž
Slovenian Š ÚÙ Ü Ž
Sorbian ŔŘ Ś Š Ź Ž
Spanish[25] Ú Ü
Swedish Ü
Tahitian Ū
Tetum Ú
Tongan Ú Ū
Tswana[52] Š
Tunisian Arabic[58 Ù Ż
Turkish ÛÜ
Turkmen[55] Ü Ý Ž
Venda[51] Ú
Vietnamese[26] ÚÙ Ũ Ý
Volapük Ü
Võro Š Ü Ž
Walloon[27] ÙÛ
Welsh[28] ÚÙÛÜ ŴÝŶŸ
West Frisian Ú ÛÜ
Xhosa ÚÙÛÜ
Yoruba[56] ÚÙÛ Ǔ Ũ
Zazaki Û

Notes

  1. In classical Latin, the digraphs CH, PH, RH, TH were used in loanwords from Greek, but they were not included in the alphabet. The ligatures Æ, Œ and W, as well as lowercase letters, were added to the alphabet only in Middle Ages. The letters J and U were used as typographical variants of I and V, respectively, roughly until the Enlightenment.
  2. In Afrikaans, C and Q are only (and X and Z almost only) used in loanwords.
  3. Albanian officially has the digraphs dh, gj, ll, nj, rr, sh, th, xh, zh, which is sufficient to represent the Tosk dialect. The Gheg dialect supplements the official alphabet with 6 nasal vowels, namely â, ê, î, ô, û, ŷ.
  4. Arbëresh apparently requires the digraphs dh, gj, hj, ll, nj, rr, sh, th, xh, zh. Arbëresh has the distinctive hj, which is considered as a letter in its own right.
  5. A with diaeresis is used only as a replacement character for schwa if the latter cannot be used (it was replaced by the schwa one year later because it is the most common letter). These cases should be avoided!
  6. Basque has several digraphs: dd, ll, rr, ts, tt, tx, tz. The ü, which is pronounced as /ø/, is required for various words in its Zuberoan dialect. C, Ç, Q, V, W, Y are used in foreign words and are not usually considered part of the alphabet.
  7. Bambara also has the digraphs: kh(only present in loanwords), sh (also written as ʃ; only present in some dialects). Historically, è was used instead of ɛ, ny was used instead of ɲ, and ò was used instead of ɔ in Mali
  8. Belarusian also has several digraphs: ch, dz, dź, dž.
  9. Bislama also has the digraph ng.
  10. Breton also has the digraphs ch, c'h, zh. C, Q, X are used in foreign words or digraphs only.
  11. Catalan also has a large number of digraphs: dj, gu, gü, ig, ix, ll, l·l, ny, qu, qü, rr, ss, tg, tj, ts, tx, tz. The letters K, Ñ, Q W, and Y are only used in loanwords or the digraphs mentioned.
  12. Chamorro also has the digraphs ch, ng. C used only in digraphs.
  13. Corsican has the trigraphs: chj, ghj.
  14. Croatian also has the digraphs: , lj, nj. It can also be written with four tone markers above on top of the vowels. Note that Croatian Latin is the same as Serbian Latin and they both map 1:1 to Serbian Cyrillic, where the three digraphs map to Cyrillic letters џ, љ and њ, respectively. Rarely and non-standardly, digraph dj is used instead of đ (Cyrillic ђ).
  15. Cypriot Arabic also has the letters Θ and Δ.
  16. Czech also has the digraph ch, which is considered a separate letter and is sorted between h and i. While á, ď, é, ě, í, ň, ó, ť, ú, ů, and ý are considered separate letters, in collation they are treated merely as letters with diacritics. However, č, ř, š, and ž are actually sorted as separate letters. Q, W, X, Ö occur only in loanwords.
  17. The Norwegian alphabet is currently identical with the Danish alphabet. C is part of both alphabets and is used in native Danish, but not in native Norwegian. Norwegian and Danish uses é in "én" and more uses, although é is considered a diacritic mark, while å, æ and ø are letters. Q, W, X, Z are not used except for names and some foreign words.
  18. Dinka also has the digraphs: dh, nh, ny, th. H is only present in these digraphs. Dinka also used the letters Ää, Ëë, Ïï, Öö, Ɛ̈ɛ̈, Ɔ̈ɔ̈ (the last two which do not exist as precomposed characters in Unicode)
  19. The status of ij as a letter, ligature or digraph in Dutch is disputed. C (outside the digraph ch) Q, X, and Y occur mostly in foreign words. Letters with grave and letters with circumflex occur only in loanwords.
  20. English generally now uses extended Latin letters only in loan words, such as fiancé, fiancée, and résumé. Rare publication guides may still use the dieresis on words, such as "coöperate", rather than the now-more-common "co-operate" (UK) or "cooperate" (US). For a fuller discussion, see articles branching from Lists of English words of international origin, which was used to determine the diacritics needed for more unambiguous English. However, an é or è is sometimes used in poetry to show that a normally silent vowel is to be pronounced, as in "blessèd".
  21. Filipino also known as Tagalog also uses the digraph ng, even originally with a large tilde that spanned both n and g (as in n͠g) when a vowel follows the digraph. (The use of the tilde over the two letters is now rare). Only ñ is required for everyday use (only in loanwords). The accented vowels are used in dictionaries to indicate pronunciation, g with tilde is only present in older works.
  22. Uppercase diacritics in French are often (incorrectly) thought of as being optional, but the official rules of French orthography designate accents on uppercase letters as obligatory in most cases. Many pairs or triplets are read as digraphs or trigraphs depending on context, but are not treated as such lexicographically: consonants ph, (ng), th, gu/gü, qu, ce, ch/(sh/sch), rh; vocal vowels (ee), ai/ay, ei/ey, eu, au/eau, ou; nasal vowels ain/aim, in/im/ein, un/um/eun, an/am, en/em, om/on; the half-consonant -(i)ll-; half-consonant and vowel pairs oi, oin/ouin, ien, ion. When rules that govern the French orthography are not observed, they are read as separate letters, or using an approximating phonology of a foreign language for loan words, and there are many exceptions. In addition, most final consonants are mute (including those consonants that are part of feminine, plural, and conjugation endings). Y with diaeresis and U with diaeresis are only used in certain geographical names and proper names plus their derivatives, or, in the case of U with diaeresis, newly proposed reforms. Eg. capharnaüm `shambles' is derived from the proper name Capharnaüm. Æ occurs only in Latin or Greek loanwords.
  23. Fula has X as part of the alphabet in all countries except Guinea, Guinea-Bissau, Liberia, and Sierra Leone (used only in loanwords in these countries). Ɠ, which is used only in loanwords (but still part of the alphabet), is used in Guinea only. Fula also uses the digraphs mb (In Guinea spelled nb), nd, ng, and nj. aa, ee, ii, oo, and uu are part of the alphabet in all countries except Guinea, Guinea-Bissau, Liberia, and Sierra Leone. Ƴ is used in all countries except for Nigeria, where it is written ('y). Ŋ is used in all countries except for Nigeria. Ɲ is used in Guinea, Mali, and Burkina Faso, Ñ is used in Senegal, Gambia, Mauritania, Guinea-Bissau, Liberia, and Sierra Leone, and the digraph ny is used in Niger, Cameroon, Chad, Central African Republic, and Nigeria. The apostrophe is a letter (representing the glottal stop) in Guinea-Bissau, Liberia, and Sierra Leone.
  24. Galician. The standard of 1982 set also the digraphs gu, qu (both always before e and i), ch, ll, nh and rr. In addition, the standard of 2003 added the grapheme ao as an alternative writing of ó. Although not marked (or forgotten) in the list of digraphs, they are used to represent the same sound, so the sequence ao should be considered as a digraph. Note also that nh represents a velar nasal (not a palatal as in Portuguese) and is restricted only to three feminine words, being either demonstrative or pronoun: unha ('a' and 'one'), algunha ('some') and ningunha ('not one'). The Galician reintegracionismo movement uses it as in Portuguese.
  25. German also retains most original letters in French loan words. Swiss German does not use ß any more. The long s (ſ) was in use until the mid-20th century. Sch is usually not treated like a true trigraph, neither are ch, ck, st, sp, th, (ph, rh) and qu digraphs. Q only appears in the sequence qu and in loanwords, while x and y are found almost only in loan words. The capital ß () is almost never used, ß is replaced with SS when writing all-caps. The accented letters (other than the letters ä, ö, ü, and ß) are used only in loanwords.
  26. Guaraní also uses digraphs ch, mb, nd, ng, nt, rr and the glottal stop ' . B, C, and D are only used in these digraphs.
  27. Hausa has the digraphs: sh, ts. Vowel length and tone are usually not marked. Textbooks usually use macron or doubled vowel to mark the length, grave to mark the low tone and circumflex to mark the falling tone. Therefore, in some systems, it is possible that macron is used in combination with grave or circumflex over a, e, i, o or u.
  28. Hungarian also has the digraphs: cs, dz, gy, ly, ny, sz, ty, zs; and the trigraph: dzs. Letters á, é, í, ó, ő, ú, and ű are considered separate letters, but are collated as variants of a, e, i, o, ö, u, and ü.
  29. Irish formerly used the dot diacritic in ḃ, ċ, ḋ, ḟ, ġ, ṁ, ṗ, ṡ, ṫ. These have been replaced by the digraphs: bh, ch, dh, fh, gh, mh, ph, sh, th except for in formal instances. V only occurs in onomotopoeia, such as vácarnach, vác, or vrác, or in rare alternative spellings víog and vís (usually spelled bíog and bís), or in loanwords. Z only occurs in the West Muskerry dialect in the digraph zs (a rare eclipsis of s, spelled s in other dialects and the language proper) or in loanwords.
  30. Igbo writes Ṅṅ alternatively as N̄n̄. Igbo has the digraphs: ch, gb, gh, gw, kp, kw, nw, ny, sh. C is only used in the digraph before. Also, vowels take a grave accent, an acute accent, or no accent, depending on tone.
  31. Italian also has the digraphs: ch, gh, gn, gl, sc. J, K, W, X, Y are used in foreign words. X is also used for native words derived from Latin and Greek; J is also used for just a few native words, mainly names of persons (as in Jacopo) or of places (as in Jesolo and Jesi), in which is always pronounced as letter I. While it does not occur in ordinary running texts, geographical names on maps are often written only with acute accents. The circumflex is used on an -i ending that was anciently written -ii (or -ji, -ij, -j, etc.) to distinguish homograph plurals and verb forms: e.g. e.g. principî form principi, genî from geni.
  32. Karakalpak also has the digraphs: ch, sh. A', G', I', N', O', U' are considered as letters. C, F, V are used in foreign words.
  33. Kazakh also has the digraphs: ya, yo, yw. C, Ç, É, F, H, V, X and the digraph yo are used in foreign words.
  34. Latvian also has the digraphs: dz, dž, ie. Dz and are occasionally considered separate letters of the alphabet in more archaic examples, which have been published as recently as the 1950s; however, modern alphabets and teachings discourage this due to an ongoing effort to set decisive rules for Latvian and eliminate barbaric words accumulated during the Soviet occupation. The digraph "ie" is never considered a separate letter. Ō, Ŗ, and the digraphs CH (only used in loanwords) and UO are no longer part of the alphabet, but are still used in certain dialects and newspapers that use the old orthography. Y is used only in certain dialects and not in the standard language. F and H are only used in loanwords.
  35. Lithuanian also has the digraphs: ch, dz, dž, ie, uo. However, these are not considered separate letters of the alphabet. F, H, and the digraph CH are only used in loanwords. Demanding publications such as dictionaries, maps, schoolbooks etc need additional diacritical marks to differentiate homographs.Using grave accent on A, E, I, O, U, acute accent on all vowels, and tilde accent on all vowels and on L, M, N and R. Small E and I (also with ogonek) must retain the dot when additional accent mark is added to the character; the use of ì and í (note the missing dot) is considered unacceptable.
  36. In Livonian, the letters Ö, Ȫ, Y, Ȳ were used by the older generation, but the younger generation merged these sounds; Around the late 1990's, these letters were removed from the alphabet.
  37. Marshallese often uses the old orthography (because people did not approve of the new orthography), which writes ļ as l, as m, ņ as n, p as b, as o at the ends of words or in the word yokwe (also spelled iakwe under the old orthography; under the new orthography, spelled io̧kwe), but a at other places, and d as dr before vowels, or r after vowels. The old orthography writes ā as e in some words, but ā in others; it also writes ū as i between consonants. The old orthography writes geminates and long vowels as two letters instead. Allophones of /ɘ/, written as only e o ō in the new orthography, are also written as i u and very rarely, ū. The letter Y only occurs in the words yokwe or the phrase yokwe yuk (also spelled iakwe iuk in the old orthography or io̧kwe eok in the new orthography).
  38. Maltese also has the digraphs: ie, għ.
  39. Māori uses g only in ng digraph. Wh is also a digraph.
  40. Some Mohawk speakers use orthographic i in place of the consonant y. The glottal stop is indicated with an apostrophe and long vowels are written with a colon :.
  41. [Na'vi] uses the letters ~ and ʼ and the digraphs aw, ay, ew, ey, kx, ll, ng (sometimes written as G), px, rr, ts (sometimes written as C), tx. G (in standard orthography) and X used only in digraphs.
  42. Piedmontese also uses the letter n- to indicate a velar nasal N-sound (pronounced as the gerundive termination in going), which usually precedes a vowel, as in lun-a [moon].
  43. Pinyin has four tone markers that can go on top of any of the six vowels (a, e, i, o, u, ü); e.g.: macron (ā, ē, ī, ō, ū, ǖ), acute accent (á, é, í, ó, ú, ǘ), caron (ǎ, ě, ǐ, ǒ, ǔ, ǚ), grave accent (à, è, ì, ò, ù, ǜ). It also uses the digraphs: ch, sh, zh.
  44. Polish also has the digraphs: ch, cz, dz, dż, dź, sz, rz.
  45. Portuguese also uses the digraphs ch, lh, nh, rr, ss. The trema on ü was used in Brazilian Portuguese before 2009. The grave accent was used on e, i, o, and u, until 1973. The letters èand ò are used in geographical names outside Europe and not part of the language proper. The now abandoned practice was to indicate underlying stress in words ending in -mente -- sòmente, ùltimamente etc. Neither the digraphs nor accented letters are considered part of the alphabet.
  46. Romanian normally uses a comma diacritic below the letters s and t (ș, ț), but it is frequently replaced with an attached cedilla below these letters (ş, ţ) due to past lack of standardization. K, Q, W, X, and Y occur only in loanwords.
  47. Romani has the digraphs: čh, dž, kh, ph, th.
  48. Slovak also has the digraphs dz, dž, and ch, which are considered separate letters. While á, ä, ď, é, í, ĺ, ň, ó, ô, ŕ, ť, ú, and ý are considered separate letters, in collation they are treated merely as letters with diacritics. However, č, ľ, š, and ž, as well as the digraphs, are actually sorted as separate letters. Q, W, X, Ě, Ö, Ř, Ü occur only in loanwords.
  49. Spanish uses several digraphs to represent single sounds: ch, gu (preceding e or i), ll, qu, rr; of these, the digraphs ch and ll were traditionally considered individual letters with their own name (che, elle) and place in the alphabet (after c and l, respectively), but in order to facilitate international compatibility the Royal Spanish Academy decided to cease this practice in 1994 and all digraphs are now collated as combinations of two separate characters. While cedilla is etymologically Spanish diminutive of ceda (z) and Sancho Pança is the original form in Cervantes books, C with cedilla ç is now completely displaced by z in contemporary language. In poetry, the diaeresis may be used to break a diphthong into separate vowels. Regarding that usage, Ortografía de la lengua española states that "diaeresis is usually placed over the closed vowel [i.e. 'i' or 'u'] and, when both are closed, generally over the first". In this context, the use of ï is rare, but part of the normative orthography.
  50. Swedish uses é in well integrated loan words like idé and armé, although é is considered a modified e, while å, ä, ö are letters. á and à are rarely used words. W and z are used in some integrated words like webb and zon. Q, ü, è, and ë are used for names only, but exist in Swedish names. For foreign names ó, ç, ñ and more are sometimes used, but usually not. Swedish has many digraphs and some trigraphs. ch, dj, lj, rl, rn, rs, sj, sk, si, ti, sch, skj, stj and others are usually pronounced as one sound.
  51. Tunisian Arabic also uses the digraph: . Usually Tunisian Arabic is written in the Arabic script; this Latin alphabet, which is based on Maltese, was not created until 2015.
  52. Tswana also has the digraphs: kg, kh, ng, ph, th, tl, tlh, ts, tsh, tš, tšh. The letters C, Q, and X only appear in onomotopeia and loanwords. The letters V and Z only appear in loanwords.
  53. Turkmen had a slightly differrent alphabet before 1993 (which used some unusual letters) Ýý was written as ¥ÿ, Ňň was written as Ññ, and Şş was written as , and Žž was written £⌠ (so that all characters were available in Code page 437). In the new alphabet, all characters are available in ISO/IEC 8859-2.
  54. Ulithian also has the digraphs: ch, [l'], mw, ng. C used only in digraphs.
  55. Uzbek also has the digraphs: ch, ng, sh considered as letters. C used only in digraphs. G', O' and apostrophe (') are considered as letters. These letters have preferred typographical variants: Gʻ, Oʻ and ʼ respectively.
  56. Venda also has the digraphs and trigraphs: bv, bw, dz, dzh, dzw, fh, hw, kh, khw, ng, ny, nz, ṅw, ph, pf, pfh, sh, sw, th, ts, tsh, tsw, ty, ṱh, vh, zh, zw. C, J, Q are used in foreign words.
  57. Vietnamese has seven additional base letters: ă â đ ê ô ơ ư. It uses five tone markers that can go on top (or below) any of the 12 vowels (a, ă, â, e, ê, i, o, ô, ơ, u, ư, y); e.g.: grave accent (à, ằ, ầ, è, ề, ì, ò, ồ, ờ, ù, ừ, ỳ), hook above (ả, ẳ, ẩ, ẻ, ể, ỉ, ỏ, ổ, ở, ủ, ử, ỷ), tilde (ã, ẵ, ẫ, ẽ, ễ, ĩ, õ, ỗ, ỡ, ũ, ữ, ỹ), acute accent (á, ắ, ấ, é, ế, í, ó, ố, ớ, ú, ứ, ý), and dot below (ạ, ặ, ậ, ẹ, ệ, ị, ọ, ộ, ợ, ụ, ự, ỵ). It also uses several digraphs and trigraphs  ch, gh, gi, kh, ng, ngh, nh, ph, th, tr  but they are no longer considered letters.
  58. Walloon has the digraphs and trigraphs: ae, ch, dj, ea, jh, oe, oen, oi, sch, sh, tch, xh. The letter X outside the digraph xh is in some orthographies, but not the default two. The letter Q is in some orthographies (including one default orthography), but not in the other default orthography. Also in some orthographies are À, Ì, Ù, Ë, Ö, and even (which is not available as a precomposed character in Unicode)
  59. Welsh has the digraphs ch, dd, ff, ng, ll, ph, rh, th. It also occasionally uses circumflexes, diaereses, acute accents and grave accents on its seven vowels (a, e, i, o, u, w, y), but accented characters are not regarded as separate letters of the alphabet.
  60. Xhosa has a large number of digraphs, trigraphs, and even one tetragraph are used to represent various phonemes: bh, ch, dl, dy, dz, gc, gq, gr, gx, hh, hl, kh, kr, lh, mb, mf, mh, nc, ndl, ndz, ng, ng', ngc, ngh, ngq, ngx, nh, nkc, nkq, nkx, nq, nx, ntl, ny, nyh, ph, qh, rh, sh, th, ths, thsh, ts, tsh, ty, tyh, wh, xh, yh, zh. It also occasionally uses acute accents, grave accents, circumflexes, and diaereses on its five vowels (a, e, i, o, u), but accented characters are not regarded as separate letters of the alphabet.
  61. Yapese has the digraphs and trigraphs: aa, ae, ch, ea, ee, ii, [k'], [l'], [m'], [n'], ng, [ng'], oe, oo, [p'], [t'], th, [th'], uu, [w'], [y']. Q, representing the glottal stop, is not used. Often an apostrophe is used to represent the glottal stop instead. C used only in digraphs. H used only in digraphs and loanwords. J used only in loanwords.
  62. Yoruba uses the digraph gb. Also, vowels take a grave accent, an acute accent, or no accent, depending on tone. Although the "dot below" diacritic is widely used, purists prefer a short vertical underbar (Unicode COMBINING VERTICAL LINE BELOW U+0329) - this resembles the IPA notation for a syllabic consonant, attached to the base of the letter (E, O or S). The seven Yoruba vowels (A, E, E underbar, I, O, O underbar, U) can be uttered in three different tones: high (acute accent); middle (no accent) and low (grave accent). The letters M and N, when written without diacritics, indicate nasalisation of the preceding vowel. M and N also occur as syllabics - in these circumstances, they take acute or grave tonal diacritics, like the vowels. Middle tone is marked with a macron to differentiate it from the unmarked nasalising consonants. A tilde was used in older orthography (still occasionally used) to indicate a double vowel. This is tonally ambiguous, and has now been replaced by showing the paired vowels, each marked with the appropriate tones.

However, where a double vowel has the tonal sequence high-low or low-high, it may optionally be replaced by a single vowel with a circumflex (high-low) or caron (low-high), eg. á + à = â; à + á = ǎ.

Miscellanea

See also

Categories

Footnotes

  1. As defined in ISO/IEC 646 based on ASCII which was based on the 26 letters of the English alphabet and previous telecommunications standards, and used in later ISO standards, see Latin characters in Unicode.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.