Arabic (Unicode block)

Arabic
Range U+0600..U+06FF
(256 code points)
Plane BMP
Scripts Arabic (237 char.)
Common (6 char.)
Inherited (12 char.)
Major alphabets Arabic
Pashto
Persian
Urdu
Assigned 255 code points
Unused 1 reserved code points
1 deprecated
Source standards ISO 8859-6
Unicode version history
1.0.0 169 (+169)
1.1 194 (+25)
3.0 206 (+12)
3.2 208 (+2)
4.0 227 (+19)
4.1 235 (+8)
5.1 250 (+15)
6.0 252 (+2)
6.1 253 (+1)
6.3 254 (+1)
7.0 255 (+1)

Note: [1][2]

Arabic is a Unicode block, containing the standard letters and the most common diacritics of the Arabic script, and the Arabic-Indic digits.[3]

Block

Arabic[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+060x  ؀   ؁   ؂   ؃   ؄   ؅  ؆ ؇ ؈ ؉ ؊ ؋ ، ؍ ؎ ؏
U+061x ؐ ؑ ؒ ؓ ؔ ؕ ؖ ؗ ؘ ؙ ؚ ؛ ALM ؞ ؟
U+062x ؠ ء آ أ ؤ إ ئ ا ب ة ت ث ج ح خ د
U+063x ذ ر ز س ش ص ض ط ظ ع غ ػ ؼ ؽ ؾ ؿ
U+064x ـ ف ق ك ل م ن ه و ى ي ً ٌ ٍ َ ُ
U+065x ِ ّ ْ ٓ ٔ ٕ ٖ ٗ ٘ ٙ ٚ ٛ ٜ ٝ ٞ ٟ
U+066x ٠ ١ ٢ ٣ ٤ ٥ ٦ ٧ ٨ ٩ ٪ ٫ ٬ ٭ ٮ ٯ
U+067x ٰ ٱ ٲ ٳ ٴ ٵ ٶ ٷ ٸ ٹ ٺ ٻ ټ ٽ پ ٿ
U+068x ڀ ځ ڂ ڃ ڄ څ چ ڇ ڈ ډ ڊ ڋ ڌ ڍ ڎ ڏ
U+069x ڐ ڑ ڒ ړ ڔ ڕ ږ ڗ ژ ڙ ښ ڛ ڜ ڝ ڞ ڟ
U+06Ax ڠ ڡ ڢ ڣ ڤ ڥ ڦ ڧ ڨ ک ڪ ګ ڬ ڭ ڮ گ
U+06Bx ڰ ڱ ڲ ڳ ڴ ڵ ڶ ڷ ڸ ڹ ں ڻ ڼ ڽ ھ ڿ
U+06Cx ۀ ہ ۂ ۃ ۄ ۅ ۆ ۇ ۈ ۉ ۊ ۋ ی ۍ ێ ۏ
U+06Dx ې ۑ ے ۓ ۔ ە ۖ ۗ ۘ ۙ ۚ ۛ ۜ  ۝  ۞ ۟
U+06Ex ۠ ۡ ۢ ۣ ۤ ۥ ۦ ۧ ۨ ۩ ۪ ۫ ۬ ۭ ۮ ۯ
U+06Fx ۰ ۱ ۲ ۳ ۴ ۵ ۶ ۷ ۸ ۹ ۺ ۻ ۼ ۽ ۾ ۿ
Notes
1.^ As of Unicode version 10.0
2.^ Grey area indicates non-assigned code point
3.^ Unicode code point U+0673 is deprecated as of Unicode version 6.0

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Arabic block:

Version Final code points[lower-alpha 1] Count L2 ID WG2 ID Document
1.0.0 U+060C, 061B, 061F, 0621..063A, 0640..0652, 0660..066C, 0670..06B7, 06BA..06BE, 06C0..06CE, 06D0..06D5, 06F0..06F9 169 (to be determined)
L2/01-270 Hosken, Martin (2001-06-19), How U+06D5 works in Uighur, Some technical information collected 
L2/04-290 Karlsson, Kent (2004-07-16), Updating the Arabic Shaping normative data 
L2/04-419 Davis, Mark (2004-11-18), ArabicShaping suggestion e-mail 
L2/09-146 Pournader, Roozbeh (2009-04-15), Moving dots and Arabic script shaping: Farsi Yeh's and Jawi Nya 
L2/10-045 Allawi, Adil (2010-01-27), Proposal for changes to ArabicShaping.txt to allow machine generation of Arabic fonts and glyphs 
L2/10-168 Mansour, Kamal (2010-05-04), Problems with the joining behavior of Arabic Letter Yeh Barree (U+06D2) 
L2/10-108 Moore, Lisa (2010-05-19), "B.13.2", UTC #123 / L2 #220 Minutes 
L2/11-092 Pournader, Roozbeh (2011-03-08), Changes to schematic names of Arabic letters 
L2/11-206 N4066 Proposing to Supplement with the Script and Character of Chaghatay Language, 2011-04-25 
N4067 Proposal to Encode Special Scripts and Characters in UCS for Uighur language, 2011-05-15 
L2/11-245 N4113 Aalto, Tero (2011-06-08), Ad hoc report on Uighur 
L2/12-063 N4218 Proposal to add a Named UCS Sequence Identifier UYGHUR LETTERS, 2012-02-02 
L2/12-101 N4231 Pournader, Roozbeh; Anderson, Deborah (2012-02-09), Comments on N4218 Proposal to add a Named UCS Sequence Identifier UYGHUR LETTERS 
L2/12-098 N4254 "RESOLUTION M59.05 (Named USIs for characters for Uyghur and Chaghatay)", WG2 Resolutions, 2012-02-17 
L2/12-381 Pournader, Roozbeh (2012-11-03), Initial and medial forms of Arabic Letter Noon Ghunna 
L2/12-343R2 Moore, Lisa (2012-12-04), "B.1.1.5", UTC #133 Minutes 
L2/13-119 Pournader, Roozbeh (2013-05-08), Dot positioning of U+06A3 Arabic Letter Feh with Dot Below 
N4463 Silamu, Wushour; Anderson, Deborah; Constable, Peter (2013-06-28), User Guidelines for Uyghur, Kazakh, Kyrgyz, and Chagatai 
L2/13-226 Milo, Thomas (2013-11-26), Arabic Amphibious Characters 
L2/14-109 Milo, Thomas (2014-05-01), Koranic and Classic orthography in Unicode and computer typography 
L2/14-136 Pournader, Roozbeh (2014-05-08), The right hehs for Arabic script orthographies of Sorani Kurdish and Uighur 
1.1 U+066D, 06D6..06ED 25 (to be determined)
L2/01-428 Kew, Jonathan (2001-11-01), Request for clarification regarding U+06DD ARABIC END OF AYAH and other Arabic enclosing marks 
L2/03-112 Pournader, Roozbeh (2003-03-05), New Arabic controls and Arabic joining 
L2/05-150 Freytag, Asmus (2005-05-05), Arabic errata 
L2/05-151 Milo, Thomas (2005-05-12), Annotations to the printing of the 1924 Azhar Qur'an 
L2/05-203 McGowan, Rick (2005-08-04), Public Review Issue #73: Representative Glyphs for Arabic Characters U+06DF, U+06E0, and U+06E1 
L2/05-231 Mansour, Kamal (2005-08-11), Regarding the proposed changes for the representative glyphs for 06DF, 06E0, and 06E1 
L2/06-324R2 Moore, Lisa (2006-11-29), "B.14.2", UTC #109 Minutes 
L2/09-358R Pournader, Roozbeh (2009-10-28), Discussion document for polishing Koranic support in Unicode 
L2/10-209 Pournader, Roozbeh (2010-06-07), Public Review Issue #171: Changing the properties of U+06DE from a combining mark to a spacing symbol 
3.0 U+0653..0655, 06B8..06B9, 06BF, 06CF, 06FA..06FE 12 (to be determined)
3.2 U+066E..066F 2 L2/00-354 Davis, Mark; Mansour, Kamal (2000-10-12), Proposal For Addition To Arabic repertoire 
L2/01-150 N2357 Proposal to encode two Arabic characters to the UCS, 2001-04-04 
4.0 U+0600..0602, 060D..060E, 0610..0614, 0656..0658 13 L2/00-135 Nelson, Paul; Farhan, Ashhar; Hisam, Arif; Hisam, Kashif; Clews, John (2000-04-07), Proposal to Add Urdu Epethit and Abbreviation Diacritics to the Arabic Block 
L2/01-303 Vikas, Om (2001-07-26), Letter from the Government from India on "Draft for Unicode Standard for Indian Scripts" 
L2/01-304 Feedback on Unicode Standard 3.0, 2001-08-02 
L2/01-305 McGowan, Rick (2001-08-08), Draft UTC Response to L2/01-304, "Feedback on Unicode Standard 3.0" 
L2/01-425 N2483 Kew, Jonathan (2001-11-01), Proposal to add Arabic-script honorifics and other marks 
L2/01-426 Kew, Jonathan (2001-11-01), Proposal to add Arabic-script honorifics and other marks, Appendix: Examples of usage 
L2/01-428 Kew, Jonathan (2001-11-01), Request for clarification regarding U+06DD ARABIC END OF AYAH and other Arabic enclosing marks 
L2/01-439 Milo, Tom (2001-11-02), Arabic Year-sign examples 
L2/01-430R McGowan, Rick (2001-11-20), UTC Response to L2/01-304, “Feedback on Unicode Standard 3.0” 
L2/02-061 N2482 Kew, Jonathan (2002-01-29), Bidi committee consensus on Arabic additions from L2/01-425 
L2/02-227 N2487 Proposal to add 16 Arabic characters, 2002-05-21 
L2/03-102 Vikas, Om (2003-03-04), Unicode Standard for Indic Scripts 
L2/03-101.10 Proposed Changes in Indic Scripts [Urdu, Sindhi, and Kashmiri document], 2003-03-04 
L2/03-112 Pournader, Roozbeh (2003-03-05), New Arabic controls and Arabic joining 
L2/04-196 N2653 Umamaheswaran, V. S. (2004-06-04), "a-3", Unconfirmed minutes of WG 2 meeting 44 
L2/06-332 Esfahbod, Behdad; Pournader, Roozbeh (2006-10-15), Proposal to change the Bidi category of five Arabic characters from AL to AN 
L2/06-372 Lata, Swaran (2006-11-04), Issues Pertinent to Kashmiri 
L2/06-324R2 Moore, Lisa (2006-11-29), "B.14.2", UTC #109 Minutes 
L2/15-183R Pournader, Roozbeh (2015-07-28), Candidate characters for Grapheme_Cluster_Break=Prepend 
U+0603, 060F, 0615 3 N2413 Proposal for Incorporation of Urdu in ISO/IEC 10646 and Unicode, 2002-01-23 
L2/02-005 Hussain, Sarmad; Afzal, Muhammad (2001-12-18), Urdu Computing Standards (Charts and Exhibits) 
L2/02-006, L2/02-006 N2413-1 Zia, Khaver (2002-01-10), Towards Unicode Standard for Urdu 
L2/02-003 N2413-2 Afzal, Muhammad; Hussain, Sarmad (2001-12-28), Urdu Computing Standards: Development of Urdu Zabta Takhti (UZT) 1.01 
L2/02-004 N2413-3 Hussain, Sarmad; Afzal, Muhammad (2001-12-28), Urdu Computing Standards: Urdu Zabta Takhti (UZT) 1.01 
L2/02-163 N2413-4 Proposal to add Marks and Digits in Arabic Code Block (for Urdu), 2002-04-30 
L2/02-011R Kew, Jonathan (2002-01-12), Comments on L2/02-006: Towards Unicode Standard for Urdu 
L2/02-197 Freytag, Asmus (2002-05-01), Urdu Feedback from Bidi Committee 
L2/02-166R2 Moore, Lisa (2002-08-09), UTC #91 Minutes 
L2/02-372 N2453 Umamaheswaran, V. S. (2002-10-30), "7.9", Unconfirmed minutes of WG 2 meeting 42 
L2/03-034 Nelson, Paul; Ross, Fiona; Holloway, Tim; Hudson, John (2003-02-10), Proposal to change character properties of ARABIC SIGN SAFHA (U+0603) 
L2/04-196 N2653 Umamaheswaran, V. S. (2004-06-04), "a-3", Unconfirmed minutes of WG 2 meeting 44 
U+06EE..06EF, 06FF 3 L2/01-427 N2481 Kew, Jonathan (2001-11-01), Proposal to add Parkari letters to Arabic block 
L2/02-227 N2487 Proposal to add 16 Arabic characters, 2002-05-21 
4.1 U+060B 1 N2523 Everson, Michael (2002-11-20), Proposal to encode the AFGHANI SIGN in the UCS 
L2/03-330 N2640 Everson, Michael (2003-10-01), Revised proposal to encode the AFGHANI SIGN in the UCS 
U+061E, 065A..065C 4 L2/98-274 Davis, Mark; Mansour, Kamal (1998-07-28), Proposed Arabic Script Additions for Minority Languages 
L2/98-409 Davis, Mark; Mansour, Kamal (1998-12-01), Proposal to add 25 Arabic characters to the BMP 
L2/02-021 Davis, Mark; Mansour, Kamal (2002-01-17), Proposal To Amend Arabic repertoire 
L2/03-154 Kew, Jonathan; Mansour, Kamal; Davis, Mark (2003-05-16), Proposal to encode productive Arabic-script modifier marks 
L2/03-168 Kew, Jonathan (2003-06-02), Proposal to encode Arabic-script letters for African languages 
L2/03-210 Kew, Jonathan (2003-06-12), Draft chart showing UTC #95 additions to Arabic blocks 
L2/03-223 N2598 Kew, Jonathan (2003-07-10), Proposal to encode additional Arabic-script characters 
U+0659 1 L2/03-133R N2581R2 Everson, Michael; Pournader, Roozbeh (2003-05-29), Proposal to encode the ARABIC ZWARAKAY in the UCS 
U+065D..065E 2 L2/04-025R N2723 Kew, Jonathan (2004-03-15), Proposal to encode Additional Arabic script characters 
5.1 U+0606..060A 5 L2/05-318 Lazrek, Azzeddine (2005-10-24), Proposals for Unicode Consortium [Arabic mathematical symbols] 
L2/05-320 Lazrek, Azzeddine (2005-07-10), Arabic Mathematical Diverse Symbols, Additional characters proposed to Unicode 
L2/06-125 N3086, N3086-1 Lazrek, Azzeddine (2006-03-30), Diverse Arabic Mathematical Symbols 
U+0616, 063B..063F 6 L2/06-345R N3180R Everson, Michael; Pournader, Roozbeh; Sarbar, Elnaz (2006-10-24), Proposal to encode eight Arabic characters for Persian and Azerbaijani in the UCS 
L2/07-221 Hallissy, Bob (2007-07-19), Shaping behavior of Arabic characters based on Farsi Yeh [2007.07.19] 
L2/07-225 Moore, Lisa (2007-08-21), "B.14.3.1", UTC #112 Minutes 
U+0617..061A 4 L2/06-358R N3185R Everson, Michael; Pournader, Roozbeh (2006-11-01), Proposal to encode four Qur'anic Arabic characters in the UCS 
6.0 U+0620, 065F 2 L2/98-274 Davis, Mark; Mansour, Kamal (1998-07-28), Proposed Arabic Script Additions for Minority Languages 
L2/98-409 Davis, Mark; Mansour, Kamal (1998-12-01), Proposal to add 25 Arabic characters to the BMP 
L2/02-021 Davis, Mark; Mansour, Kamal (2002-01-17), Proposal To Amend Arabic repertoire 
L2/09-406 N3686 Proposal to add one character in the Arabic block for representation of Kashmiri and annotation of existing characters, 2008-10-24 
L2/09-176 Aazim, Muzaffar; Mansour, Kamal; Pournader, Roozbeh (2009-04-30), Proposal to add two Kashmiri characters and one annotation to the Arabic block 
L2/09-215 Pournader, Roozbeh; Anderson, Deborah (2009-05-14), Proposal to add two Kashmiri characters 
L2/10-169 Lata, Swaran (2010-05-06), Comments on the Proposed Arabic Letter Kashmiri Yeh 
6.1 U+0604 1 L2/09-144R3 N3734 Pandey, Anshuman (2009-11-20), Proposal to Encode the Samvat Date Sign for Arabic 
6.3 U+061C 1 L2/03-159 Kew, Jonathan (2003-05-28), Proposal to encode Arabic triple dot punctuation mark 
L2/11-005 Allouche, Matitiahu; Mohie, Mohamed (2011-01-16), Proposal to encode an Arabic-Letter Mark (ALM) 
L2/11-016 Moore, Lisa (2011-02-15), "Scripts and Symbols — Arabic letter mark", UTC #126 / L2 #223 Minutes 
L2/11-278 Allouche, Matitiahu; Mohie, Mohamed (2011-07-17), Proposal to encode an Arabic-Letter Mark (ALM) 
L2/11-397 Edberg, Peter (2011-10-25), Proposed addition of AL MARK and LEVEL DIRECTION MARK (PRI #205 background) 
L2/11-398 Edberg, Peter (2011-10-25), Accumulated Feedback on PRI #205 (moderated) 
L2/11-330 N4181 Anderson, Deborah (2011-11-04), Proposed Additions to ISO/IEC 10646 
L2/11-353 Moore, Lisa (2011-11-30), "B.11.18", UTC #129 / L2 #226 Minutes 
L2/11-432R N4180 Allouche, Matitiahu; Mohie, Mohamed (2012-02-15), Proposal to encode the Arabic Letter Mark (ALM) 
L2/13-040 Pournader, Roozbeh; Lanin, Aharon (2013-01-29), Fasttracking Arabic Letter Mark (ALM) 
L2/13-011 Moore, Lisa (2013-02-04), UTC #134 Minutes 
L2/13-240 Davis, Mark (2013-12-12), Reconciling Script and Script_Extensions 
7.0 U+0605 1 L2/09-163R Pandey, Anshuman (2009-09-15), Proposal to Encode Coptic Numerals in ISO/IEC 10646 
L2/10-114 N3786 Pandey, Anshuman (2010-04-10), Towards an Encoding for Coptic Numbers in the UCS 
L2/10-206R N3843R Pandey, Anshuman (2010-06-21), Final Proposal to Encode Coptic Numbers 
L2/10-421R N3958R Pandey, Anshuman (2010-11-01), Request to Rename ‘Coptic Numbers’ to ‘Coptic Epact Numerals’ 
L2/11-062R N3990 Pandey, Anshuman (2011-02-14), Final Proposal to Encode Coptic Epact Numbers 
  1. Proposed code points and characters names may differ from final code points and names

See also

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.
  3. The Unicode Consortium. The Unicode Standard, Version 6.0.0, (Mountain View, CA: The Unicode Consortium, 2011. ISBN 978-1-936213-01-6), Chapter 8
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.