YUSCII

From Wikipedia, the free encyclopedia

YUSCII was an informal name for JUS I.B1.002, national variant of ISO 646, 7-bit Latinic character encoding standard, and used in Yugoslavia before widespread use of later ISO-8859-2, Microsoft and Unicode standards. It was named after ASCII, having the first word "American" replaced with "Yugoslav": "Yugoslav Standard Code for Information Interchange". It maintained the same codes for all essential characters and replaced a number of symbols with letters specific to languages spoken in Yugoslavia.

YUSCII was originally developed for teletype telegraphs but it also spread for computer use. This was widely considered a bad idea among software developers who needed the original ASCII such as {, [, }, ], ^, ~, |, \ in their source code. Numerous attempts to replace it with something better kept failing due to limited support. Eventually, Microsoft's introduction of code pages, appearance of Unicode and availability of fonts finally spelled sure (but nevertheless still slow) end of YUSCII.

[edit] Codepage layout

Code points remained largely the same as in ASCII to maintain maximum compatibility. Following table shows allocation of character codes in YUSCII. Red entries are (somewhat) different from ASCII. Both Latin and Cyrillic glyphs are shown:

Binary Dec Hex Glyph
0010 0000 32 20 (blank) (␠)
0010 0001 33 21 !
0010 0010 34 22 "
0010 0011 35 23 #
0010 0100 36 24 $
0010 0101 37 25 %
0010 0110 38 26 &
0010 0111 39 27 '
0010 1000 40 28 (
0010 1001 41 29 )
0010 1010 42 2A *
0010 1011 43 2B +
0010 1100 44 2C ,
0010 1101 45 2D -
0010 1110 46 2E .
0010 1111 47 2F /
0011 0000 48 30 0
0011 0001 49 31 1
0011 0010 50 32 2
0011 0011 51 33 3
0011 0100 52 34 4
0011 0101 53 35 5
0011 0110 54 36 6
0011 0111 55 37 7
0011 1000 56 38 8
0011 1001 57 39 9
0011 1010 58 3A :
0011 1011 59 3B ;
0011 1100 60 3C <
0011 1101 61 3D =
0011 1110 62 3E >
0011 1111 63 3F ?
 
Binary Dec Hex Glyph
0100 0000 64 40 Ž/Ж
0100 0001 65 41 A/А
0100 0010 66 42 B/Б
0100 0011 67 43 C/Ц
0100 0100 68 44 D/Д
0100 0101 69 45 E/Е
0100 0110 70 46 F/Ф
0100 0111 71 47 G/Г
0100 1000 72 48 H/Х
0100 1001 73 49 I/И
0100 1010 74 4A J/Ј
0100 1011 75 4B K/К
0100 1100 76 4C L/Л
0100 1101 77 4D M/М
0100 1110 78 4E N/Н
0100 1111 79 4F O/О
0101 0000 80 50 P/П
0101 0001 81 51 Q/Љ
0101 0010 82 52 R/Р
0101 0011 83 53 S/С
0101 0100 84 54 T/Т
0101 0101 85 55 U/У
0101 0110 86 56 V/В
0101 0111 87 57 W/Њ
0101 1000 88 58 X/Џ
0101 1001 89 59 Y/Ѕ
0101 1010 90 5A Z/З
0101 1011 91 5B Š/Ш
0101 1100 92 5C Đ/Ђ
0101 1101 93 5D Ć/Ћ
0101 1110 94 5E Č/Ч
0101 1111 95 5F _
 
Binary Dec Hex Glyph
0110 0000 96 60 ž/ж
0110 0001 97 61 a/а
0110 0010 98 62 b/б
0110 0011 99 63 c/ц
0110 0100 100 64 d/д
0110 0101 101 65 e/е
0110 0110 102 66 f/ф
0110 0111 103 67 g/г
0110 1000 104 68 h/х
0110 1001 105 69 i/и
0110 1010 106 6A j/ј
0110 1011 107 6B k/к
0110 1100 108 6C l/л
0110 1101 109 6D m/м
0110 1110 110 6E n/н
0110 1111 111 6F o/о
0111 0000 112 70 p/п
0111 0001 113 71 q/љ
0111 0010 114 72 r/р
0111 0011 115 73 s/с
0111 0100 116 74 t/т
0111 0101 117 75 u/у
0111 0110 118 76 v/в
0111 0111 119 77 w/њ
0111 1000 120 78 x/џ
0111 1001 121 79 y/ѕ
0111 1010 122 7A z/з
0111 1011 123 7B š/ш
0111 1100 124 7C đ/ђ
0111 1101 125 7D ć/ћ
0111 1110 126 7E č/ч

[edit] Control characters

Control characters are the same as in ASCII:

Binary Oct Dec Hex Abbr PR1 CS2 Description
0000 0000 000 0 00 NUL ^@ Null character
0000 0001 001 1 01 SOH ^A Start of Header
0000 0010 002 2 02 STX ^B Start of Text
0000 0011 003 3 03 ETX ^C End of Text
0000 0100 004 4 04 EOT ^D End of Transmission
0000 0101 005 5 05 ENQ ^E Enquiry
0000 0110 006 6 06 ACK ^F Acknowledgment
0000 0111 007 7 07 BEL ^G Bell
0000 1000 010 8 08 BS ^H Backspace3,7
0000 1001 011 9 09 HT ^I Horizontal Tab
0000 1010 012 10 0A LF ^J Line feed
0000 1011 013 11 0B VT ^K Vertical Tab
0000 1100 014 12 0C FF ^L Form feed
0000 1101 015 13 0D CR ^M Carriage return6
0000 1110 016 14 0E SO ^N Shift Out
0000 1111 017 15 0F SI ^O Shift In
0001 0000 020 16 10 DLE ^P Data Link Escape
0001 0001 021 17 11 DC1 ^Q Device Control 1 (oft. XON)
0001 0010 022 18 12 DC2 ^R Device Control 2
0001 0011 023 19 13 DC3 ^S Device Control 3 (oft. XOFF)
0001 0100 024 20 14 DC4 ^T Device Control 4
0001 0101 025 21 15 NAK ^U Negative Acknowledgement
0001 0110 026 22 16 SYN ^V Synchronous Idle
0001 0111 027 23 17 ETB ^W End of Trans. Block
0001 1000 030 24 18 CAN ^X Cancel
0001 1001 031 25 19 EM ^Y End of Medium
0001 1010 032 26 1A SUB ^Z Substitute
0001 1011 033 27 1B ESC ^[ Escape5
0001 1100 034 28 1C FS ^\ File Separator
0001 1101 035 29 1D GS ^] Group Separator
0001 1110 036 30 1E RS ^^ Record Separator
0001 1111 037 31 1F US ^_ Unit Separator
0111 1111 177 127 7F DEL ^? Delete4,7
  1. Printable Representation, the Unicode glyphs reserved for representing control characters when it is necessary to print or display them rather than have them perform their intended function.
  2. Control key Sequence, the traditional key sequences for inputting control characters. The caret (^) represents the "Control" or "Ctrl" key that must be held down while pressing the second key in the sequence. The caret-key representation is also used by some software to represent control characters.
  3. The Backspace character can also be entered by pressing the "Backspace", "Bksp", or ← key on some systems.
  4. The Delete character can also be entered by pressing the "Delete" or "Del" key. It can also be entered by pressing the "Backspace", "Bksp", or ← key on some systems.
  5. The Escape character can also be entered by pressing the "Escape" or "Esc" key on some systems.
  6. The Carriage Return character can also be entered by pressing the "Return", "Ret", "Enter", or ↵ key on most systems.
  7. The ambiguity surrounding the Backspace key comes from systems that translated the DEL control character into a BS (backspace) before transmitting it. Some software was unable to process the character and would display "^H" instead.

[edit] See also

Languages