Octet (computing)
The octet is a unit of digital information in computing and telecommunications that consists of eight bits. The term is often used when the term byte might be ambiguous, as the byte has historically been used for storage units of a variety of sizes.
The term octad(e) for eight bits is no longer common.[1][2]
Definition
A variable-length sequence of octets, as in Abstract Syntax Notation One (ASN.1), is referred to as an octet string.
The international standard IEC 60027-2, chapter 3.8.2, states that a byte is an octet of bits. However, the unit byte has historically been platform-dependent and has represented various storage sizes in the history of computing. Due to the influence of several major computer architectures and product lines, the byte became overwhelmingly associated with eight bits. This meaning of byte is codified in such standards as ISO/IEC 80000-13. While to most people presently, byte and octet are synonymous, those working with certain legacy systems are careful to avoid ambiguity.
Octets are often expressed and displayed using a variety of representations, for example in the hexadecimal, decimal, or octal number systems. The binary value of all eight bits set (or activated) is 2, equal to the hexadecimal value 1111111116, the decimal value FF 10, and the octal value 255 8. One octet can be used to represent decimal values ranging from 0 to 255. 377
The term octet (symbol: o) is often used when the use of byte might be ambiguous. It is frequently used in the Request for Comments (RFC) publications of the Internet Engineering Task Force to describe storage sizes of network protocol parameters. The earliest example is RFC 635 from 1974. In 2000, Bob Bemer claimed to have proposed the usage of the term octet for "8-bit bytes" when he headed software operations for Cie. Bull in France in 1965 to 1966 already.[3]
In France, French Canada and Romania, octet is used in common language instead of byte when the 8-bit sense is required, for example, a megabyte (MB) is termed a megaoctet (Mo).
Octad
Historically, the term octad (or octade) was used to specifically denote 8 bits as well at least in Western Europe;[2][1] however, this usage is no longer common today. The exact origin of this term is unclear, but it can be found in British,[2] Dutch and German sources of the 1960s and 1970s, and throughout the documentation of Philips mainframe computers.[1] Similar terms exist in common English such as triad for a grouping of three and decade for ten.
Unit multiples
Unit multiples of the octet may be formed with SI prefixes and binary prefixes (power of 2 prefixes) as standardized by the International Electrotechnical Commission during 1998.
SI Prefixes | |||
---|---|---|---|
1 kilooctet (ko) | = 103 octets | = 1000 octets | |
1 megaoctet (Mo) | = 106 octets | = 1000 ko | = 1000000 octets |
1 gigaoctet (Go) | = 109 octets | = 1000 Mo | = 1000000000 octets |
1 teraoctet (To) | = 1012 octets | = 1000 Go | = 1000000000000 octets |
1 petaoctet (Po) | = 1015 octets | = 1000 To | = 1000000000000000 octets |
1 exaoctet (Eo) | = 1018 octets | = 1000 Po | = 1000000000000000000 octets |
1 zettaoctet (Zo) | = 1021 octets | = 1000 Eo | = 1000000000000000000000 octets |
1 yottaoctet (Yo) | = 1024 octets | = 1000 Zo | = 1000000000000000000000000 octets |
Binary Prefixes | |||
---|---|---|---|
1 kibioctet (Kio) | = 210 octets | = 1024 octets | |
1 mebioctet (Mio) | = 220 octets | = 1024 Kio | = 1048576 octets |
1 gibioctet (Gio) | = 230 octets | = 1024 Mio | = 1073741824 octets |
1 tebioctet (Tio) | = 240 octets | = 1024 Gio | = 1099511627776 octets |
1 pebioctet (Pio) | = 250 octets | = 1024 Tio | = 1125899906842624 octets |
1 exbioctet (Eio) | = 260 octets | = 1024 Pio | = 1152921504606846976 octets |
1 zebioctet (Zio) | = 270 octets | = 1024 Eio | = 1180591620717411303424 octets |
1 yobioctet (Yio) | = 280 octets | = 1024 Zio | = 1208925819614629174706176 octets |
Use in internet protocol addresses
The octet is used in the representation of Internet Protocol computer network addresses.[4] An IPv4 address consists of four octets, usually shown individually as a series of decimal values ranging from 0 to 255, each separated by a full stop (dot). Using octets with all eight bits set, the representation of the highest numbered IPv4 address is 255.255.255.255.
An IPv6 address consists of sixteen octets, shown using hexadecimal representation (two digits per octet) and using a colon character (:) after each pair of octets (16 bits also known as hextet) for readability, like this FE80:0000:0000:0000:0123:4567:89AB:CDEF. If a pair or more consecutive octets equal zero it may be replaced by two following colon characters (::) but this can be used only once in a given IPv6 address to avoid ambiguity. The previously given IPv6 address can thus also be written as FE80::0123:4567:89AB:CDEF. In addition leading zeroes may also be omitted as they are not significant bits in the address. Applying this to the previous example mentioned will result in an IPv6 address of FE80::123:4567:89AB:CDEF.
References
- 1 2 3 "Philips - Philips Data Systems' product range - April 1971" (PDF). Philips. 1971. Archived from the original (PDF) on 2016-03-03. Retrieved 2016-10-03.
- 1 2 3 Williams, R. H. (1969-01-01). "British Commercial Computer Digest: Pergamon Computer Data Series". Pergamon Press. ISBN 1483122107. 978-1483122106. Retrieved 2015-08-03.
- ↑ Bemer, Robert William (2000-08-08). "Why is a byte 8 bits? Or is it?". Computer History Vignettes. Retrieved 2017-05-15.
[…] I came to work for IBM, and saw all the confusion caused by the 64-character limitation. Especially when we started to think about word processing, which would require both upper and lower case. […] I even made a proposal (in view of STRETCH, the very first computer I know of with an 8-bit byte) that would extend the number of punch card character codes to 256 […]. So some folks started thinking about 7-bit characters, but this was ridiculous. With IBM's STRETCH computer as background, handling 64-character words divisible into groups of 8 (I designed the character set for it, under the guidance of Dr. Werner Buchholz, the man who DID coin the term "byte" for an 8-bit grouping). […] It seemed reasonable to make a universal 8-bit character set, handling up to 256. In those days my mantra was "powers of 2 are magic". And so the group I headed developed and justified such a proposal […] The IBM 360 used 8-bit characters, although not ASCII directly. Thus Buchholz's "byte" caught on everywhere. I myself did not like the name for many reasons. The design had 8 bits moving around in parallel. But then came a new IBM part, with 9 bits for self-checking, both inside the CPU and in the tape drives. I exposed this 9-bit byte to the press in 1973. But long before that, when I headed software operations for Cie. Bull in France in 1965-66, I insisted that "byte" be deprecated in favor of "octet". […]
- ↑ Kozierok, Charles M. (2005-09-20) [2001]. "The TCP/IP Guide - Binary Information and Representation: Bits, Bytes, Nibbles, Octets and Characters - Byte versus Octet". 3.0. Archived from the original on 2017-04-03. Retrieved 2017-04-03.
External links
- The dictionary definition of octet at Wiktionary