Intel HEX
From Wikipedia, the free encyclopedia
Intel HEX is a file format for conveying binary information for applications like programming microcontrollers, EPROMs, and other kinds of chips. It is one of the oldest file formats available for this purpose.
The format is a text file, with each line containing hexadecimal values encoding a sequence of data and their starting offset or absolute address.
There are three types of Intel HEX: 8-bit, 16-bit, and 32-bit. They are distinguished by their byte order.
Each line of Intel HEX file consists of six parts:
- Start code, one character, an ASCII colon ':'.
- Byte count, two hex digits, a number of bytes (hex digit pairs) in the data field. 16 (0x10) or 32 (0x20) bytes of data are the usual compromise values between line length and address overhead.
- Address, four hex digits, a 16-bit address of the beginning of the memory position for the data. Limited to 64 kilobytes, the limit is worked around by specifying higher bits via additional record types.
- Record type, two hex digits, 00 to 05, defining the type of the data field.
- Data, a sequence of n bytes of the data themselves. 2n hex digits.
- Checksum, two hex digits - the least significant byte of the two's complement sum of the values of all fields except fields 1 and 6. It is calculated by adding together the hex-encoded bytes (hex digit pairs), taking only the LSB, and either subtracting the byte from 0x100 or inverting the bytes (XORing 0xFF) and adding one (1). If the checksum is correctly calculated, adding all the bytes (the Byte count, both bytes in Address, the Record type, each Data byte and the Checksum) together will always result in a value wherein the least significant byte is zero (0x00).
There are six record types:
- 00, data record, contains data and 16-bit address. The format described above.
- 01, End Of File record, a file termination record. No data. Has to be the last line of the file, only one per file permitted. Usually ':00000001FF'.
- 02, Extended Segment Address Record, segment-base address. Used when 16 bits is not enough, identical to 80x86 real mode addressing. The address specified by the 02 record is multiplied by 16 (shifted 4 bits left) and added to the subsequent 00 record addresses. This allows addressing of up to a megabyte of address space. The address field of this record has to be 0000, the byte count is 02 (the segment is 16-bit). The least significant hex digit of the segment address is always 0.
- 03, Start Segment Address Record. For 80x86 processors, it specifies the initial content of the CS:IP registers. The address field is 0000, the byte count is 04, the first two bytes are the CS value, the latter two are the IP value.
- 04, Extended Linear Address Record, allowing for fully 32 bit addressing. The address field is 0000, the byte count is 02. The two data bytes represent the upper 16 bits of the 32 bit address, when combined with the address of the 00 type record.
- 05, Start Linear Address Record. The address field is 0000, the byte count is 04. The 4 data bytes represent the 32-bit value loaded into the EIP register of the 80386 and higher CPU.
To confuse the matter, there are various format subtypes:
- I8HEX or INTEL 8, 8-bit format.
- I16HEX or INTEL 16, 16-bit format. Allows usage of 02 records. The data field endianness may be byte-swapped.
- I32HEX or INTEL 32, 32-bit format. Allows usage of 03, 04, and 05 records. The data field endianness may be byte-swapped.
Beware of byte-swapped data. Some programmers tend to misinterpret the byte order in case of I16HEX and I32HEX.
[edit] External links
- Hexplorer for editing of Intel HEX files. For Windows, free.
- SRecord, multi-platform GPL'ed tool for manipulating EPROM load files.
- Binex, a converter between Intel HEX and binary.
- Format description at PIC List