Intel HEX

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Template:Short description Script error: No such module "Distinguish". Script error: No such module "redirect hatnote". Template:Use dmy dates Template:Use list-defined references Template:Cs1 config Script error: No such module "Infobox".Template:Template otherScript error: No such module "Check for unknown parameters".

Intel hexadecimal object file format, Intel hex format or Intellec Hex is a file format that conveys binary information in ASCII text form,[1] making it possible to store on non-binary media such as paper tape, punch cards, etc., to display on text terminals or be printed on line-oriented printers.[2] The format is commonly used for programming microcontrollers, EPROMs, and other types of programmable logic devices and hardware emulators. In a typical application, a compiler or assembler converts a program's source code (such as in C or assembly language) to machine code and outputs it into a object or executable file in hexadecimal (or binary) format. In some applications, the Intel hex format is also used as a container format holding packets of stream data.[3] Common file extensions used for the resulting files are .HEX[4] or .H86.[5][6] The HEX file is then read by a programmer to write the machine code into a PROM or is transferred to the target system for loading and execution.[2][7] There are various tools to convert files between hexadecimal and binary format (i.e. HEX2BIN), and vice versa (i.e. OBJHEX, OH, OHX, BIN2HEX).

History

The Intel hex format was originally designed for Intel's Intellec Microcomputer Development Systems[8]Template:Rp (MDS) in 1973 in order to load and execute programs from paper tape. It was also used to specify memory contents to Intel for ROM production,[9] which previously had to be encoded in the much less efficient BNPF (Begin-Negative-Positive-Finish) format.[8]Template:Rp In 1973, Intel's "software group" consisted only of Bill Byerly and Kenneth Burgett, and Gary Kildall as an external consultant doing business as Microcomputer Applications Associates (MAA) and founding Digital Research in 1974.[10][11][12][13] Beginning in 1975, the format was utilized by Intellec Series II ISIS-II systems supporting diskette drives, with files using the file extension HEX.[14] Many PROM and EPROM programming devices accept this format.

Format

Intel HEX consists of lines of ASCII text that are separated by line feed or carriage return characters or both. Each text line contains uppercase hexadecimal characters that encode multiple binary numbers. The binary numbers may represent data, memory addresses, or other values, depending on their position in the line and the type and length of the line. Each text line is called a record.

Record structure

A record (line of text) consists of six fields (parts) that appear in order from left to right:[2]

  1. Start code, one character, an ASCII colon 'Template:Background color'. All characters preceding this symbol in a record should be ignored.[9][5][15][16][17][18] In fact, very early versions of the specification even asked for a minimum of 25 NUL characters to precede the first record and follow the last one, owing to the format's origins as a paper tape format which required some tape leadin and leadout for handling.[9][19][16][17] However, as this was a little known part of the specification, not all software written copes with this correctly. It allows to store other related information in the same file (and even the same line),[9][18] a facility used by various software development utilities to store symbol tables or additional comments,[20][9][16][21][13][22] and third-party extensions using other characters as start code like the digits 'Template:Background color'..'Template:Background color' by Intel[23] and Keil,[21] 'Template:Background color' by Mostek,[24][25] or 'Template:Background color', 'Template:Background color', 'Template:Background color', 'Template:Background color', 'Template:Background color' and 'Template:Background color' by TDL.[25][26] By convention, 'Template:Background color' is often used for comments.[27][28] Neither of these extensions may contain any ':' characters as part of the payload.
  2. Byte count, two hex digits (one hex digit pair), indicating the number of bytes (hex digit pairs) in the data field. The maximum byte count is 255 (0xFF). The values of 8 (0x08),[13] 16 (0x10)[13] and 32 (0x20) are commonly used byte counts. Not all software copes with counts larger than 16.[29]
  3. Address, four hex digits, representing the 16-bit beginning memory address offset of the data. The physical address of the data is computed by adding this offset to a previously established base address, thus allowing memory addressing beyond the 64 kilobyte limit of 16-bit addresses. The base address, which defaults to zero, can be changed by various types of records. Base addresses and address offsets are always expressed as big endian values.
  4. Record type (see record types below), two hex digits, Template:Intel HEX to Template:Intel HEX, defining the meaning of the data field.
  5. Data, a sequence of n bytes of data, represented by 2n hex digits. Some records omit this field (n equals zero). The meaning and interpretation of data bytes depends on the application. (4-bit data will either have to be stored in the lower or upper half of the bytes, that is, one byte holds only one addressable data item.[9])
  6. Checksum, two hex digits, a computed value that can be used to verify the record has no errors.

Color legend

As a visual aid, the fields of Intel HEX records are colored throughout this article as follows:

Template:Legend inline Template:Legend inline Template:Legend inline Template:Legend inline Template:Legend inline Template:Legend inline

Checksum calculation

A record's checksum byte is the two's complement of the least significant byte (LSB) of the sum of all decoded byte values in the record preceding the checksum. It is computed by summing the decoded byte values and extracting the LSB of the sum (i.e., the data checksum), and then calculating the two's complement of the LSB (e.g., by inverting its bits and adding one).

For example, in the case of the record Template:Intel HEX, the sum of the decoded byte values is Template:Intel HEX + Template:Intel HEX + Template:Intel HEX + Template:Intel HEX + Template:Intel HEX + Template:Intel HEX + Template:Intel HEX = E2, which has LSB value E2. The two's complement of E2 is Template:Intel HEX, which is the checksum byte appearing at the end of the record.

The validity of a record can be checked by computing its checksum and verifying that the computed checksum equals the checksum appearing in the record; an error is indicated if the checksums differ. Since the record's checksum byte is the two's complement — and therefore the additive inverse — of the data checksum, this process can be reduced to summing all decoded byte values, including the record's checksum, and verifying that the LSB of the sum is zero. When applied to the preceding example, this method produces the following result: Template:Intel HEX + Template:Intel HEX + Template:Intel HEX + Template:Intel HEX + Template:Intel HEX + Template:Intel HEX + Template:Intel HEX + Template:Intel HEX = 100, which has LSB value 00.

Text line terminators

Intel HEX records are usually separated by one or more ASCII line termination characters so that each record appears alone on a text line. This enhances readability by visually delimiting the records and it also provides padding between records that can be used to improve machine parsing efficiency. However, the line termination characters are optional, as the 'Template:Background color' is used to detect the start of a record.[9][5][19][15][16][17][18]

Programs that create HEX records typically use line termination characters that conform to the conventions of their operating systems. For example, Linux programs use a single LF (line feed, hex value 0A) character to terminate lines, whereas Windows programs use a CR (carriage return, hex value 0D) followed by a LF.

Record types

Intel HEX has six standard record types:[2]

Hex code Record type Description Example
Template:Intel HEX Data The byte count specifies number of data bytes in the record. The example has Template:Intel HEX (eleven) data bytes. The 16-bit starting address for the data (in the example at addresses beginning at Template:Intel HEX) and the data (Template:Intel HEX, Template:Intel HEX, Template:Intel HEX, Template:Intel HEX, Template:Intel HEX, Template:Intel HEX, Template:Intel HEX, Template:Intel HEX, Template:Intel HEX, Template:Intel HEX, Template:Intel HEX). Template:Intel HEX
Template:Intel HEX End Of File Must occur exactly once per file in the last record of the file. The byte count is Template:Intel HEX, the address field is typically Template:Intel HEX and the data field is omitted. Template:Intel HEX
Template:Intel HEX Extended Segment Address The byte count is always Template:Intel HEX, the address field (typically Template:Intel HEX) is ignored and the data field contains a 16-bit segment base address. This is multiplied by 16 and added to each subsequent data record address to form the starting address for the data. This allows addressing up to one mebibyte (1048576 bytes) of address space. Template:Intel HEX
Template:Intel HEX Start Segment Address For 80x86 processors, specifies the starting execution address. The byte count is always Template:Intel HEX, the address field is Template:Intel HEX and the first two data bytes are the CS value, the latter two are the IP value. The execution should start at this address. Template:Intel HEX
Template:Intel HEX Extended Linear Address Allows for 32 bit addressing (up to 4 GiB). The byte count is always Template:Intel HEX and the address field is ignored (typically Template:Intel HEX). The two data bytes (big endian) specify the upper 16 bits of the 32 bit absolute address for all subsequent type Template:Intel HEX records; these upper address bits apply until the next Template:Intel HEX record. The absolute address for a type Template:Intel HEX record is formed by combining the upper 16 address bits of the most recent Template:Intel HEX record with the low 16 address bits of the Template:Intel HEX record. If a type Template:Intel HEX record is not preceded by any type Template:Intel HEX records then its upper 16 address bits default to 0000. Template:Intel HEX
Template:Intel HEX Start Linear Address The byte count is always Template:Intel HEX, the address field is Template:Intel HEX. The four data bytes represent a 32-bit address value (big endian). In the case of CPUs that support it, this 32-bit address is the address at which execution should start. Template:Intel HEX

Other record types have been used for variants, including Template:Intel HEX ('blinky' messages / transmission protocol container) by Wayne and Layne,[30] Template:Intel HEX (block start), Template:Intel HEX (block end), Template:Intel HEX (padded data), Template:Intel HEX (custom data) and Template:Intel HEX (other data) by the BBC/Micro:bit Educational Foundation,[31] and Template:Intel HEX (data in code segment), Template:Intel HEX (data in data segment), Template:Intel HEX (data in stack segment), Template:Intel HEX (data in extra segment), Template:Intel HEX (paragraph address for absolute code segment), Template:Intel HEX (paragraph address for absolute data segment), Template:Intel HEX (paragraph address for absolute stack segment) and Template:Intel HEX (paragraph address for absolute extra segment) by Digital Research.[6][15]

Script error: No such module "anchor".Named formats

The original 4-bit/8-bit Intellec Hex Paper Tape Format and Intellec Hex Computer Punched Card Format in 1973/1974 supported only one record type Template:Intel HEX.[32][33][20] This was expanded around 1975Template:When to also support record type Template:Intel HEX.[9] Sometimes called symbolic hexadecimal format,[34] it could include an optional header containing a symbol table for symbolic debugging,[20][23][21][13] all characters in a record preceding the colon are ignored.[9][5]

Around 1978Template:When, Intel introduced the new record types Template:Intel HEX and Template:Intel HEX (to add support for the segmented address space of the then-new 8086/8088 processors) in their Extended Intellec Hex Format.Template:When

Special names are sometimes used to denote the formats of HEX files that employ specific subsets of record types. For example:

File example

This example shows a file that has four data records followed by an end-of-file record:

Template:Intel HEX
Template:Intel HEX
Template:Intel HEX
Template:Intel HEX
Template:Intel HEX

Template:Legend inline Template:Legend inline Template:Legend inline Template:Legend inline Template:Legend inline Template:Legend inline

Script error: No such module "anchor".Variants

Besides Intel's own extension, several third-parties have also defined variants and extensions of the Intel hex format, including Digital Research (as in the so-called "Digital Research hex format"[6][15]), Zilog, Mostek,[24][25] TDL,[25][26] Texas Instruments, Microchip,[35][36] c't, Wayne and Layne,[30] and BBC/Micro:bit Educational Foundation (with its "Universal Hex Format"[31]). These can have information on program entry points and register contents, a swapped byte order in the data fields, fill values for unused areas, fuse bits, and other differences.

The Digital Research hex format for 8086 processors supports segment information by adding record types to distinguish between code, data, stack, and extra segments.[5][6][15]

Most assemblers for CP/M-80 (and also XASM09 for the Motorola 6809) don't use record type 01h to indicate the end of a file, but use a zero-length data type 00h entry instead.[37][4] This eases the concatenation of multiple hex files.[38][39][4]

Texas Instruments defines a variant where addresses are based on the bit-width of a processor's registers, not bytes.

Script error: No such module "anchor".Microchip defines variants INTHX8S[40] (INHX8L,[4] INHX8H[4]), INHX8M,[40][4][41] INHX16[40] (INHX16M[4]) and INHX32[42] for their PIC microcontrollers.

Alfred Arnold's cross-macro-assembler AS,[4] Werner Hennig-Roleff's 8051-emulator SIM51,[21] and Matthias R. Paul's cross-converter BINTEL[43] are also known to define extensions to the Intel hex format.

See also

References

Script error: No such module "Unsubst". Template:Reflist

Further reading

  • Script error: No such module "citation/CS1".
  • Script error: No such module "citation/CS1".
  • Script error: No such module "citation/CS1".
  • Script error: No such module "citation/CS1". (32 pages)
  • Script error: No such module "citation/CS1". (8 pages)

External links

  • binex - a converter between Intel HEX and binary for Windows.
  • SRecord, a converter between Intel HEX and binary for Linux (usage), C++ source code.
  • kk_ihex, open source C library for reading and writing Intel HEX
  • libgis, open source C library that converts Intel HEX, Motorola S-Record, Atmel Generic files.
  • bincopy is a Python package for manipulating Intel HEX files.
  • SwiftIntelHex - a Swift package to parse Intel HEX files for iOS and macOS.
  1. a b Cite error: Invalid <ref> tag; no text was provided for refs named Intel_1980_FU-UG
  2. a b c d Cite error: Invalid <ref> tag; no text was provided for refs named Intel_1988_Spec
  3. Cite error: Invalid <ref> tag; no text was provided for refs named Analog_2021
  4. a b c d e f g h Cite error: Invalid <ref> tag; no text was provided for refs named Arnold_2020_AS
  5. a b c d e Cite error: Invalid <ref> tag; no text was provided for refs named DRI_1981_CPM86-SG
  6. a b c d Cite error: Invalid <ref> tag; no text was provided for refs named DRI_1983_CPM86-PG
  7. Cite error: Invalid <ref> tag; no text was provided for refs named ARM_hex
  8. a b Cite error: Invalid <ref> tag; no text was provided for refs named Crosby_1994
  9. a b c d e f g h i Cite error: Invalid <ref> tag; no text was provided for refs named Intel_1977
  10. Cite error: Invalid <ref> tag; no text was provided for refs named Kildall_1980_CPM
  11. Cite error: Invalid <ref> tag; no text was provided for refs named Kildall_1993
  12. Cite error: Invalid <ref> tag; no text was provided for refs named Burgett_2017
  13. a b c d e Cite error: Invalid <ref> tag; no text was provided for refs named Roche_2020
  14. Cite error: Invalid <ref> tag; no text was provided for refs named Feichtinger_1987
  15. a b c d e Cite error: Invalid <ref> tag; no text was provided for refs named DRI_1984_CCPM-PRG
  16. a b c d Cite error: Invalid <ref> tag; no text was provided for refs named DataIO_1980_OG
  17. a b c Cite error: Invalid <ref> tag; no text was provided for refs named DataIO_1987_TFF
  18. a b c Cite error: Invalid <ref> tag; no text was provided for refs named Fujitsu_2001
  19. a b Cite error: Invalid <ref> tag; no text was provided for refs named DRI_1982_CPM-M
  20. a b c Cite error: Invalid <ref> tag; no text was provided for refs named Intel_1974_PLM
  21. a b c d Cite error: Invalid <ref> tag; no text was provided for refs named Hennig-Roleff_1993
  22. Cite error: Invalid <ref> tag; no text was provided for refs named GeorgG_2021
  23. a b Cite error: Invalid <ref> tag; no text was provided for refs named Intel_1979_2920
  24. a b Cite error: Invalid <ref> tag; no text was provided for refs named Formaniak-Leitch_1977
  25. a b c d Cite error: Invalid <ref> tag; no text was provided for refs named Ogdin-Colvin-Pittman-Tubb_1977
  26. a b Cite error: Invalid <ref> tag; no text was provided for refs named Kreidl_1981
  27. Cite error: Invalid <ref> tag; no text was provided for refs named Rüger_2022
  28. Cite error: Invalid <ref> tag; no text was provided for refs named Bull-Dean-Rüger-Wunsch_2023
  29. Cite error: Invalid <ref> tag; no text was provided for refs named Xilinx_2010_MCS
  30. a b Cite error: Invalid <ref> tag; no text was provided for refs named Beckler_2016
  31. a b Cite error: Invalid <ref> tag; no text was provided for refs named BBC_2021_Universal
  32. Cite error: Invalid <ref> tag; no text was provided for refs named Intel_1973_Intellec8
  33. Cite error: Invalid <ref> tag; no text was provided for refs named Intel_1974_MOD8
  34. Cite error: Invalid <ref> tag; no text was provided for refs named Intel_1988_DevTools
  35. Cite error: Invalid <ref> tag; no text was provided for refs named Kanda_2012
  36. Cite error: Invalid <ref> tag; no text was provided for refs named Microchip_2018
  37. Cite error: Invalid <ref> tag; no text was provided for refs named Kildall_1978
  38. Cite error: Invalid <ref> tag; no text was provided for refs named Zschocke_1987
  39. Cite error: Invalid <ref> tag; no text was provided for refs named Prior_1989
  40. a b c Cite error: Invalid <ref> tag; no text was provided for refs named Microchip_1994
  41. Cite error: Invalid <ref> tag; no text was provided for refs named Lucid_INHX8M
  42. Cite error: Invalid <ref> tag; no text was provided for refs named Lucid_INHX32
  43. Cite error: Invalid <ref> tag; no text was provided for refs named Paul_1992