ISO 15924
Template:Short description ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts (a "set of graphic characters used for the written form of one or more languages"). Each script is given both a four-letter code and a numeric code.[1]
Where possible the codes are derived from ISO 639-2, where the name of a script and the name of a language using the script are identical (example: Gujarātī ISO 639 guj, ISO 15924 Gujr). Preference is given to the 639-2 Bibliographical codes, which is different from the otherwise often preferred use of the Terminological codes.[1]
4-letter ISO 15924 codes are incorporated into the IANA Language Subtag Registry for IETF language tags and so can be used in file formats that make use of such language tags. For example, they can be used in HTML and XML to help Web browsers determine which typeface to use for foreign text. This way one could differentiate, for example, between Serbian written in the Cyrillic (sr-Cyrl) or Latin (sr-Latn) script, or mark romanized or transliterated text as such.
Maintenance
ISO appointed the Unicode Consortium as the Registration Authority (RA) for the standard. The RA is responsible for appointing a registrar who works with a Joint Advisory Committee (JAC) in developing and implementing the standard. The registrar from 2004 to 2018 was Michael Everson, and from January 2019 the registrar has been Markus Scherer of Google.[2] The JAC consists of six members: one representative of the RA (Markus Scherer), one representative of ISO 639-2 (Randall K. Barry of the Library of Congress), one representative of ISO/TC 37 (Christian Galinski), one representative of ISO/TC 46 (Peeter Päll), and two representatives of ISO/IEC JTC 1/SC 2 (Rick McGowan and Jan Kučera, both affiliated with the Unicode Consortium).[3]
Script codes
Numeric ranges
- <templatestyles src="Mono/styles.css" />000–099 Hieroglyphic and cuneiform scripts
- <templatestyles src="Mono/styles.css" />100–199 Right-to-left alphabetic scripts
- <templatestyles src="Mono/styles.css" />200–299 Left-to-right alphabetic scripts
- <templatestyles src="Mono/styles.css" />300–399 Alphasyllabic scripts
- <templatestyles src="Mono/styles.css" />400–499 Syllabic scripts
- <templatestyles src="Mono/styles.css" />500–599 Ideographic scripts
- <templatestyles src="Mono/styles.css" />600–699 Undeciphered scripts
- <templatestyles src="Mono/styles.css" />700–799 Shorthands and other notations[4]
- <templatestyles src="Mono/styles.css" />800–899 (unassigned)
- <templatestyles src="Mono/styles.css" />900–999 Private use, alias, special codes[5]
Special codes
- <templatestyles src="Mono/styles.css" />Qaaa–Qabx (900–949): 50 Codes reserved for private use (for example, <templatestyles src="Mono/styles.css" />Qaag is defined in LDML to mark Burmese text encoded for the Zawgyi font)[6]
- <templatestyles src="Mono/styles.css" />Zsye 993: Emoji
- <templatestyles src="Mono/styles.css" />Zinh 994: Code for inherited scriptTemplate:Efn
- <templatestyles src="Mono/styles.css" />Zmth 995: Mathematical notation
- <templatestyles src="Mono/styles.css" />Zsym 996: Symbols
- <templatestyles src="Mono/styles.css" />Zxxx 997: Code for unwritten documents
- <templatestyles src="Mono/styles.css" />Zyyy 998: Code for undetermined script
- <templatestyles src="Mono/styles.css" />Zzzz 999: Code for uncoded script
Exceptionally reserved codes
Two four-letter codes are reserved at the request of the Common Locale Data Repository (CLDR) project:[7]
- <templatestyles src="Mono/styles.css" />Root: Reserved for the language-neutral base of the CLDR locale tree
- <templatestyles src="Mono/styles.css" />True: Reserved for the Boolean value "true"
List of codes
This list of codes is from the ISO 15924 standard.[7] Template:ISO 15924 script codes and related Unicode data
Relations to other standards
The following standards are referred to as indispensable by ISO 15924.
- ISO 639-2:1998 Codes for the representation of names of languages — Part 2: Alpha-3 code
- ISO/IEC 9541-1:1991 Information technology — Font information interchange — Part 1: Architecture
- ISO/IEC 10646-1:2020 Information technology — Universal Multiple-Octet Coded Character Set (UCS)
For definition of font and glyph the standard refers to
- ISO/IEC 9541-1:1991
Around 160 scripts are defined in Unicode. Through a linkpin called "Property Value Alias", Unicode has made a 1:1 connection between a script defined, and its ISO 15924 standard. See Script (Unicode).
See also
Notes
References
<templatestyles src="Reflist/styles.css" />
- ↑ a b Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ In July, 2010, Duployan shorthand was assigned code 755, even though the 700-799 range still carried its original designation of (unassigned). Shortly thereafter, Revision 1.1 clarified that codes in the 700s were reserved for "Shorthands and other notations", although that revision is only provisional until it can be confirmed by governing committees.
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ a b Script error: No such module "citation/CS1".
Script error: No such module "Check for unknown parameters".