Z-variant

From Wikipedia, the free encyclopedia
(Redirected from Zvariant)
Jump to navigation Jump to search

Template:Short description Template:SpecialChars In Unicode, two glyphs are said to be Z-variants (often spelled zVariants) if they share the same etymology but have slightly different appearances and different Unicode code points. For example, the Unicode characters Template:Mono Script error: No such module "Lang". and Template:Mono Script error: No such module "Lang". are Z-variants. The notion of Z-variance is only applicable to the "CJKV scripts"—Chinese, Japanese, Korean and Vietnamese—and is a subtopic of Han unification.

Differences on the Z-axis

The Unicode philosophy of code point allocation for CJK languages is organized along three "axes." The X-axis represents differences in semantics; for example, the Latin capital A (Template:Mono A) and the Greek capital alpha (Template:Mono Α) are represented by two distinct code points in Unicode, and might be termed "X-variants" (though this term is not common). The Y-axis represents significant differences in appearance though not in semantics; for example, the traditional Chinese character māo "cat" (Template:Mono Script error: No such module "Lang".) and the simplified Chinese character (Template:Mono Script error: No such module "Lang".) are Y-variants.[1]

The Z-axis represents minor typographical differences. For example, the Chinese characters (Template:Mono Script error: No such module "Lang".) and (Template:Mono Script error: No such module "Lang".) are Z-variants, as are (Template:Mono Script error: No such module "Lang".) and (Template:Mono Script error: No such module "Lang".). The glossary at Unicode.org defines "Z-variant" as "Two CJK unified ideographs with identical semantics and unifiable shapes,"[1] where "unifiable" is taken in the sense of Han unification.

Thus, were Han unification perfectly successful, Z-variants would not exist. They exist in Unicode because it was deemed useful to be able to "round-trip" documents between Unicode and other CJK encodings such as Big5 and CCCII. For example, the character Script error: No such module "Lang". has CCCII encoding 21552D, while its Z-variant Script error: No such module "Lang". has CCCII encoding 2D552D. Therefore, these two variants were given distinct Unicode code points, so that converting a CCCII document to Unicode and back would be a lossless operation.

Confusion

There is some confusion over the exact definition of "Z-variant." For example, in an Internet Draft (of Template:IETF RFC) dated 2002,[2] one finds Template:Transliteration "no" (Template:Mono Script error: No such module "Lang".) and (Template:Mono Script error: No such module "Lang".) described as "font variants," the term "Z-variant" being apparently reserved for interlanguage pairs such as the Mandarin Chinese Template:Transliteration "rabbit" (Template:Mono Script error: No such module "Lang".) and the Japanese Template:Transliteration "rabbit" (Template:Mono Script error: No such module "Lang".). However, the Unicode Consortium's Unihan database[3]Script error: No such module "Unsubst". treats both pairs as Z-variants.

See also

Template:Sister project

References

Template:Reflist

Template:Unicode navigation

  1. a b Script error: No such module "citation/CS1".
  2. Script error: No such module "Citation/CS1".
  3. Script error: No such module "citation/CS1".