Search results
Jump to navigation
Jump to search
- ...t|square brackets]] or [[slash (punctuation)|slashes]], which are not part of the alphabet proper and merely signify that it is phonetic as opposed to re Today, officially, SAMPA has been developed for all the sounds of the following languages: ...6 KB (763 words) - 13:45, 28 April 2025
- {{Short description|Thai language character set and encoding}} ...the Royal Thai Government, and is the sole official standard for encoding Thai in [[Thailand]]. ...17 KB (2,289 words) - 03:55, 29 March 2025
- {{Short description|Thai character encoding, based on ASCII}} ...ly referred to as '''Latin/Thai'''. It is nearly identical to the national Thai standard [[TIS-620]] (1990). The sole difference is that ISO/IEC 8859-11 al ...36 KB (4,886 words) - 09:05, 1 March 2025
- {{Short description|Digital representation of text characters}} ...nge|ASCII]], the upper 128 codepoints are TSCII-specific. After long years of being used on the Internet by private agreement only, it was successfully r ...13 KB (1,866 words) - 14:38, 30 April 2025
- ...ter encoding", or "charset", used to encode a given document as a sequence of bytes. ...fferent platforms. The external character encoding is chosen by the author of the document (or the software the author uses to create the document) and d ...22 KB (3,374 words) - 17:04, 14 October 2025
- ...is a code point reserved in some [[coded character set]]s for the purpose of breaking words across lines by inserting visible [[hyphen]]s if they fall o ...11-04-08}}</ref><ref name="kuhn">{{cite web| title= Unicode interpretation of SOFT HYPHEN breaks ISO 8859-1 compatibility| url= https://www.unicode.org/ ...10 KB (1,543 words) - 00:23, 1 June 2024
- {{Short description|Sets of characters used in the 1980s & 90s}} '''Windows code pages''' are sets of characters or [[code pages]] (known as [[character encoding]]s in other ope ...45 KB (6,196 words) - 19:21, 24 March 2025
- ...ape]] with the word "Wikipedia" encoded in [[ASCII]]. Presence and absence of a hole represents 1 and 0, respectively; for example, W is encoded as {{cod ...[[control character]]s and [[Whitespace character|whitespace]]. Character encodings have also been defined for some [[constructed language]]s. When encoded, ch ...31 KB (4,430 words) - 01:16, 5 November 2025
- ...ntaining Unicode characters are transcoded to a subset of ASCII consisting of letters, digits, and hyphens, which is called the letter–digit–hyphen (LDH) ...tracker.ietf.org/doc/html/rfc3492 3492], ''Punycode: A Bootstring encoding of Unicode for Internationalized Domain Names in Applications (IDN)'', A. Cost ...13 KB (1,965 words) - 17:40, 30 April 2025
- {{Short description|Series of standards for 8-bit character encodings}} ...access=subscription }}</ref> The ISO working group maintaining this series of standards has been disbanded. ...48 KB (5,214 words) - 15:50, 14 October 2025
- ...ansformation Format]], [[extended ASCII]],{{efn|Not in the strictest sense of the term, as ASCII bytes can appear as trail bytes.}} [[variable-width enco ...ode of 0x80 in Microsoft's later versions of CP936/GBK and a two byte code of A2 E3 in GB18030.<!--any other exceptions? anyone suicidal enough to check? ...44 KB (6,224 words) - 18:26, 4 May 2025
- ...ay when the product’s actual origin is disclosed. The International Review of Retail, Distribution and Consumer Research, 27(1): 43-60.</ref> * [[Berghaus]], a British outdoor equipment company, converted the name of its first premises (LD Mountain Centre) roughly into German to market its o ...16 KB (2,304 words) - 23:57, 5 April 2025
- {{Short description|Dated classifications of computing character sets}} ...a [[character encoding]] and as such it is a specific association of a set of printable [[character (computing)|character]]s and [[control character]]s w ...93 KB (11,953 words) - 08:23, 4 February 2025
- {{Short description|Type of internet domain name}} [[Image:IDN-utopia-greek.jpg|thumbnail|Example of Greek IDN with domain name in non-[[Latin alphabet]]: ουτοπία.δπθ.gr ([[Pun ...42 KB (5,916 words) - 10:45, 21 June 2025
- {{short description|10th letter of the Latin alphabet}} {{About|the tenth letter of the Latin alphabet}} ...26 KB (3,554 words) - 11:08, 29 June 2025
- | caption = Logo of the [[Unicode Consortium]] | lang = 168 scripts ''([[Script (Unicode)#List of scripts in Unicode|list]])'' ...111 KB (15,416 words) - 01:03, 18 November 2025
- ...rs written from 0 to 9|The ten digits of the [[Arabic numerals]], in order of value]] ...|base]], the number of different digits required is the [[absolute value]] of the base. For example, decimal (base 10) requires ten digits (0 to 9), ...34 KB (4,871 words) - 21:23, 23 April 2025
- ...ic component, generally based on the [[rebus principle]], and the addition of a phonetic component to pure [[ideographs]] is considered to be a key innov ==Types of logographic systems== ...31 KB (4,339 words) - 21:00, 2 October 2025
- The usual name of the script is given first; the name of the [[language]]s in which the script is written follows (in brackets), par ...ll expressive capacity of a language. Unger disputes claims made on behalf of [[Blissymbols]] in his 2004 book ''Ideogram''. ...54 KB (6,599 words) - 14:17, 28 June 2025
- ...anish Argentinian, Swedish, Tagalog, Tajik Cyrillic, Tamil, Tatar, Telugu, Thai, Turkish, Ukrainian, Urdu, Uyghur, Uzbek, Uzbek Cyrillic, Venetian, Vietnam ...crosoft Windows application; the author considered, but rejected, the idea of using [[wxWidgets]] to [[Porting|port]] it to the [[macOS|Mac OS X]] and [[ ...29 KB (3,711 words) - 08:49, 19 June 2025