Two dots (diacritic)
Template:Short description Template:Redirects Template:Infobox diacritic Template:Orthography notation Diacritical marks of two dots Template:Char, placed side-by-side over or under a letter, are used in several languages for several different purposes. The most familiar to English-language speakers are the diaeresis and the umlaut, though there are numerous others. For example, in Albanian, Template:Char represents a schwa. Such diacritics are also sometimes used for stylistic reasons (as in the family name Brontë or the band name Mötley Crüe).
In modern computer systems using Unicode, the two-dot diacritics are almost always encoded identically, having the same code point.[1] For example, Template:Unichar represents both o-umlaut and o-diaeresis. Their appearance in print or on screen may vary between typefaces but rarely within the same typeface.
The word trema (Template:Langx), used in linguistics and also classical scholarship, describes the form of both the umlaut diacritic and the diaeresis rather than their function and is used in those contexts to refer to either.
Uses
Diaeresis
Template:Main article As the "diaeresis" diacritic, it is used to mark the separation of two distinct vowels in adjacent syllables when an instance of diaeresis (or hiatus) occurs, so as to distinguish from a digraph or diphthong. For example, in the obsolete spelling "coöperate", the diaeresis reminded the reader that the word has four syllables co-op-er-ate, not three. It is used in several languages of western and southern Europe, though rarely now in English.[2] One well-known usage is in French - the diaeresis is used in naïve, which is commonly spelled in English without the diaeresis. It is, however, obligatory in French, to show that it is pronounced [na.iv] rather than [nev].
Umlaut
Template:Main article As the "umlaut" diacritic, it indicates a sound shift Template:Snd also known as umlautTemplate:Snd in which a back vowel becomes a front vowel. It is a specific feature of German and other Germanic languages, affecting the graphemes Template:Angle bracket, Template:Angle bracket, Template:Angle bracket and Template:Angle bracket, which are modified to Template:Angle bracket, Template:Angle bracket, Template:Angle bracket and Template:Angle bracket.
It can be seen in the Sutterlin script, formerly used widely in German handwriting, in which the letter e is formed as two short parallel vertical lines very close together (see under Sütterlin#Characteristics).
Stylistic use
The two dot diacritic is also sometimes used for purely stylistic reasons. For example, the Brontë family, whose surname was derived from Gaelic and had been anglicised as "Prunty", or "Brunty": At some point, the father of the sisters, Patrick Brontë (born Brunty), decided on the alternative spelling with a diaeresis diacritic over the terminal Template:Angbr to indicate that the name had two syllables.
Similarly the "metal umlaut" is a diacritic that is sometimes used gratuitously or decoratively over letters in the names of hard rock or heavy metal bandsTemplate:Sndfor example, those of Motörhead and Mötley Crüe, and of parody bands, such as Spın̈al Tap.
Other uses by language
A double dot is also used as a diacritic in cases where it functions as neither a diaeresis nor an umlaut. In the International Phonetic Alphabet (IPA), a double dot above a letter is used for a centralized vowel, a situation more similar to umlaut than to diaeresis. In other languages it is used for vowel length, nasalization, tone, and various other uses where diaeresis or umlaut was available typographically. The IPA uses a double dot below a letter to indicate breathy (murmured) voice.[3]Template:Efn
Vowels
- In Albanian, Tagalog, Kashubian, and Luxembourgish Template:Angle bracket represents a schwa [ə].
- In Aymara, a double dot is used on Template:Angle bracket Template:Angle bracket Template:Angle bracket for vowel length.
- In the Basque dialect of Soule, Template:Angle bracket represents Script error: No such module "IPA".
- In the DMG romanization of Tunisian Arabic, Template:Angle bracket, Template:Angle bracket, Template:Angle bracket, Template:Angle bracket, and Template:Angle bracket represent Script error: No such module "IPA"., Script error: No such module "IPA"., Script error: No such module "IPA"., Script error: No such module "IPA"., and Script error: No such module "IPA"..
- In Ligurian official orthography, Template:Angle bracket is used to represent the sound Script error: No such module "IPA"..
- In Māori, a diaeresis (e.g. Script error: No such module "Lang".) was often used on computers in the past instead of the macron to indicate long vowels, as the diaeresis was relatively easy to produce on many systems, and the macron difficult or impossible.[4][5]
- In Seneca, Template:Angle bracket Template:Angle bracket are nasal vowels, though Template:Angle bracket is Script error: No such module "IPA"., as in German umlaut.
- In Vurës (Vanuatu), Template:Angle bracket and Template:Angle bracket encode respectively Script error: No such module "IPA". and Script error: No such module "IPA"..
- In the Pahawh Hmong script, a double dot is used as one of several tone marks.
- The double dot was used in the early Cyrillic alphabet, which was used to write Old Church Slavonic. The modern Cyrillic Belarusian and Russian alphabets include the letter Template:Angle bracket (yo), although replacing it with the letter Template:Angle bracket without the diacritic is allowed in Russian.
- Since the 1870s, Template:Angbr, Template:Angbr (Cyrillic letter yi) has been used in the Ukrainian alphabet for iotated Script error: No such module "IPA".; plain і is not iotated Script error: No such module "IPA".. In Udmurt, ӥ is used for uniotated Script error: No such module "IPA"., with и for iotated Script error: No such module "IPA"..
- The form Template:Angbr is common in Dutch handwriting and also occasionally used in printed text – but is a form of the digraph "ij" rather than a modification of the letter Template:Angbr.
- Komi and Udmurt use Template:Angle bracket (a Cyrillic O with two dots) for Script error: No such module "IPA"..
- The Swedish, Finnish and Estonian languages use Template:Angle bracket and Template:Angle bracket to represent Script error: No such module "IPA". and Script error: No such module "IPA".
- In the languages of J.R.R. Tolkien's Middle-Earth novels, a diaeresis is used to separate vowels belonging to different syllables (e.g. in Eärendil) and on final e to mark it as not a schwa or silent (e.g. in Manwë, Aulë, Oromë, etc.). (There is no schwa in these languages but Tolkien wanted to make sure that readers wouldn't mistakenly pronounce one when speaking the names aloud.)Script error: No such module "Unsubst".
Consonants
Jacaltec (a Mayan language) and Malagasy are among the very few languages with a double dot on the letter "n"; in both, n̈ is the velar nasal Script error: No such module "IPA"..
In Udmurt, a double dot is also used with the consonant letters ӝ Script error: No such module "IPA". (from ж Script error: No such module "IPA".), ӟ Script error: No such module "IPA". (from з Script error: No such module "IPA".) and ӵ Script error: No such module "IPA". (from ч Script error: No such module "IPA".).
When distinction is important, Ḧ and ẍ are used for representing Script error: No such module "IPA". and Script error: No such module "IPA". in the Kurdish Kurmanji alphabet (which are otherwise represented by "h" and "x"). These sounds are borrowed from Arabic.
Ẅ and ÿ: Ÿ is generally a vowel, but it is used as the (semi-vowel) consonant Script error: No such module "IPA". (a Script error: No such module "IPA". without the use of the lips) in Tlingit. This sound is also found in Coast Tsimshian, where it is written ẅ.
A number of languages in Vanuatu use double dots on consonants, to represent linguolabial (or "apicolabial") phonemes in their orthography. Thus Araki contrasts bilabial p Script error: No such module "IPA". with linguolabial p̈ Script error: No such module "IPA".; bilabial m Script error: No such module "IPA". with linguolabial m̈ Script error: No such module "IPA".; and bilabial v Script error: No such module "IPA". with linguolabial v̈ Script error: No such module "IPA"..
Seneca uses Template:Angle bracket for Script error: No such module "IPA"..
In Arabic the letter ẗ is used in the ISO 233 transliteration for the tāʾ marbūṭah [ة], used to mark feminine gender in nouns and adjectives.
Syriac uses a two dots above a letter, called Siyame, to indicate that the word should be understood as plural. For instance, Script error: No such module "Lang". (Template:Transliteration) means "house", while Script error: No such module "Lang". (Template:Transliteration) means "houses". The sign is used especially when no vowel marks are present, which could differentiate between the two forms. Although the origin of the Siyame is different from that of the diaeresis sign, in modern computer systems both are represented by the same Unicode character. This, however, often leads to wrong rendering of the Syriac text.
The N'Ko script, used to write the Mandé languages of West Africa uses a two-dot diacritic (among others) to represent non-native sounds. The dots are slightly larger than those used for diaeresis or umlaut.
Diacritic underneath
Script error: No such module "Labelled list hatnote". The IPA specifies a "subscript umlaut", for example Hindi Script error: No such module "IPA". "potter";[3]Template:Rp the ALA-LC romanization system provides for its use and is one of the main schemes to romanize Persian (for example, rendering Template:Angbr as Template:Angbr). The notation was used to write some Asian languages in Latin script, for example Red Karen.
The double-dot underneath a vowel is still used in Fuzhou romanization of Eastern Min to indicate a modified vowel sound; placing the modifier diacritic underneath the vowel letter makes it easier to combine it with tonal diacritics above the letter, as in the word Mìng-dĕ̤ng-ngṳ̄ ("Eastern Min language").
Side dots
The diacritics 〮 and 〯 , known as Bangjeom (Script error: No such module "Lang".), were used to mark pitch accents in Hangul for Middle Korean. They were written to the left of a syllable in vertical writing and above a syllable in horizontal writing.
Computer encodings
In Unicode
Character encoding generally treats the umlaut and the diaeresis as the same diacritic mark. Unicode refers to both as diaereses without making any distinction, although the term itself has a more precise literary meaning. For example, Template:Unichar represents both o-umlaut and o-diaeresis, while similar codes are used to represent all such cases.
Unicode encodes a number of cases of "letter with a two dots diacritic" as precomposed characters and these are displayed below. (Unicode uses the term "Diaeresis" for all two-dot diacritics, irrespective of the actual term used for the language in question.) In addition, many more symbols may be composed using the combining character facility, Template:Unichar, that may be used with any letter or other diacritic to create a customised symbol but this does not mean that the result has any real-world application and are not shown in the table. Template:Letters with diacritic/headerTemplate:HlistTemplate:Letters with diacritic/footer
Both the combining character Template:Tt and the pre-composed codepoints may be regarded as an umlaut or a diaeresis according to context. Compound diacritics are possible, for example Template:Unichar, used as a tonal marks for Hanyu Pinyin, which uses both a two dots diacritic with a caron diacritic. Conversely, when the letter to be accented is an Template:Angbr, the diacritic replaces the tittle, thus: Template:Angbr.
Sometimes, there's a need to distinguish between the umlaut sign and the diaeresis sign. For instance, either may appear in a German name. ISO/IEC JTC 1/SC 2/WG 2 recommends the following for these cases:[6]
- To represent the umlaut use the Combining Diaeresis (U+0308)
- To represent the diaeresis use Combining Grapheme Joiner (CGJ, U+034F) + Combining Diaeresis (U+0308)
The same advice can be found in the official Unicode FAQ.[7]
Since version 3.2.0, Unicode also provides Template:Unichar which can produce the older umlaut typography.
Unicode provides a combining double dot below as Template:Unichar.
Finally, for use with the N'Ko script, there is Template:Unichar.
Pre-Unicode
Template:Broader ASCII, a seven-bit code with just 95 "printable" characters, has no provision for any kind of dot diacritic. Subsequent standardisation treated ASCII as the US national variant of ISO/IEC 646: the French, German and other national variants reassigned a few code points to specific vowels with diacritics, as precomposed characters. Some of these variants also defined the sequence Template:Tt,backspace,Template:Tt as producing Template:Tt but few terminals supported this.
The subsequent (eight bit) ISO 8859-1 character encoding includes the letters ä, ë, ï, ö, ü, and their respective capital forms, as well as ÿ in lower case only, with Ÿ added in the revised edition ISO 8859-15 and Windows-1252.
Computer usage
Character encoding generally treats the umlaut and the diaeresis as the same diacritic mark. Unicode refers to both as diaereses without making any distinction, although the term itself has a more precise literary meaning. For example, Template:Unichar represents both o-umlaut and o-diaeresis, while similar codes are used to represent all such cases.
In countries where the local language(s) routinely include letters with diacritics, local keyboards are typically engraved with those symbols. If letters with double dots are not present on the keyboard, there are a number of ways to input them into a computer system. (For details, see local sources, computer system documentation and the article Unicode input.)
See also
Notes
References
External links
Script error: No such module "Navbox". Template:Latin script
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ a b Script error: No such module "citation/CS1"..
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".