Cedilla
Template:Short description Script error: No such module "redirect hatnote". Script error: No such module "Distinguish". Script error: No such module "For". Template:Expand French Template:Infobox diacritic Template:Orthography notation A cedilla (Template:IPAc-en Template:Respell; from Spanish Template:Linktext, "small ceda", i.e. small "z"), or cedille (from French Script error: No such module "Lang"., Script error: No such module "IPA".), is a hook or tail (Template:Char) added under certain letters (as a diacritical mark) to indicate that their pronunciation is modified. In Catalan (where it is called Script error: No such module "Lang".), French, and Portuguese (where it is called a Script error: No such module "Lang".) it is used only under the letter Template:Angbr (to form Template:Angbr), and the entire letter is called, respectively, Script error: No such module "Lang". (i.e. "broken C"), Script error: No such module "Lang"., and Script error: No such module "Lang". (or Script error: No such module "Lang"., colloquially). It is used to mark vowel nasalization in many languages of Sub-Saharan Africa, including Vute from Cameroon.
This diacritic is not to be confused with the ogonek (◌̨), which resembles the cedilla but mirrored. It looks also very similar to the diacritical comma, which is used in the Romanian and Latvian alphabet, and which is misnamed "cedilla" in the Unicode standard.
There is substantial overlap between the cedilla and a diacritical comma. The cedilla is traditionally centered on the letter, and when there is no stroke for it to attach to in that position, as in Ņ ņ, the connecting stroke is omitted, taking the form of a comma. However, the cedilla may instead be shifted left or right to attach to a descending leg. In some orthographies the comma form has been generalized even in cases where the cedilla could attach, as in Ḑ ḑ, but is still considered to be a cedilla. This produces a contrast between attached and non-attached (comma) glyphs, which is usually left to the font but in the cases of Ş ş Ţ ţ and Ș ș Ț ț is formalized by Unicode.
Origin
The tail originated in Spain as the bottom half of a miniature cursive z. The word cedilla is the diminutive of the Old Spanish name for this letter, Script error: No such module "Lang". (Script error: No such module "Lang".).[1] Modern Spanish and isolationist Galician no longer use this diacritic, although it is used in Reintegrationist Galician, Portuguese,[2] Catalan, Occitan, and French, which gives English the alternative spellings of cedille, from French "Script error: No such module "Lang".", and the Portuguese form Script error: No such module "Lang".. An obsolete spelling of cedilla is cerilla.[2] The earliest use in English cited by the Oxford English Dictionary[2] is a 1599 Spanish-English dictionary and grammar.[3] Chambers' Cyclopædia[4] is cited for the printer-trade variant ceceril in use in 1738.[2] Its use in English is not universal and applies to loan words from French and Portuguese such as façade, limaçon and cachaça (often typed facade, limacon and cachaca because of lack of ç keys on English-language keyboards).
With the advent of typeface modernism, the calligraphic nature of the cedilla was thought somewhat jarring on sans-serif typefaces, and so some designers instead substituted a comma design, which could be made bolder and more compatible with the style of the text.Template:Efn This reduces the visual distinction between the cedilla and the diacritical comma.
C
Script error: No such module "Labelled list hatnote". The most frequent character with cedilla is "ç" ("c" with cedilla, as in façade). It was first used for the sound of the voiceless alveolar affricate Script error: No such module "IPA". in old Spanish and stems from the letter Template:Angbr (the Visigothic form of the letter Template:Angbr), whose upper loop was lengthened and reinterpreted as a "c", whereas its lower loop became the diminished appendage, the cedilla.
It represents the "soft" sound Script error: No such module "IPA"., the voiceless alveolar sibilant, where a "c" would normally represent the "hard" sound Script error: No such module "IPA". (before "a", "o", "u", or at the end of a word) in English and in certain Romance languages such as Catalan, Galician, French (where ç appears in the name of the language itself, Script error: No such module "Lang".), Ligurian, Occitan, and Portuguese. In Occitan, Friulian, and Catalan, ç can also be found at the beginning of a word (Script error: No such module "Lang"., Script error: No such module "Lang".) or at the end (Script error: No such module "Lang".).
It represents the voiceless postalveolar affricate Script error: No such module "IPA". (as in English "church") in Albanian, Azerbaijani, Crimean Tatar, Friulian, Kurdish, Tatar, Turkish (as in Script error: No such module "Lang"., Script error: No such module "Lang"., Script error: No such module "Lang"., Script error: No such module "Lang".), and Turkmen. It is also sometimes used this way in Manx, to distinguish it from the velar fricative.
In the International Phonetic Alphabet, ⟨ç⟩ represents the voiceless palatal fricative.
S
Script error: No such module "Labelled list hatnote". The character "ş" represents the voiceless postalveolar fricative Script error: No such module "IPA". (as in "show") in several languages, including many belonging to the Turkic languages, and included as a separate letter in their alphabets:
- Turkish
- Azerbaijani
- Crimean Tatar
- Gagauz
- Tatar
- Turkmen
- Romanian (substitution use when S-comma [Ș] was missing from pre-3.0 Unicode standards, and older standards, still frequent, but an error)
- Kurdish
In HTML character entity references Ş and ş can be used.
T
Script error: No such module "Labelled list hatnote". Gagauz uses Ţ (T with cedilla), one of the few languages to do so, and Ş (S with cedilla). Besides being present in some Gagauz orthographies, T with Cedilla also exists in the General Alphabet of Cameroon Languages, in the Kabyle language, in the Manjak and Mankanya languages, and possibly elsewhere.
In 1868, Ambroise Firmin-Didot suggested in his book Script error: No such module "Lang". (Observations on French Spelling) that French phonetics could be better regularized by adding a cedilla beneath the letter "t" in some words. For example, the suffix Script error: No such module "Lang". is usually not pronounced as Script error: No such module "IPA". but as Script error: No such module "IPA".. It has to be distinctly learned that in words such as Script error: No such module "Lang". (but not Script error: No such module "Lang".), it is pronounced Script error: No such module "IPA".. A similar effect occurs with other prefixes or within words. Firmin-Didot surmised that a new character could be added to French orthography. A letter with the same description, T-cedilla (majuscule: Ţ, minuscule: ţ), is used in Gagauz. A similar letter, the T-comma (majuscule: Ț, minuscule: ț), exists in Romanian, but it has a comma accent, not a cedilla.
Languages with other characters with cedillas
Latvian
Comparatively, some consider the diacritics on the palatalized Latvian consonants "ģ", "ķ", "ļ", "ņ", and formerly "ŗ" to be cedillas. Although their Adobe glyph names are commas, their names in the Unicode Standard are "g", "k", "l", "n", and "r" with a cedilla. The letters were introduced to the Unicode standard before 1992, and their names cannot be altered. The uppercase equivalent "Ģ" sometimes has a regular cedilla.
Marshallese
Script error: No such module "Unsubst". In Marshallese orthography, four letters in Marshallese have cedillas: Template:Angle bracket. In standard printed text they are always cedillas, and their omission or the substitution of comma below and dot below diacritics are nonstandard.Script error: No such module "Unsubst".
Template:As of, many font rendering engines do not display any of these properly, for two reasons:
- "Script error: No such module "Lang"." and "Script error: No such module "Lang"." usually do not display properly at all, because of the use of the cedilla in Latvian. Unicode has precombined glyphs for these letters, but most quality fonts display them with comma below diacritics to accommodate the expectations of Latvian orthography. This is considered nonstandard in Marshallese. The use of a zero-width non-joiner between the letter and the diacritic can alleviate this problem: "Script error: No such module "Lang"." and "Script error: No such module "Lang"." may display properly, but may not; see below.
- "Script error: No such module "Lang"." and "Script error: No such module "Lang"." do not currently exist in Unicode as precombined glyphs, and must be encoded as the plain Latin letters "Script error: No such module "Lang"." and "Script error: No such module "Lang"." with the combining cedilla diacritic. Most Unicode fonts issued with Windows do not display combining diacritics properly, showing them too far to the right of the letter, as with Tahoma ("m̧" and "o̧") and Times New Roman ("m̧" and "o̧"). This mostly affects "Script error: No such module "Lang".", and may or may not affect "Script error: No such module "Lang".". But some common Unicode fonts like Arial Unicode MS ("m̧" and "o̧"), Cambria ("m̧" and "o̧") and Lucida Sans Unicode ("m̧" and "o̧") do not have this problem. When "Script error: No such module "Lang"." is properly displayed, the cedilla is either underneath the center of the letter, or is underneath the right-most leg of the letter, but is always directly underneath the letter wherever it is positioned.
Because of these font display issues, it is not uncommon to find nonstandard ad hoc substitutes for these letters. The online version of the Marshallese-English Dictionary (the only complete Marshallese dictionary in existence)Script error: No such module "Unsubst". displays the letters with dot below diacritics, all of which do exist as precombined glyphs in Unicode: "Script error: No such module "Lang".", "Script error: No such module "Lang".", "Script error: No such module "Lang"." and "Script error: No such module "Lang".". The first three exist in the International Alphabet of Sanskrit Transliteration, and "Script error: No such module "Lang"." exists in the Vietnamese alphabet, and both of these systems are supported by the most recent versions of common fonts like Arial, Courier New, Tahoma and Times New Roman. This sidesteps most of the Marshallese text display issues associated with the cedilla, but is still inappropriate for polished standard text.
Vute
Vute, a Mambiloid language from Cameroon, uses cedilla for the nasalization of all vowel qualities (cf. the ogonek used in Polish and Navajo for the same purpose). This includes unconventional Roman letters that are formalized from the IPA into the official writing system. These include <i̧ ȩ ɨ̧ ə̧ a̧ u̧ o̧ ɔ̧>.
Hebrew
The ISO 259 romanization of Biblical Hebrew uses Ȩ (E with cedilla) and Ḝ (E with cedilla and breve).
Saanich
Saanich uses a spacing cedilla Template:Angle bracket as a letter. For fonts created or updated after 2025, a cedilla in the middle of a word should not trigger a word break at the end of a line, to accommodate Saanich.
Diacritical comma
Languages such as Romanian, Latvian and Livonian add a comma (virgula) to some letters, such as Script error: No such module "Lang"., which looks somewhat like a cedilla, but is more precisely a diacritical comma. This is particularly confusing with letters which can take either diacritic: for example, the consonant Script error: No such module "IPA". is written as "ş" in Turkish but as "ș" in Romanian, and Romanian writers will sometimes use the former instead of the latter because of insufficient computer support.
Adobe names of the Latvian letters (Script error: No such module "Lang". and formerly Script error: No such module "Lang".) use the word "comma", but in the Unicode Standard they are named "g", "k", "l", "n", and "r" with cedilla. The letters were introduced to the Unicode standard before 1992, and their names cannot be altered. Influenced by Latvian, Livonian has the same problem for "d̦", "ļ", "ņ", "ŗ" and "ț". The Polish letters Script error: No such module "Lang". and Script error: No such module "Lang". and Lithuanian letters Script error: No such module "Lang". and Script error: No such module "Lang". are not made with the cedilla either, but with the unrelated ogonek diacritic.
Unicode
Unicode encodes a number of cases of "letter with cedilla" (so called, as explained above) as precomposed characters. In addition, several more letters in language orthographies are composed using the combining character facility (Template:Unichar and Template:Unichar).
Template:Letters with diacritic/headerTemplate:HlistTemplate:Letters with diacritic/footer In ambiguous cases, typeface designers must choose whether to use a cedilla diacritic or comma-below diacritic for these codepoints, leaving it to others to provide the user with a method to achieve the other form (i.e., that relies on the combining character method). Here are three popular faces that demonstrate the choices made:
- Arial: Ç ç Ḉ ḉ Ḑ ḑ Ȩ ȩ Ḝ ḝ Ģ ģ Ḩ ḩ Ķ ķ Ļ ļ M̧ m̧ Ņ ņ O̧ o̧ Ŗ ŗ Ş ş Ţ ţ Z̧ z̧
- Times New Roman: Ç ç Ḉ ḉ Ḑ ḑ Ȩ ȩ Ḝ ḝ Ģ ģ Ḩ ḩ Ķ ķ Ļ ļ M̧ m̧ Ņ ņ O̧ o̧ Ŗ ŗ Ş ş Ţ ţ Z̧ z̧
- Courier New: Ç ç Ḉ ḉ Ḑ ḑ Ȩ ȩ Ḝ ḝ Ģ ģ Ḩ ḩ Ķ ķ Ļ ļ M̧ m̧ Ņ ņ O̧ o̧ Ŗ ŗ Ş ş Ţ ţ Z̧ z̧
In each case, the diacritic displayed with D, G, K, L, N and R is a comma-below; in the other cases it is displayed as a cedilla. It may be that computer fonts are sold in the Romanian and Turkish markets that favour the national standard form of this diacritic.
References
Template:Reflist Template:Notelist
External links
- ScriptSource—Positioning the traditional cedilla
- Diacritics Project—All you need to design a font with correct accents
- Keyboard Help—Learn how to make world language accent marks and other diacriticals on a computer
Script error: No such module "Navbox". Template:Latin script
- ↑ For Script error: No such module "Lang". being the diminutive of Script error: No such module "Lang"., see definition of cedilla, Script error: No such module "Lang"., 22nd edition, Real Academia Española Template:In lang, which can be seen in context by accessing the site of the Real Academia and searching for Script error: No such module "Lang".. (This was accessed 27 July 2006.)
- ↑ a b c d Template:OED
- ↑ Minsheu, John (1599) Percyvall's (R.) Dictionarie in Spanish and English (as enlarged by J. Minsheu) Edm. Bollifant, London, Template:OCLC
- ↑ Chambers, Ephraim (1738) Cyclopædia; or, an universal dictionary of arts and sciences (2nd ed.) Template:OCLC