Cedilla: Difference between revisions
fixed typo: changed 'diacrital' to 'diacritical' |
imported>Jonesey95 Fix Linter errors. |
||
| (One intermediate revision by one other user not shown) | |||
| Line 1: | Line 1: | ||
{{ | {{Short description|Diacritic used in Latin alphabets}} | ||
{{dablinks|date=December 2025}} | |||
{{Infobox diacritic | |||
| mark = ¸ | |||
{{ | | name = Cedilla | ||
{{Infobox diacritic| | | unicode = U+0327 (combining diacritic)<br/>U+00B8 (spacing symbol, ISO/IEC 8859) | ||
| transcription = | |||
}} | }} | ||
The '''cedilla''' ( ◌̧ ) (from [[Spanish language|Spanish]] ''cedilla'', "little ''[[z]]''") is a [[diacritic]] of the [[Latin alphabet]]. In [[French language|French]], it is used only under the [[letter (alphabet)|letter]] ''[[c]]'', both in [[lowercase]] and [[capital letter|uppercase]] forms: ''ç'', ''Ç''. | |||
It is also used in several other languages under different letters. It bears a visual resemblance to the numeral ''5'' with its upper stroke removed. | |||
Historically, the Spanish cedilla (and, by geographic extension, in [[Portuguese language|Portuguese]] and [[Catalan language|Catalan]], then in French and [[Occitan language|Occitan]]) was placed only under a ''c'' derived, among other possibilities, from a [[Latin]] ''c'' that had undergone [[palatalization]]. It then formed the letter '''''ç''''' (“''c'' with cedilla”), originally pronounced /ts/ and later /s/ (and sometimes /z/ between [[vowel]]s). | |||
== History == | |||
The modern [[grapheme]] of the cedilla derives from medieval [[Gothic script]] or [[Visigothic script]] ‹ ꝣ ›. The use of this sign arose from the limitations of the Latin alphabet. Its name comes from Spanish and appeared in the 17th century, meaning “little z” (as ''c'' replaced ''z'' in Spanish before ''e''). | |||
Under the letter ''c'', the handwritten cedilla developed through three successive forms: a diacritic ''z'', then a ''z'' with a cedilla (subscript and sometimes superscript), and finally the modern ''c'' with cedilla. By contrast, the evolution of the ''e caudata'' is considered unrelated to that of the cedilla. | |||
== In manuscripts == | |||
=== ''C with cedilla'' === | |||
The palatal [[phoneme]] /ts/ of the [[Romance languages]] derives from a [[Latin]] /k/ ''c'' that was [[palatalization|palatalized]] and then [[assibilation|assibilated]]. Before vowels that would otherwise trigger a non-palatalized (and therefore incorrect) pronunciation (/k/ before ''a'', ''o'', and ''u''), scribes used various spellings to indicate the “new” pronunciation: simply ''c'', ''ce'', or ''cz'' (with ''e'' and ''z'' functioning as [[diacritic]] letters).<ref> | |||
{{cite book |last=Riegel |first=Martin |title=Grammaire méthodique du français |author2=Pellat, Jean-Christophe |author3=Rioul, René |publisher=Presses universitaires de France |year=2009 |language=French |trans-title=Methodical grammar of French}}</ref> | |||
Thus ''ceo'' and ''czo'' were read /tso/: the diacritic ''e'' and ''z'' prevented the reading /ko/. | |||
This latter notation appears in French as early as the first literary manuscript in the French language, the ''[[Sequence of Saint Eulalia]]'' (dated to 881 and consisting of 29 verses), where it occurs only once, in verse 21. | |||
According to [[Algirdas Julien Greimas|Greimas]] (2001), the neuter demonstrative ''ço'' appears in the ''Sequence of Saint Eulalia''.<ref> | |||
{{cite book |last=Greimas |first=Algirdas Julien |title=Dictionnaire de l'ancien français |publisher=Larousse |year=2001 |isbn=2035320488 |language=French |trans-title=Dictionary of Old French}}</ref> | |||
[[ | |||
However, Greimas gives only the form ''ço'' for this text, although the manuscript, rediscovered in the 19th century, contains no cedilla in any of its 29 verses. Moreover, the manuscript dates to 881 rather than broadly to the “10th century”. | |||
== | By contrast, {{Interlanguage link|Pierre Ivart|fr|Ivar Ch'Vavar}} analyzes the form ''czo'' as follows:<ref> | ||
= | {{cite web | ||
|last=Ivart | |||
|first=Pierre | |||
|title=La séquence de Sainte Eulalie est-elle un texte picard ? | |||
|language=French | |||
|trans-title=Is the Sequence of Saint Eulalia a Picard text? | |||
|url=http://ysagnier.free.fr/langues/eulalie_picard.htm | |||
|access-date=15 December 2025 | |||
}}</ref> | |||
{{blockquote| | |||
{{ | On '''tc''' and '''cz'''. These graphemes therefore represent ''ts''. The case of “czo” (v. 21) is fairly clear: the scribe could not use ''c'' alone before ''o'' to represent ''tch'', since before ''o'', ''c'' always has the value ''k'' (except in the ''Oaths of Strasbourg''). He therefore resorts to an experimental grapheme ''cz''.}} | ||
The ''z'' in ''czo'' is thus interpreted as a diacritic ''z'' which, once placed beneath the ''c'', would become the cedilla. | |||
The Visigothic script is indeed thought to have abbreviated this grapheme around the 11th century in Spain. Initially, the ''c'' was written above the ''z'' in its form ''ʒ''; later, the ''c'' regained its full size while ''{{IPA|ʒ}}'' was reduced to a subscript sign. Thus, the Spanish word ''lanʒa'' /lantsa/ (“lance”) came to be written ''lança''. The usefulness of such a sign, and an early attempt to systematize the notation of /ts/, led (depending on scribes) to the extension of the cedilla before the vowels ''i'' and ''e'' (''çinco'', “five”). This was later regarded as a form of [[hypercorrection]], since ''c'' alone was sufficient (''cinq'' and ''çinq'' are pronounced identically). | |||
=== | Maria Selig confirms this Visigothic origin:<ref> | ||
{{cite book |last=Selig |first=Maria |title=Le passage à l'écrit des langues romanes |author2=Frank, Barbara |author3=Hartmann, Jörg |publisher=Gunter Narr Verlag |year=1993 |isbn=3823342614 |pages=127 |language=French |trans-title=The transition to writing in the Romance languages}}</ref> | |||
{{blockquote| | |||
The | The history of the cedilla and its diffusion is well known today, so I shall confine myself to a very brief synthesis. As a result of the application of Visigothic script to new Spanish sounds, <ç>, with a subscript (sometimes superscript) <z>, appears from the earliest monuments of Castilian. The use of the cedilla has also been observed in the oldest charters written in Provençal (hence its later presence in Catalan) and in French.}} | ||
Selig also notes that the diacritic spread across Europe more slowly than its phonetic value, and was in some cases “reappropriated” by different languages to represent sounds unrelated to its original function. | |||
[[ | In French, according to [[Jean Dubois (linguist)|Jean Dubois]], the cedilla appears “as early as the 8th century in Visigothic manuscripts, but was little used by scribes, who preferred to add an extra letter to indicate the sibilant sound of ''c'' (they wrote ''receut'', ''aperceut'')”.<ref> | ||
{{cite book |last=Dubois |first=Jean |title=Dictionnaire de linguistique |publisher=Larousse |year=2002 |isbn=203532047X |language=French |trans-title=Dictionary of linguistics}}</ref> | |||
Accordingly, in the manuscripts of ''[[The Song of Roland]]'', the cedilla is not used, although modern transcriptions add it for ease of reading. | |||
== Paleographic cedilled ''e'' (''e caudata'') == | |||
{{main|E caudata}} | |||
| | |||
[[File:Sacrecon.png|thumb|left|Excerpt from a Latin book published in [[Rome]] in 1632. An ''e caudata'' appears in the words ''Sacrę'', ''propagandę'', ''prædictę'', and ''grammaticę'' (alongside the form ''grammaticæ''). The typographic ''e caudata'' was adopted during the Renaissance from the much older manuscript ''e caudata'' (as in ''The Song of Roland'').]] | |||
A form resembling a cedilla can therefore be found beneath the letter ''e'' in medieval [[manuscript]]s, with usage attested as early as the 6th century in [[uncial script]]. The resulting letter is known as ''[[e caudata]]'' (“''e'' with a tail”, also called “tailed e”). It more or less frequently replaces the Latin [[Digraph (orthography)|digraph]] ''ae'' (often written as the [[ligature (typography)|ligature]] ''æ'', a convention that later spread more widely). This digraph generally represented an open {{IPA|/ɛ/}} (originally long, until distinctions of [[vowel length]] disappeared), derived from the Classical Latin [[diphthong]] {{IPA|/ae̯/}}, which was [[monophthong]]ized from the 2nd century onward. | |||
== | This usage continued in manuscripts until the 18th century but did not survive the advent of [[printing press|printing]]:<ref name="auto">{{cite book |last=Mortier |first=Raoul |title=Les textes de la Chanson de Roland — La version d'Oxford |publisher=Éditions de La geste francor |year=1940 |language=French |trans-title=The texts of The Song of Roland — The Oxford version}}</ref> | ||
== | {{blockquote| | ||
[The scribe of ''The Song of Roland'' wrote] ''ciel'' or ''cel'' with a cedilled ''e'' because, until the 13th century, Latin words in ''æ'' or ''œ'' were often written with a cedilled ''e''; recognizing the Latin ''cælum'' beneath the French ''cel'', he allowed himself (which has no meaning in French) to use a cedilled ''e'' (vv. 545, 646, 723, 1156, and 1596).}} | |||
* [http://diacritics. | |||
* [http://www. | It is noteworthy that this letter, represented here as ''ę'' (with an [[ogonek]]) or ''ȩ'' (with a cedilla), has been preserved in [[Romance linguistics|Romance philological]] transcription, whereas the digraph ''ae'' (in its ligatured form ''[[æ]]'', known as ''ash'') has been retained in the transcription of [[Germanic languages]]. ''ę'' was used in manuscripts of [[Old English]] written in Insular Irish uncial. | ||
Although this sign is often referred to as a “cedilla”, this is an anachronism: it has no connection with the letter ''z'', and it more likely derives from a subscript ''a''. | |||
This cedilla-like mark, whose use varied before the spread of printing, can therefore serve as an indicator for the dating of manuscripts by [[palaeography|palaeographers]]. For example, according to the ''Dictionnaire de paléographie'' by Louis Mas Latrie (1854), “manuscripts in which one finds the cedilled ''e'' rather than ''œ'' must be placed between five and seven hundred years ago”, that is, between 1150 and 1350:<ref name="auto"/> | |||
{{blockquote| | |||
The letter ''e'' with a cedilla for ''æ'' therefore seems to characterize the eleventh century. Mabillon, ''De Re Diplomatica'', p. 367, supports this thesis. He already shows ''ę'' for ''ae'' in the tenth century, e.g. ''suę'' for ''suae'', ''ex sacramentario Ratoldi'', no. 587. But he also shows that this usage was not yet general and cites ''Galliae'', ''ex ms. codice Remigio''. His citations of fragments from the eleventh century generally contain ''ę'' for ''ae''. “''Ex codice nostro S. Germani'', 527: ''sapię'' for ''sapientiae''.” In the twelfth century, the same scholar shows ''ę'' for ''oe'', while plain ''e'' is used for ''ae''. “''Ex Flora Corb.'' nos. 488 and 489, ''pęno'' for ''poena'' (beginning of the twelfth century); ''dicte ecclesie'' for ''dictae ecclesiae''.” Charters provide the most compelling evidence and seem to prove that ''e'' with a cedilla used for ''ae'', when the usage is general, denotes the eleventh century. | |||
}} | |||
=== Early printing === | |||
Manuscript usage was taken up in [[printing]], first by Spanish and Portuguese printers, and then imitated by the French printer [[Geoffroy Tory]]. According to Auguste Bernard, as early as 1509,<ref name="Bernard"> | |||
{{cite book |last=Bernard |first=Auguste |title=Geofroy Tory, peintre et graveur, premier imprimeur royal, réformateur de l'orthographie et de la typographie sous François Ier |publisher=E. Tross |year=1857 |series=Bulletin de la Société du Protestantisme Français |language=French |trans-title=Geoffroy Tory, painter and engraver, first royal printer, reformer of orthography and typography under Francis I |issn=1141-054X}}</ref> “Tory proposed writing with a cedilla the penultimate ''e'' of the third person plural of the perfect tense of verbs of the third conjugation (''emere'', ''contendere'', etc.) in order to distinguish it from the infinitive,” following the model already used shortly before 1509 in the ''Psalterium quintuplex''. If Bernard's account is followed, the cedilla would therefore have been used in Latin printing by Tory from the very beginning of the 16th century. | |||
The cedilla in French, in the form of ''c-cedilla'', was first explicitly advocated in 1529 by the same author, in the introduction to his book ''{{Interlanguage link|Champ fleury|fr}}'', published in 1529 (with printing privilege dated {{date|5 September 1526}}).<ref name="Bernard" /> | |||
Its subtitle clearly expresses its purpose: ''l’art et la science de la due et vraie proportion de la lettre'' (“the art and science of the proper and true proportion of the letter”). This work is, moreover, the first typographical treatise written in French: | |||
{{blockquote| | |||
''C'' before ''o'', in French pronunciation and language, is sometimes hard, as in ''coquin'', ''coquard'', ''coq'', ''coquillard''; sometimes it is soft, as in ''garcon'', ''macon'', ''francois'', and other similar words. | |||
}} | |||
[[File:Latin letter Çç.svg|thumb|class=skin-invert-image|Çç Çç]] | |||
[[File:Buchdruck-15-jahrhundert 1.jpg|thumb|A printing workshop in the 15th century.]] | |||
This defense of the cedilla was not immediately put into practice. In Tory's system, the cedilla was intended to mark /s/ (and no longer /ts/, since this [[phoneme]] had simplified in French by the 13th century and in [[Old Castilian]] between the 14th and 16th centuries). The cedilla formed part of Geoffroy Tory's typographical innovations (along with the [[comma]] and the [[apostrophe]]), whose aim was likely to facilitate the commercialization of the first books printed in French rather than Latin. | |||
He used the cedilla in French for the first time in ''Le sacre et coronnement de la royne'' by Guillaume Bochetel, published in 1531.<ref> | |||
{{cite web | |||
|title=L'apparition de la cédille en français | |||
|trans-title=The appearance of the cedilla in French | |||
|publisher=Musée d’Ecouen / Bibliothèque nationale de France | |||
|year=2011 | |||
|url=http://expositions.bnf.fr/tory/grand/2015.htm | |||
|language=French | |||
|access-date=15 December 2025 | |||
}}</ref> | |||
According to many authors, Tory generalized the use of ''c-cedilla'' in his edition of ''L’Adolescence Clémentine'' by [[Clément Marot]], the fourth edition of the work, published in 1533. The book had first appeared on 12 August 1532 in Paris, published by Roffet, without cedillas, and then on 7 June 1533 by Tory, this time with cedillas.<ref> | |||
{{cite book |last=Roudaut |first=François |title=L'Adolescence clémentine |publisher=Librairie générale française |year=2005 |isbn=2-253-08699-1 |location=Paris |pages=470 |language=French |trans-title=The Clementine Adolescence}}</ref> | |||
In reality, Tory had already introduced the cedilla at the beginning of 1530<ref> | |||
{{cite book |last=Rickard |first=Peter |title=La langue française au seizième siècle: étude suivie de textes |publisher=Cambridge University Press |year=1968 |pages=38 |language=French |trans-title=The French language in the sixteenth century: study followed by texts}}</ref> in his pamphlet ''Le sacre et le coronnement de la Royne, imprime par le commandement du Roy nostre Sire'', where it appears three times, in the words ''façon'', ''commença'', and ''Luçon''. | |||
The 1533 edition of ''L’Adolescence Clémentine'' nevertheless represents the first true generalization of the cedilla in a work that enjoyed success and was intended for a relatively large [[print run]] for the period. Tory justified the use of the cedilla in the introduction to this edition using the same arguments already advanced in ''Champ fleury'': | |||
{{blockquote| | |||
[published] with certain marked accents, namely on the masculine ''é'' as distinct from the feminine, on words joined together by synaloephas, and under the ''ç'' when it takes on the pronunciation of ''s'', which until now, through lack of consideration, had not been done in the French language, although it was and remains very necessary. | |||
}} | |||
The practical application of Tory's orthographic system is irregular: apostrophes are missing in ''par faulte dadvis'', and oddly placed in ''combien q’uil''—likely a typographical error. As Bernard observes, this was the first work in which Tory applied his orthographic system, and the inexperience of his compositors is evident in the mistakes made by omission or transposition.<ref name="Bernard" /> | |||
From this point onward, the cedilla was adopted by all printers.<ref name="Bernard" /> Before this, supporters of etymological [[orthography]] wrote ''francoys''. Usage initially remained unstable. For example, in the ''Œuvres poétiques'' of [[Louise Labé]] (published by [[Jean de Tournes]] in 1555), one finds the cedilla in ''aperçu'' but not in ''perſa'' (modern ''perça''), which is instead written with an ''s'' to avoid ''perca''. | |||
From there, the use of the “''c with a tail''” (its earliest name) spread throughout France, but it was not until the 17th century that its use became truly common. | |||
In [[Spanish language|Spanish]], the cedilla was abandoned in the 18th century (''ç'' being replaced by ''z'' or simple ''c'' before ''e'' and ''i''), while /ts/ had simplified to /s/ between the 14th and 16th centuries and then to /θ/ in the 17th century. Other related languages ([[Catalan language|Catalan]], [[French language|French]], [[Portuguese language|Portuguese]]) nevertheless retained it. | |||
=== After the Renaissance === | |||
The introduction (and subsequent retention) of such a character in written French was an effective and broadly accepted way of definitively resolving the problem of the ambiguous pronunciation of the Latin letter ''c''. Indeed, when ''c'' precedes ''a'', ''o'', or ''u'', it is pronounced /k/; when it precedes any other vowel, it is pronounced /s/. The sign therefore makes it possible to preserve links with the past and to maintain the graphic coherence of the language by making spelling less ambiguous. The presence of a cedilla in a word or form keeps visible the relationships with the [[Etymology|etymon]] and with [[Derivation (linguistics)|derived forms]] or related forms. | |||
For [[Albert Dauzat]], “the simplification of an irrational orthography was in keeping with the tendencies of the 17th century, enamoured of clarity and reason. Many writers called for reform […]”.<ref>{{cite book |last=Dauzat |first=Albert |title=Phonétique et grammaire historiques de la langue française |publisher=Librairie Larousse |year=1950 |page=129 |language=French |trans-title=Historical phonetics and grammar of the French language}}</ref> The cedilla therefore became a stake in the many projects for [[Reforms of French orthography|orthographic reform of the French language]]. | |||
==== T-cedilla in French ==== | |||
With regard to these attempts at orthographic reform, the history of the ''t''-cedilla in French is exemplary. | |||
In 1663, in ''Rome la ridicule, Caprice'' by Saint-Amant, the printer and proofreader for the [[Elzevir|Elzeviers]] in Amsterdam, Simon Moinet, used the cedilla under the letter ''t'' in French (for example, he wrote ''invanţion'').<ref>{{cite book |last=Firmin-Didot |first=Ambroise |title=Observations sur l'orthographe, ou ortografie, française |publisher= |year=1868 |page=84 |language=French |trans-title=Observations on French orthography, or ortografie}}</ref> | |||
In 1766, {{Interlanguage link|Jean-Raymond de Petity|fr}}, preacher to the queen, proposed the use of the cedilla under ''t'' to distinguish cases where it is read /t/ from those where it is pronounced /s/:<ref name="auto"/> | |||
{{blockquote| | |||
One could still derive another benefit from the cedilla in favour of children and foreigners, who are often embarrassed about how they should pronounce the letter ''t'' in certain words; this would be to apply this sign to that letter when it has the value of ''s'', as in the words ''minutie'', ''portion'', ''faction'', ''quotien'', etc. By this expedient its pronunciation would be regulated, and one would no longer confuse the cases where it has its natural value, as in the words ''partie'', ''question'', ''digestion'', ''chrétien''. When it costs so little to remedy imperfections, it is gratuitously wishing to perpetuate them to allow them to subsist. | |||
}} | |||
[[File:Unicode 0x0327.svg|thumb|upright|class=skin-invert-image|The cedilla in French, a diacritic that nearly enabled a major simplification of French orthography.]] | |||
[[Ambroise Firmin Didot|Ambroise Firmin-Didot]], in his ''Observations sur l'orthographe, ou ortografie, française'' (1868), proposed to the [[Académie française]] a similar reform project aiming to introduce a ''t''-cedilla, ''ţ'' (depending on configuration, this may appear as a comma rather than a cedilla), in words where ''t'' is pronounced /s/ before ''i''. This would have eliminated a large number of irregularities in spelling (''nous adoptions'' ~ ''les adoptions'', ''pestilence'' ~ ''pestilentiel'', ''il différencie'' ~ ''il balbutie''). One would thus have written: ''les adopţions'', ''pestilenciel'' (with ''c'' preferred in order to agree better with the base ''pestilence''), ''il différencie'', ''il balbuţie''. | |||
In fact, as the author himself notes, the grammarians of [[Port-Royal Logic|Port-Royal]] had already proposed such an improvement before him (by means of a ''t'' with a [[Underdot|subscript dot]]: ''les adopṭions''). The project ultimately remained a dead letter. | |||
==== ''Açhille'', ''çhien'', ''çheval'': the proposals of Nicolas Beauzée ==== | |||
In the same spirit as that of Firmin-Didot, the generalization of ''c''- and ''t''-cedilla was defended by an Enlightenment [[grammarian]] such as [[Nicolas Beauzée]]. Thus, according to the 19th-century encyclopedist B. Jullien:<ref>{{cite book |last1=Glaire |first1=Jean-Baptiste |title=Encyclopédie catholique, répertoire universel et raisonné des sciences, des lettres, des arts et des métiers, formant une bibliothèque universelle, avec la biographie des hommes célèbres |last2=Walsh |first2=Joseph-Alexis |last3=Chantrel |first3=Joseph |last4=Orse |first4=Abbé |last5=Alletz |first5=Édouard |publisher=P. Desbarres |year=1843 |volume=6 |page=99 |language=French |trans-title=Catholic Encyclopedia, a universal and reasoned repertory of sciences, letters, arts, and trades, forming a universal library, with biographies of famous men}}</ref> | |||
{{blockquote| | |||
[The celebrated grammarian Nicolas Beauzée, who devoted himself extensively to the modifications to be introduced into our orthography in order to regularize it, wished to generalize the use of the cedilla, and derived such advantage from it that one can only regret that the [[Académie française|Academy]] did not take up this project in order to introduce it into our [[writing system|writing]]. According to Beauzée, the cedilla should indicate not only for the letter ''c'', but also for other letters when appropriate, and notably for ''t'', the transition from a hard sound to a [[fricative consonant|sibilant]] sound. This being the case, a simple cedilla would eliminate certain spelling differences that nothing justifies. | |||
Thus one writes ''monarque'' with ''qu'', and ''monarchie'' with ''ch''; Beauzée proposed that ''ch'' written without a cedilla should always be pronounced /k/, and that the sibilant ''ch'', that of ''chien'' and ''cheval'', should be written with a cedilla, ''çhien'', ''çheval'': one would then write ''monarche'' and ''monarçhie''. The [[etymology]] would be preserved, and the pronunciation exactly represented. | |||
We write ''chœur'' and pronounce /kœʁ/; we write and pronounce ''chose''; and this diversity of pronunciation is often a difficulty for those who do not know French. According to Beauzée, one should write ''chœur'' and ''çhose'', ''Achaïe'' and ''Açhille'', ''Michel-Ange'' and ''arçhevêque'', and so on; note that this would hardly be a change in spelling, merely an extremely slight addition, which nevertheless would have the happiest results for all. How is it that the body established to guide the French language does not make every effort to adopt such wise corrections? | |||
Beauzée quite reasonably extended the application of the cedilla to the letter ''t''. Indeed, this letter very often takes in French the sibilant sound of ''s'', without there being any general rule for this. Thus ''nous portions'' and ''des portions'', ''nous inventions'' and ''des inventions'', are written exactly the same way and pronounced differently; Beauzée proposed placing the cedilla under the ''t'' pronounced as ''s''. Immediately all difficulty would disappear, and etymology would be preserved. The same was to apply in all such words as ''minutie'', ''calvitie'', etc., where ''t'' takes the sound of ''s''. By reciprocity, one could later restore the ''c''-cedilla in some words from which it has been improperly removed to make room for plain ''c''. Such is the case, for example, with ''mince'' derived from ''minutus'', ''accourcir'' derived from ''court'', where the ''c'' not present in the [[Root (linguistics)|root]] was substituted, under the influence of pronunciation, for the ''t'' required by etymology. | |||
One could multiply such examples; it suffices for me to have shown how one might successively introduce into our orthography some perfectly rational changes which, after a short time, would render it regular, while not offending usage. Assuredly this would be a fine service rendered to our language. | |||
}} | |||
Moreover, it would have been possible to write the words ''lança'' and ''français'' using the letter ''s'', since the phoneme /ts/ no longer existed at the time of the [[loanword|borrowing]] of the cedilla. The [[phoneme]] had even merged with the other /s/ sounds. However, it was the visual and etymologizing appearance of the word that prevailed. The spelling ''*lansa'' would have introduced an awkward alternation: ''*il lansa'' ~ ''ils lancèrent''. In other languages, such as Spanish, the spelling of a conjugated verb may be inconsistent: one now writes ''lanzar'', thus “cutting oneself off” from the Latin etymology ''lanceare'', which was more explicitly reflected in ''lançar'' (though it reappears in alternation with ''lance'' in the present [[subjunctive]]). | |||
In addition to maintaining visual etymological coherence, the cedilla also makes it possible, in certain cases, to resolve spelling problems for the sound /s/ derived from /k/. For example, ''reçu'' retains a link with ''recevoir'', but above all could not be written in any other way: ''*resu'' would be read /ʁəzy/ and ''*ressu'' /resy/. The same applies to ''leçon'' and other words in which a [[schwa]] is followed by the phoneme /s/. In other cases, plain ''c'' without a cedilla is retained. The retention of ''c'' in such words is explained by an orthographic [[archaism]]: the Latin or French [[etymon]] remains visible, allowing greater visual coherence by preserving a link between the cedilla-marked [[Derivation (linguistics)|derived form]] and the [[Root (linguistics)|root]] from which it originates. In this way, ''lança'' and ''lançons'' remain clearly and visually connected to the root ''lanc-'' /lɑ̃s/ of ''lancer'', ''lance'', etc. Likewise, ''reçu'' retains a link with ''recevoir''. Conversely, when the sound /k/ must be obtained before the graphic vowels ''e'', ''i'', and ''y'', a ''u'' is used as a diacritic letter following ''c'': ''accueil''. | |||
Used as a diacritic detached from its original ''c'', the cedilla was extended to other letters in other languages from the 19th century onward. | |||
=== Chronology of the appearance of the cedilla === | |||
* Before the 9th century, [[occurrence]]s of the Visigothic cedilla (ʒ), which was shortened to ''ç'' in the 11th century. | |||
* In parallel, the palaeographic cedilled ''e'' (e caudata) is attested as early as the 6th century. | |||
* 9th century – ''[[Sequence of Saint Eulalia|Cantilène de sainte Eulalie]]'': a [[hapax]] of the diacritic ''z'', intended to be shortened to ''ç'', appears in the form ''czo''. | |||
* 1480 – Birth of [[Geoffroy Tory]] in [[Bourges]]. | |||
* Before 1500 – Spanish and Portuguese printers create [[typeface]]s for the cedilla; these enter France via [[Toulouse]]. | |||
* 1509 – Tory innovates in Latin printing (cedillas on the ''e'' of the verbs ''emere'', ''contendere''). | |||
* 1529 (completed in 1526) – Tory argues for the introduction of the cedilla into French in ''Champ fleury''. | |||
* Early 1530 – Tory introduces the cedilla in ''Le sacre et le coronnement de la Royne, imprime par le commandement du Roy nostre Sire''. | |||
* June 7, 1533 – Publication of the fourth edition of ''L'Adolescence clémentine'' by Tory, representing a major dissemination of the cedilla. | |||
* October 1533 – Death of Geoffroy Tory. | |||
* 18th century – The cedilla disappears from Spanish; it is used by all printers in France. Nicolas Beauzée proposes its generalization in place of ''s''. Numerous spelling reform attempts follow: some call for abandoning the cedilla, others for generalizing it. However, this diacritic, successfully established by Tory shortly before his death in 1533, has retained essentially the same rules of use down to the present day. | |||
=== Etymology === | |||
Although the cedilla appeared in French [[manuscript]]s as early as the 9th century and in French [[printing]] from 1530 onward, the word ''cédille'' itself is attested<ref name="Rey">{{cite book |last=Rey |first=Alain |title=Dictionnaire historique de la langue française |publisher=Le Robert |year=1992 |isbn=2-85036-532-7 |location=Paris |language=French |trans-title=Historical Dictionary of the French Language}}</ref> only in 1611, in the altered form ''cerille'', and then as ''cédille'' in 1654–1655. The word ''cerilla'' had, however, already been borrowed from [[Spanish language|Spanish]] in 1492, and the form ''cedilla'' is attested in 1558. In Spanish, ''cedilla'' means “little z” and is the diminutive of the name of the letter ''z'' in Spanish, ''zeda'' (now obsolete, like ''ceda'';<ref>{{cite web |author=Asale |last2=Rae |title=cedilla |trans-title=cedilla |url=http://dle.rae.es/ |access-date=December 15, 2025 |website=Diccionario de la lengua española – Tricentennial Edition |language=Spanish}}</ref> the current name being ''[[:es:Z|zeta]]''), itself derived from the Latin ''zeta'', from Greek ''zêta'', “the sixth letter of the [[Greek alphabet]]”. Greek ''zêta'' is itself “borrowed from [[Semitic languages|Phoenician]] (cf. [[Hebrew]] ''zajit'', [[Arabic]] ''zayn'')”.<ref name="Rey" /> | |||
In his article in the ''[[Encyclopédie ou Dictionnaire raisonné des sciences, des arts et des métiers|Encyclopédie]]'',<ref>{{cite book |title=Encyclopédie ou Dictionnaire raisonné des sciences, des arts et des métiers |volume=2 |page=796 |language=French |trans-title=Encyclopedia, or a Systematic Dictionary of the Sciences, Arts, and Crafts}}</ref> and later in his ''Œuvres'',<ref name="oeuvres">{{cite book |last=Dumarsais |first=César Chesneau |title=Œuvres |publisher=Pougin |year=1797 |editor-last=Millon |editor-first=Charles |volume=4 |page=298 |language=French |trans-title=Works |editor2-last=Duchosal |editor2-first=Marie-Emile-Guillaume}}</ref> the term ''cedilla'' was mistakenly interpreted by [[César Chesneau Dumarsais|Dumarsais]] in French as meaning “little c” rather than “little z”, due to the shape of the cedilla: | |||
{{blockquote| | |||
The term ''cédille'' comes from the Spanish ''cedilla'', which means “little c”; for the Spaniards also have, like us, the ''c'' without a cedilla, which then has a hard sound before the three letters ''a'', ''o'', ''u''; and when they wish to give a soft sound to the ''c'' preceding one of these three letters, they subscript the cedilla to it, which they call ''c con cedilla'', that is, ''c with cedilla''. Moreover, this character might well derive from the Greek [[sigma]] represented thus [[Stigma|Ϛ]], as we have noted under the letter c (''sic''); for the ''c'' with cedilla is pronounced like ''s'' at the beginning of the words ''sage'', ''second'', ''si'', ''sobre'', ''sucre''. | |||
}} | |||
== Current usage == | |||
=== Romance languages === | |||
In French, [[Catalan language|Catalan]], [[Occitan language|Occitan]] (more widespread in the classical orthography), and [[Portuguese language|Portuguese]], the Hispanic cedilla is used under the letter ''c'' to indicate /s/ before ''a'', ''o'', and ''u''. In Catalan and Occitan (classical orthography only), ''-ç'' is also used word-finally to indicate ''/s'', for example in ''dolç'' (“sweet”). | |||
[[Friulian language|Friulian]] uses a cedilled ''c'' to represent {{IPA|[tʃ]}}. | |||
==== Romanian ==== | |||
[[File:Virguliţa şi sedila.svg|thumb|class=skin-invert-image|''T'' and ''s'' with subscript comma and ''t'' and ''s'' with cedilla in the [[Times New Roman]] font.]] | |||
In [[Romanian language|Romanian]], the diacritic plays a much more prominent role: [[Ș|Ș ș]] (formerly: Ş ş) {{IPA|[ʃ]}}, and [[Ț|Ț ț]] (formerly: Ţ ţ) {{IPA|[ts]}}. After having been written using [[Glagolitic alphabet|so-called Glagolitic characters]] of Church Slavonic until the 19th century, Romanian has since been written in the Latin alphabet. Its orthography then drew partly on Italian and French models, and partly, especially with regard to letters bearing diacritics, on transliteration practices close to those of the Balkan linguistic area. The most recent major reforms date from 1953, followed by more chaotic changes after the end of communism. Modern Romanian normally uses two letters with a [[Comma#Subscript comma|subscript comma]]. | |||
In 2003, the [[Romanian Academy]] specified that the letters ''ș'' and ''ț'' share the same diacritic: a comma placed a short distance beneath the letters ''s'' and ''t'', rather than a cedilla.<ref>{{cite book |last=Sala |first=Marius |url=http://www.secarica.ro/std/InstitLingvTastatura-20031008.pdf |title=Adresă către Academia Română |date=October 7, 2003 |language=Romanian |trans-title=Address to the Romanian Academy |access-date=December 15, 2025}}</ref> | |||
Because the [[ISO/IEC 8859-2]] and [[Unicode]] standards initially treated the Romanian subscript comma as merely a graphic variant of the cedilla, cedilled ''s'' (U+015E, U+015F) became widespread in computing, especially since it also exists in Turkish (allowing a single ISO [[Character encoding|character set]] for both languages). Cedilled ''t'' (U+0162, U+0163), however, has most often continued to be represented as a ''t'' with a subscript comma, primarily for aesthetic reasons. As a result, modern fonts most often display an ''s'' with a cedilla and a ''t'' with a cedilla shaped like a comma. | |||
Unicode now distinguishes the two characters, as shown in the illustration. The characters named “Latin capital letter ''S'' with comma below” (U+0218) and “Latin small letter ''s'' with comma below” (U+0219), as well as “Latin capital letter ''T'' with comma below” (U+021A) and “Latin small letter ''t'' with comma below” (U+021B), are preferred in careful typography. | |||
For [[alphabetical order|alphabetical sorting]], the two Romanian letters with subscript comma (or cedilla) are considered distinct letters, ordered after ''s'' and ''t''. | |||
=== Turkic languages === | |||
:Çç {{IPA|[tʃ]}}, Şş {{IPA|[ʃ]}} | |||
Both letters have been used in the orthography of [[Turkish language|Turkish]] since the [[romanization]] adopted on November 1, 1928. They are regarded as distinct letters, ordered respectively after ''c'' and ''s'', and not as variants of those letters. The use of ''ç'' for {{IPA|[t͡ʃ]}} may have been inspired by [[Albanian language|Albanian]] usage, while ''ş'' appears to follow Romanian practice. | |||
The [[Turkmen alphabet]], adopted in 1991 following the independence of [[Turkmenistan]], is largely inspired by Western alphabets, and particularly by Turkish. As in Turkish, it includes Çç {{IPA|[tʃ]}} and Şş {{IPA|[ʃ]}}. | |||
==== Azerbaijani ==== | |||
:Çç {{IPA|[t͡ʃ]}} | |||
:Şş {{IPA|[ʃ]}} | |||
In [[Azerbaijani language|Azerbaijani]], the cedilla is used, for example, in ''içmək'' {{IPA|[ˈit͡ʃmæk]}} (“to drink”) and ''danışmak'' {{IPA|[daniʃmak]}} (“to consult”). | |||
==== Tatar ==== | |||
In the Tatar Latin alphabet ''Jaᶇalif'' (''Yañalif'') or ''Yañalatinitsa'' (“new Latin alphabet”), which was adopted in 1999 and is commonly used on the Internet, two letters with a cedilla are employed: | |||
:Çç {{IPA|[ɕ]}}, {{IPA|[t͡ʃ]}} or {{IPA|[t͡s]}} | |||
:Şş {{IPA|[ʃ]}} | |||
In the literary Tatar language (in [[Kazan]]), the letter {{lang|tt|ç}} is pronounced {{IPA|[ɕ]}}, while {{lang|tt|c}} is {{IPA|[ʑ]}}. In the western and southern parts of the Tatar-speaking area (''Mişär''), {{lang|tt|ç}} is {{IPA|[t͡ʃ]}}, or {{IPA|[t͡s]}} in the north, and {{lang|tt|c}} is {{IPA|[d͡ʒ]}}. | |||
In Siberia, in the eastern part of the Tatar-speaking area, {{lang|tt|ç}} is {{IPA|[ts]}}, and {{lang|tt|c}} is {{IPA|[ʒ]}}. | |||
=== Albanian === | |||
:Çç {{IPA|[tʃ]}} | |||
In the current orthography of [[Albanian language|Albanian]], adopted in 1908 at the [[Congress of Monastir]], the letter ''ç'' is used to represent {{IPA|[t͡ʃ]}}.<ref>{{cite journal |last=Trix |first=Frances |year=1997 |title=Alphabet conflict in the Balkans: Albanian and the Congress of Monastir |trans-title=Alphabet conflict in the Balkans: Albanian and the Congress of Monastir |journal=International Journal of the Sociology of Language |language=English |issue=128 |pages=1–23 |issn=0165-2516}}</ref> | |||
=== Latvian === | |||
*Ģģ {{IPA|[ɟ]}} | |||
*Ķķ {{IPA|[c]}} | |||
*Ļļ {{IPA|[ʎ]}} | |||
*Ņņ {{IPA|[ɲ]}} | |||
*Ŗŗ {{IPA|[r]}} | |||
[[Latvian language|Latvian]] uses a cedilla in the form of a “subscript comma” to indicate the [[palatalization (phonetics)|palatalization]] of the consonants /g/, /k/, /l/, /n/, and /r/, written as ''ģ'', ''ķ'', ''ļ'', ''ņ'', and ''ŗ''. For reasons of legibility, this diacritic is placed above the lowercase ''g'', where it may take several forms, including a curved [[quotation mark]], an inverted [[comma]], or an [[acute accent]]. For the uppercase ''G'', where legibility is not an issue, the diacritic remains below: ''Ģ''. | |||
[[File:Latvian Ergonomic Keyboard Layout.png|thumb|center|600px|Latvian keyboard layout (rarely used).]] | |||
As the pronunciation of ''r'' and ''ŗ'' is no longer distinguished in standard Latvian, the latter letter was removed from the orthography during the years of Soviet occupation. This reform was generally not accepted by Latvians in exile. After Latvia regained independence in 1991, ''ŗ'' was nevertheless not reinstated in the official orthography. | |||
Latvian orthography, derived from German, introduced cedillas and [[ogonek]]s in order to enrich an alphabet of German origin that was insufficient to represent all Latvian sounds. Thus, Ģ, Ķ, Ļ, and Ņ still denote the palatalized equivalents of ''G'', ''K'', ''L'', and ''N''. Until the beginning of the 20th century, Latvian orthography was highly irregular. | |||
=== Other alphabets === | |||
Some recently created alphabets directly inspired by the Latin alphabet have added numerous diacritics to address mismatches between sounds and letters. A well-known example is [[Vietnamese language|Vietnamese]], which does not use the cedilla. By contrast, the [[Marshallese alphabet]] does include it, and is often cited as a notable example of an alphabet devised by linguists studying the language. | |||
==== Kurdish ==== | |||
:Çç {{IPA|[t͡ʃ]}} | |||
:Şş {{IPA|[ʃ]}} | |||
In [[Kurdish language|Kurdish]], examples include ''şer'' (“war”) and ''piçûk'' (“small”). | |||
==== Marshallese ==== | |||
*Ļ, ļ {{IPA|/ɫ/}} | |||
*m̧ {{IPA|/mʷ/}} | |||
*Ņ, ņ {{IPA|/ɳ/}} | |||
*o̧ {{IPA|/oː/}} | |||
[[Marshallese language|Marshallese]] (a [[Malayo-Polynesian languages|Malayo-Polynesian language]] spoken in the [[Marshall Islands]]) is written using a Latin alphabet that includes several unusual cedilled letters: ''l'', ''m'', ''n'', and ''o'', namely ''ļ'', ''m̧'', ''ņ'', and ''o̧''. Of these, only ''l'' and ''n'' exist as precomposed Unicode characters (as of Unicode version 4). The others must be composed using the combining cedilla U+0327. Care should be taken not to encode ''o'' with cedilla as ''o'' with an [[ogonek]] (''ǫ''). | |||
According to a foundational grammar available online,<ref>{{cite book | |||
|last=Cook | |||
|first=Richard | |||
|title=Peace Corps Marshall Islands: Marshallese Language Training | |||
|trans-title=Peace Corps Marshall Islands: Marshallese Language Training | |||
|language=English | |||
|year=1992 | |||
|publisher=Peace Corps | |||
|url=http://www.linguistics.berkeley.edu/~rscook/pdf/PCMLT-JejeinM.pdf | |||
|access-date=December 15, 2025 | |||
}}</ref> ''ļ'' would correspond to {{IPA|/ɫ/}}, ''m̧'' to {{IPA|/mʷ/}} (labialized /m/), ''ņ'' to {{IPA|/ɳ/}} (retroflex /n/), and ''o̧'' to a type of long /oː/. These values are not confirmed by a study of Marshallese phonology,<ref>{{cite journal | |||
|last=Willson | |||
|first=Heather | |||
|title=A Brief Introduction to Marshallese Phonology | |||
|trans-title=A Brief Introduction to Marshallese Phonology | |||
|language=English | |||
|journal=UCLA Working Papers | |||
|year=2000 | |||
|url=http://www.bol.ucla.edu/~hwillson/ABriefIntroductiontoMarshallesePhonology.pdf | |||
|access-date=December 15, 2025 | |||
}}</ref> which does not discuss the current orthography. | |||
==== Cameroonian languages ==== | |||
The [[General Alphabet of Cameroonian Languages]] recommends avoiding diacritics above graphemes to modify phonetic value, reserving that position for tone marking. Diacritics below graphemes are therefore preferred for phonetic modification. The cedilla is one such diacritic, indicating [[nasalization]] in practice, notably in [[Dii language|Dii]], [[Kako language|Kako]], [[Karang language|Karang]], [[Maka language|Maka]], [[Mbodomo language|Mbodomo]], [[Mundani language|Mundani]], [[Pana language|Pana]], and [[Vute language|Vute]]. | |||
Nasalized vowels marked with a cedilla include: | |||
*A̧, a̧ | |||
*Ȩ, ȩ | |||
*Ɛ̧, ɛ̧ | |||
*Ə̧, ə̧ | |||
*I̧, i̧ | |||
*Ɨ̧, ɨ̧ | |||
*O̧, o̧ | |||
*Ɔ̧, ɔ̧ | |||
*U̧, u̧ | |||
==== Kinande ==== | |||
In [[Kinande language|Kinande]], the cedilla is used to indicate [[advanced tongue root]] articulation in vowels, notably ''i'' and ''u'': | |||
*I̧, i̧ | |||
*U̧, u̧ | |||
==== Indigenous languages of the Americas ==== | |||
In the orthographies developed by the New Tribes Mission for [[Jodï language|Jodï]], [[Maco language|Maco]], and [[Piaroa language|Piaroa]], the cedilla is used to indicate nasalized vowels. | |||
=== Languages with ogoneks === | |||
The cedilla should not be confused with the [[ogonek]], which is not discussed in this article. Languages such as [[Navajo language|Navajo]], [[Apache languages|Apache]], [[Polish language|Polish]], and, as in the example below, [[Lithuanian language|Lithuanian]], do not use cedillas but ogoneks: | |||
*Ą, ą | |||
*Ę, ę | |||
*Į, į | |||
*Ų, ų | |||
=== Phonetic transcription === | |||
In the [[International Phonetic Alphabet]], {{IPA|ç}} represents the [[voiceless palatal fricative]]. This sound does not occur in French. | |||
Alan Timberlake uses the cedilla to indicate consonant [[palatalization (phonetics)|palatalization]] in a Russian grammar published in 2004:{{sfn|Timberlake|2004|p=53}} p̧ b̧ ţ ḑ ķ ģ ç̆ ʒ̧̆ ş ş̆ x̧ v̧ z̧ z̧̆ m̧ ņ ļ ŗ. | |||
== ASCII and ISO 646 transcription == | |||
Basic [[ASCII]] (the American version of the [[ISO/IEC 646]] standard encoding characters from 0 to 127) does not include letters with diacritics. At a time when it was often the only available [[code page]], some users simulated the cedilla by placing a [[comma]] after the letter; for example, writing <code>c,a</code> for ''ça''. | |||
However, national variants of ISO 646 used the few non-invariant positions of the standard to encode additional punctuation marks and diacritics: | |||
* The French version<ref>{{cite web |title=ISO registry record no. 69 |trans-title=ISO registry record no. 69 |url=http://www.itscj.ipsj.or.jp/ISO-IR/069.pdf |publisher=ISO |language=fr |access-date=15 December 2025}}</ref> (standard NF Z 62010-1982, deposited with ECMA by AFNOR) encodes the lowercase ''c with cedilla'' at position 124, replacing the '''|''' character of the American version. | |||
* An earlier French version<ref>{{cite web |title=ISO registry record no. 25 |trans-title=ISO registry record no. 25 |url=http://www.itscj.ipsj.or.jp/ISO-IR/025.pdf |publisher=ISO |language=fr |access-date=15 December 2025}}</ref> (standard NF Z 62010-1973, obsolete since 1985) required the use of the ''backspace'' control character (BS, code 8) to overstrike characters and simulate the addition of a diacritic, except for letters already encoded with diacritics in the national variant; thus the cedilla could be encoded as <BS ; comma> following an uppercase ''C''. | |||
* The Spanish,<ref>{{cite web |title=ISO registry record no. 85 |trans-title=ISO registry record no. 85 |url=http://www.itscj.ipsj.or.jp/ISO-IR/085.pdf |publisher=ISO |language=es |access-date=15 December 2025}}</ref> Catalan, and Basque versions of ISO 646 (registered with ECMA by IBM or Olivetti) encode uppercase and lowercase ''c with cedilla'' at positions 93 and 125 respectively, replacing the ASCII characters ''']''' and '''}'''. | |||
* The Portuguese versions<ref>{{cite web |title=ISO registry record no. 84 |trans-title=ISO registry record no. 84 |url=http://www.itscj.ipsj.or.jp/ISO-IR/084.pdf |publisher=ISO |language=pt |access-date=15 December 2025}}</ref><ref>{{cite web |title=ISO registry record no. 16 |trans-title=ISO registry record no. 16 |url=http://www.itscj.ipsj.or.jp/ISO-IR/016.pdf |publisher=ISO |language=pt |access-date=15 December 2025}}</ref> (registered with ECMA by IBM or Olivetti) encode uppercase and lowercase ''c with cedilla'' at positions 92 and 124 respectively, replacing the ASCII characters '''\''' and '''|'''. | |||
* The Italian version<ref>{{cite web |title=ISO registry record no. 15 |trans-title=ISO registry record no. 15 |url=http://www.itscj.ipsj.or.jp/ISO-IR/015.pdf |publisher=ISO |language=it |access-date=15 December 2025}}</ref> (registered with ECMA by Olivetti) encodes the lowercase ''c with cedilla'' at position 92, replacing the ASCII '''\'''. | |||
* The French, Spanish, Portuguese, German,<ref>{{cite web |title=ISO registry record no. 21 |trans-title=ISO registry record no. 21 |url=http://www.itscj.ipsj.or.jp/ISO-IR/021.pdf |publisher=ISO |language=de |access-date=15 December 2025}}</ref> Hungarian,<ref>{{cite web |title=ISO registry record no. 86 |trans-title=ISO registry record no. 86 |url=http://www.itscj.ipsj.or.jp/ISO-IR/086.pdf |publisher=ISO |language=hu |access-date=15 December 2025}}</ref> Norwegian,<ref>{{cite web |title=ISO registry record no. 60 |trans-title=ISO registry record no. 60 |url=http://www.itscj.ipsj.or.jp/ISO-IR/060.pdf |publisher=ISO |language=no |access-date=15 December 2025}}</ref> Swedish,<ref>{{cite web |title=ISO registry record no. 11 |trans-title=ISO registry record no. 11 |url=http://www.itscj.ipsj.or.jp/ISO-IR/011.pdf |publisher=ISO |language=sv |access-date=15 December 2025}}</ref> and Greek<ref>{{cite web |title=ISO registry record no. 88 |trans-title=ISO registry record no. 88 |url=http://www.itscj.ipsj.or.jp/ISO-IR/088.pdf |publisher=ISO |language=el |access-date=15 December 2025}}</ref> variants of ISO 646 continue to refer to the cedilla as a possible representation of the comma (although they do not prescribe any specific use of a control character for this purpose). | |||
== Notes and references == | |||
{{reflist}} | |||
== Bibliography == | |||
* {{cite journal |last=Bernard |first=Auguste |year=1837 |title=Du premier emploi dans l'imprimerie et dans la langue française, de l'apostrophe, de l'accent et de la cédille |trans-title=On the first use in printing and in the French language of the apostrophe, the accent, and the cedilla |journal=Bulletin du bibliophile belge |language=fr}} | |||
* {{cite book |last=Firmin-Didot |first=Ambroise |year=1868 |title=Observations sur l'orthographe, ou ortografie, française |trans-title=Observations on French orthography, or ortografie |publisher=Ambroise Firmin Didot |location=Paris |language=fr}} | |||
* {{cite book |last1=Daniels |first1=Peter T. |last2=Bright |first2=William |year=1996 |title=The World's Writing Systems |publisher=Oxford University Press |location=Oxford and New York |pages=xlvi + 920 |isbn=978-0-19-507993-7 |language=en}} | |||
* {{cite book |last=Huchon |first=Mireille |year=2002 |title=Histoire de la langue française |trans-title=History of the French language |publisher=Paris |language=fr}} | |||
* {{cite book |last=Steffens |first=Franz |year=1910 |title=Paléographie latine |trans-title=Latin paleography |publisher=Honoré Champion |location=Paris |language=fr}} | |||
* {{cite book |last=Timberlake |first=Alan |year=2004 |title=A reference grammar of Russian |publisher=Cambridge University Press |isbn=978-0-521-77292-1 |language=en}} | |||
== See also == | |||
* [[Diacritic]] | |||
* [[French orthography]] | |||
{{Navbox diacritical marks}} | {{Navbox diacritical marks}} | ||
| Line 120: | Line 372: | ||
[[Category:Latin-script diacritics]] | [[Category:Latin-script diacritics]] | ||
[[Category:Turkish language]] | [[Category:Turkish language]] | ||
[[Category:Cyrillic-script diacritics]] | |||
[[Category:History of the French language]] | |||
Latest revision as of 17:43, 23 December 2025
Template:Short description Template:Dablinks Template:Infobox diacritic
The cedilla ( ◌̧ ) (from Spanish cedilla, "little z") is a diacritic of the Latin alphabet. In French, it is used only under the letter c, both in lowercase and uppercase forms: ç, Ç.
It is also used in several other languages under different letters. It bears a visual resemblance to the numeral 5 with its upper stroke removed.
Historically, the Spanish cedilla (and, by geographic extension, in Portuguese and Catalan, then in French and Occitan) was placed only under a c derived, among other possibilities, from a Latin c that had undergone palatalization. It then formed the letter ç (“c with cedilla”), originally pronounced /ts/ and later /s/ (and sometimes /z/ between vowels).
History
The modern grapheme of the cedilla derives from medieval Gothic script or Visigothic script ‹ ꝣ ›. The use of this sign arose from the limitations of the Latin alphabet. Its name comes from Spanish and appeared in the 17th century, meaning “little z” (as c replaced z in Spanish before e).
Under the letter c, the handwritten cedilla developed through three successive forms: a diacritic z, then a z with a cedilla (subscript and sometimes superscript), and finally the modern c with cedilla. By contrast, the evolution of the e caudata is considered unrelated to that of the cedilla.
In manuscripts
C with cedilla
The palatal phoneme /ts/ of the Romance languages derives from a Latin /k/ c that was palatalized and then assibilated. Before vowels that would otherwise trigger a non-palatalized (and therefore incorrect) pronunciation (/k/ before a, o, and u), scribes used various spellings to indicate the “new” pronunciation: simply c, ce, or cz (with e and z functioning as diacritic letters).[1]
Thus ceo and czo were read /tso/: the diacritic e and z prevented the reading /ko/.
This latter notation appears in French as early as the first literary manuscript in the French language, the Sequence of Saint Eulalia (dated to 881 and consisting of 29 verses), where it occurs only once, in verse 21.
According to Greimas (2001), the neuter demonstrative ço appears in the Sequence of Saint Eulalia.[2]
However, Greimas gives only the form ço for this text, although the manuscript, rediscovered in the 19th century, contains no cedilla in any of its 29 verses. Moreover, the manuscript dates to 881 rather than broadly to the “10th century”.
By contrast, Template:Interlanguage link analyzes the form czo as follows:[3]
<templatestyles src="Template:Blockquote/styles.css" />
On tc and cz. These graphemes therefore represent ts. The case of “czo” (v. 21) is fairly clear: the scribe could not use c alone before o to represent tch, since before o, c always has the value k (except in the Oaths of Strasbourg). He therefore resorts to an experimental grapheme cz.
Script error: No such module "Check for unknown parameters".
The z in czo is thus interpreted as a diacritic z which, once placed beneath the c, would become the cedilla.
The Visigothic script is indeed thought to have abbreviated this grapheme around the 11th century in Spain. Initially, the c was written above the z in its form ʒ; later, the c regained its full size while Script error: No such module "IPA". was reduced to a subscript sign. Thus, the Spanish word lanʒa /lantsa/ (“lance”) came to be written lança. The usefulness of such a sign, and an early attempt to systematize the notation of /ts/, led (depending on scribes) to the extension of the cedilla before the vowels i and e (çinco, “five”). This was later regarded as a form of hypercorrection, since c alone was sufficient (cinq and çinq are pronounced identically).
Maria Selig confirms this Visigothic origin:[4]
<templatestyles src="Template:Blockquote/styles.css" />
The history of the cedilla and its diffusion is well known today, so I shall confine myself to a very brief synthesis. As a result of the application of Visigothic script to new Spanish sounds, <ç>, with a subscript (sometimes superscript) <z>, appears from the earliest monuments of Castilian. The use of the cedilla has also been observed in the oldest charters written in Provençal (hence its later presence in Catalan) and in French.
Script error: No such module "Check for unknown parameters".
Selig also notes that the diacritic spread across Europe more slowly than its phonetic value, and was in some cases “reappropriated” by different languages to represent sounds unrelated to its original function.
In French, according to Jean Dubois, the cedilla appears “as early as the 8th century in Visigothic manuscripts, but was little used by scribes, who preferred to add an extra letter to indicate the sibilant sound of c (they wrote receut, aperceut)”.[5]
Accordingly, in the manuscripts of The Song of Roland, the cedilla is not used, although modern transcriptions add it for ease of reading.
Paleographic cedilled e (e caudata)
Script error: No such module "Labelled list hatnote".
A form resembling a cedilla can therefore be found beneath the letter e in medieval manuscripts, with usage attested as early as the 6th century in uncial script. The resulting letter is known as e caudata (“e with a tail”, also called “tailed e”). It more or less frequently replaces the Latin digraph ae (often written as the ligature æ, a convention that later spread more widely). This digraph generally represented an open Script error: No such module "IPA". (originally long, until distinctions of vowel length disappeared), derived from the Classical Latin diphthong Script error: No such module "IPA"., which was monophthongized from the 2nd century onward.
This usage continued in manuscripts until the 18th century but did not survive the advent of printing:[6]
<templatestyles src="Template:Blockquote/styles.css" />
[The scribe of The Song of Roland wrote] ciel or cel with a cedilled e because, until the 13th century, Latin words in æ or œ were often written with a cedilled e; recognizing the Latin cælum beneath the French cel, he allowed himself (which has no meaning in French) to use a cedilled e (vv. 545, 646, 723, 1156, and 1596).
Script error: No such module "Check for unknown parameters".
It is noteworthy that this letter, represented here as ę (with an ogonek) or ȩ (with a cedilla), has been preserved in Romance philological transcription, whereas the digraph ae (in its ligatured form æ, known as ash) has been retained in the transcription of Germanic languages. ę was used in manuscripts of Old English written in Insular Irish uncial.
Although this sign is often referred to as a “cedilla”, this is an anachronism: it has no connection with the letter z, and it more likely derives from a subscript a.
This cedilla-like mark, whose use varied before the spread of printing, can therefore serve as an indicator for the dating of manuscripts by palaeographers. For example, according to the Dictionnaire de paléographie by Louis Mas Latrie (1854), “manuscripts in which one finds the cedilled e rather than œ must be placed between five and seven hundred years ago”, that is, between 1150 and 1350:[6]
<templatestyles src="Template:Blockquote/styles.css" />
The letter e with a cedilla for æ therefore seems to characterize the eleventh century. Mabillon, De Re Diplomatica, p. 367, supports this thesis. He already shows ę for ae in the tenth century, e.g. suę for suae, ex sacramentario Ratoldi, no. 587. But he also shows that this usage was not yet general and cites Galliae, ex ms. codice Remigio. His citations of fragments from the eleventh century generally contain ę for ae. “Ex codice nostro S. Germani, 527: sapię for sapientiae.” In the twelfth century, the same scholar shows ę for oe, while plain e is used for ae. “Ex Flora Corb. nos. 488 and 489, pęno for poena (beginning of the twelfth century); dicte ecclesie for dictae ecclesiae.” Charters provide the most compelling evidence and seem to prove that e with a cedilla used for ae, when the usage is general, denotes the eleventh century.
Script error: No such module "Check for unknown parameters".
Early printing
Manuscript usage was taken up in printing, first by Spanish and Portuguese printers, and then imitated by the French printer Geoffroy Tory. According to Auguste Bernard, as early as 1509,[7] “Tory proposed writing with a cedilla the penultimate e of the third person plural of the perfect tense of verbs of the third conjugation (emere, contendere, etc.) in order to distinguish it from the infinitive,” following the model already used shortly before 1509 in the Psalterium quintuplex. If Bernard's account is followed, the cedilla would therefore have been used in Latin printing by Tory from the very beginning of the 16th century.
The cedilla in French, in the form of c-cedilla, was first explicitly advocated in 1529 by the same author, in the introduction to his book Template:Interlanguage link, published in 1529 (with printing privilege dated 5 September 1526).[7]
Its subtitle clearly expresses its purpose: l’art et la science de la due et vraie proportion de la lettre (“the art and science of the proper and true proportion of the letter”). This work is, moreover, the first typographical treatise written in French:
<templatestyles src="Template:Blockquote/styles.css" />
C before o, in French pronunciation and language, is sometimes hard, as in coquin, coquard, coq, coquillard; sometimes it is soft, as in garcon, macon, francois, and other similar words.
Script error: No such module "Check for unknown parameters".
This defense of the cedilla was not immediately put into practice. In Tory's system, the cedilla was intended to mark /s/ (and no longer /ts/, since this phoneme had simplified in French by the 13th century and in Old Castilian between the 14th and 16th centuries). The cedilla formed part of Geoffroy Tory's typographical innovations (along with the comma and the apostrophe), whose aim was likely to facilitate the commercialization of the first books printed in French rather than Latin.
He used the cedilla in French for the first time in Le sacre et coronnement de la royne by Guillaume Bochetel, published in 1531.[8]
According to many authors, Tory generalized the use of c-cedilla in his edition of L’Adolescence Clémentine by Clément Marot, the fourth edition of the work, published in 1533. The book had first appeared on 12 August 1532 in Paris, published by Roffet, without cedillas, and then on 7 June 1533 by Tory, this time with cedillas.[9]
In reality, Tory had already introduced the cedilla at the beginning of 1530[10] in his pamphlet Le sacre et le coronnement de la Royne, imprime par le commandement du Roy nostre Sire, where it appears three times, in the words façon, commença, and Luçon.
The 1533 edition of L’Adolescence Clémentine nevertheless represents the first true generalization of the cedilla in a work that enjoyed success and was intended for a relatively large print run for the period. Tory justified the use of the cedilla in the introduction to this edition using the same arguments already advanced in Champ fleury:
<templatestyles src="Template:Blockquote/styles.css" />
[published] with certain marked accents, namely on the masculine é as distinct from the feminine, on words joined together by synaloephas, and under the ç when it takes on the pronunciation of s, which until now, through lack of consideration, had not been done in the French language, although it was and remains very necessary.
Script error: No such module "Check for unknown parameters".
The practical application of Tory's orthographic system is irregular: apostrophes are missing in par faulte dadvis, and oddly placed in combien q’uil—likely a typographical error. As Bernard observes, this was the first work in which Tory applied his orthographic system, and the inexperience of his compositors is evident in the mistakes made by omission or transposition.[7]
From this point onward, the cedilla was adopted by all printers.[7] Before this, supporters of etymological orthography wrote francoys. Usage initially remained unstable. For example, in the Œuvres poétiques of Louise Labé (published by Jean de Tournes in 1555), one finds the cedilla in aperçu but not in perſa (modern perça), which is instead written with an s to avoid perca.
From there, the use of the “c with a tail” (its earliest name) spread throughout France, but it was not until the 17th century that its use became truly common.
In Spanish, the cedilla was abandoned in the 18th century (ç being replaced by z or simple c before e and i), while /ts/ had simplified to /s/ between the 14th and 16th centuries and then to /θ/ in the 17th century. Other related languages (Catalan, French, Portuguese) nevertheless retained it.
After the Renaissance
The introduction (and subsequent retention) of such a character in written French was an effective and broadly accepted way of definitively resolving the problem of the ambiguous pronunciation of the Latin letter c. Indeed, when c precedes a, o, or u, it is pronounced /k/; when it precedes any other vowel, it is pronounced /s/. The sign therefore makes it possible to preserve links with the past and to maintain the graphic coherence of the language by making spelling less ambiguous. The presence of a cedilla in a word or form keeps visible the relationships with the etymon and with derived forms or related forms.
For Albert Dauzat, “the simplification of an irrational orthography was in keeping with the tendencies of the 17th century, enamoured of clarity and reason. Many writers called for reform […]”.[11] The cedilla therefore became a stake in the many projects for orthographic reform of the French language.
T-cedilla in French
With regard to these attempts at orthographic reform, the history of the t-cedilla in French is exemplary.
In 1663, in Rome la ridicule, Caprice by Saint-Amant, the printer and proofreader for the Elzeviers in Amsterdam, Simon Moinet, used the cedilla under the letter t in French (for example, he wrote invanţion).[12]
In 1766, Template:Interlanguage link, preacher to the queen, proposed the use of the cedilla under t to distinguish cases where it is read /t/ from those where it is pronounced /s/:[6]
<templatestyles src="Template:Blockquote/styles.css" />
One could still derive another benefit from the cedilla in favour of children and foreigners, who are often embarrassed about how they should pronounce the letter t in certain words; this would be to apply this sign to that letter when it has the value of s, as in the words minutie, portion, faction, quotien, etc. By this expedient its pronunciation would be regulated, and one would no longer confuse the cases where it has its natural value, as in the words partie, question, digestion, chrétien. When it costs so little to remedy imperfections, it is gratuitously wishing to perpetuate them to allow them to subsist.
Script error: No such module "Check for unknown parameters".
Ambroise Firmin-Didot, in his Observations sur l'orthographe, ou ortografie, française (1868), proposed to the Académie française a similar reform project aiming to introduce a t-cedilla, ţ (depending on configuration, this may appear as a comma rather than a cedilla), in words where t is pronounced /s/ before i. This would have eliminated a large number of irregularities in spelling (nous adoptions ~ les adoptions, pestilence ~ pestilentiel, il différencie ~ il balbutie). One would thus have written: les adopţions, pestilenciel (with c preferred in order to agree better with the base pestilence), il différencie, il balbuţie.
In fact, as the author himself notes, the grammarians of Port-Royal had already proposed such an improvement before him (by means of a t with a subscript dot: les adopṭions). The project ultimately remained a dead letter.
Açhille, çhien, çheval: the proposals of Nicolas Beauzée
In the same spirit as that of Firmin-Didot, the generalization of c- and t-cedilla was defended by an Enlightenment grammarian such as Nicolas Beauzée. Thus, according to the 19th-century encyclopedist B. Jullien:[13]
<templatestyles src="Template:Blockquote/styles.css" />
[The celebrated grammarian Nicolas Beauzée, who devoted himself extensively to the modifications to be introduced into our orthography in order to regularize it, wished to generalize the use of the cedilla, and derived such advantage from it that one can only regret that the Academy did not take up this project in order to introduce it into our writing. According to Beauzée, the cedilla should indicate not only for the letter c, but also for other letters when appropriate, and notably for t, the transition from a hard sound to a sibilant sound. This being the case, a simple cedilla would eliminate certain spelling differences that nothing justifies.
Thus one writes monarque with qu, and monarchie with ch; Beauzée proposed that ch written without a cedilla should always be pronounced /k/, and that the sibilant ch, that of chien and cheval, should be written with a cedilla, çhien, çheval: one would then write monarche and monarçhie. The etymology would be preserved, and the pronunciation exactly represented.
We write chœur and pronounce /kœʁ/; we write and pronounce chose; and this diversity of pronunciation is often a difficulty for those who do not know French. According to Beauzée, one should write chœur and çhose, Achaïe and Açhille, Michel-Ange and arçhevêque, and so on; note that this would hardly be a change in spelling, merely an extremely slight addition, which nevertheless would have the happiest results for all. How is it that the body established to guide the French language does not make every effort to adopt such wise corrections?
Beauzée quite reasonably extended the application of the cedilla to the letter t. Indeed, this letter very often takes in French the sibilant sound of s, without there being any general rule for this. Thus nous portions and des portions, nous inventions and des inventions, are written exactly the same way and pronounced differently; Beauzée proposed placing the cedilla under the t pronounced as s. Immediately all difficulty would disappear, and etymology would be preserved. The same was to apply in all such words as minutie, calvitie, etc., where t takes the sound of s. By reciprocity, one could later restore the c-cedilla in some words from which it has been improperly removed to make room for plain c. Such is the case, for example, with mince derived from minutus, accourcir derived from court, where the c not present in the root was substituted, under the influence of pronunciation, for the t required by etymology.
One could multiply such examples; it suffices for me to have shown how one might successively introduce into our orthography some perfectly rational changes which, after a short time, would render it regular, while not offending usage. Assuredly this would be a fine service rendered to our language.
Script error: No such module "Check for unknown parameters".
Moreover, it would have been possible to write the words lança and français using the letter s, since the phoneme /ts/ no longer existed at the time of the borrowing of the cedilla. The phoneme had even merged with the other /s/ sounds. However, it was the visual and etymologizing appearance of the word that prevailed. The spelling *lansa would have introduced an awkward alternation: *il lansa ~ ils lancèrent. In other languages, such as Spanish, the spelling of a conjugated verb may be inconsistent: one now writes lanzar, thus “cutting oneself off” from the Latin etymology lanceare, which was more explicitly reflected in lançar (though it reappears in alternation with lance in the present subjunctive).
In addition to maintaining visual etymological coherence, the cedilla also makes it possible, in certain cases, to resolve spelling problems for the sound /s/ derived from /k/. For example, reçu retains a link with recevoir, but above all could not be written in any other way: *resu would be read /ʁəzy/ and *ressu /resy/. The same applies to leçon and other words in which a schwa is followed by the phoneme /s/. In other cases, plain c without a cedilla is retained. The retention of c in such words is explained by an orthographic archaism: the Latin or French etymon remains visible, allowing greater visual coherence by preserving a link between the cedilla-marked derived form and the root from which it originates. In this way, lança and lançons remain clearly and visually connected to the root lanc- /lɑ̃s/ of lancer, lance, etc. Likewise, reçu retains a link with recevoir. Conversely, when the sound /k/ must be obtained before the graphic vowels e, i, and y, a u is used as a diacritic letter following c: accueil.
Used as a diacritic detached from its original c, the cedilla was extended to other letters in other languages from the 19th century onward.
Chronology of the appearance of the cedilla
- Before the 9th century, occurrences of the Visigothic cedilla (ʒ), which was shortened to ç in the 11th century.
- In parallel, the palaeographic cedilled e (e caudata) is attested as early as the 6th century.
- 9th century – Cantilène de sainte Eulalie: a hapax of the diacritic z, intended to be shortened to ç, appears in the form czo.
- 1480 – Birth of Geoffroy Tory in Bourges.
- Before 1500 – Spanish and Portuguese printers create typefaces for the cedilla; these enter France via Toulouse.
- 1509 – Tory innovates in Latin printing (cedillas on the e of the verbs emere, contendere).
- 1529 (completed in 1526) – Tory argues for the introduction of the cedilla into French in Champ fleury.
- Early 1530 – Tory introduces the cedilla in Le sacre et le coronnement de la Royne, imprime par le commandement du Roy nostre Sire.
- June 7, 1533 – Publication of the fourth edition of L'Adolescence clémentine by Tory, representing a major dissemination of the cedilla.
- October 1533 – Death of Geoffroy Tory.
- 18th century – The cedilla disappears from Spanish; it is used by all printers in France. Nicolas Beauzée proposes its generalization in place of s. Numerous spelling reform attempts follow: some call for abandoning the cedilla, others for generalizing it. However, this diacritic, successfully established by Tory shortly before his death in 1533, has retained essentially the same rules of use down to the present day.
Etymology
Although the cedilla appeared in French manuscripts as early as the 9th century and in French printing from 1530 onward, the word cédille itself is attested[14] only in 1611, in the altered form cerille, and then as cédille in 1654–1655. The word cerilla had, however, already been borrowed from Spanish in 1492, and the form cedilla is attested in 1558. In Spanish, cedilla means “little z” and is the diminutive of the name of the letter z in Spanish, zeda (now obsolete, like ceda;[15] the current name being zeta), itself derived from the Latin zeta, from Greek zêta, “the sixth letter of the Greek alphabet”. Greek zêta is itself “borrowed from Phoenician (cf. Hebrew zajit, Arabic zayn)”.[14]
In his article in the Encyclopédie,[16] and later in his Œuvres,[17] the term cedilla was mistakenly interpreted by Dumarsais in French as meaning “little c” rather than “little z”, due to the shape of the cedilla:
<templatestyles src="Template:Blockquote/styles.css" />
The term cédille comes from the Spanish cedilla, which means “little c”; for the Spaniards also have, like us, the c without a cedilla, which then has a hard sound before the three letters a, o, u; and when they wish to give a soft sound to the c preceding one of these three letters, they subscript the cedilla to it, which they call c con cedilla, that is, c with cedilla. Moreover, this character might well derive from the Greek sigma represented thus Ϛ, as we have noted under the letter c (sic); for the c with cedilla is pronounced like s at the beginning of the words sage, second, si, sobre, sucre.
Script error: No such module "Check for unknown parameters".
Current usage
Romance languages
In French, Catalan, Occitan (more widespread in the classical orthography), and Portuguese, the Hispanic cedilla is used under the letter c to indicate /s/ before a, o, and u. In Catalan and Occitan (classical orthography only), -ç is also used word-finally to indicate /s, for example in dolç (“sweet”).
Friulian uses a cedilled c to represent Script error: No such module "IPA"..
Romanian
In Romanian, the diacritic plays a much more prominent role: Ș ș (formerly: Ş ş) Script error: No such module "IPA"., and Ț ț (formerly: Ţ ţ) Script error: No such module "IPA".. After having been written using so-called Glagolitic characters of Church Slavonic until the 19th century, Romanian has since been written in the Latin alphabet. Its orthography then drew partly on Italian and French models, and partly, especially with regard to letters bearing diacritics, on transliteration practices close to those of the Balkan linguistic area. The most recent major reforms date from 1953, followed by more chaotic changes after the end of communism. Modern Romanian normally uses two letters with a subscript comma.
In 2003, the Romanian Academy specified that the letters ș and ț share the same diacritic: a comma placed a short distance beneath the letters s and t, rather than a cedilla.[18]
Because the ISO/IEC 8859-2 and Unicode standards initially treated the Romanian subscript comma as merely a graphic variant of the cedilla, cedilled s (U+015E, U+015F) became widespread in computing, especially since it also exists in Turkish (allowing a single ISO character set for both languages). Cedilled t (U+0162, U+0163), however, has most often continued to be represented as a t with a subscript comma, primarily for aesthetic reasons. As a result, modern fonts most often display an s with a cedilla and a t with a cedilla shaped like a comma.
Unicode now distinguishes the two characters, as shown in the illustration. The characters named “Latin capital letter S with comma below” (U+0218) and “Latin small letter s with comma below” (U+0219), as well as “Latin capital letter T with comma below” (U+021A) and “Latin small letter t with comma below” (U+021B), are preferred in careful typography.
For alphabetical sorting, the two Romanian letters with subscript comma (or cedilla) are considered distinct letters, ordered after s and t.
Turkic languages
- Çç Script error: No such module "IPA"., Şş Script error: No such module "IPA".
Both letters have been used in the orthography of Turkish since the romanization adopted on November 1, 1928. They are regarded as distinct letters, ordered respectively after c and s, and not as variants of those letters. The use of ç for Script error: No such module "IPA". may have been inspired by Albanian usage, while ş appears to follow Romanian practice.
The Turkmen alphabet, adopted in 1991 following the independence of Turkmenistan, is largely inspired by Western alphabets, and particularly by Turkish. As in Turkish, it includes Çç Script error: No such module "IPA". and Şş Script error: No such module "IPA"..
Azerbaijani
- Çç Script error: No such module "IPA".
- Şş Script error: No such module "IPA".
In Azerbaijani, the cedilla is used, for example, in içmək Script error: No such module "IPA". (“to drink”) and danışmak Script error: No such module "IPA". (“to consult”).
Tatar
In the Tatar Latin alphabet Jaᶇalif (Yañalif) or Yañalatinitsa (“new Latin alphabet”), which was adopted in 1999 and is commonly used on the Internet, two letters with a cedilla are employed:
- Çç Script error: No such module "IPA"., Script error: No such module "IPA". or Script error: No such module "IPA".
- Şş Script error: No such module "IPA".
In the literary Tatar language (in Kazan), the letter Script error: No such module "Lang". is pronounced Script error: No such module "IPA"., while Script error: No such module "Lang". is Script error: No such module "IPA".. In the western and southern parts of the Tatar-speaking area (Mişär), Script error: No such module "Lang". is Script error: No such module "IPA"., or Script error: No such module "IPA". in the north, and Script error: No such module "Lang". is Script error: No such module "IPA".. In Siberia, in the eastern part of the Tatar-speaking area, Script error: No such module "Lang". is Script error: No such module "IPA"., and Script error: No such module "Lang". is Script error: No such module "IPA"..
Albanian
- Çç Script error: No such module "IPA".
In the current orthography of Albanian, adopted in 1908 at the Congress of Monastir, the letter ç is used to represent Script error: No such module "IPA"..[19]
Latvian
- Ģģ Script error: No such module "IPA".
- Ķķ Script error: No such module "IPA".
- Ļļ Script error: No such module "IPA".
- Ņņ Script error: No such module "IPA".
- Ŗŗ Script error: No such module "IPA".
Latvian uses a cedilla in the form of a “subscript comma” to indicate the palatalization of the consonants /g/, /k/, /l/, /n/, and /r/, written as ģ, ķ, ļ, ņ, and ŗ. For reasons of legibility, this diacritic is placed above the lowercase g, where it may take several forms, including a curved quotation mark, an inverted comma, or an acute accent. For the uppercase G, where legibility is not an issue, the diacritic remains below: Ģ.
As the pronunciation of r and ŗ is no longer distinguished in standard Latvian, the latter letter was removed from the orthography during the years of Soviet occupation. This reform was generally not accepted by Latvians in exile. After Latvia regained independence in 1991, ŗ was nevertheless not reinstated in the official orthography.
Latvian orthography, derived from German, introduced cedillas and ogoneks in order to enrich an alphabet of German origin that was insufficient to represent all Latvian sounds. Thus, Ģ, Ķ, Ļ, and Ņ still denote the palatalized equivalents of G, K, L, and N. Until the beginning of the 20th century, Latvian orthography was highly irregular.
Other alphabets
Some recently created alphabets directly inspired by the Latin alphabet have added numerous diacritics to address mismatches between sounds and letters. A well-known example is Vietnamese, which does not use the cedilla. By contrast, the Marshallese alphabet does include it, and is often cited as a notable example of an alphabet devised by linguists studying the language.
Kurdish
- Çç Script error: No such module "IPA".
- Şş Script error: No such module "IPA".
In Kurdish, examples include şer (“war”) and piçûk (“small”).
Marshallese
- Ļ, ļ Script error: No such module "IPA".
- m̧ Script error: No such module "IPA".
- Ņ, ņ Script error: No such module "IPA".
- o̧ Script error: No such module "IPA".
Marshallese (a Malayo-Polynesian language spoken in the Marshall Islands) is written using a Latin alphabet that includes several unusual cedilled letters: l, m, n, and o, namely ļ, m̧, ņ, and o̧. Of these, only l and n exist as precomposed Unicode characters (as of Unicode version 4). The others must be composed using the combining cedilla U+0327. Care should be taken not to encode o with cedilla as o with an ogonek (ǫ).
According to a foundational grammar available online,[20] ļ would correspond to Script error: No such module "IPA"., m̧ to Script error: No such module "IPA". (labialized /m/), ņ to Script error: No such module "IPA". (retroflex /n/), and o̧ to a type of long /oː/. These values are not confirmed by a study of Marshallese phonology,[21] which does not discuss the current orthography.
Cameroonian languages
The General Alphabet of Cameroonian Languages recommends avoiding diacritics above graphemes to modify phonetic value, reserving that position for tone marking. Diacritics below graphemes are therefore preferred for phonetic modification. The cedilla is one such diacritic, indicating nasalization in practice, notably in Dii, Kako, Karang, Maka, Mbodomo, Mundani, Pana, and Vute.
Nasalized vowels marked with a cedilla include:
- A̧, a̧
- Ȩ, ȩ
- Ɛ̧, ɛ̧
- Ə̧, ə̧
- I̧, i̧
- Ɨ̧, ɨ̧
- O̧, o̧
- Ɔ̧, ɔ̧
- U̧, u̧
Kinande
In Kinande, the cedilla is used to indicate advanced tongue root articulation in vowels, notably i and u:
- I̧, i̧
- U̧, u̧
Indigenous languages of the Americas
In the orthographies developed by the New Tribes Mission for Jodï, Maco, and Piaroa, the cedilla is used to indicate nasalized vowels.
Languages with ogoneks
The cedilla should not be confused with the ogonek, which is not discussed in this article. Languages such as Navajo, Apache, Polish, and, as in the example below, Lithuanian, do not use cedillas but ogoneks:
- Ą, ą
- Ę, ę
- Į, į
- Ų, ų
Phonetic transcription
In the International Phonetic Alphabet, Script error: No such module "IPA". represents the voiceless palatal fricative. This sound does not occur in French.
Alan Timberlake uses the cedilla to indicate consonant palatalization in a Russian grammar published in 2004:Template:Sfn p̧ b̧ ţ ḑ ķ ģ ç̆ ʒ̧̆ ş ş̆ x̧ v̧ z̧ z̧̆ m̧ ņ ļ ŗ.
ASCII and ISO 646 transcription
Basic ASCII (the American version of the ISO/IEC 646 standard encoding characters from 0 to 127) does not include letters with diacritics. At a time when it was often the only available code page, some users simulated the cedilla by placing a comma after the letter; for example, writing c,a for ça.
However, national variants of ISO 646 used the few non-invariant positions of the standard to encode additional punctuation marks and diacritics:
- The French version[22] (standard NF Z 62010-1982, deposited with ECMA by AFNOR) encodes the lowercase c with cedilla at position 124, replacing the | character of the American version.
- An earlier French version[23] (standard NF Z 62010-1973, obsolete since 1985) required the use of the backspace control character (BS, code 8) to overstrike characters and simulate the addition of a diacritic, except for letters already encoded with diacritics in the national variant; thus the cedilla could be encoded as <BS ; comma> following an uppercase C.
- The Spanish,[24] Catalan, and Basque versions of ISO 646 (registered with ECMA by IBM or Olivetti) encode uppercase and lowercase c with cedilla at positions 93 and 125 respectively, replacing the ASCII characters ] and }.
- The Portuguese versions[25][26] (registered with ECMA by IBM or Olivetti) encode uppercase and lowercase c with cedilla at positions 92 and 124 respectively, replacing the ASCII characters \ and |.
- The Italian version[27] (registered with ECMA by Olivetti) encodes the lowercase c with cedilla at position 92, replacing the ASCII \.
- The French, Spanish, Portuguese, German,[28] Hungarian,[29] Norwegian,[30] Swedish,[31] and Greek[32] variants of ISO 646 continue to refer to the cedilla as a possible representation of the comma (although they do not prescribe any specific use of a control character for this purpose).
Notes and references
<templatestyles src="Reflist/styles.css" />
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ a b c Script error: No such module "citation/CS1".
- ↑ a b c d Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ a b Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "Citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "Citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
Script error: No such module "Check for unknown parameters".
Bibliography
- Script error: No such module "Citation/CS1".
- Script error: No such module "citation/CS1".
- Script error: No such module "citation/CS1".
- Script error: No such module "citation/CS1".
- Script error: No such module "citation/CS1".
- Script error: No such module "citation/CS1".
See also
Script error: No such module "Navbox". Template:Latin script/main