Czech orthography

From Wikipedia, the free encyclopedia
(Redirected from Czech Orthography)
Jump to navigation Jump to search

Template:Short description Template:More citations needed Script error: No such module "Infobox".Template:Template otherScript error: No such module "Check for unknown parameters". Czech orthography is a system of rules for proper formal writing (orthography) in Czech. The earliest form of separate Latin script specifically designed to suit Czech was devised by Czech theologian and church reformist Jan Hus, the namesake of the Hussite movement, in one of his seminal works, De orthographia bohemica (On Bohemian orthography).

The modern Czech orthographic system is diacritic, having evolved from an earlier system which used many digraphs (although one digraph has been kept - ch). The caron (known as háček in Czech) is added to standard Latin letters to express sounds which are foreign to Latin. The acute accent is used for long vowels.

The Czech orthography is considered the model for many other Balto-Slavic languages using the Latin alphabet; Slovak orthography being its direct revised descendant, while the Croatian Gaj's Latin alphabet and its Slovene and Serbian descendant system are largely based on it. The Baltic languages, such as Latvian and Lithuanian, are also largely based on it. All of them make use of similar diacritics and also have a similar, usually interchangeable, relationship between the letters and the sounds they are meant to represent.[1]

Alphabet

The Czech alphabet consists of 42 letters.

Czech alphabet
Majuscule forms (uppercase/capital letters)
A Á B C Č D Ď E É Ě F G H Ch I Í J K L M N
Ň O Ó P Q R Ř S Š T Ť U Ú Ů V W X Y Ý Z Ž
Minuscule forms (lowercase/small letters)
a á b c č d ď e é ě f g h ch i í j k l m n
ň o ó p q r ř s š t ť u ú ů v w x y ý z ž
Czech alphabet (detail)
Letter Name Letter Name
Uppercase Lowercase Uppercase Lowercase
A a á Ň ň
Á á dlouhé á; á s čárkou O o ó
B b Ó óTemplate:Efn dlouhé ó; ó s čárkou
C c P p
Č č čé Q q kvé
D d R r er
Ď ď ďé Ř ř
E e é S s es
É é dlouhé é; é s čárkou Š š
ĚTemplate:Efn ě ije; é s háčkem T t
F fTemplate:Efn ef Ť ť ťé
G gTemplate:Efn U u ú
H h Ú ú dlouhé ú; ú s čárkou
Ch ch chá ŮTemplate:Efn ů ů s kroužkem
I i í; měkké i V v
Í í dlouhé í; dlouhé měkké í;
í s čárkou; měkké í s čárkou
W w dvojité vé
J j X x iks
K k Y y ypsilon; krátké tvrdé ý
L l el ÝTemplate:Efn ý dlouhé ypsilon; dlouhé tvrdé ý;
ypsilon s čárkou; tvrdé ý s čárkou
M m em Z z zet
N n en Ž ž žet

Template:Notelist

The letters Q, W, and X are used exclusively in foreign words, and the former two are respectively replaced with KV and V once the word becomes "naturalized" (assimilated into Czech); the digraphs dz and are also used mostly for foreign words and are not considered to be distinct letters in the Czech alphabet.

Orthographic principles

Czech orthography is primarily phonemic (rather than phonetic) because an individual grapheme usually corresponds to an individual phoneme (rather than a sound). However, some graphemes and letter groups are remnants of historical phonemes which were used in the past but have since merged with other phonemes. Some changes in the phonology have not been reflected in the orthography.

Vowels
Grapheme IPA value Notes
a Template:IPAslink
á Template:IPAslink
e Template:IPAslink
é Template:IPAslink
ě Template:IPAslink, Script error: No such module "IPA". Marks palatalization of preceding consonant; see usage rules below
i Template:IPAslink Palatalizes preceding Template:Angbr, Template:Angbr, or Template:Angbr; see usage rules below
í Template:IPAslink Palatalizes preceding Template:Angbr, Template:Angbr, or Template:Angbr; see usage rules below
o Template:IPAslink
ó Template:IPAslink Occurs mostly in words of foreign origin.
u Template:IPAslink
ú Template:IPAslink See usage rules below
ů Template:IPAslink See usage rules below
y Template:IPAslink See usage rules below
ý Template:IPAslink See usage rules below
Consonants
Grapheme IPA value Notes
b Template:IPAslink
c Template:IPAslink [n 1]
č Template:IPAslink [n 1]
d Template:IPAslink Represents Template:IPAslink before Template:Angbr; see below
ď Template:IPAslink
f Template:IPAslink Occurs mostly in words of foreign origin.
g Template:IPAslink Occurs mostly in words of foreign origin. Script error: No such module "Unsubst".
h Template:IPAslink
ch Template:IPAslink
j Template:IPAslink
k Template:IPAslink
l Template:IPAslink
m Template:IPAslink
n Template:IPAslink Represents Template:IPAslink before Template:Angbr; see below
ň Template:IPAslink
p Template:IPAslink
r Template:IPAslink
ř Template:IPAslink [n 2]
s Template:IPAslink
š Template:IPAslink
t Template:IPAslink Represents Template:IPAslink before Template:Angbr; see below
ť Template:IPAslink
v Template:IPAslink
x Script error: No such module "IPA". Occurs only in words of foreign origin; pronounced Script error: No such module "IPA". in words with the prefix 'ex-' before vowels or voiced consonants.
z Template:IPAslink
ž Template:IPAslink
  1. a b Unofficial ligatures are sometimes used for the transcription of affricates: Script error: No such module "IPA".. The actual IPA version supports using two separate letters which can be joined by a tiebar.
  2. The "long-leg R" Template:Angbr IPA is sometimes used to transcribe voiced Template:Angbr (unofficially). This character was withdrawn from the IPA and replaced by the "lower-case R" with the "up-tack" diacritic mark, which denotes "raised alveolar trill".

Voicing assimilation

Script error: No such module "Labelled list hatnote". All the obstruent consonants are subject to voicing (before voiced obstruents except Template:Angbr) or devoicing (before voiceless consonants and at the end of words); spelling in these cases is morphophonemic (i.e. the morpheme has the same spelling as before a vowel). An exception is the cluster Template:Angbr, in which the Script error: No such module "IPA". is voiced to Script error: No such module "IPA". only in Moravian dialects, while in Bohemia the Script error: No such module "IPA". is devoiced to Script error: No such module "IPA". instead (e.g. shodit Script error: No such module "IPA"., in Moravia Script error: No such module "IPA".). Devoicing Script error: No such module "IPA". changes its articulation place: it becomes Script error: No such module "IPA".. After unvoiced consonants Template:Angbr is devoiced: for instance, in Template:Wikt-lang 'three', which is pronounced {{errorTemplate:Main other|Audio file "Cs-tři.ogg" not found}}Template:Category handlerTemplate:Category handler. Written voiced or voiceless counterparts are kept according to the etymology of the word, e.g. odpadnout Script error: No such module "IPA". (to fall away) - od- is a prefix; written Script error: No such module "IPA". is devoiced here because of the following voiceless Script error: No such module "IPA"..

For historical reasons, the consonant Script error: No such module "IPA". is written k in Czech words like kde ('where', < Proto-Slavic *kъdě) or kdo ('who', < Proto-Slavic *kъto). This is because the letter g was historically used for the consonant Script error: No such module "IPA".. The original Slavic phoneme Script error: No such module "IPA". changed into Script error: No such module "IPA". in the Old-Czech period. Thus, Script error: No such module "IPA". is not a separate phoneme (with a corresponding grapheme) in words of domestic origin; it occurs only in foreign words (e.g. graf, gram, etc.).

Final devoicing

Unlike in English but like German, Dutch and Russian, voiced consonants are pronounced voicelessly in the final position in words. In declension, they are voiced in cases where the words take on endings.

Compare:

led Script error: No such module "IPA".ledy Script error: No such module "IPA". (ice – ices)
let Script error: No such module "IPA".lety Script error: No such module "IPA". (flight – flights)


"Soft" I and "hard" Y

The letters Template:Vr and Template:Vr are both pronounced Script error: No such module "IPA"., while Template:Vr and Template:Vr are both pronounced Script error: No such module "IPA".. Template:Vr was originally pronounced Script error: No such module "IPA". as in contemporary Polish. However, in the 14th century, this difference in standard pronunciation disappeared, though it has been preserved in some Moravian dialects.[2] In words of native origin "soft" Template:Vr and Template:Vr cannot follow "hard" consonants, while "hard" Template:Vr and Template:Vr cannot follow "soft" consonants; "neutral" consonants can be followed by either vowel:

Hard and soft consonants
Soft ž, š, č, ř, c, j, ď, ť, ň
Neutral b, f, l, m, p, s, v, z
Hard h, ch, k, r, d, t, n, g

When Template:Vr or Template:Vr is written after Template:Vr in native words, these consonants are soft, as if they were written Template:Vr. That is, the sounds Script error: No such module "IPA". are written Template:Vr instead of Template:Vr, e.g. in čeština Script error: No such module "IPA".. The sounds Script error: No such module "IPA". are denoted, respectively, by Template:Vr. In words of foreign origin, Template:Vr are pronounced Script error: No such module "IPA".; that is, as if they were written Template:Vr, e.g. in diktát, dictation.

Historically the letter Template:Vr was hard, but this changed in the 19th century. However, in some words it is still followed by the letter Template:Vr: tác (plate) – tácy (plates).

Because neutral consonants can be followed by either Template:Vr or Template:Vr, in some cases they distinguish homophones, e.g. být (to be) vs. bít (to beat), mýt (to wash) vs. mít (to have). At school pupils must memorize word roots and prefixes where Template:Vr is written; Template:Vr is written in other cases. Writing Template:Vr or Template:Vr in endings is dependent on the declension patterns.

Letter Ě

The letter Template:Vr is a vestige of Old Czech palatalization. The originally palatalizing phoneme /ě/ Script error: No such module "IPA". became extinct, changing to Script error: No such module "IPA". or Script error: No such module "IPA"., but it is preserved as a grapheme which can never appear in the initial position.

  • Script error: No such module "IPA". are written Template:Vr instead of Template:Vr, analogously to Template:Vr
  • Script error: No such module "IPA". are usually written Template:Vr instead of Template:Vr
    • In words like vjezd (entry, drive-in) objem (volume), Template:Vr are written because in such cases –je- is etymologically preceded by the prefixes v- or ob-
  • Script error: No such module "IPA". is usually written Template:Vr instead of Template:Vr, except for morphological reasons in some words (jemný, soft -> jemně, softly)
    • The first-person singular pronouns (for the genitive and accusative cases) and mně (for the dative and locative) are homophones Script error: No such module "IPA".—see Czech declension

Letter Ů

There are two ways in Czech to write long Script error: No such module "IPA".: Template:Vr and Template:Vr. Template:Vr cannot occur in an initial position, while Template:Vr occurs almost exclusively in the initial position or at the beginning of a word root in a compound.

Historically, long Template:Vr changed into the diphthong Template:Vr Script error: No such module "IPA". (as also happened in the English Great Vowel Shift with words such as "house"), though not in word-initial position in the prestige form. In 1848 Template:Vr at the beginning of word-roots was changed into Template:Vr in words like Template:Wikt-lang to reflect this. Thus, the letter Template:Vr is written at the beginning of word-roots only: úhel (angle), trojúhelník (triangle), except in loanwords: skútr (scooter).

Meanwhile, historical long Template:Vr Script error: No such module "IPA". changed into the diphthong Template:Vr Script error: No such module "IPA".. As was common with scribal abbreviations, the letter Template:Vr in the diphthong was sometimes written as a ring above the letter Template:Vr, producing Template:Vr, e.g. kóň > kuoň > kůň (horse), like the origin of the German umlaut. Later, the pronunciation changed into Script error: No such module "IPA"., but the grapheme Template:Vr has remained. It never occurs at the beginning of words: dům (house), domů (home, homeward).

The letter Template:Vr now has the same pronunciation as the letter Template:Vr (long Script error: No such module "IPA".), but alternates with a short Template:Vr when a word is inflected (e.g. nom. kůň → gen. koně, nom. dům → gen. domu), thus showing the historical evolution of the language.

Agreement between the subject and the predicate

The predicate must be always in accordance with the subject in the sentence - in number and person (personal pronouns), and with past and passive participles also in gender. This grammatical principle affects the orthography (see also "Soft" I and "Hard" Y) – it is especially important for the correct choice and writing of plural endings of the participles.

Examples:

Gender Sg. Pl. English
masculine animate pes byl koupen psi byli koupeni a dog was bought/dogs were bought
masculine inanimate hrad byl koupen hrady byly koupeny a castle was bought/castles were bought
feminine kočka byla koupena kočky byly koupeny a cat was bought/cats were bought
neuter město bylo koupeno města byla koupena a town was bought/towns were bought

The mentioned example shows both past (byl, byla ...) and passive (koupen, koupena ...) participles. The accordance in gender takes effect in the past tense and the passive voice, not in the present and future tenses in active voice.

If the complex subject is a combination of nouns of different genders, masculine animate gender is prior to others and the masculine inanimate and feminine genders are prior to the neuter gender.

Examples:

muži a ženy byli - men and women were
kočky a koťata byly - cats and kittens were
my jsme byli (my = we all/men) vs. my jsme byly (my = we women) - we were

Priority of genders:

masculine animate > masculine inanimate & feminine > neuter

Punctuation

The use of the full stop (.), the colon (:), the semicolon (;), the question mark (?) and the exclamation mark (!) is similar to their use in other European languages. The full stop is placed after a number if it stands for ordinal numerals (as in German), e.g. 1. den (= první den) – the 1st day.

The comma is used to separate individual parts in complex-compound sentences, lists, isolated parts of sentences, etc. Its use in Czech is different from English. Subordinate (dependent) clauses must be always separated from their principal (independent) clauses, for instance. A comma is not placed before a (and), i (as well as), ani (nor) and nebo (or) when they connect parts of sentences or clauses in copulative conjunctions (on a same level). It must be placed in non-copulative conjunctions (consequence, emphasis, exclusion, etc.). A comma can, however, occur in front of the word a (and) if the former is part of comma-delimited parenthesis: Jakub, můj mladší bratr, a jeho učitel Filip byli příliš zabráni do rozhovoru. Probírali látku, která bude u zkoušky, a též, kdo na ní bude. A comma also separates subordinate conjunctions introduced by composite conjunctions a proto (and therefore) and a tak (and so).

Examples:

  • otec a matka – father and mother, otec nebo matka – father or mother (coordinate relation – no commas)
  • Je to pravda, nebo ne? – Is it true, or not? (exclusion)
  • Pršelo, a proto nikdo nepřišel. – It was raining, and so no one came. (consequence)
  • Já vím, kdo to je. – I know who it is.
  • Myslím, že se mýlíš. – I think you are mistaken. (subordinate relation)
  • Jak se máš, Anno? – How are you, Anna? (addressing a person)
  • Karel IV., římský císař a český král, založil hrad Karlštejn.Charles IV, Roman Emperor and Bohemian king, founded the Karlštejn Castle. (comma-delimited parenthesis)

Quotation marks. The first one preceding the quoted text is placed to the bottom line:

  • Petr řekl: „Přijdu zítra.“ – Peter said: "I'll come tomorrow."

Other types of quotation marks: ‚‘ »«

Apostrophes are used rarely in Czech. They can denote a missing sound in non-standard speech, but it is optional, e.g. řek' or řek (= řekl, he said).

Capital letters

The first word of every sentence and all proper names are capitalized. Special cases are:

  • Respect expression – optional: Ty (you sg.), Tvůj (your sg.), Vy (you pl.), Váš (your pl.); Bůh (God), Mistr (Master), etc.
  • Headings – The first word is capitalized.
  • Cities, towns and villages – All words are capitalized, except for prepositions: Nové Město nad Metují (New-Town-upon-Metuje).
  • Geographical or local names – The first word is capitalized, common names as ulice (street), náměstí (square) or moře (sea) are not capitalized: ulice Svornosti (Concordance Street), Václavské náměstí (Wenceslas Square), Severní moře (North Sea). Since 1993, the initial preposition and the first following word are capitalized: lékárna U Černého orla (Black Eagle Pharmacy).
  • Official names of institutions – The first word is capitalized: Městský úřad v Kolíně (The Municipal Office in Kolín) vs. městský úřad (a municipal office). In some cases, an initial common name is not capitalized even if it is factually a part of the name: okres Semily (Semily District), náměstí Míru (Peace Square).
  • Names of nations and nationality nouns are capitalized: Anglie (England), Angličan (Englishman), Německo (Germany), Němec (German). Adjectives derived from geographical names and names of nations, such as anglický (English – adjective) and pražský (Prague – adjective, e.g. pražské metro, Prague subway), are not. Names of languages are not capitalized: angličtina (English).
  • Possessive adjectives derived from proper names are capitalized: Pavlův dům (Paul's house).
  • Brands are capitalized as a trademark or company name, but usually not as product names: přijel trabant a několik škodovek but přijelo auto značky Trabant a několik aut značky Škoda, zákaz vjezdu segwayů but zákaz vjezdu vozítek Segway
  • If a proper name contains other proper names, the inner proper names keep their orthography: Poslanecká sněmovna Parlamentu České republiky, Kostelec nad Černými lesy, Filozofická fakulta Jihočeské univerzity v Českých Budějovicích

History

In the 9th century, the Glagolitic script was used, during the 11th century it was replaced by Latin script. There are five periods in the development of the Czech Latin-based orthographic system:

Primitive orthography
For writing sounds which are foreign to the Latin alphabet, letters with similar sounds were used. The oldest known written notes in Czech originate from the 11th century. The literature was written predominantly in Latin in this period. Unfortunately, it was very ambiguous at times, with c, for example, being used for c, č, and k.
Digraphic orthography
Various digraphs were used for non-Latin sounds. The system was not consistent and it also did not distinguish long and short vowels. It had some features that Polish orthography has kept, such as cz, rz instead of č, ř, but was still crippled by ambiguities, such as spelling both s and š as s/ss, z and ž as z, and sometimes even c and č both as cz, only distinguishing by context. Long vowels such as á were sometimes (but not always) written double as aa. Other features of the day included spelling j as g and v as w, as the early modern Latin alphabet had not by then distinguished j from i or v from u.
Diacritic orthography
Introduced probably by Jan Hus. Using diacritics for long vowels ("virgula", an acute, "čárka" in Czech) and "soft" consonants ("punctus rotundus", a dot above a letter, which has survived in Polish ż) was suggested for the first time in "De orthographia Bohemica" around 1406. Diacritics replaced digraphs almost completely. It was also suggested that the Prague dialect should become the standard for Czech. Jan Hus is considered to be the author of that work but there is some uncertainty about this.
Brethren orthography
The Bible of Kralice (1579–1593), the first complete Czech translation of the Bible from the original languages by the Czech Brethren, became the model for the literary form of the language. The punctus rotundus was replaced by the caron ("háček"). There were some differences from the current orthography, e.g. the digraph ſſ was used instead of š; ay, ey, au instead of aj, ej, ou; v instead of u (at the beginning of words); w instead of v; g instead of j; and j instead of í (Script error: No such module "Lang". = její, hers). Y was written always after c, s and z (e.g. cizí, foreign, was written cyzý) and the conjunction i (as well as, and) was written y.
Modern orthography
During the period of the Czech National Renaissance (end of the 18th century and the first half of the 19th century), Czech linguists (Josef Dobrovský et al.) codified some reforms in the orthography. These principles have been effective up to the present day. The later reforms in the 20th century mostly referred to introducing loanwords into Czech and their adaptation to the Czech orthography.

Computer encoding

In computing, several different coding standards have existed for this alphabet, among them:

See also

References

Template:Reflist

External links

Template:Language orthographies

  1. Script error: No such module "citation/CS1".
  2. Script error: No such module "citation/CS1".
  3. Script error: No such module "citation/CS1".