Wiki143:Indic transliteration

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Template:Subcat guideline This is a guideline for the transliteration (or Romanization) of writings from Indic languages and Indic scripts for use in the English-language Wikipedia. It is based on ISO 15919, and is applicable to all languages of south Asia that are written in Indic scripts.

All transliteration should be from the written form in the original script of the original language of the name or term. The original text in the original script may also be included for reference and checking.

Formal transliteration

The formal transliteration may be used to accurately and unambiguously present the phonetic content of the original script. It should be provided for reference whenever reference to the original source is needed.

The scheme is based on ISO 15919 for Indic scripts. This is very close to IAST with minor differences to accommodate non-Devanagari scripts. The differences are:

  • ए - IAST: e, ISO: ē
  • ओ - IAST: o, ISO: ō
  • अं - IAST: Script error: No such module "lang"., ISO: ṁ (ṃ is used to specifically represent Gurmukhi Tippi ੰ)
  • ऋ - IAST: Script error: No such module "lang"., ISO: r̥
  • ॠ - IAST: Script error: No such module "lang"., ISO: r̥̄

Simplified transliteration

A set of simplified transliteration symbols is provided here. These are not part of the ISO standard. They have been devised for Wikipedia, and they may be used to avoid the use of diacritic marks. Simplified transliterations should not be considered to be authoritative, and may result in ambiguous transliteration.

Inherent vowel

When the source script does not indicate the removal of the inherent 'a' and it is not pronounced in the original source language, such unpronounced 'a's are removed.

The inherent vowel is always transliterated as 'a' in the formal ISO 15919 transliteration. In the simplified transliteration, 'a' is also normally used except in the Bengali, Assamese, and Odia languages, where 'o'/'ô' is used. See Romanization of Bengali for the transliteration scheme set for Bengali on Wikipedia.

In certain instances, the inherent vowel is not pronounced. The rules for such differ among languages. In some instances, the removal of an inherent vowel is explicitly marked by the presence of a virama.

Devanagari क्
Bengali ক্
Gurmukhi ਕ੍
Gujarati ક્
Oriya କ୍
Tamil க்
Telugu క్
Kannada ಕ್
Malayalam ക്
Sinhala ක්

Vowels

Vowels are presented in their independent form on the left of each column, and combined with the corresponding consonant ka on the right. An asterisk indicates that the letter or ligature exists, but has not been encoded in unicode or is archaic/obsolete.

ISO 15919 Simplified IPA Devanagari Bengali/Assamese Gurmukhi Gujarati Oriya Tamil Telugu Kannada Malayalam Sinhala
a a ə/Script error: No such module "IPA"./ä/Script error: No such module "IPA"./o
ā a Script error: No such module "IPA"./a का কা ਕਾ કા କା கா కా ಕಾ കാ කා
æ ae Script error: No such module "IPA". कॅ - - - - - - - - - - - - - - - - කැ
ǣ ae Script error: No such module "IPA". - - - - - - - - - - - - - - - - - - කෑ
i i i कि কি ਕਿ કિ କି கி కి ಕಿ കി කි
ī i iː/i की কী ਕੀ કી କୀ கீ కీ ಕೀ കീ කී
u u u कु কু ਕੁ કુ କୁ கு కు ಕು കു කු
ū u uː/u कू কূ ਕੂ કૂ କୂ கூ కూ ಕೂ കൂ කූ
ĕ e æ/Script error: No such module "IPA". कॅ - - - - કૅ - - - - - - - - - - - -
e e e कॆ - - - - - - - - கெ కె ಕೆ കെ කෙ
ē e eː/e/Script error: No such module "IPA". के কে ਕੇ કે କେ கே కే ಕೇ കേ කේ
ai ai Script error: No such module "IPA"./əj/æ/Script error: No such module "IPA"./Script error: No such module "IPA". कै কৈ ਕੈ કૈ କୈ கை కై ಕೈ കൈ කෛ
ŏ o Script error: No such module "IPA". कॉ - - - - કૉ - - - - - - - - - - - -
o o o कॊ - - - - - - - - கொ కొ ಕೊ കൊ කො
ō o oː/o को কো ਕੋ કો କୋ கோ కో ಕೋ കോ කෝ
au au Script error: No such module "IPA"./əw/Script error: No such module "IPA"./ow कौ কৌ ਕੌ કૌ କୌ கௌ కౌ ಕೌ കൌ කෞ
ri Script error: No such module "IPA"./ri/ru कृ কৃ - - કૃ କୃ - - కృ ಕೃ കൃ කෘ
r̥̄ ri Script error: No such module "IPA"./riː/ruː/ri/ru कॄ কৄ - - કૄ କୄ - - కౄ ಕೄ കൄ කෲ
li Script error: No such module "IPA"./li/lu कॢ কৢ - - કૢ କୢ - - కౢ ಕೢ കൢ කෟ
l̥̄ li Script error: No such module "IPA"./liː/luː/li/lu कॣ কৣ - - કૣ କୣ - - కౣ ಕೣ കൣ කෳ

Consonants

See also Brahmic family#Consonants.

ISO 15919 Simplified IPA Devanagari Bengali/
Assamese
Gurmukhi Gujarati Oriya Tamil Telugu Kannada Malayalam Sinhala
k k Script error: No such module "IPA".
kh kh Script error: No such module "IPA". கஃ
g g g
gh gh Script error: No such module "IPA". [1] கஃ
n ŋ
c ch Script error: No such module "IPA"./s
ch chh Script error: No such module "IPA"./s சஃ
j j Script error: No such module "IPA"./z
jh jh Script error: No such module "IPA"./z [2] ஜஃ
ñ n Script error: No such module "IPA"./n/-
t Script error: No such module "IPA"./t
ṭh th Script error: No such module "IPA"./tʰ டஃ
d Script error: No such module "IPA"./d
ḍh dh Script error: No such module "IPA"./dʱ [3] டஃ
n Script error: No such module "IPA"./n
t t Script error: No such module "IPA"./t
th th Script error: No such module "IPA"./tʰ தஃ
d d Script error: No such module "IPA"./d
dh dh Script error: No such module "IPA"./dʱ [4] தஃ
n n Script error: No such module "IPA"./n[5]
n n - ਨ਼ ન઼ - - - න.[6]
p p p
ph ph Script error: No such module "IPA"./f பஃ
b b b
bh bh Script error: No such module "IPA". [7] பஃ
m m m
y y j
r r r/Script error: No such module "IPA".[8] র/ৰ[9]
r r - ਰ਼ ર઼ - ර.[10]
[11] r r र्‍ - - - - - - - - -
l l l
l Script error: No such module "IPA". - ਲ਼
l Script error: No such module "IPA". - - ળ઼ - ළ.[12]
v v Script error: No such module "IPA"./Script error: No such module "IPA".[13] [14]
ś sh Script error: No such module "IPA"./Script error: No such module "IPA"./Script error: No such module "IPA"./Script error: No such module "IPA". ਸ਼ [15]
sh Script error: No such module "IPA"./Script error: No such module "IPA"./Script error: No such module "IPA"./Script error: No such module "IPA". -
s s s/Script error: No such module "IPA"./Script error: No such module "IPA".
h h Script error: No such module "IPA". [16]
q q q क़ ক় ਕ਼ ક઼ କ଼ க̡ - - - -
ḵẖ kh x ख़ খ় ਖ਼ ખ઼ ଖ଼ - - - - -
ġ g ɣ ग़ গ় ਗ਼ ગ઼ ଗ଼ - - - - -
z z z ज़ জ় ਜ਼ જ઼ ଜ଼ ஃஜ - ಜ಼ - -
r ɽ ड़ ড় ડ઼ ଡ଼ - - - - -
ṛh rh ɽʱ ढ़ ঢ় ੜ੍ਹ ઢ઼ ଢ଼ - - - - -
f f f फ़ ফ় ਫ਼ ફ઼ ଫ଼ ஃப - ಫ಼
y j/e य़ য় ਯ਼ ય઼ - - - - -
t Script error: No such module "IPA". त़ ত় ਤ਼ ત઼ ତ଼ - - - - -
s s स़ স় - સ઼ ସ଼ - - - - -
h Script error: No such module "IPA". ह़ হ় ਹ਼ હ઼ ହ଼ - - - - -
w w w व़ [17] ਵ਼ વ઼ வ̡ - - - -
t t - - - - - - - റ്റ[18] (ഺ) -
- khy kʰj - ক্ষ[19] - - - - - - - -
  • <templatestyles src="Citation/styles.css"/>^ See special notes for Punjabi, specifically voiced aspirates.
  • <templatestyles src="Citation/styles.css"/>^ In Indo-Aryan languages, this letter is theoretically pronounced as a dental nasal, but it is actually alveolar. In Tamil and Malayalam, it is a dental nasal and the alveolar nasal has a separate letter (: see note below).
  • <templatestyles src="Citation/styles.css"/>^ This letter is obsolete. See the Malayalam language article for further details.
  • <templatestyles src="Citation/styles.css"/>^ In languages that contrast two rhotic consonants, this is generally Script error: No such module "IPA".. In Indo-Aryan languages that do not make this distinction but have Script error: No such module "IPA". and [r] as allophones, the /r/ phoneme is generally pronounced Script error: No such module "IPA". when following a voiced consonant (although there are exceptions, such as the consonant j Script error: No such module "IPA".) and [r] in most other environments.
  • <templatestyles src="Citation/styles.css"/>^ Use when the distinction between the reph and eyelash form of Ra is required; otherwise transliterate as 'r'.
  • <templatestyles src="Citation/styles.css"/>^ Used when writing Tamil in Sinhala script.
  • <templatestyles src="Citation/styles.css"/>^ Use Script error: No such module "Lang". for Bengali and Manipuri, and Script error: No such module "Lang". for Assamese.
  • <templatestyles src="Citation/styles.css"/>^ Assamese and Manipuri only.
  • <templatestyles src="Citation/styles.css"/>^ May be pronounced 'w' in some languages.
  • <templatestyles src="Citation/styles.css"/>^ Also the Tamil ligature SRI (Script error: No such module "Lang". = Script error: No such module "Lang". or, prior to Unicode 4.1, Script error: No such module "Lang". = Script error: No such module "Lang".) should be transliterated as śrī with ś, although srī may be also acceptable. See [20] and [21].
  • <templatestyles src="Citation/styles.css"/>^ See special notes for Punjabi. Specifically 'ha'.
  • <templatestyles src="Citation/styles.css"/>^
  • <templatestyles src="Citation/styles.css"/>^ This is the symbol for the geminate consonant - the letter for the single [t], PNG Image, has become obsolete.
  • <templatestyles src="Citation/styles.css"/>^ Only in Assamese. Script error: No such module "Lang". in Assamese is not a composite but an individual letter with a phonetic value unlike in other languages.

Assamese velar fricatives

ISO 15919 Simplified IPA Assamese
ś x x
x x
s x x

Sinhalese half-nasals

ISO 15919 Simplified IPA Sinhala
n̆g ng ng
[22] jn gn
n̆j nj
n̆ḍ nd
n̆d nd nd̪
m̆b mb mb
  • <templatestyles src="Citation/styles.css"/>^ This character is technically a conjunct, but is encoded separately in Unicode.

Sindhi/Punjabi consonants

ISO 15919 Simplified IPA Devanagari Gurmukhi Shahmukhi Saraiki
gg[23] gg ɠ ॻ (ग॒) ੱਗ Template:Uninastaliq Template:Uninastaliq
jj[24] jj ʄ ॼ (ज॒) ੱਜ Template:Uninastaliq Template:Uninastaliq
ḍḍ[25] dd ॾ (ड॒) ੱਡ Template:Uninastaliq Template:Uninastaliq
bb[26] bb ɓ ॿ (ब॒) ੱਬ Template:Uninastaliq Template:Uninastaliq
  • <templatestyles src="Citation/styles.css"/>^ Represents Sindhi/Western Punjabi bbē (ٻ).
  • <templatestyles src="Citation/styles.css"/>^ Represents Sindhi/Western Punjabi jjē (ڄ).
  • <templatestyles src="Citation/styles.css"/>^ Represents Sindhi dd.ē (ڏ) or Western Punjabi dd.āl (ڋ).
  • <templatestyles src="Citation/styles.css"/>^ Represents Sindhi ggē (ڳ) or Western Punjabi ggāf (ڰ).

Special notes for Punjabi

Punjabi is rather unique for an Indo-European language in that tones are a prominent feature of speech. As such, the IPA conversion is not accurate for Punjabi. Fortunately, there is a direct correlation between certain aspirated consonants and use of subscript /ha/ to represent different tones.

Voiced aspirates

The consonants that are employed for voiced aspirates in other Indian languages are not pronounced as such in Punjabi. In Punjabi these consonants are used to mark changes in tone. The table below indicates how each consonant is pronounced based on its position within a word.

Consonant Beginning of word All other positions
Script error: No such module "Lang". Script error: No such module "Lang".
Script error: No such module "IPA".
Script error: No such module "Lang".
Script error: No such module "IPA".
Script error: No such module "Lang". Script error: No such module "Lang".
Script error: No such module "IPA".
Script error: No such module "Lang".
Script error: No such module "IPA".
Script error: No such module "Lang". Script error: No such module "Lang".
Script error: No such module "IPA".
Script error: No such module "Lang".
Script error: No such module "IPA".
Script error: No such module "Lang". Script error: No such module "Lang".
Script error: No such module "IPA".
Script error: No such module "Lang".
Script error: No such module "IPA".
Script error: No such module "Lang". Script error: No such module "Lang".
Script error: No such module "IPA".
Script error: No such module "Lang".
Script error: No such module "IPA".

At the beginning or middle of a word, a voiced aspirate indicates a low tone on the following vowel. Examples:

  • Script error: No such module "Lang". Script error: No such module "IPA". is actually pronounced Script error: No such module "IPA".
  • Script error: No such module "Lang". Script error: No such module "IPA". is actually pronounced Script error: No such module "IPA".
  • Script error: No such module "Lang". Script error: No such module "IPA". is actually pronounced Script error: No such module "IPA".

At the end of the word (stem-final), the voiced aspirates indicate a high tone on the preceding vowel. Examples:

  • Script error: No such module "Lang". Script error: No such module "IPA". is actually pronounced Script error: No such module "IPA".

Ha

At the beginning of a word, Script error: No such module "Lang". indicates Script error: No such module "IPA"..

In the middle or at the end of a word, ha indicates a high tone on the preceding vowel. Examples:

  • Script error: No such module "Lang". Script error: No such module "IPA". is actually pronounced Script error: No such module "IPA".

Subscript ha also indicates a high tone on the preceding vowel. Examples:

  • Script error: No such module "Lang". Script error: No such module "IPA". is actually pronounced Script error: No such module "IPA".

The following conventions apply apart from at the beginning of a word:

  • Script error: No such module "Lang". converts into a high tone Script error: No such module "Lang". (e.g. Script error: No such module "Lang". is pronounced Script error: No such module "Lang". Script error: No such module "IPA".).
  • Script error: No such module "Lang". converts into a high tone Script error: No such module "Lang". (e.g. Script error: No such module "Lang". is pronounced Script error: No such module "Lang". Script error: No such module "IPA".).
  • Script error: No such module "Lang". converts into a high tone Script error: No such module "Lang". (e.g. Script error: No such module "Lang". is pronounced Script error: No such module "Lang". Script error: No such module "IPA".).
  • Script error: No such module "Lang". converts into a high tone Script error: No such module "Lang". (e.g. Script error: No such module "Lang". is pronounced Script error: No such module "Lang". Script error: No such module "IPA".).

References

Nasalisation

ISO 15919 IPA Devanagari Bengali Gurmukhi Gujarati Oriya Tamil Telugu Kannada Malayalam Sinhala
[30] Script error: No such module "IPA". [31]
[32] Script error: No such module "IPA". - - - - - - - - -
[33] Script error: No such module "IPA". - - - - -
Script error: No such module "IPA". - - - - - - - - -
  • <templatestyles src="Citation/styles.css"/>^ The signs ṁ and ṃ are essentially identical. However, Gurmukhi has two separate nasal characters and if this distinction is to be retained separate identifiers must be used.
  • <templatestyles src="Citation/styles.css"/>^ For Malayalam, it is transliterated as 'm' at the end of a word. There is no actual phonemic nasalisation in Malayalam. This symbol only indicates nasalisation when Malayalam script is being used to write Sanskrit. Otherwise, it represents either consonantal /m/ (without the inherent vowel) or consonantal Script error: No such module "IPA". (without the inherent vowel), mostly in borrowed Sanskrit words that originally had nasalisation. Some of these borrowed words are pronounced with /m/ and others with Script error: No such module "IPA"., and, because of analogy, this symbol has come to represent these phonemes (when the vowels are suppressed - otherwise the normal letters would be used) in native words as well.
  • <templatestyles src="Citation/styles.css"/>^ When applied to a semivowel (y, r, l, ḷ or v), in contrast to its application to a vowel, candrabindu is placed before the semivowel. For example, Script error: No such module "Lang". is written sa:m̐yyantā and not saym̐yantā.

The standard nasal signs (ṁ and ṃ) are only to be used at the end of words OR when it is crucial to keep the distinction between Bindi and Tippi use in Gurmukhi. Otherwise, the following rules should be enforced:

When followed by ISO 15919 IPA
k, kh, g, gh or ṅ
q, ḵẖ, or ġ
ŋ
c, ch, j, jh or ñ
z
ñ ɲ
ṭ, ṭh, ḍ, ḍh, or ṇ ɳ
t, th, d dh, or n n n
p, ph, b bh, or m
f
m m
y, r, l, v, ś, ṣ, s, h
n n


References

Script specific resources