Urdu alphabet

From Wikipedia, the free encyclopedia
(Redirected from Urdu script)
Jump to navigation Jump to search

Template:Short description Template:Use dmy dates Script error: No such module "Infobox".Template:Template otherScript error: No such module "Check for unknown parameters". Template:Contains special characters Template:Arabic script sidebar Template:Writing systems worldwideThe Urdu alphabet (Template:Langx) is the right-to-left alphabet used for writing Urdu. It is a modification of the Persian alphabet, which itself is derived from the Arabic script. It has co-official status in the republics of Pakistan, India and South Africa. The Urdu alphabet has up to 39[1] or 40[2] distinct letters with no distinct letter cases and is typically written in the calligraphic Nastaʿlīq script, whereas Arabic is more commonly written in the Naskh style.

Usually, bare transliterations of Urdu into the Latin alphabet (called Roman Urdu) omit many phonemic elements that have no equivalent in English or other languages commonly written in the Latin script.

History

The standard Urdu script is a modified version of the Perso-Arabic script and has its origins in the 13th century Iran. It is also related to Shahmukhi, used for the Punjabi language varieties in Punjab, Pakistan. It is closely related to the development of the Nastaʻliq style of Perso-Arabic script.

Despite the invention of the Urdu typewriter in 1911, Urdu newspapers continued to publish prints of handwritten scripts by calligraphers known as katibs or khush-navees until the late 1980s. The Pakistani national newspaper Daily Jang was the first Urdu newspaper to use Nastaʿlīq computer-based composition. There are efforts under way to develop more sophisticated and user-friendly Urdu support on computers and the internet. Nowadays, nearly all Urdu newspapers, magazines, journals, and periodicals are composed on computers with Urdu software programs.

Other than the Indian subcontinent, the Urdu script is also used by Pakistan's large diaspora, including in the United Kingdom, the United Arab Emirates, the United States, Canada, Saudi Arabia and other places.[2]

Nastaliq

File:Persian Nastaʿlīq's proportions.jpg
Example showing Nastaliq's (Persian) proportion rules.Script error: No such module "Unsubst".

Script error: No such module "Labelled list hatnote".

Urdu is written in the Nastaliq style (Template:Langx Nastaʿlīq). The Nastaliq calligraphic writing style began as a Persian mixture of the Naskh and Ta'liq scripts. After the Muslim conquest of the Indian subcontinent, Nastaʻliq became the preferred writing style for Urdu. It is the dominant style in Pakistan and many Urdu writers elsewhere in the world use it. Nastaʿlīq is more cursive and flowing than its Naskh counterpart.

In the Arabic alphabet, and many others derived from it, letters are regarded as having two or three general forms each, based on their position in the word (though Arabic calligraphy can add a great deal of complexity). But the Nastaliq style in which Urdu is written uses more than three general forms for many letters, even in simple non-decorative documents.[3]

Alphabet

The Urdu script is an abjad script derived from the modern Persian script, which is itself a derivative of the Arabic script. As an abjad, the Urdu script only shows consonants and long vowels; short vowels can only be inferred by the consonants' relation to each other. While this type of script is convenient in Semitic languages like Arabic and Hebrew, whose consonant roots are the key of the sentence, Urdu is an Indo-European language, which requires more precision in vowel sound pronunciation, hence necessitating more memorisation. The number of letters in the Urdu alphabet is somewhat ambiguous and debated.[4]

Letter names and phonemes

NameTemplate:Sfn Forms IPA Romanization Unicode Order
Urdu
Roman Urdu
Isolated Final Medial Initial ALA-LC[5] Hunterian[6] Template:Efn-ua [7] Template:Efn-ua
Script error: No such module "Lang".
alif
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA".Template:Efn-ua ā, – ā, – U+0627 1 1 1
Script error: No such module "Lang".
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". b b U+0628 2 2 2
Script error: No such module "Lang".
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". p p U+067E 3 3 3
Script error: No such module "Lang".
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". t t U+062A 4 4 4
Script error: No such module "Lang".
ṭē
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". t U+0679 5 5 5
Script error: No such module "Lang".
s̱ē
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". s U+062B 6 6 6
Script error: No such module "Lang".
jīm
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". j j U+062C 7 7 7
Script error: No such module "Lang".
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". c ch U+0686 8 8 8
Script error: No such module "Lang".
baṛī ḥē
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". h U+062D 9 9 9
Script error: No such module "Lang".
ḥā'e huttī
Script error: No such module "Lang".
ḥā'e muhmala
Script error: No such module "Lang".
k͟hē
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". k͟h kh U+062E 10 10 10
Script error: No such module "Lang".
dāl
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". d d U+062F 11 11 11
Script error: No such module "Lang".
ḍāl
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". d U+0688 12 12 12
Script error: No such module "Lang".
ẕāl
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". z U+0630 13 13 13
Script error: No such module "Lang".
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". r r U+0631 14 14 14
Script error: No such module "Lang".
ṛē
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA".
Template:Efn-ua
r U+0691 15 15 15
Script error: No such module "Lang".
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". z z U+0632 16 16 16
Script error: No such module "Lang".
zhē
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA".
Template:Efn-ua
zh zh U+0698 17 17 17
Script error: No such module "Lang".
sīn
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". s s U+0633 18 18 18
Script error: No such module "Lang".
shīn
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". sh sh U+0634 19 19 19
Script error: No such module "Lang".
ṣwād
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". s U+0635 20 20 20
Script error: No such module "Lang".
ẓwād
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". z U+0636 21 21 21
Script error: No such module "Lang".
t̤oTemplate:Hamzaē
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". t U+0637 22 22 22
Script error: No such module "Lang".
z̤oTemplate:Hamzaē
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". z U+0638 23 23 23
Script error: No such module "Lang".
ʻain
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". ʻ ʻ
Script error: No such module "Unsubst".
U+0639 24 24 24
Script error: No such module "Lang".
g͟hain
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". g͟h gh U+063A 25 25 25
Script error: No such module "Lang".
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". f f U+0641 26 26 26
Script error: No such module "Lang".
qāf
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". q q U+0642 27 27 27
Script error: No such module "Lang".
kāf
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". k k U+06A9 28 28 28
Script error: No such module "Lang".
gāf
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". g g U+06AF 29 29 29
Script error: No such module "Lang".
lām
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". l l U+0644 30 30 30
Script error: No such module "Lang".
mīm
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". m m U+0645 31 31 31
Script error: No such module "Lang".
nūn
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". n n U+0646 32 32 32
Script error: No such module "Lang".
nūn g͟hunnā
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA".
Template:Efn-ua
n U+06BA
U+0658
Template:Efn-ua
Template:Efn-ua 32a 33
Script error: No such module "Lang".
Template:Hamzao
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". v,
ū, u, o, au
w,
ū, u, o, au
U+0648 33 33 34
Script error: No such module "Lang".
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". h, ā, e h, ā, e U+06C1
Template:Efn-ua
34 34 35
Script error: No such module "Lang".
choṭī hē
34a
Script error: No such module "Lang".
do-cashmī hē
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". or Script error: No such module "IPA".
Template:Efn-ua
h h U+06BE 35 34b 36
Script error: No such module "Lang".
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". y, ī, á y, ī, á U+06CC 36 35 38
Script error: No such module "Lang".
baṛī yē
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA".
Template:Efn-ua
ai, e ai, e U+06D2 37 35b 39
Script error: No such module "Lang".
hamzah
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "IPA". or silent
Template:Efn-ua
Template:Hamza, –, yi Template:Hamza, –, yi U+0626 35a 37
Template:Efn-ua
Script error: No such module "Lang". U+0621 0

Footnotes: Template:Notelist-ua

Additional characters and variations

Arabic Tāʼ marbūṭah

Script error: No such module "anchor". Tāʼ marbūṭah is also sometimes considered the 40th letter of the Urdu alphabet, though it is rarely used except for in certain loan words from Arabic. Tāʼ marbūṭah is regarded as a form of tā, the Arabic version of Urdu tē, but it is not pronounced as such, and when replaced with an Urdu letter in naturalised loan words it is usually replaced with Gol hē.

Table

Table of additional characters and variations
Group Letter Template:Efn-ua Name (see: Glossary of key words) Unicode [8][9]
Nastaliq
Template:Efn-ua
Naskh with
diacritics
Roman Urdu or English[1][7]
Alif Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang".
[7]
alif maddah
[7]Template:Efn-ua
U+0622
alef with madda above [9]
Hamza Template:Efn-ua Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang".
[7]
hamzah U+0621
hamza [9]
Script error: No such module "Lang". Script error: No such module "Lang". hamza on the line
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". hamza diacritic
Template:Efn-uaTemplate:Efn-ua
U+0654
Hamza Above
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang".
[7]
hamzah U+0626
yeh with hamza above [9]
Script error: No such module "Lang". Script error: No such module "Lang". yē hamza / alif hamza
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". baṛī yē hamza U+06D3
yeh barree with hamza above [8]
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang".
[7]
vāv-e mahmūz
[7]
U+0624
waw with hamza above [9]
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". U+06C2
heh goal with hamza above [8]
or U+06C1 + U+0654
Arabic Template:Efn-ua Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". Template:Langx Template:Langx tāʼ marbūṭah
"bound ta"
U+06C3
teh marbuta goal [8]
Script error: No such module "Lang". Script error: No such module "Lang". Script error: No such module "Lang". U+0629
teh marbuta [9]
Script error: No such module "Lang". Script error: No such module "Lang". Template:Langx Template:Langx tāʼ maftūḥah
"open ta"
U+062A
Teh

Footnotes: Template:Notelist-ua

Hamza in Nastaliq

Script error: No such module "anchor". Hamza can be difficult to recognise in Urdu handwriting and fonts designed to replicate it, closely resembling two dots above as featured in Script error: No such module "Lang". Té and Script error: No such module "Lang". Qaf, whereas in Arabic and Geometric fonts it is more distinct and closely resembles the western form of the numeral 2 (two).

Digraphs

The digraphs of aspirated consonants are as follows.
Digraph[5] Transcription[5] IPA Examples
بھ bh Script error: No such module "IPA". بھاری
پھ ph Script error: No such module "IPA". پھول
تھ th Script error: No such module "IPA". تھیلا
ٹھ ṭh Script error: No such module "IPA". ٹھنڈا
جھ jh Script error: No such module "IPA". جھاڑی
چھ chh Script error: No such module "IPA". چھتری
دھ dh Script error: No such module "IPA". دھوبی
ڈھ ḍh Script error: No such module "IPA". ڈھول
رھ rh Script error: No such module "IPA". تیرھواں
ڑھ ṛh Script error: No such module "IPA". اڑھائی
کھ kh Script error: No such module "IPA". کھانسی
گھ gh Script error: No such module "IPA". گھوڑا
لھ lh Script error: No such module "IPA". دولھا (alternative of دُلہا)
مھ mh Script error: No such module "IPA". تمھیں
نھ nh Script error: No such module "IPA". ننھا

A separate do-chashmi-he letter, Script error: No such module "Lang"., exists to denote a /ʰ/ or a /ʱ/. This letter is mainly used as part of the multitude of digraphs, detailed in above.

Differences from the Persian alphabet

Urdu has more letters added to the Perso-Arabic base to represent sounds not present in Persian, which already has additional letters added to the Arabic base itself to represent sounds not present in Arabic. The letters added are shown in the table below:

Template:Static row numbers

Letter IPA
Script error: No such module "Lang". /ʈ/
Script error: No such module "Lang". /ɖ/
Script error: No such module "Lang". /ɽ/
Script error: No such module "Lang". /◌̃/
Script error: No such module "Lang". /ɛ:/ or /e:/.

Retroflex letters

File:Hindustani Urdu retroflex letter T.svg

Old Hindustani used four dots Script error: No such module "Lang". over three Arabic letters Script error: No such module "Lang". to represent retroflex consonants.[10] In handwriting those dots were often written as a small vertical line attached to a small triangle. Subsequently, this shape became identical to a small letter Script error: No such module "Lang". t̤oʼē.[11] It is commonly and erroneously assumed that ṭāʾ itself was used to indicate retroflex consonants because of it being an emphatic alveolar consonant that Arabic scribes thought approximated the Hindustani retroflexes.Script error: No such module "Unsubst". In modern Urdu, called to'e is always pronounced as a dental, not a retroflex. Script error: No such module "Unsubst".

Vowels

The Urdu language has ten vowels and ten nasalized vowels. Each vowel has four forms depending on its position: initial, middle, final and isolated. Like in its parent Arabic alphabet, Urdu vowels are represented using a combination of digraphs and diacritics. Alif, Waw, Ye, He and their variants are used to represent vowels.

Vowel chart

Urdu does not have standalone vowel letters. Short vowels (a, i, u) are represented by optional diacritics (zabar, zer, pesh) upon the preceding consonant or a placeholder consonant (alif, ain, or hamzah) if the syllable begins with the vowel, and long vowels by consonants alif, ain, ye, and wa'o as matres lectionis, with disambiguating diacritics, some of which are optional (zabar, zer, pesh), whereas some are not (madd, hamzah). Urdu does not have short vowels at the end of words. This is a table of Urdu vowels:

Romanization Pronunciation Final Middle Initial
a Script error: No such module "IPA". N/A ـَ اَ
ā Script error: No such module "IPA". ـَا، ـَی، ـَہ ـَا آ
i Script error: No such module "IPA". N/A ـِ اِ
ī Script error: No such module "IPA". ـِى ـِیـ اِیـ
e Script error: No such module "IPA". ـےTemplate:Popdf ـیـ ایـ
ai Script error: No such module "IPA". ـَےTemplate:Popdf ـَیـ اَیـ
u Script error: No such module "IPA". N/A ـُ اُ
ū Script error: No such module "IPA". ـُو اُو
o Script error: No such module "IPA". ـو او
au Script error: No such module "IPA". ـَو اَو

Alif

Alif is the first letter of the Urdu alphabet, and it is used exclusively as a vowel. At the beginning of a word, alif can be used to represent any of the short vowels: اب ab, اسم ism, اردو Urdū. For long ā at the beginning of words alif-mad is used: آپ āp, but a plain alif in the middle and at the end: بھاگنا bhāgnā.

Wāʾo

Wāʾo is used to render the vowels "ū", "o", "u" and "au" ([uː], [oː], [ʊ] and [ɔː] respectively), and it is also used to render the labiodental approximant, [ʋ]. Only when preceded by the consonant k͟hē (خ), can wāʾo render the "u" ([ʊ]) sound (such as in خود, "k͟hud" - myself), or not pronounced at all (such as in خواب, "k͟haab" - dream). This is known as the silent wāʾo, and is only present in words loaned from Persian.[12]

Ye

Ye is divided into two variants: choṭī ye ("little ye") and baṛī ye ("big ye").

Choṭī ye (ی) is written in all forms exactly as in Persian. It is used for the long vowel "ī" and the consonant "y".

Baṛī ye (ے) is used to render the vowels "e" and "ai" (Script error: No such module "IPA". and Script error: No such module "IPA". respectively). Baṛī ye is distinguishable in writing from choṭī ye only when it comes at the end of a word/ligature. Additionally, Baṛī ye is never used to begin a word/ligature, unlike choṭī ye.

Letter's name Final Form Middle Form Initial Form Isolated Form
چھوٹی يے
Choṭī ye
ـی ـیـ یـ ی
بڑی يے
Baṛī ye
ـے ے

The 2 he's

He is divided into two variants: gol he ("round he") and do-cašmi he ("two-eyed he").

Gol he (ہ) is written round and zigzagged, and can impart the "h" (Script error: No such module "IPA".) sound anywhere in a word. Additionally, at the end of a word, it can be used to render the long "a" or the "e" vowels (Script error: No such module "IPA". or Script error: No such module "IPA".), which also alters its form slightly (on modern digital writing systems, this final form is achieved by writing two he's consecutively).

Do-cašmi he (ھ) is written as in Arabic Naskh style (as a loop), in order to create the aspirate consonants and write Arabic words.

Letter's name Final Form Middle Form Initial Form Isolated Form
گول ہے
Gol he
ـہ ـہـ ہـ ہ
دو چشمی ہے
Do-cašmi he
ـھ ـھـ ھـ ھ

Ayn

Ayn in its initial and final position is silent in pronunciation and is replaced by the sound of its preceding or succeeding vowel.

Nun Ghunnah

Vowel nasalization is represented by nun ghunna written after their non-nasalized versions, for example: ہَے when nasalized would become ہَیں. In middle form nun ghunna is written just like nun and is differentiated by a diacritic called Script error: No such module "Lang". or ulta jazm which is a superscript V symbol above the ن٘.

Examples:

Form Urdu Transcription
Orthography Script error: No such module "Lang". Script error: No such module "lang".
End form Script error: No such module "Lang". Script error: No such module "lang".
Middle form Script error: No such module "Lang". Script error: No such module "lang".

Diacritics

Urdu uses the same subset of diacritics used in Arabic based on Persian conventions. Urdu also uses Persian names of the diacritics instead of Arabic names. Commonly used diacritics are zabar (Arabic fatḥah), zer (Arabic kasrah), pesh (Arabic dammah) which are used to clarify the pronunciation of vowels, as shown above. Jazam (ـْـ, Arabic sukun) is used to indicate a consonant cluster and tashdid (ـّـ, Arabic shaddah) is used to indicate a gemination, although it is never used for verbs, which require double consonants to be spelled out separately. Other diacritics include khari zabar (Arabic dagger alif), do zabar (Arabic fathatan) which are found in some common Arabic loan words. Other Arabic diacritics are also sometimes used though very rarely in loan words from Arabic. Zer-e-izafat and hamzah-e-izafat are described in the next section.

Other than common diacritics, Urdu also has special diacritics, which are often found only in dictionaries for the clarification of irregular pronunciation. These diacritics include kasrah-e-majhool, fathah-e-majhool, dammah-e-majhool, Script error: No such module "Lang"., ulta jazam, alif-e-wavi and some other very rare diacritics. Among these, only Script error: No such module "Lang". is used commonly in dictionaries and has a Unicode representation at U+0658. Other diacritics are only rarely written in printed form, mainly in some advanced dictionaries.[13]

Iẓāfat

Iẓāfat is a syntactical construction of two nouns, where the first component is a determined noun, and the second is a determiner. This construction was borrowed from Persian. A short vowel "i" is used to connect these two words, and when pronouncing the newly formed word the short vowel is connected to the first word. If the first word ends in a consonant or an ʿain (ع), it may be written as zer (  ِ ) at the end of the first word, but usually is not written at all. If the first word ends in choṭī he (ہ) or ye (ی or ے) then hamzā (ء) is used above the last letter (ۂ or ئ or ۓ). If the first word ends in a long vowel (ا or و), then a different variation of baṛī ye (ے) with hamzā on top (ئے, obtained by adding ے to ئ) is added at the end of the first word.Template:Sfn

Forms Example Transliteration Meaning
ـ◌ِ شیرِ پنجاب sher-e-Panjāb the lion of Punjab
ۂ ملکۂ دنیا malikā-e-dunyā the queen of the world
ئ ولئ کامل walī-e-kāmil perfect saint
ـئے مئے عشق mai-e-ishq the wine of love
ئے روئے زمین rū-'e-zamīn the surface of the Earth
صدائے بلند sadā-'e-buland a high voice

Computers and the Urdu alphabet

In the early days of computers, Urdu was not properly represented on any code page. One of the earliest code pages to represent Urdu was IBM Code Page 868 which dates back to 1990.[14] Other early code pages which represented Urdu alphabets were Windows-1256 and MacArabic encoding both of which date back to the mid-1990s. In Unicode, Urdu is represented inside the Arabic block. Another code page for Urdu, which is used in India, is Perso-Arabic Script Code for Information Interchange. In Pakistan, the 8-bit code page which is developed by National Language Authority is called Urdu Zabta Takhti (اردو ضابطہ تختی) (UZT)[15] which represents Urdu in its most complete form including some of its specialized diacritics, though UZT is not designed to coexist with the Latin alphabet.

Encoding Urdu in Unicode

Script error: No such module "anchor".

Confusable glyphs in Urdu and Arabic script
Characters
in Urdu
Characters
in Arabic
Template:Nq (U+06C1)
Template:Nq (U+06BE)
Script error: No such module "Lang". (U+0647)
Template:Nq (U+06CC) Template:Nq (U+0649)
Script error: No such module "Lang". (U+064A)
Template:Nq (U+06A9) Script error: No such module "Lang". (U+0643)

Like other writing systems derived from the Arabic script, Urdu uses the 0600–06FF Unicode range.[16] Certain glyphs in this range appear visually similar (or identical when presented using particular fonts) even though the underlying encoding is different. This presents problems for information storage and retrieval. For example, the University of Chicago's electronic copy of John Shakespear's "A Dictionary, Hindustani, and English"[17] includes the word 'Script error: No such module "Lang".' (bhārat "India"). Searching for the string "Script error: No such module "Lang"." returns no results, whereas querying with the (identical-looking in many fonts) string "Script error: No such module "Lang"." returns the correct entry.[18] This is because the medial form of the Urdu letter do chashmi he (U+06BE)—used to form aspirate digraphs in Urdu—is visually identical in its medial form to the Arabic letter hāʾ (U+0647; phonetic value Script error: No such module "IPA".). In Urdu, the Script error: No such module "IPA". phoneme is represented by the character U+06C1, called gol he (round he), or chhoti he (small he).

In 2003, the Center for Research in Urdu Language Processing (CRULP)[19]—a research organisation affiliated with Pakistan's National University of Computer and Emerging Sciences—produced a proposal for mapping from the 1-byte UZT encoding of Urdu characters to the Unicode standard.[20] This proposal suggests a preferred Unicode glyph for each character in the Urdu alphabet.

Software

The Daily Jang was the first Urdu newspaper to be typeset digitally in Nastaʻliq by computer. There are efforts underway to develop more sophisticated and user-friendly Urdu support on computers and on the Internet. Nowadays, nearly all Urdu newspapers, magazines, journals and periodicals are composed on computers via various Urdu software programmes, the most widespread of which is InPage Desktop Publishing package. Microsoft has included Urdu language support in all new versions of Windows and both Windows Vista and Microsoft Office 2007 are available in Urdu through Language Interface Pack[21] support. Most Linux Desktop distributions allow the easy installation of Urdu support and translations as well.[22] Apple implemented the Urdu language keyboard across Mobile devices in its iOS 8 update in September 2014.[23]

Romanization standards and systems

Script error: No such module "Labelled list hatnote". There are several romanization standards for writing Urdu with the Latin alphabet, though they are not very popular because most fall short of representing the Urdu language properly. Instead of standard romanization schemes, people on Internet, mobile phones and media often use a non-standard form of romanization which tries to mimic English orthography. The problem with this kind of romanization is that it can only be read by native speakers, and even for them with great difficulty. Among standardized romanization schemes, the most accurate is ALA-LC romanization, which is also supported by National Language Authority. Other romanization schemes are often rejected because either they are unable to represent sounds in Urdu properly, or they often do not take regard of Urdu orthography, and favor pronunciation over orthography.[24]

The National Language Authority of Pakistan has developed a number of systems with specific notations to signify non-English sounds, but these can only be properly read by someone already familiar with the loan letters.Script error: No such module "Unsubst".

Roman Urdu also holds significance among the Christians of Pakistan and North India. Urdu was the dominant native language among Christians of Karachi and Lahore in present-day Pakistan and Madhya Pradesh, Uttar Pradesh Rajasthan in India, during the early part of the 19th and 20th century, and is still used by Christians in these places. Pakistani and Indian Christians often used the Roman script for writing Urdu. Thus Roman Urdu was a common way of writing among Pakistani and Indian Christians in these areas up to the 1960s. The Bible Society of India publishes Roman Urdū Bibles that enjoyed sale late into the 1960s (though they are still published today). Church songbooks are also common in Roman Urdu. However, the usage of Roman Urdu is declining with the wider use of Hindi and English in these states.

Glossary of key words from letter names

Script error: No such module "anchor".

Translations and other uses of key words from Urdu letter names
Letter name(s) Urdu word Examples of other uses
Isolated
form
Urdu
name
Roman Urdu Urdu IPA Roman Urdu
name
English Translation Urdu Roman Urdu or IPA Translation
Script error: No such module "Lang". Script error: No such module "Lang". baṛī ħē Script error: No such module "Lang". Template:Ipa[25] baṛī /
bari
big / elder[25] Script error: No such module "Lang". Baṛi ant large intestine
Script error: No such module "Lang". Script error: No such module "Lang". baṛī yē Script error: No such module "Lang". Ant intestine
Script error: No such module "Lang". Script error: No such module "Lang". čhōṭī yē Script error: No such module "Lang". Template:Ipa[25] choti small / minor / junior[25]
Script error: No such module "Lang". Script error: No such module "Lang". čhōṭī hē Script error: No such module "Lang". small intestine
Script error: No such module "Lang". gōl hē Script error: No such module "Lang". Template:Ipa[25] gōl round / spherical / vague / silly / obese[26] Script error: No such module "Lang". gol gappay panipuri
Script error: No such module "Lang". Script error: No such module "Lang". dō-čašmī hē Script error: No such module "Lang". Template:Ipa do-cashmī two-eyed
Script error: No such module "Unsubst".
Script error: No such module "Lang". do-cashmi

dorabīn

binoculars
Script error: No such module "Lang". dorabīn telescope
Script error: No such module "Lang". do 2 / two Script error: No such module "Lang". do ayvanīt bicameralism
Script error: No such module "Lang". Template:Ipa[25] chashm the eye / hope / expectation[26] Script error: No such module "Lang". cashm eye
Script error: No such module "Lang". Script error: No such module "Lang". nūn-e ğunnah Script error: No such module "Lang". Template:Ipa[25] ğunnahScript error: No such module "String"./ g͟hunnah nasal sound or twang[25] Script error: No such module "Unsubst".
Script error: No such module "Lang". Script error: No such module "Lang". alif maddah Script error: No such module "Lang". maddah Arabic: Script error: No such module "Unsubst".
Script error: No such module "Lang". Script error: No such module "Lang". vāv-e mahmūz Script error: No such module "Lang". Template:Ipa[25] mahmūz defective / improper[25] Script error: No such module "Unsubst".
Template:Uninastaliq Script error: No such module "Lang".
[27]
harūf tahajī (alphabet) Script error: No such module "Lang". Template:Ipa tahajī sequence
Script error: No such module "Unsubst".
Script error: No such module "Unsubst".
Script error: No such module "Lang". Template:Ipa[25] harūf letters (plural)[25]
(often referred to as "alphabets" in informal Pakistani English)
Script error: No such module "Unsubst".
Script error: No such module "Lang". Template:Ipa[25] harf "letter of the alphabet" / handwriting / statement / blame / stigma[25] Script error: No such module "Unsubst".

See also

References

<templatestyles src="Reflist/styles.css" />

  1. a b Script error: No such module "citation/CS1".
  2. a b Script error: No such module "citation/CS1".
  3. Script error: No such module "citation/CS1".
  4. Script error: No such module "citation/CS1".
  5. a b c Script error: No such module "citation/CS1".
  6. Geographical Names Romanization in Pakistan. UNGEGN, 18th Session. Geneva, 12–23 August 1996. Working Papers No. 85 and No. 85 Add. 1.
  7. a b c d e f g h Script error: No such module "citation/CS1".
  8. a b c d Script error: No such module "citation/CS1".
  9. a b c d e f Script error: No such module "citation/CS1".
  10. Script error: No such module "citation/CS1".
  11. Script error: No such module "citation/CS1".
  12. Script error: No such module "citation/CS1".
  13. Script error: No such module "citation/CS1".
  14. "IBM 868 code page"
  15. Script error: No such module "citation/CS1".
  16. Script error: No such module "citation/CS1".
  17. Script error: No such module "citation/CS1".
  18. Script error: No such module "citation/CS1".
  19. Script error: No such module "citation/CS1".
  20. Template:Webarchive
  21. Script error: No such module "citation/CS1".
  22. Script error: No such module "citation/CS1".
  23. Script error: No such module "citation/CS1".
  24. Script error: No such module "citation/CS1".
  25. a b c d e f g h i j k l m n Script error: No such module "citation/CS1".
  26. a b Script error: No such module "citation/CS1".Script error: No such module "Unsubst".Template:Cbignore
  27. Script error: No such module "citation/CS1".

Script error: No such module "Check for unknown parameters".

Sources

  • Script error: No such module "citation/CS1".
  • Script error: No such module "citation/CS1".
  • Script error: No such module "citation/CS1".
  • Script error: No such module "citation/CS1".

External links

Template:Sister project

Script error: No such module "Navbox". Template:Arabic alphabets Template:Languages of Pakistan