Arabic script in Unicode

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Template:Short description Many scripts in Unicode, such as Arabic, have special orthographic rules that require certain combinations of letterforms to be combined into special ligature forms. In English, the common ampersand (&) developed from a ligature in which the handwritten Latin letters e and t (spelling et, Latin for and) were combined.[1] The rules governing ligature formation in Arabic can be quite complex, requiring special script-shaping technologies such as the Arabic Calligraphic Engine by Thomas Milo's DecoType.[2]

As of Unicode Template:Unicode version, the Arabic script is contained in the following blocks:[3]

The basic Arabic range encodes the standard letters and diacritics, but does not encode contextual forms (U+0621–U+0652 being directly based on ISO 8859-6); and also includes the most common diacritics and Arabic-Indic digits. The Arabic Supplement range encodes letter variants mostly used for writing African (non-Arabic) languages. The Arabic Extended-B and Arabic Extended-A ranges encode additional Qur'anic annotations and letter variants used for various non-Arabic languages. The Arabic Presentation Forms-A range encodes contextual forms and ligatures of letter variants needed for Persian, Urdu, Sindhi and Central Asian languages. The Arabic Presentation Forms-B range encodes spacing forms of Arabic diacritics, and more contextual letter forms. The presentation forms are present only for compatibility with older standards, and are not currently needed for coding text.[4] The Arabic Mathematical Alphabetical Symbols block encodes characters used in Arabic mathematical expressions. The Indic Siyaq Numbers block contains a specialized subset of Arabic script that was used for accounting in India under the Mughal Empire by the 17th century through the middle of the 20th century.[5][6] The Ottoman Siyaq Numbers block contains a specialized subset of Arabic script, also known as Siyakat numbers, used for accounting in Ottoman Turkish documents.[6]

Contextual forms

Below is a demonstration for the basic alphabet used in Modern Standard Arabic illustrating how Arabic letters are expected to appear in different contexts. Codepoints listed as contextual forms should "should not be used in general interchange"Template:R. Unicode has other methods of encoding the difference if necessary, such as Zero-width joiner.

General
Unicode
Contextual forms Name
Isolated Final (End) Medial (Middle) Initial (Beginning)
0627
Script error: No such module "Lang".
FE8D
Script error: No such module "Lang".
FE8E
Script error: No such module "Lang".
Script error: No such module "lang".
0628
Script error: No such module "Lang".
FE8F
Script error: No such module "Lang".
FE90
Script error: No such module "Lang".
FE92
Script error: No such module "Lang".
FE91
Script error: No such module "Lang".
Script error: No such module "lang".
062A
Script error: No such module "Lang".
FE95
Script error: No such module "Lang".
FE96
Script error: No such module "Lang".
FE98
Script error: No such module "Lang".
FE97
Script error: No such module "Lang".
Script error: No such module "lang".
062B
Script error: No such module "Lang".
FE99
Script error: No such module "Lang".
FE9A
Script error: No such module "Lang".
FE9C
Script error: No such module "Lang".
FE9B
Script error: No such module "Lang".
Script error: No such module "lang".
062C
Script error: No such module "Lang".
FE9D
Script error: No such module "Lang".
FE9E
Script error: No such module "Lang".
FEA0
Script error: No such module "Lang".
FE9F
Script error: No such module "Lang".
Script error: No such module "lang".
062D
Script error: No such module "Lang".
FEA1
Script error: No such module "Lang".
FEA2
Script error: No such module "Lang".
FEA4
Script error: No such module "Lang".
FEA3
Script error: No such module "Lang".
Script error: No such module "lang".
062E
Script error: No such module "Lang".
FEA5
Script error: No such module "Lang".
FEA6
Script error: No such module "Lang".
FEA8
Script error: No such module "Lang".
FEA7
Script error: No such module "Lang".
Script error: No such module "lang".
062F
Script error: No such module "Lang".
FEA9
Script error: No such module "Lang".
FEAA
Script error: No such module "Lang".
Script error: No such module "lang".
0630
Script error: No such module "Lang".
FEAB
Script error: No such module "Lang".
FEAC
Script error: No such module "Lang".
Script error: No such module "lang".
0631
Script error: No such module "Lang".
FEAD
Script error: No such module "Lang".
FEAE
Script error: No such module "Lang".
Script error: No such module "lang".
0632
Script error: No such module "Lang".
FEAF
Script error: No such module "Lang".
FEB0
Script error: No such module "Lang".
Script error: No such module "lang".
0633
Script error: No such module "Lang".
FEB1
Script error: No such module "Lang".
FEB2
Script error: No such module "Lang".
FEB4
Script error: No such module "Lang".
FEB3
Script error: No such module "Lang".
Script error: No such module "lang".
0634
Script error: No such module "Lang".
FEB5
Script error: No such module "Lang".
FEB6
Script error: No such module "Lang".
FEB8
Script error: No such module "Lang".
FEB7
Script error: No such module "Lang".
Script error: No such module "lang".
0635
Script error: No such module "Lang".
FEB9
Script error: No such module "Lang".
FEBA
Script error: No such module "Lang".
FEBC
Script error: No such module "Lang".
FEBB
Script error: No such module "Lang".
Script error: No such module "lang".
0636
Script error: No such module "Lang".
FEBD
Script error: No such module "Lang".
FEBE
Script error: No such module "Lang".
FEC0
Script error: No such module "Lang".
FEBF
Script error: No such module "Lang".
Script error: No such module "lang".
0637
Script error: No such module "Lang".
FEC1
Script error: No such module "Lang".
FEC2
Script error: No such module "Lang".
FEC4
Script error: No such module "Lang".
FEC3
Script error: No such module "Lang".
Script error: No such module "lang".
0638
Script error: No such module "Lang".
FEC5
Script error: No such module "Lang".
FEC6
Script error: No such module "Lang".
FEC8
Script error: No such module "Lang".
FEC7
Script error: No such module "Lang".
Script error: No such module "lang".
0639
Script error: No such module "Lang".
FEC9
Script error: No such module "Lang".
FECA
Script error: No such module "Lang".
FECC
Script error: No such module "Lang".
FECB
Script error: No such module "Lang".
Script error: No such module "lang".
063A
Script error: No such module "Lang".
FECD
Script error: No such module "Lang".
FECE
Script error: No such module "Lang".
FED0
Script error: No such module "Lang".
FECF
Script error: No such module "Lang".
Script error: No such module "lang".
0641
Script error: No such module "Lang".
FED1
Script error: No such module "Lang".
FED2
Script error: No such module "Lang".
FED4
Script error: No such module "Lang".
FED3
Script error: No such module "Lang".
Script error: No such module "lang".
0642
Script error: No such module "Lang".
FED5
Script error: No such module "Lang".
FED6
Script error: No such module "Lang".
FED8
Script error: No such module "Lang".
FED7
Script error: No such module "Lang".
Script error: No such module "lang".
0643
Script error: No such module "Lang".
FED9
Script error: No such module "Lang".
FEDA
Script error: No such module "Lang".
FEDC
Script error: No such module "Lang".
FEDB
Script error: No such module "Lang".
Script error: No such module "lang".
0644
Script error: No such module "Lang".
FEDD
Script error: No such module "Lang".
FEDE
Script error: No such module "Lang".
FEE0
Script error: No such module "Lang".
FEDF
Script error: No such module "Lang".
Script error: No such module "lang".
0645
Script error: No such module "Lang".
FEE1
Script error: No such module "Lang".
FEE2
Script error: No such module "Lang".
FEE4
Script error: No such module "Lang".
FEE3
Script error: No such module "Lang".
Script error: No such module "lang".
0646
Script error: No such module "Lang".
FEE5
Script error: No such module "Lang".
FEE6
Script error: No such module "Lang".
FEE8
Script error: No such module "Lang".
FEE7
Script error: No such module "Lang".
Script error: No such module "lang".
0647
Script error: No such module "Lang".
FEE9
Script error: No such module "Lang".
FEEA
Script error: No such module "Lang".
FEEC
Script error: No such module "Lang".
FEEB
Script error: No such module "Lang".
Script error: No such module "lang".
0648
Script error: No such module "Lang".
FEED
Script error: No such module "Lang".
FEEE
Script error: No such module "Lang".
Script error: No such module "lang".
064A
Script error: No such module "Lang".
FEF1
Script error: No such module "Lang".
FEF2
Script error: No such module "Lang".
FEF4
Script error: No such module "Lang".
FEF3
Script error: No such module "Lang".
Script error: No such module "lang".
0622
Script error: No such module "Lang".
FE81
Script error: No such module "Lang".
FE82
Script error: No such module "Lang".
Script error: No such module "lang".
0629
Script error: No such module "Lang".
FE93
Script error: No such module "Lang".
FE94
Script error: No such module "Lang".
Script error: No such module "lang".
0649
Script error: No such module "Lang".
FEEF
Script error: No such module "Lang".
FEF0
Script error: No such module "Lang".
Script error: No such module "lang".

Punctuation and ornaments

Only the Arabic question mark ⟨Script error: No such module "Lang".⟩ and the Arabic comma ⟨Script error: No such module "Lang".⟩ are used in regular Arabic script typing and the comma is often substituted for the Latin script comma ⟨,⟩ which is also used as the decimal separator when the Eastern Arabic numerals are used (e.g. ⟨100.6⟩ compared to ⟨Script error: No such module "Lang".⟩).

Word ligatures

Arabic Presentation Forms-A has a few characters defined as "word ligatures" for terms frequently used in formulaic expressions in Arabic. They are rarely used out of professional liturgical typing, also the Rial grapheme is normally written fully, not by the ligature.

Code blocks

Script error: No such module "Labelled list hatnote".

Arabic

Character table

Code Result Unicode name
U+0600 Template:Script/Arabic Arabic Number Sign
U+0601 Template:Script/Arabic Arabic Sign Sanah
U+0602 Template:Script/Arabic Arabic Footnote Marker
U+0603 Template:Script/Arabic Arabic Sign Safha
U+0604 Template:Script/Arabic Arabic Sign Samvat

used for writing Samvat era dates in Urdu

U+0605 Template:Script/Arabic Arabic Number Mark Above

may be used with Coptic Epact numbers

U+0606 Template:Script/Arabic Arabic-Indic Cube Root

→ U+221B ∛ Cube Root

U+0607 Template:Script/Arabic Arabic-Indic Fourth Root

→ U+221C ∜ Fourth Root

U+0608 Template:Script/Arabic Arabic Ray
U+0609 Template:Script/Arabic Arabic-Indic Per Mille Sign

→ U+2030 ‰ Per Mille Sign

U+060A Template:Script/Arabic Arabic-Indic Per Ten Thousand Sign

→ U+2031‱ Per Ten Thousand Sign

U+060B Template:Script/Arabic Afghani Sign
U+060C Template:Script/Arabic Arabic Comma

also used with Thaana and Syriac in modern text

→ U+002C, Comma

→ U+2E32 ⸲ Turned Comma

→ U+2E41 ⹁ Reversed Comma

U+060D Template:Script/Arabic Arabic Date Separator
U+060E Template:Script/Arabic Arabic Poetic Verse Sign
U+060F Template:Script/Arabic Arabic Sign Misra
U+0610 Template:Script/Arabic Arabic Sign Sallallahou Alayhe Wassallam

represents sallallahu alayhe wasallam "may God's peace and blessings be upon him"

U+0611 Template:Script/Arabic Arabic Sign Alayhe Assallam

represents alayhe assalam "upon him be peace"

U+0612 Template:Script/Arabic Arabic Sign Rahmatullah Alayhe

represents rahmatullah alayhe "may God have mercy upon him"

U+0613 Template:Script/Arabic Arabic Sign Radi Allahou Anhu

represents radi allahu 'anhu "may God be pleased with him"

U+0614 Template:Script/Arabic Arabic Sign Takhallus

sign placed over the name or nom-de-plume of a poet, or in some writings used to mark all proper names

U+0615 Template:Script/Arabic Arabic Small High Tah

marks a recommended pause position in some Qurans published in Iran and Pakistan should not be confused with the small TAH sign used as a diacritic for some letters such as 0679

U+0616 Template:Script/Arabic Arabic Small High Ligature Alef With Lam With Yeh

early Persian

Arabic Small High Ligature Alef With Yeh Barree

U+0617 Template:Script/Arabic Arabic Small High Zain
U+0618 Template:Script/Arabic Arabic Small Fatha

should not be confused with 064E Fatha

U+0619 Template:Script/Arabic Arabic Small Damma

should not be confused with 064F Damma

U+061A Template:Script/Arabic Arabic Small Kasra

should not be confused with 0650 Kasra

U+061B Template:Script/Arabic Arabic Semicolon

also used with Thaana and Syriac in modern text → U+003B ; Semicolon → U+204F ⁏ Reversed Semicolon → U+2E35 ⸵ Turned Semicolon

U+061C Template:Script/Arabic Arabic Letter Mark (Alm)
U+061D Template:Script/Arabic Arabic End Of Text Mark
U+061E Template:Script/Arabic Arabic Triple Dot Punctuation Mark
U+061F Template:Script/Arabic Arabic Question Mark

also used with Thaana and Syriac in modern text → U+003F ? Question Mark → U+2E2E ⸮ Reversed Question Mark

U+0620 Template:Script/Arabic Arabic Letter Kashmiri Yeh
U+0621 Template:Script/Arabic Arabic Letter Hamza

→ U+02BE ʾ Modifier Letter Right Half Ring

U+0622 Template:Script/Arabic Arabic Letter Alef With Madda Above

≡ آ U+0627 U+0653

U+0623 Template:Script/Arabic Arabic Letter Alef With Hamza Above

≡ أ U+0627 U+0654

U+0624 Template:Script/Arabic Arabic Letter Waw With Hamza Above

≡ ؤ U+0648 U+0654

U+0625 Template:Script/Arabic Arabic Letter Alef With Hamza Below

≡ إ U+0627 U+0655

U+0626 Template:Script/Arabic Arabic Letter Yeh With Hamza Above

in Kyrgyz the hamza is consistently positioned to the top right in isolate and final forms ≡ ئ U+064A U+0654

U+0627 Template:Script/Arabic Arabic Letter Alef
U+0628 Template:Script/Arabic Arabic Letter Beh
U+0629 Template:Script/Arabic Arabic Letter Teh Marbuta
U+062A Template:Script/Arabic Arabic Letter Teh
U+062B Template:Script/Arabic Arabic Letter The
U+062C Template:Script/Arabic Arabic Letter Jeem
U+062D Template:Script/Arabic Arabic Letter Hah
U+062E Template:Script/Arabic Arabic Letter Khah
U+062F Template:Script/Arabic Arabic Letter Dal
U+0630 Template:Script/Arabic Arabic Letter Thal
U+0631 Template:Script/Arabic Arabic Letter Reh
U+0632 Template:Script/Arabic Arabic Letter Zain
U+0633 Template:Script/Arabic Arabic Letter Seen
U+0634 Template:Script/Arabic Arabic Letter Sheen
U+0635 Template:Script/Arabic Arabic Letter Sad
U+0636 Template:Script/Arabic Arabic Letter Dad
U+0637 Template:Script/Arabic Arabic Letter Tah
U+0638 Template:Script/Arabic Arabic Letter Zah
U+0639 Template:Script/Arabic Arabic Letter Ain

→ U+01B9 ƹ Latin Small Letter Ezh Reversed → U+02BF ʿ MODIFIER LETTER LEFT HALF RING

U+063A Template:Script/Arabic Arabic Letter Ghain
U+063B Template:Script/Arabic Arabic Letter Keheh With Two Dots Above
U+063C Template:Script/Arabic Arabic Letter Keheh With Three Dots Below
U+063D Template:Script/Arabic Arabic Letter Farsi Yeh With Inverted V

Azerbaijani

U+063E Template:Script/Arabic Arabic Letter Farsi Yeh With Two Dots Above
U+063F Template:Script/Arabic Arabic Letter Farsi Yeh With Three Dots Above
U+0640 Template:Script/Arabic Arabic Tatweel

inserted to stretch characters or to carry tashkil with no base letter also used with Adlam, Hanifi Rohingya, Mandaic, Manichaean, Psalter Pahlavi, Sogdian, and Syriac= kashida

U+0641 Template:Script/Arabic Arabic Letter Feh
U+0642 Template:Script/Arabic Arabic Letter Qaf
U+0643 Template:Script/Arabic Arabic Letter Kaf
U+0644 Template:Script/Arabic Arabic Letter Lam
U+0645 Template:Script/Arabic Arabic Letter Meem

Sindhi uses a shape with a short tail

U+0646 Template:Script/Arabic Arabic Letter Noon
U+0647 Template:Script/Arabic Arabic Letter Heh
U+0648 Template:Script/Arabic Arabic Letter Waw
U+0649 Template:Script/Arabic Arabic Letter Alef Maksura

represents YEH-shaped dual-joining letter with no dots in any positional form not intended for use in combination with 0654 → U+0626 ئ Arabic Letter Yeh With Hamza Above

U+064A Template:Script/Arabic Arabic Letter Yeh

loses its dots when used in combination with 0654 retains its dots when used in combination with other combining marks → U+08A8 ࢨ Arabic Letter Yeh With Two Dots Below And Hamza Above

U+064B Template:Script/Arabic Arabic Fathatan
U+064C Template:Script/Arabic Arabic Dammatan

a common alternative form is written as two intertwined dammas, one of which is turned 180 degrees

U+064D Template:Script/Arabic Arabic Kasratan
U+064E Template:Script/Arabic Arabic Fatha
U+064F Template:Script/Arabic Arabic Damma
U+0650 Template:Script/Arabic Arabic Kasra
U+0651 Template:Script/Arabic Arabic Shadda
U+0652 Template:Script/Arabic Arabic Sukun

marks absence of a vowel after the base consonant used in some Qurans to mark a long vowel as ignored can have a variety of shapes, including a circular one and a shape that looks like '06E1' → U+06E1 ۡArabic Small High Dotless Head Of Khah

U+0653 Template:Script/Arabic Arabic Maddah Above

used for madd jaa'iz in South Asian and Indonesian orthographies →U+089C ࢜ Arabic Madda Waajib →U+089E ࢞ Arabic Doubled Madda →U+089F ࢟ Arabic Half Madda Over Madda

U+0654 Template:Script/Arabic Arabic Hamza Above

restricted to hamza and ezafe semantics is not used as a diacritic to form new letters

U+0655 Template:Script/Arabic Arabic Hamza Below
U+0656 Template:Script/Arabic Arabic Subscript Alef
U+0657 Template:Script/Arabic Arabic Inverted Damma

Kashmiri, Urdu, Swahili, Somali

U+0658 Template:Script/Arabic Arabic Mark Noon Ghunna

Baluchi indicates nasalization in Urdu

U+0659 Template:Script/Arabic Arabic Zwarakay

Pashto

U+065A Template:Script/Arabic Arabic Vowel Sign Small V Above

African languages

U+065B Template:Script/Arabic Arabic Vowel Sign Inverted Small V Above

African languages

U+065C Template:Script/Arabic Arabic Vowel Sign Dot Below

African languages also used in Quranic text in African and other orthographies

U+065D Template:Script/Arabic Arabic Reversed Damma

African languages

U+065E Template:Script/Arabic Arabic Fatha With Two Dots

Kalami

U+065F Template:Script/Arabic Arabic Wavy Hamza Below

Kashmiri

U+0660 Template:Script/Arabic Arabic-Indic Digit Zero
U+0661 Template:Script/Arabic Arabic-Indic Digit One
U+0662 Template:Script/Arabic Arabic-Indic Digit Two
U+0663 Template:Script/Arabic Arabic-Indic Digit Three
U+0664 Template:Script/Arabic Arabic-Indic Digit Four
U+0665 Template:Script/Arabic Arabic-Indic Digit Five
U+0666 Template:Script/Arabic Arabic-Indic Digit Six
U+0667 Template:Script/Arabic Arabic-Indic Digit Seven
U+0668 Template:Script/Arabic Arabic-Indic Digit Eight
U+0669 Template:Script/Arabic Arabic-Indic Digit Nine
U+066A Template:Script/Arabic Arabic Percent Sign

→ U+0025 % Percent Sign

U+066B Template:Script/Arabic Arabic Decimal Separator

the ordinary comma is most commonly used instead

→ U+002C, Comma

U+066C Template:Script/Arabic Arabic Thousands Separator

the Arabic comma is most commonly used instead

→ U+060C ، Arabic Comma

→ U+0027 ' Apostrophe

→ U+2019 ’ Right Single Quotation Mark

U+066D Template:Not a typo Arabic Five Pointed Star

appearance rather variable

→ U+002A * Asterisk

U+066E Template:Script/Arabic Arabic Letter Dotless Beh
U+066F Template:Script/Arabic Arabic Letter Dotless Qaf
U+0670 Template:Script/Arabic Arabic Letter Superscript Alef
U+0671 Template:Script/Arabic Arabic Letter Alef Wasla

Quranic Arabic

U+0672 Template:Script/Arabic Arabic Letter Alef With Wavy Hamza Above

Baluchi, Kashmiri

U+0673 Template:Script/Arabic Arabic Letter Alef With Wavy Hamza Below (deprecated)[7] Kashmiri

This character is deprecated and its use is strongly discouraged; the sequence 0627 065F is the preferred way of encoding this character.

U+0674 Template:Script/Arabic Arabic Letter High Hamza

Kazakh, Jawi forms digraphs

U+0675 Template:Script/Arabic Arabic Letter High Hamza Alef

preferred spelling is Template:RlmTemplate:Script/ArabicTemplate:Rlm U+0674 U+0627

U+0676 Template:Script/Arabic Arabic Letter High Hamza Waw

preferred spelling is Template:RlmTemplate:Script/ArabicTemplate:Rlm U+0674 U+0648

U+0677 Template:Script/Arabic Arabic Letter U With Hamza Above

preferred spelling is Template:RlmTemplate:Script/ArabicTemplate:Rlm U+0674 U+06C7

U+0678 Template:Script/Arabic Arabic Letter High Hamza Yeh

preferred spelling is Template:RlmTemplate:Script/ArabicTemplate:Rlm U+0674 06CC

U+0679 Template:Script/Arabic Arabic Letter Tteh

Urdu

U+067A Template:Script/Arabic Arabic Letter Tteheh

Sindhi

U+067B Template:Script/Arabic Arabic Letter Beeh

Sindhi

U+067C Template:Script/Arabic Arabic Letter Teh With Ring

Pashto

U+067D Template:Script/Arabic Arabic Letter Teh With Three Dots Above Downwards

Sindhi

U+067E Template:Script/Arabic Arabic Letter Peh

Persian, Urdu, ...

U+067F Template:Script/Arabic Arabic Letter Teheh

Sindhi

U+0680 Template:Script/Arabic Arabic Letter Beheh

Sindhi

U+0681 Template:Script/Arabic Arabic Letter Hah With Hamza Above

Pashto, Sarikoli represents the phoneme /dz/

U+0682 Template:Script/Arabic Arabic Letter Hah With Two Dots Vertical Above

not used in modern Pashto

U+0683 Template:Script/Arabic Arabic Letter Nyeh

Sindhi

U+0684 Template:Script/Arabic Arabic Letter Dyeh

Sindhi, historically Bosnian

U+0685 Template:Script/Arabic Arabic Letter Hah With Three Dots Above

Pashto, Khwarazmian, Sarikoli represents the phoneme /ts/ in Pashto

U+0686 Template:Script/Arabic Arabic Letter Tcheh

Persian, Urdu, ...

U+0687 Template:Script/Arabic Arabic Letter Tcheheh

Sindhi

U+0688 Template:Script/Arabic Arabic Letter Ddal

Urdu

U+0689 Template:Script/Arabic Arabic Letter Dal With Ring

Pashto

U+068A Template:Script/Arabic Arabic Letter Dal With Dot Below

Sindhi, early Persian, Pegon, Malagasy

U+068B Template:Script/Arabic Arabic Letter Dal With Dot Below And Small Tah

Lahnda

U+068C Template:Script/Arabic Arabic Letter Dahal

Sindhi

U+068D Template:Script/Arabic Arabic Letter Ddahal

Sindhi

U+068E Template:Script/Arabic Arabic Letter Dul

older shape for DUL, now obsolete in Sindhi Burushaski

U+068F Template:Script/Arabic Arabic Letter Dal With Three Dots Above Downwards

Sindhi current shape used for DUL

U+0690 Template:Script/Arabic Arabic Letter Dal With Four Dots Above

Old Urdu, not in current use

U+0691 Template:Script/Arabic Arabic Letter Rreh

Urdu

U+0692 Template:Script/Arabic Arabic Letter Reh With Small V

Kurdish

U+0693 Template:Script/Arabic Arabic Letter Reh With Ring

Pashto

U+0694 Template:Script/Arabic Arabic Letter Reh With Dot Below

Kurdish, early Persian

U+0695 Template:Script/Arabic Arabic Letter Reh With Small V Below

Kurdish

U+0696 Template:Script/Arabic Arabic Letter Reh With Dot Below And Dot Above

Pashto

U+0697 Template:Script/Arabic Arabic Letter Reh With Two Dots Above

Dargwa

U+0698 Template:Script/Arabic Arabic Letter Jeh

Persian, Urdu, ...

U+0699 Template:Script/Arabic Arabic Letter Reh With Four Dots Above

Sindhi

U+069A Template:Script/Arabic Arabic Letter Seen With Dot Below And Dot Above

Pashto

U+069B Template:Script/Arabic Arabic Letter Seen With Three Dots Below

early Persian

U+069C Template:Script/Arabic Arabic Letter Seen With Three Dots Below And Three Dots Above

Moroccan Arabic

U+069D Template:Script/Arabic Arabic Letter Sad With Two Dots Below

Turkic

U+069E Template:Script/Arabic Arabic Letter Sad With Three Dots Above

Berber, Burushaski

U+069F Template:Script/Arabic Arabic Letter Tah With Three Dots Above

Old Hausa

U+06A0 Template:Script/Arabic Arabic Letter Ain With Three Dots Above

Jawi

U+06A1 Template:Script/Arabic Arabic Letter Dotless Feh

Adighe

U+06A2 Template:Script/Arabic Arabic Letter Feh With Dot Moved Below

Maghrib Arabic

U+06A3 Template:Script/Arabic Arabic Letter Feh With Dot Below

Ingush

U+06A4 Template:Script/Arabic Arabic Letter Veh

Middle Eastern Arabic for foreign words Kurdish, Khwarazmian, early Persian, Jawi

U+06A5 Template:Script/Arabic Arabic Letter Feh With Three Dots Below

North African Arabic for foreign words

U+06A6 Template:Script/Arabic Arabic Letter Peheh

Sindhi

U+06A7 Template:Script/Arabic Arabic Letter Qaf With Dot Above

Maghrib Arabic, Uyghur

U+06A8 Template:Script/Arabic Arabic Letter Qaf With Three Dots Above

Tunisian and Algerian Arabic

U+06A9 Template:Script/Arabic Arabic Letter Keheh

Persian, Urdu, Sindhi, ...= kaf mashkula

U+06AA Template:Script/Arabic Arabic Letter Swash Kaf

represents a letter distinct from Arabic KAF (0643) in Sindhi

U+06AB Template:Script/Arabic Arabic Letter Kaf With Ring

Pashto may appear like an Arabic KAF (0643) with a ring below the base

U+06AC Template:Script/Arabic Arabic Letter Kaf With Dot Above

use for the Jawi gaf is not recommended, although it may be found in some existing text data; recommended character for Jawi gaf is 0762 → U+0762 ݢ Arabic Letter Keheh With Dot Above

U+06AD Template:Script/Arabic Arabic Letter Ng

Uyghur, Kazakh, Moroccan Arabic, early Jawi, early Persian, ...

U+06AE Template:Script/Arabic Arabic Letter Kaf With Three Dots Below

Berber, early Persian Pegon alternative for 08B4

U+06AF Template:Script/Arabic Arabic Letter Gaf

Persian, Urdu, ...

U+06B0 Template:Script/Arabic Arabic Letter Gaf With Ring

Lahnda

U+06B1 Template:Script/Arabic Arabic Letter Ngoeh

Sindhi

U+06B2 Template:Script/Arabic Arabic Letter Gaf With Two Dots Below

not used in Sindhi

U+06B3 Template:Script/Arabic Arabic Letter Gueh

Sindhi, Saraiki

U+06B4 Template:Script/Arabic Arabic Letter Gaf With Three Dots Above

not used in Sindhi, Karakalpak

U+06B5 Template:Script/Arabic Arabic Letter Lam With Small V

Kurdish, historically Bosnian

U+06B6 Template:Script/Arabic Arabic Letter Lam With Dot Above

Kurdish

U+06B7 Template:Script/Arabic Arabic Letter Lam With Three Dots Above

Kurdish

U+06B8 Template:Script/Arabic Arabic Letter Lam With Three Dots Below

Avar, Soqotri

U+06B9 Template:Script/Arabic Arabic Letter Noon With Dot Below
U+06BA Template:Script/Arabic Arabic Letter Noon Ghunna

Urdu, archaic Arabic dotless in all four contextual forms

U+06BB Template:Script/Arabic Arabic Letter Rnoon

dotless in all four contextual forms Sindhi

U+06BC Template:Script/Arabic Arabic Letter Noon With Ring

Pashto

U+06BD Template:Script/Arabic Arabic Letter Noon With Three Dots Above

Jawi

U+06BE Template:Script/Arabic Arabic Letter Heh Doachashmee

forms aspirate digraphs in Urdu and other languages of South Asia represents the glottal fricative /h/ in Uyghur

U+06BF Template:Script/Arabic Arabic Letter Tcheh With Dot Above
U+06C0 Template:Script/Arabic Arabic Letter Heh With Yeh Above

for ezafe, use 0654 over the language-appropriate base letter actually a ligature, not an independent letter Arabic letter hamzah on ha (1.0) ≡ ۀ U+06D5 U+0654

U+06C1 Template:Script/Arabic Arabic Letter Heh Goal

Urdu

U+06C2 Template:Script/Arabic Arabic Letter Heh Goal With Hamza Above

Urdu actually a ligature, not an independent letter ≡ ۂ U+06C1 U+0654

U+06C3 Template:Script/Arabic Arabic Letter Teh Marbuta Goal

Urdu

U+06C4 Template:Script/Arabic Arabic Letter Waw With Ring

Kashmiri

U+06C5 Template:Script/Arabic Arabic Letter Kirghiz Oe

Kyrgyz a glyph variant occurs which replaces the looped tail with a horizontal bar through the tail

U+06C6 Template:Script/Arabic Arabic Letter Oe

Uyghur, Kurdish, Kazakh, Azerbaijani, historically Bosnian

U+06C7 Template:Script/Arabic Arabic Letter U

Azerbaijani, Kazakh, Kyrgyz, Uyghur

U+06C8 Template:Script/Arabic Arabic Letter Yu

Uyghur

U+06C9 Template:Script/Arabic Arabic Letter Kirghiz Yu

Kazakh, Kyrgyz, historically Bosnian

U+06CA Template:Script/Arabic Arabic Letter Waw With Two Dots Above

Kurdish

U+06CB Template:Script/Arabic Arabic Letter Ve

Uyghur, Kazakh

U+06CC Template:Script/Arabic Arabic Letter Farsi Yeh

Arabic, Persian, Urdu, Kashmiri, ... initial and medial forms of this letter have dots → U+0649 ى ARABIC LETTER ALEF MAKSURA → U+064A ي Arabic Letter Yeh

U+06CD Template:Script/Arabic Arabic Letter Yeh With Tail

Pashto, Sindhi

U+06CE Template:Script/Arabic Arabic Letter Yeh With Small V

Kurdish

U+06CF Template:Script/Arabic Arabic Letter Waw With Dot Above

Jawi

U+06D0 Template:Script/Arabic Arabic Letter E

Pashto, Uyghur used as the letter bbeh in Sindhi

U+06D1 Template:Script/Arabic Arabic Letter Yeh With Three Dots Below

Mende languages, Hausa

U+06D2 Template:Script/Arabic Arabic Letter Yeh Barree

Urdu

U+06D3 Template:Script/Arabic Arabic Letter Yeh Barree With Hamza Above

Urdu

U+06D4 Template:Script/Arabic Arabic Full Stop

Urdu

U+06D5 Template:Script/Arabic Arabic Letter Ae

Uyghur, Kazakh, Kyrgyz

U+06D6 Template:Script/Arabic Arabic Small High Ligature Sad With Lam With Alef Maksura
U+06D7 Template:Script/Arabic Arabic Small High Ligature Qaf With Lam With Alef Maksura
U+06D8 Template:Script/Arabic Arabic Small High Meem Initial Form
U+06D9 Template:Script/Arabic Arabic Small High Lam Alef
U+06DA Template:Script/Arabic Arabic Small High Jeem
U+06DB Template:Script/Arabic Arabic Small High Three Dots
U+06DC Template:Script/Arabic Arabic Small High Seen
U+06DD Template:Script/Arabic Arabic End of Ayah
U+06DE Template:Script/Arabic Arabic Star of Rub El Hizb
U+06DF Template:Script/Arabic Arabic Small High Rounded Zero

smaller than the typical circular shape used for 0652

U+06E0 Template:Script/Arabic Arabic Small High Upright Rectangular Zero

the term "rectangular zero" is a translation of the Arabic name of this sign

U+06E1 Template:Script/Arabic Arabic Small High Dotless Head Of Khah presentation form of 0652, using font technology to select the variant is preferred

used in some Qurans to mark absence of a vowel= Arabic jazm → U+0652 ْ Arabic Sukun

U+06E2 Template:Script/Arabic Arabic Small High Meem Isolated Form
U+06E3 Template:Script/Arabic Arabic Small Low Seen
U+06E4 Template:Script/Arabic Arabic Small High Madda

typically used with 06E5, 06E6, 06E7, and 08F3

U+06E5 Template:Script/Arabic Arabic Small Waw

→ U+08D3 ࣓ Arabic Small Low Waw → U+08F3 ࣳ Arabic Small High Waw

U+06E6 Template:Script/Arabic Arabic Small Yeh
U+06E7 Template:Script/Arabic Arabic Small High Yeh
U+06E8 Template:Script/Arabic Arabic Small High Noon
U+06E9 Template:Script/Arabic Arabic Place Of Sajdah

there is a range of acceptable glyphs for this character

U+06EA Template:Script/Arabic Arabic Empty Centre Low Stop
U+06EB Template:Script/Arabic Arabic Empty Centre High Stop
U+06EC Template:Script/Arabic Arabic Rounded High Stop With Filled Centre

also used in Quranic text in African and other orthographies to represent wasla, ikhtilas, etc.

U+06ED Template:Script/Arabic Arabic Small Low Meem
U+06EE Template:Script/Arabic Arabic Letter Dal With Inverted V
U+06EF Template:Script/Arabic Arabic Letter Reh With Inverted V

also used in early Persian

U+06F0 Template:Script/Arabic Extended Arabic-Indic Digit Zero
U+06F1 Template:Script/Arabic Extended Arabic-Indic Digit One
U+06F2 Template:Script/Arabic Extended Arabic-Indic Digit Two
U+06F3 Template:Script/Arabic Extended Arabic-Indic Digit Three
U+06F4 Template:Script/Arabic Extended Arabic-Indic Digit Four

Persian has a different glyph than Sindhi and Urdu

U+06F5 Template:Script/Arabic Extended Arabic-Indic Digit Five

Persian, Sindhi, and Urdu share glyph different from Arabic

U+06F6 Template:Script/Arabic Extended Arabic-Indic Digit Six

Persian, Sindhi, and Urdu have glyphs different from Arabic

U+06F7 Template:Script/Arabic Extended Arabic-Indic Digit Seven

Urdu and Sindhi have glyphs different from Arabic

U+06F8 Template:Script/Arabic Extended Arabic-Indic Digit Eight
U+06F9 Template:Script/Arabic Extended Arabic-Indic Digit Nine
U+06FA Template:Script/Arabic Arabic Letter Sheen With Dot Below
U+06FB Template:Script/Arabic Arabic Letter Dad With Dot Below
U+06FC Template:Script/Arabic Arabic Letter Ghain With Dot Below
U+06FD Template:Script/Arabic Arabic Sign Sindhi Ampersand
U+06FE Template:Script/Arabic Arabic Sign Sindhi Postposition Men
U+06FF Template:Script/Arabic Arabic Letter Heh With Inverted V

Compact table

Arabic<templatestyles src="Citation/styles.css"/>[1]<templatestyles src="Citation/styles.css"/>[2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+060x  ؀   ؁   ؂   ؃   ؄   ؅  ؆ ؇ ؈ ؉ ؊ ؋ ، ؍ ؎ ؏
U+061x ؐ ؑ ؒ ؓ ؔ ؕ ؖ ؗ ؘ ؙ ؚ ؛  ALM  ؝ ؞ ؟
U+062x ؠ ء آ أ ؤ إ ئ ا ب ة ت ث ج ح خ د
U+063x ذ ر ز س ش ص ض ط ظ ع غ ػ ؼ ؽ ؾ ؿ
U+064x ـ ف ق ك ل م ن ه و ى ي ً ٌ ٍ َ ُ
U+065x ِ ّ ْ ٓ ٔ ٕ ٖ ٗ ٘ ٙ ٚ ٛ ٜ ٝ ٞ ٟ
U+066x ٠ ١ ٢ ٣ ٤ ٥ ٦ ٧ ٨ ٩ ٪ ٫ ٬ ٭ ٮ ٯ
U+067x ٰ ٱ ٲ ٳ ٴ ٵ ٶ ٷ ٸ ٹ ٺ ٻ ټ ٽ پ ٿ
U+068x ڀ ځ ڂ ڃ ڄ څ چ ڇ ڈ ډ ڊ ڋ ڌ ڍ ڎ ڏ
U+069x ڐ ڑ ڒ ړ ڔ ڕ ږ ڗ ژ ڙ ښ ڛ ڜ ڝ ڞ ڟ
U+06Ax ڠ ڡ ڢ ڣ ڤ ڥ ڦ ڧ ڨ ک ڪ ګ ڬ ڭ ڮ گ
U+06Bx ڰ ڱ ڲ ڳ ڴ ڵ ڶ ڷ ڸ ڹ ں ڻ ڼ ڽ ھ ڿ
U+06Cx ۀ ہ ۂ ۃ ۄ ۅ ۆ ۇ ۈ ۉ ۊ ۋ ی ۍ ێ ۏ
U+06Dx ې ۑ ے ۓ ۔ ە ۖ ۗ ۘ ۙ ۚ ۛ ۜ  ۝  ۞ ۟
U+06Ex ۠ ۡ ۢ ۣ ۤ ۥ ۦ ۧ ۨ ۩ ۪ ۫ ۬ ۭ ۮ ۯ
U+06Fx ۰ ۱ ۲ ۳ ۴ ۵ ۶ ۷ ۸ ۹ ۺ ۻ ۼ ۽ ۾ ۿ
Notes
1.<templatestyles src="Citation/styles.css"/>^ Template:Unicode version
2.<templatestyles src="Citation/styles.css"/>^ Unicode code point U+0673 is deprecated as of Unicode version 6.0

Arabic Supplement

Arabic Supplement<templatestyles src="Citation/styles.css"/>[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+075x ݐ ݑ ݒ ݓ ݔ ݕ ݖ ݗ ݘ ݙ ݚ ݛ ݜ ݝ ݞ ݟ
U+076x ݠ ݡ ݢ ݣ ݤ ݥ ݦ ݧ ݨ ݩ ݪ ݫ ݬ ݭ ݮ ݯ
U+077x ݰ ݱ ݲ ݳ ݴ ݵ ݶ ݷ ݸ ݹ ݺ ݻ ݼ ݽ ݾ ݿ
Notes
1.<templatestyles src="Citation/styles.css"/>^ Template:Unicode version

Arabic Extended-B

Template:Unicode chart Arabic Extended-B

Arabic Extended-A

Template:Unicode chart Arabic Extended-A

Arabic Presentation Forms A

They are mostly ligatures which can be created from the previous charts' characters, with the exception of the bracket-like graphemes Template:Not a typo and some of them are ligatures of common liturgical phrases.

Arabic Presentation Forms-A<templatestyles src="Citation/styles.css"/>[1]<templatestyles src="Citation/styles.css"/>[2]<templatestyles src="Citation/styles.css"/>[3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+FB5x
U+FB6x
U+FB7x ﭿ
U+FB8x
U+FB9x
U+FBAx
U+FBBx ﮿
U+FBCx
U+FBDx
U+FBEx
U+FBFx ﯿ
U+FC0x
U+FC1x
U+FC2x
U+FC3x ﰿ
U+FC4x
U+FC5x
U+FC6x
U+FC7x ﱿ
U+FC8x
U+FC9x
U+FCAx
U+FCBx ﲿ
U+FCCx
U+FCDx
U+FCEx
U+FCFx ﳿ
U+FD0x
U+FD1x
U+FD2x
U+FD3x ﴿
U+FD4x
U+FD5x
U+FD6x
U+FD7x ﵿ
U+FD8x
U+FD9x
U+FDAx
U+FDBx ﶿ
U+FDCx
U+FDDx
U+FDEx
U+FDFx ﷿
Notes
1.<templatestyles src="Citation/styles.css"/>^ Template:Unicode version
2.<templatestyles src="Citation/styles.css"/>^ Grey areas indicate non-assigned code points
3.<templatestyles src="Citation/styles.css"/>^ Black areas indicate noncharacters (code points that are guaranteed never to be assigned as encoded characters in the Unicode Standard)

Arabic Presentation Forms B

These can all be created from the basic chart's characters.

Arabic Presentation Forms-B<templatestyles src="Citation/styles.css"/>[1]<templatestyles src="Citation/styles.css"/>[2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+FE7x ﹿ
U+FE8x
U+FE9x
U+FEAx
U+FEBx ﺿ
U+FECx
U+FEDx
U+FEEx
U+FEFx ZW
NBSP
Notes
1.<templatestyles src="Citation/styles.css"/>^ Template:Unicode version
2.<templatestyles src="Citation/styles.css"/>^ Grey areas indicate non-assigned code points

Rumi Numeral Symbols

Template:Unicode chart Rumi Numeral Symbols

Arabic Extended-C

Template:Unicode chart Arabic Extended-C

Indic Siyaq Numbers

Template:Unicode chart Indic Siyaq Numbers

Ottoman Siyaq Numbers

Template:Unicode chart Ottoman Siyaq Numbers

Arabic Mathematical Alphabetic Symbols

Template:Unicode chart Arabic Mathematical Alphabetic Symbols

References

  1. "What is the origin of the ampersand (&)?"
  2. unicode.org Biography: Thomas Milo - DecoType
  3. Script error: No such module "citation/CS1".
  4. Script error: No such module "citation/CS1".
  5. Script error: No such module "citation/CS1".
  6. a b Script error: No such module "citation/CS1".
  7. Deprecated as of Unicode version 6.0 UCD Change History "The particular combination of an alef with this vowel mark should be written with the sequence <U+0627 ARABIC LETTER ALEF, U+065F ARABIC WAVY HAMZA BELOW>, rather than with the character U+0673 ARABIC LETTER ALEF WITH WAVY HAMZA BELOW, which has been deprecated and which is not canonically equivalent. Script error: No such module "citation/CS1".

External links

Template:Unicode navigation