Wiki143:Manual of Style/Arabic

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Script error: No such module "Labelled list hatnote". Template:MoS-guideline Template:Main other

This page proposes a guideline regarding the use of Arabic words on the English Wikipedia.

On the English Wikipedia, Arabic is rendered into Latin script according to one of four methods in order of decreasing preference:

  1. Common English translation
  2. Common transcription
  3. Basic transcription
  4. Strict transliteration

The transliteration of Arabic used by Wikipedia is based on the ALA-LC romanization method, with a few simple changes that make it easier to read and manage in compliance with the main Manual of Style. The strict transliteration uses accents, underscores, and underdots, and is only used for etymology, usually alongside the original Arabic. All other cases of Arabic script romanization will use the same standard, but without accents, underscores, and underdots. Some exceptions to this rule may apply.

Definitions

Arabic

In general, as specified on WP:English, a common English translation takes precedence over other methods to represent Arabic. This convention deals with the cases in which no common English translation is available. For the purposes of this convention, an Arabic word is defined as a name or phrase that is most commonly originally rendered in the Arabic script, and that in English is not usually translated into a common English word. These could be in any language that uses this script, such as Arabic, Persian, or Ottoman Turkish.

Examples of Arabic script rendered into Latin:

Examples of titles not transliterated from Arabic script:

Common transcription

A word or name has a common transcriptionTemplate:Efn (anglicization) if a large majority of references in English use the same transcription or if a reliable source shows that an individual self-identifies with a particular transcription. Non-printable characters (including underscores) should be avoided.

Examples of references include the Oxford Dictionary, the FBI, the NY Times, CNN, the Washington Post, Al Jazeera, Encarta, Britannica, the Library of Congress, and other academic sources. Examples of self-identification include a driver's license or passport in which the individual personally chose a particular form of transcription.

Google searches can be useful in determining the most common usage, but should not be heavily relied upon. The content of large searches may not be relevant to the subject being discussed or may misrepresent the figures due to the use in languages other than English. For example, the ISO transliteration (ISO 233) of Script error: No such module "Lang". is "[[Qa'im (disambiguation)|Template:Transliteration]]", but the transcription "al-Qaim" receives five times as many hits. This word is used in the names of three historical Caliphs and a town in Iraq, and is also another name for the Mahdi in Shia Islam. Since Google searches do not discriminate between them, other sources must be used to determine if a common transcription exists for any particular usage. Google search counts are also biased toward syndicated news articles: a single syndicated reference may generate hundreds or thousands of hits, amplifying the weight of whatever spelling happens to be used by that one reference.

If there is no common transcription, a basic transcription is used (see below).

Examples:

  • There is no single most popular transcription for the name of the prophet of Islam. "Mohammed", "Mohammad", "Muhammad", and "Mohamed" are all commonly used. The basic transcription "Muhammad" is used.
  • The capital of Egypt is most widely known as Cairo. The basic transcription of "al-Qahira" is not used.
  • The common transcription of the leader of al-Qaeda (itself a common transcription of the strict transliteration al-Qāʿida) is "Osama bin Laden". The basic transcription of Usama ibn Ladin is not used.

Note: the Arabic word Script error: No such module "Lang". (Template:Langx) should be transcribed ibn unless a common transcription requires the colloquial bin.

Basic transcription

The basic transcriptionTemplate:Efn uses a systematic convention of rendering Arabic scripts. The basic transcription from Arabic to Roman letters is found below.

The basic transcription does not carry enough information to accurately write or pronounce the original Arabic script. For example, it does not differentiate between certain pairs of similar letters (e.g. Script error: No such module "Lang". Template:Transliteration vs. Script error: No such module "Lang". Template:Transliteration), or between long and short vowels. It does, however, increase the readability of the article to those not familiar with Arabic transliteration, and avoids characters that may be unreadable to browsers. This transcription method can be seen as a compromise between strict transliteration and Wikipedia conventions.

Strict transliteration

A strict transliteration is completely reversible, allowing the original writing to be faithfully restored. A strict transliteration need not be a 1:1 mapping of characters as long as there are clear rules for choosing one character over another. A source character may be mapped (1:n) into a sequence of several target characters without losing sequential reversibility.

A strict transliteration uses a system of accents, underscores, and underdots to render the original Arabic in a form that preserves all the information in the original Arabic.

ALA-LC romanization is most commonly used for this purpose; other common transliteration standards include ISO 233 and DIN 31635.

Note that several letters proposed in the strict transliteration system below do not render correctly for some widespread software configurations (e.g. ḥ, ṣ, ḍ, ṭ, ṛ, and ẓ). Using the Template:Tl template to enclose transliterations allows CSS classes to address these issues.

Examples

Arabic Common Basic Strict
Script error: No such module "Lang". Cairo al-Qahira Template:Transliteration
Script error: No such module "Lang". Salaf as-Salaf as-Salih Template:Transliteration
Script error: No such module "Lang". Baibars al-Zahir Baybars Template:Transliteration
Script error: No such module "Lang". Abbasid al-Abbasiyyun Template:Transliteration
Script error: No such module "Lang". Karbala Karbala' Template:Transliteration
Script error: No such module "Lang". Muhammad Template:Transliteration
Script error: No such module "Lang". al-Qaeda al-Qa'ida Template:Transliteration

Article titles and redirects

Article titles

Article titles should conform to WP:CRITERIA. Rules of thumb that will work in most cases:

  1. Use the translation or transcription that is most often used in English-language reliable sources (WP:COMMONNAME principle).
    Example: Henna
  2. When there are several forms that occur often in English-language reliable sources, and for those that are used most often it is unclear which one outdoes the others in usage, choose among these the one that is closest to the basic transcription.
    Example: Jinn (not Djinn nor Genies)
  3. In all other cases use the basic transcription.
    Example: Jabir ibn Aflah
  4. Stay within the constraints of WP:TITLESPECIALCHARACTERS.
    Example: Na'im ibn Musa (not Na‘im ibn Musa)

Choosing an article title that diverts from the above rules of thumb can only be done with a consensus that the alternative article title conforms better to WP:CRITERIA, and when all applicable redirects are in place.

Example: Thābit ibn Qurra

Redirects

All frequently occurring name variants, including transcriptions and transliterations, should redirect to the article. There will often be many redirects, but this is intentional and does not represent a problem.

Article text

Lead paragraph

All articles with Arabic titles should have a lead paragraph which includes the article title, along with the original Arabic script and the strict transliteration in parentheses, preferably in the lead sentence. This is in accordance with the official Wikipedia policy at WP:ENGLISH. Many articles that are missing this information are listed at Category:Articles needing Arabic script or text. Arabic script is used in combination with the Template:Tl, while the strict transliteration is written using Template:Tl. A combination of the Template:Tlx and Template:Tl templates can also be represented by Template:Tlx:

  • Template:Tlx: will mark the text as Arabic. In some browsers, this may trigger a more legible font.
  • Template:Tlx: provides a mouseover note indicating that the inserted text is transliterated from Arabic. The transliteration has to be italicised manually.
  • Template:Tlx: provides a combination of a link to Arabic language, the original Arabic term, its transliteration and a literal translation.

The standard format, with, pursuant to Template:Tl, the transliteration system indicated, is given in the following examples:

Some cases will require variations on this format. If the name is extremely long, the first appearance of the name is suitable to provide the strict transliteration. Likewise, if a strict transliteration appears overly repetitious, it should be in place of the page title in the lead paragraph.

Example:

Main text and general usage

As with the convention for titles, common English translations should be used as much as possible. Likewise, if these are not available, one should first try a common transcription before resorting to the basic transcription. Strict transliterations in the main text should only be used out of necessity, e.g. explanations in linguistic texts or articles about transliterating.

Clash with wiki markup

Words ending with Template:Transliteration or a Template:Transliteration are transcribed with an apostrophe at the end. This can cause a problem if the word is at the end of an italicized or bold text. In order to prevent the final apostrophe from being interpreted as wiki markup '' and ''', use Template:Tl2.

Example: ''Karbala[[:Template:((]]`[[:Template:))]]'' for KarbalaTemplate:`.

Collation in alphabetical order

Script error: No such module "Labelled list hatnote".

  • Index by family name in modern cases where there is one, otherwise by the first component in the commonly used name.
  • For indexing of persons, the definite article "al-" and its variants (ash-, ad-, etc.) should be omitted when they form part of a modern family name.
  • However, for organisational names, where a common transcription is established by usage, the al- or el- part is often treated as a full part of the word.
    • Example: Al-Qaeda should be indexed as "Al-Qaeda", not "Qaeda".
  • Include particles such as Abu, Abd, Abdel, Abdul, ben, bin and bint as part of the name. When found in modern surnames, such names are considered compound names and the particles are integral to the name.
  • For indexing, the apostrophe (representing hamza and ‘ayn) should be ignored, and letters with diacritics should be indexed as if they did not have their diacritics.

Transliteration

Template:See The strict transliteration presented below is based on the ALA-LC Romanization method (1997), and standards from the United Nations Group of Experts on Geographical Names. It also includes some alternative symbols adopted in ISO 233 and DIN 31635, which are used by such sources as the Encyclopedia of Islam, and are available in the Arabic tab of the default Wikipedia editor.

The basic transcription is a simplified version.[discuss]

Consonants

Arabic Name Basic
transcr.
Strict
translit.
Notes
Script error: No such module "Lang". bā’ b
Script error: No such module "Lang". tā’ t
Script error: No such module "Lang". thā’ th the sequence Script error: No such module "Lang". is optionally written Template:Angle bracket in ALA-LC Arabic romanization
Script error: No such module "Lang". jīm j/g g is usually in contemporary articles pertaining to Egypt or Egyptian Arabic or when a word is spelled with Script error: No such module "Lang". but pronounced Template:IPAslink as advised by romanization schemes (ALA-LC, DIN, and UN).
Script error: No such module "Lang". ḥā’ h
Script error: No such module "Lang". khā’ kh the sequence Script error: No such module "Lang". is optionally written Template:Angle bracket in ALA-LC Arabic romanization
Script error: No such module "Lang". dāl d
Script error: No such module "Lang". dhāl dh the sequence Script error: No such module "Lang". is optionally written Template:Angle bracket in ALA-LC Arabic romanization
Script error: No such module "Lang". rā’ r
Script error: No such module "Lang". zāy z
Script error: No such module "Lang". sīn s
Script error: No such module "Lang". shīn sh the sequence Script error: No such module "Lang". is optionally written Template:Angle bracket in ALA-LC Arabic romanization
Script error: No such module "Lang". ṣād s
Script error: No such module "Lang". ḍād d
Script error: No such module "Lang". ṭā’ t
Script error: No such module "Lang". ẓā’ z
Script error: No such module "Lang". ‘ayn
  1. REDIRECT Template:Large

Template:Redirect category shell||

  1. REDIRECT Template:Large

Template:Redirect category shell or

  1. REDIRECT Template:Large

Template:Redirect category shell||When using basic transcription, it is omitted in the initial position.Template:Efn-lr

Script error: No such module "Lang". ghayn gh
Script error: No such module "Lang". fā’ f
Script error: No such module "Lang". qāf q
Script error: No such module "Lang". kāf k
Script error: No such module "Lang". lām l
Script error: No such module "Lang". mīm m
Script error: No such module "Lang". nūn n
Script error: No such module "Lang". hā’ h
Script error: No such module "Lang". hamza
  1. REDIRECT Template:Large

Template:Redirect category shell||

  1. REDIRECT Template:Large

Template:Redirect category shell or

  1. REDIRECT Template:Large

Template:Redirect category shell||It is omitted in the initial position both when using basic transcription and when using strict transliteration.Template:Efn-lr

Script error: No such module "Lang". tā’ marbūṭa a or ah or at a or ah or at usually as a or ah (ALA-LC), but sometimes as at (in construct case).Template:Efn-lr
Script error: No such module "Lang". wāw w See also long vowels
Script error: No such module "Lang". ya’ y See also long vowels
Script error: No such module "Lang". (yā’) i or iyy ī, īy or iyy romanized īy (ALA-LC) or iyy except in final positionTemplate:Efn-lr
Script error: No such module "Lang". alif madda a, 'aTemplate:Efn-lr ’ā, ā, or ʾā Initially ā, medially; ’ā (ALA-LC) or ʾā (depending on which one is used for hamza)
Notes from the ALA-LC specifications

Template:Notelist-lr

Vowels

Arabic Name Basic
transcr.
Strict
translit.
064EScript error: No such module "Check for unknown parameters".
Template:Resize
Template:Transliteration Template:Transliteration
064FScript error: No such module "Check for unknown parameters".
Template:Resize
Template:Transliteration Template:Transliteration
0650Script error: No such module "Check for unknown parameters".
Template:Resize
Template:Transliteration Template:Transliteration
064E 0627Script error: No such module "Check for unknown parameters".
Template:Resize
Template:Transliteration a ā
064E 0649Script error: No such module "Check for unknown parameters".
Template:Resize
Template:Transliteration a Template:Transliteration (DIN)Script error: No such module "Check for unknown parameters". or Template:Transliteration (ALA-LC)Script error: No such module "Check for unknown parameters".
064F 0648Script error: No such module "Check for unknown parameters".
Template:Resize
Template:Transliteration u ū
0650 064AScript error: No such module "Check for unknown parameters".
Template:Resize
Template:Transliteration i ī

Definite article

Romanizing the Arabic definite article is usually preferred unassimilated.

Solar
letters
Basic
transcr.
Strict
translit.
Script error: No such module "Lang". t
Script error: No such module "Lang". th
Script error: No such module "Lang". d
Script error: No such module "Lang". dh
Script error: No such module "Lang". r
Script error: No such module "Lang". z
Script error: No such module "Lang". s
Script error: No such module "Lang". sh
Script error: No such module "Lang". s
Script error: No such module "Lang". d
Script error: No such module "Lang". t
Script error: No such module "Lang". z
Script error: No such module "Lang". l
Script error: No such module "Lang". n

Arabic has only one definite article (Script error: No such module "Lang". al-). However, if it is followed by a solar letter (listed in the table right), the "L" is assimilated in pronunciation with this solar letter and the solar letter is doubled.

Examples

Both the non-assimilated (al-) or the assimilated (ad-) form appear in various standards of transliteration. Choose one and use it consistently throughout the article.

"Al-" and its variants (ash-, ad-, ar-, etc.) are always written in lower case, also when forming part of proper nouns, except when beginning a sentence. It is always separated from the following word (which takes the upper case when it is a proper noun) by a hyphen.

Examples
  • "He was a member of al-Qaeda."
  • "Al-Qaeda has been designated as a terrorist group."

Dynastic Al

Script error: No such module "Labelled list hatnote".

Some people, especially in the region of Arabia, when they are descended from a famous ancestor, start their last name with Script error: No such module "Lang". Template:Transliteration Script error: No such module "IPA"., a noun meaning "family" or "clan", like the dynasty Al Saud (family of Saud) or Al ash-Sheikh (family of the Sheikh). Script error: No such module "Lang". Template:Transliteration Script error: No such module "IPA". is distinct from the definite article Script error: No such module "Lang". Template:Transliteration Script error: No such module "IPA"..

Arabic meaning transcription IPA example
Script error: No such module "Lang". the Template:Transliteration Script error: No such module "IPA". Maytham al-Tammar
Script error: No such module "Lang". family/clan of Template:Transliteration Script error: No such module "IPA". Bandar bin Abdulaziz Al Saud
Script error: No such module "Lang". tribe/people of Template:Transliteration Script error: No such module "IPA". Ahl al-Bayt

Capitalization

Rules for the capitalization of English should be followed, except for the definite article, as explained above.

Names

Script error: No such module "Labelled list hatnote". The basic transcription of Arabic names comprises a variation on the following structure:

  • the given name (ism)
  • multiple patronymics (nasab), as appropriate, each preceded by the particle ibn (son) or bint (daughter).
Note: the Arabic particle Script error: No such module "Lang". (Template:Langx) should be transcribed ibn unless a common transcription requires the colloquial form bin (e.g. Osama bin Laden)

If Abū is preceded by ibn, the correct grammatical format is ibn Abī, not ibn Abū.

Persian

Script error: No such module "Labelled list hatnote". When the Arabic script was adopted for the Persian language, there were letters pronounced in Persian which did not have a representation in the Arabic alphabet, and vice versa. The Persian alphabet adds letters to the Arabic alphabet, and changes the pronunciation of some Arabic letters. In addition, Persian does not use a definite article (al-).

Urdu

Script error: No such module "Shortcut". Urdu adds additional letters, and some existing letters are transliterated differently. The strict transliteration is based on the ALA-LC Romanization method for Urdu (2012). The basic transcription is the same for the additional letters, but without accents, underscores and underdots. All letters in common with Arabic should likewise follow the Arabic transcription and/or translation conventions.

Consonants

Urdu Basic
transcr.
Strict
translit.
Notes
Script error: No such module "Lang". b b
Script error: No such module "Lang". p p
Script error: No such module "Lang". t t
Script error: No such module "Lang". t
Script error: No such module "Lang". s "s", combining macron below: s̱
Script error: No such module "Lang". j j
Script error: No such module "Lang". ch c
Script error: No such module "Lang". h
Script error: No such module "Lang". kh k͟h "k", combining double macron below, "h": k͟h
Script error: No such module "Lang". d d
Script error: No such module "Lang". d
Script error: No such module "Lang". z
Script error: No such module "Lang". r r
Script error: No such module "Lang". r
Script error: No such module "Lang". z z
Script error: No such module "Lang". zh zh
Script error: No such module "Lang". s s
Script error: No such module "Lang". sh sh
Script error: No such module "Lang". s
Script error: No such module "Lang". z
Script error: No such module "Lang". t "t", combining diaeresis below: t̤
Script error: No such module "Lang". z "z", combining diaeresis below: z̤
Script error: No such module "Lang".
  1. REDIRECT Template:Large

Template:Redirect category shell or

  1. REDIRECT Template:Large

Template:Redirect category shell or

  1. REDIRECT Template:Large

Template:Redirect category shell||

  1. REDIRECT Template:Large

Template:Redirect category shell or

  1. REDIRECT Template:Large

Template:Redirect category shell||The apostrophe should only be used if it appears in a common transcription; it is omitted in the initial position.

Script error: No such module "Lang". gh g͟h "g", combining double macron below, "h": g͟h
Script error: No such module "Lang". f f
Script error: No such module "Lang". q q
Script error: No such module "Lang". k k
Script error: No such module "Lang". g g
Script error: No such module "Lang". l l
Script error: No such module "Lang". m m
Script error: No such module "Lang". n n
Script error: No such module "Lang". n
Script error: No such module "Lang". w or v w or v
Script error: No such module "Lang". h h
Script error: No such module "Lang". t t
Script error: No such module "Lang".
  1. REDIRECT Template:Large

Template:Redirect category shell or

  1. REDIRECT Template:Large

Template:Redirect category shell||

  1. REDIRECT Template:Large

Template:Redirect category shell||

Script error: No such module "Lang". y y

Aspirates

Urdu Basic
transcr.
Strict
translit.
Script error: No such module "Lang". bh bh
Script error: No such module "Lang". ph ph
Script error: No such module "Lang". th th
Script error: No such module "Lang". th ṭh
Script error: No such module "Lang". jh jh
Script error: No such module "Lang". chh ch
Script error: No such module "Lang". dh dh
Script error: No such module "Lang". dh ḍh
Script error: No such module "Lang". rh ṛh
Script error: No such module "Lang". kh kh
Script error: No such module "Lang". gh gh

Vowels

Vowels Basic Trans. Strict Trans.
Script error: No such module "Lang". a a
Script error: No such module "Lang". i i
Script error: No such module "Lang". u u
Script error: No such module "Lang". a ā
Script error: No such module "Lang".Script error: No such module "Lang". a á
Script error: No such module "Lang". i ī
Script error: No such module "Lang". u ū
Script error: No such module "Lang". o o
Script error: No such module "Lang".Script error: No such module "Lang". e e
Script error: No such module "Lang". au au
Script error: No such module "Lang". ai ai

Ottoman Turkish

The Ottoman Turkish language differs from the above languages in that, since 1928, words that were once written with a Persian-influenced version of the Arabic abjad have been written using the Latin alphabet. As such, there is a long established set of standards for writing the language in a basic transcription; however, in a strict transliteration, the language adheres closely to the standards for strict transliteration described above.

Guidelines for writing Ottoman Turkish words according to the basic transcription can be found at the website of the Turkish Language Association (Türk Dil Kurumu): here for the majority of words, and here for names of people.

In the following table, only those letters which differ in either their strict transliteration or their basic transcription from the Arabic-oriented table above are shown; all others are transliterated according to that table.

Script Basic transcr. Strict translit. IPA Notes
Script error: No such module "Lang". a, â, e ā, e [ɑ:], [e] This represents a, â, or e in initial position, and â in medial or final position.
Script error: No such module "Lang". a, â ā [ɑ:] This is only written in initial position.
Script error: No such module "Lang". s [s]
Script error: No such module "Lang". c, ç c [dʒ], [tʃ] When choosing between c and ç in the basic transcription, modern Turkish orthography should be followed.
Script error: No such module "Lang". ç ç [tʃ]
Script error: No such module "Lang". h [h]
Script error: No such module "Lang". z [z]
Script error: No such module "Lang". j j [ʒ]
Script error: No such module "Lang". ş ş [ʃ]
Script error: No such module "Lang". z, d ż, [z], [d] When choosing between ż and in the strict transliteration, and z and d in the basic transcription, modern Turkish orthography should be followed.
Script error: No such module "Lang". a, 'a,
  1. REDIRECT Template:Large

Template:Redirect category shell, â||‘a, ‘ā,

  1. REDIRECT Template:Large

Template:Redirect category shell||[ɑ], [ɑ:], ø||

Script error: No such module "Lang". g, ğ ġ [ɣ], [g], [k], [h] When choosing between g and ğ in the basic transcription, modern Turkish orthography should be followed.
Script error: No such module "Lang". k [k]
Script error: No such module "Lang". k, g, ğ, n k, g, ñ [k], [n], [ɲ], [ŋ] When choosing between k, g, ğ, and n in the basic transcription, modern Turkish orthography should be followed.
Script error: No such module "Lang". g, ğ g [g], [k] When choosing between g and ğ in the basic transcription, modern Turkish orthography should be followed.
Script error: No such module "Lang". n ñ [n], [ɲ], [ŋ]
Script error: No such module "Lang". h, e, a, i h, e, a, i [h], [ɑ], [e], [i] When choosing between e and a in the transliteration, the Turkish rules of vowel harmony should be followed. This is only transliterated as h at the end of a word in proper nouns.
Script error: No such module "Lang".
  1. REDIRECT Template:Large

Template:Redirect category shell, ø||

  1. REDIRECT Template:Large

Template:Redirect category shell||ø||

Script error: No such module "Lang". v, o, ö, u, ü v, o, ō, ö, u, ū, ü [v], [o], [o:], [œ], [u], [u:], [y] When making the transliteration, modern Turkish orthography should be followed.
Script error: No such module "Lang". y, i, ı, a y, i, ī, ı, ā [j], [i], [i:], [ɯ], [ej], [ɑ:] When making the transliteration, modern Turkish orthography should be followed.
Script error: No such module "Lang". la, lâ [lɑ:]
Script error: No such module "Lang". et et [et]

Definite article

In words that use the Arabic definite article Script error: No such module "Lang"., the article always follows the assimilation of solar letters. However, the vowel Script error: No such module "Lang". can be transliterated in a number of ways.

  1. For a definite article in initial position, the definite article is written as Script error: No such module "Lang". in both the basic and the strict renderings; e.g. Script error: No such module "Lang". Script error: No such module "Lang"., Script error: No such module "Lang". Script error: No such module "Lang"..
  2. For a definite article in medial position, such as is found in many names of Arabic origin, the vowel in the strict transliteration can be written in a variety of ways; e.g. Script error: No such module "Lang"., Script error: No such module "Lang"., Script error: No such module "Lang"., Script error: No such module "Lang"., etc. In such cases, the diacritic representing the hamza or ‘ayin (i.e. Script error: No such module "Lang". or Script error: No such module "Lang".) is always used, and the choice of vowel should follow modern Turkish orthography; e.g. Script error: No such module "Lang". Script error: No such module "Lang"., Script error: No such module "Lang". Script error: No such module "Lang"., Script error: No such module "Lang". Script error: No such module "Lang"..
  3. For a definite article in medial position in the basic transcription, Script error: No such module "Lang". is not used, and the choice of vowel and spelling should follow modern Turkish orthography; e.g. Script error: No such module "Lang". Script error: No such module "Lang"., Script error: No such module "Lang". Script error: No such module "Lang"., Script error: No such module "Lang". Script error: No such module "Lang"..

Notes

Template:Notelist

External links