Lām 'alif
   HOME

TheInfoList



OR:

The Arabic alphabet, or the Arabic abjad, is the
Arabic script The Arabic script is the writing system used for Arabic (Arabic alphabet) and several other languages of Asia and Africa. It is the second-most widely used alphabetic writing system in the world (after the Latin script), the second-most widel ...
as specifically codified for writing the
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
language. It is a unicameral script written from right-to-left in a
cursive Cursive (also known as joined-up writing) is any style of penmanship in which characters are written joined in a flowing manner, generally for the purpose of making writing faster, in contrast to block letters. It varies in functionality and m ...
style, and includes 28 letters, of which most have contextual letterforms. Unlike the modern
Latin alphabet The Latin alphabet, also known as the Roman alphabet, is the collection of letters originally used by the Ancient Rome, ancient Romans to write the Latin language. Largely unaltered except several letters splitting—i.e. from , and from ...
, the script has no concept of
letter case Letter case is the distinction between the letters that are in larger uppercase or capitals (more formally ''majuscule'') and smaller lowercase (more formally '' minuscule'') in the written representation of certain languages. The writing system ...
. The Arabic alphabet is an
abjad An abjad ( or abgad) is a writing system in which only consonants are represented, leaving the vowel sounds to be inferred by the reader. This contrasts with alphabets, which provide graphemes for both consonants and vowels. The term was introd ...
, with only
consonant In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract, except for the h sound, which is pronounced without any stricture in the vocal tract. Examples are and pronou ...
s required to be written (though the long vowels – ''ā ī ū'' – are also written, with letters used for consonants); due to its optional use of diacritics to notate vowels, it is considered an impure abjad.


Letters

The basic Arabic alphabet contains 28
letters Letter, letters, or literature may refer to: Characters typeface * Letter (alphabet), a character representing one or more of the sounds used in speech or none in the case of a silent letter; any of the symbols of an alphabet * Letterform, the g ...
. Forms using the Arabic script to write other languages added and removed letters: for example ⟨پ⟩ is often used to represent in adaptations of the Arabic script. Unlike
Greek Greek may refer to: Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group *Greek language, a branch of the Indo-European language family **Proto-Greek language, the assumed last common ancestor of all kno ...
-derived alphabets, Arabic has no distinct upper and lower case letterforms. Many letters look similar but are distinguished from one another by dots () above or below their central part (). These dots are an integral part of a letter, since they distinguish between letters that represent different sounds. For example, the Arabic letters , , and have the same basic shape, but with one dot added below, two dots added above, and three dots added above respectively. The letter also has the same form in initial and medial forms, with one dot added above, though it is somewhat different in its isolated and final forms. Historically, they were often omitted, in a writing style called
rasm ''Rasm'' ( ) is an Arabic writing script often used in the early centuries of Classical Arabic literature (7th centuryearly 11th century AD). It is the same as today's Arabic script except for the difference that the Arabic diacritics are omi ...
. Both printed and written Arabic are
cursive Cursive (also known as joined-up writing) is any style of penmanship in which characters are written joined in a flowing manner, generally for the purpose of making writing faster, in contrast to block letters. It varies in functionality and m ...
, with most letters within a word directly joined to adjacent letters.


Alphabetical order

There are two main collating sequences ('alphabetical orderings') for the Arabic alphabet: , and . The Hija'i order ( ) is the more common order and it is used when sorting lists of words and names, such as in phonebooks, classroom lists, and dictionaries. The original order ( ) derives from that used by the
Phoenician alphabet The Phoenician alphabet is an abjad (consonantal alphabet) used across the Mediterranean civilization of Phoenicia for most of the 1st millennium BC. It was one of the first alphabets, attested in Canaanite and Aramaic inscriptions fo ...
and therefore resembles the sequence of letters in
Hebrew Hebrew (; ''ʿÎbrit'') is a Northwest Semitic languages, Northwest Semitic language within the Afroasiatic languages, Afroasiatic language family. A regional dialect of the Canaanite languages, it was natively spoken by the Israelites and ...
and
Greek Greek may refer to: Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group *Greek language, a branch of the Indo-European language family **Proto-Greek language, the assumed last common ancestor of all kno ...
. Letters are also assigned numerical values (
abjad numerals The Abjad numerals, also called Hisab al-Jummal (, ), are a decimal alphabetic numeral system/alphanumeric code, in which the 28 letters of the Arabic alphabet are assigned numerical values. They have been used in the Arab world, Arabic-speaking ...
) for purposes of
numerology Numerology (known before the 20th century as arithmancy) is the belief in an occult, divine or mystical relationship between a number and one or more coinciding events. It is also the study of the numerical value, via an alphanumeric system, ...
, as is done in Hebrew
gematria In numerology, gematria (; or , plural or ) is the practice of assigning a numerical value to a name, word, or phrase by reading it as a number, or sometimes by using an alphanumeric cipher. The letters of the alphabets involved have standar ...
and Greek
isopsephy In numerology, isopsephy (stressed on the ''I'' and the ''E''; , ) or isopsephism is the practice of adding up the Greek numerals, number values of the letters in a word to form a single number. The total number is then used as a metaphorical brid ...
. Letters in the Hija'i order are not considered to have numerical values.


Hijaʼi

Modern dictionaries and reference books use the alphabetical order instead of the Abjadi alphabetical order, in which letters are arranged mainly by similarity of shape. The ''hijaʼi'' order is never used for numerals. A different ''hijaʼi'' order was used in the
Maghreb The Maghreb (; ), also known as the Arab Maghreb () and Northwest Africa, is the western part of the Arab world. The region comprises western and central North Africa, including Algeria, Libya, Mauritania, Morocco, and Tunisia. The Maghreb al ...
but is now considered obsolete. The sequence is: The ''al-iklīl'' order, now obsolete, also arranged letters mainly by shape. It was first used in the 10th-century work ''Kitāb al-Iklīl''. The sequence is:


Abjadi

The abjadi order is the usual Arabic order in dictionaries and reference books of the late 1st millennium to the early 2nd millennium. However, this Arabic adjadi order is not a simple correspondence with the earlier north Semitic alphabetic order, as the latter has a position corresponding to the Aramaic letter ''
samek Samekh or samech is the fifteenth letter of the Semitic abjads, including Phoenician ''sāmek'' 𐤎, Hebrew ''sāmeḵ'' , Aramaic ''samek'' 𐡎, and Syriac ''semkaṯ'' ܣ. Arabic sīn س represents samekh, but there is no surviving descenda ...
'' , which has no cognate letter in the Arabic alphabet historically because Proto-Semitic fricatives ''*š'' (represented by ''šin'' in Aramaic) and ''*s'' (represented by ''samek'' in Aramaic) had merged into Arabic ''s''. The loss of was compensated for: * In the
Mashriq The Mashriq (; ), also known as the Arab Mashriq (), sometimes spelled Mashreq or Mashrek, is a term used by Arabs to refer to the eastern part of the Arab world, as opposed to the Maghreb (western) region, and located in West Asia and easter ...
i abjad sequence, by splitting the letter '' šīn'' into two independent Arabic letters: and , with the latter taking the place of ; * And in the
Maghreb The Maghreb (; ), also known as the Arab Maghreb () and Northwest Africa, is the western part of the Arab world. The region comprises western and central North Africa, including Algeria, Libya, Mauritania, Morocco, and Tunisia. The Maghreb al ...
i abjad sequence, by splitting the letter ''ṣāḏē'' into two independent Arabic letters: and , with the latter taking the place of . The six other letters that do not correspond to any north Semitic letter are placed at the end. This is commonly vocalized as follows: : . Another vocalization is: : This can be vocalized as: : Notes:


Letter forms

The Arabic alphabet is always cursive and letters vary in shape depending on their position within a word. Letters can exhibit up to four distinct forms corresponding to an initial, medial (middle), final, or isolated position ( IMFI). While some letters show considerable variations, others remain almost identical across all four positions. Generally, letters in the same word are linked together on both sides by short horizontal lines, but six letters () can only be linked to their preceding letter. In addition, some letter combinations are written as ligatures (special shapes), notably , which is the only mandatory ligature (the unligated combination is considered difficult to read).


Table of basic letters

, , 26 , - , data-sort-value="06." , 6 , data-sort-value="06." , 6 , , , , , , , colspan="2" , , colspan="2" , , 27 , - , 10 , 10 , , , , , , , , , , , 28 , - ! colspan="11" , , - , - , - , ʾ / , , , , colspan="4" , (used in medial and final positions as an unlinked letter) , - Notes * See the article ''
Romanization of Arabic The romanization of Arabic is the systematic rendering of Modern Standard Arabic, written and varieties of Arabic, spoken Arabic language, Arabic in the Latin script. Romanized Arabic is used for various purposes, among them transcription of na ...
'' for details on various transliteration schemes. Arabic language speakers may usually not follow a standardized scheme when transcribing words or names. Some Arabic letters which do not have an equivalent in English (such as ط) are often spelled as numbers when Romanized. Also names are regularly transcribed as pronounced locally, not as pronounced in Literary Arabic (if they were of Arabic origin). * Regarding pronunciation, the phonemic values given are those of Modern Standard Arabic, which is taught in schools and universities. In practice, pronunciation may vary considerably from region to region. For more details concerning the pronunciation of Arabic, consult the articles ''
Arabic phonology While many languages have numerous dialects that differ in phonology, contemporary spoken Arabic is more properly described as a varieties of Arabic, continuum of varieties. This article deals primarily with Modern Standard Arabic (MSA), which ...
'' and ''
varieties of Arabic Varieties of Arabic (or dialects or vernaculars) are the linguistic systems that Arabic speakers speak natively. Arabic is a Semitic languages, Semitic language within the Afroasiatic languages, Afroasiatic family that originated in the Arabian P ...
''. * The names of the Arabic letters can be thought of as abstractions of an older version where they were meaningful words in the
Proto-Semitic Proto-Semitic is the reconstructed common ancestor of the Semitic languages. There is no consensus regarding the location of the linguistic homeland for Proto-Semitic: scholars hypothesize that it may have originated in the Levant, the Sahara, ...
language. * Six letters () do not have a distinct medial form and have to be written with their final form without being connected to the next letter. Their initial form matches the isolated form. The following letter is written in its initial form, or isolated form if it is the final letter in the word. * The letter originated in the Phoenician alphabet as a consonant-sign indicating a glottal stop. Today it has lost its function as a consonant, and, together with and , is a ''
mater lectionis A ''mater lectionis'' ( , ; , ''matres lectionis'' ; original ) is any consonant letter that is used to indicate a vowel, primarily in the writing of Semitic languages such as Arabic, Hebrew and Syriac. The letters that do this in Hebrew are ...
'', a consonant sign standing in for a long vowel (see below), or as support for certain diacritics ( and '). * Arabic currently uses a
punctuation mark Punctuation marks are marks indicating how a piece of written text should be read (silently or aloud) and, consequently, understood. The oldest known examples of punctuation marks were found in the Mesha Stele from the 9th century BC, consisti ...
called the () to denote the
glottal stop The glottal stop or glottal plosive is a type of consonantal sound used in many Speech communication, spoken languages, produced by obstructing airflow in the vocal tract or, more precisely, the glottis. The symbol in the International Phonetic ...
, written alone or with a carrier: ** alone: ** with a carrier: (above or under an ), (above a ), (above a dotless or ). :In academic work, the hamza is transliterated with the
modifier letter right half ring The modifier letter right half ring () is a character found in Unicode in the Spacing Modifier Letters range (although it is not a modifier, but a standalone grapheme). It is used in romanization to transliterate the Semitic abjad letter aleph a ...
(ʾ) or () on
Wiktionary Wiktionary (, ; , ; rhyming with "dictionary") is a multilingual, web-based project to create a free content dictionary of terms (including words, phrases, proverbs, linguistic reconstructions, etc.) in all natural languages and in a number o ...
, while the
modifier letter left half ring The modifier letter left half ring ⟨⟩ is a character found in Unicode in the Spacing Modifier Letters range (although it is not a modifier, but a standalone grapheme). It is used in romanization to transliterate the Semitic abjads, Semitic abj ...
(ʿ) or () on Wiktionary, transliterates the letter '' '' (), which represents a different sound, not found in English. :The hamza has a single form, since it is never linked to a preceding or following letter. However, it is sometimes combined with a , , or , and in that case the carrier behaves like an ordinary , , or , check the table below:


Hamza forms

The Hamza (glottal stop) can be written either alone, as if it were a letter, or with a carrier, when it becomes a
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
. () indicates a long + sound as in "sorry", while the other Hamzas indicate the glottal stop in different positions of the word as in and , the writing of the Hamza is based on a set of rules, For the writing rule of each form, see .


Modified letters

The following are not individual letters, but rather different contextual variants of some of the Arabic letters.


Long vowels

In the fully vocalized Arabic text found in texts such as the Quran, a long following a consonant other than a '' '' is written with a short sign () on the consonant plus an after it; long is written as a sign for short () plus a ; and long as a sign for short () plus a . Briefly, = ; = ; and = . Long following a may be represented by an or by a free followed by an (two consecutive s are never allowed in Arabic). The table below shows vowels placed above or below a dotted circle replacing a primary consonant letter or a '' '' sign. For clarity in the table, the primary letters on the left used to mark these long vowels are shown only in their isolated form. Most consonants do connect to the left with , and written then with their medial or final form. Additionally, the letter in the last row may connect to the letter on its left, and then will use a medial or initial form. Use the table of primary letters to look at their actual glyph and joining types. In unvocalized text (one in which the short vowels are not marked), the long vowels are represented by the vowel in question: , , or . Long vowels written in the middle of a word of unvocalized text are treated like consonants with a (see below) in a text that has full diacritics. Here also, the table shows long vowel letters only in isolated form for clarity. Combinations and are always pronounced and respectively. The exception is the suffix in verb endings where is silent, resulting in or . In addition, when transliterating names and loanwords, Arabic language speakers write out most or all the vowels as long ( with , and with , and and with ), meaning it approaches a true alphabet.


Diphthongs

The
diphthongs A diphthong ( ), also known as a gliding vowel or a vowel glide, is a combination of two adjacent vowel sounds within the same syllable. Technically, a diphthong is a vowel with two different targets: that is, the tongue (and/or other parts of ...
and are represented in vocalized text as follows: A final ''is usually'' written at the end of words for '' nisba'' ( ') which is a common suffix to form adjectives of relation or pertinence. The suffix is ' for masculine ( ' for feminine); for example ' "socialist", it is also used for a singulative ending that applies to human or other
sentient Sentience is the ability to experience feelings and sensations. It may not necessarily imply higher cognitive functions such as awareness, reasoning, or complex thought processes. Some writers define sentience exclusively as the capacity for ''v ...
beings as in ''jundiyy'' "a soldier". However nowadays this final is mostly pronounced with a long () ' as in ' instead of ' . A similar mistake happens at the end of some third person plural verbs as in ' "they ran" which is pronounced nowadays as ' .


Ligatures

The use of ligature in Arabic is common. There is one compulsory ligature, that for ل + ا, which exists in two forms. All other ligatures, of which there are many, are optional. A more complex ligature that combines as many as seven distinct components is commonly used to represent the word . The only ligature within the primary range of
Arabic script in Unicode Many scripts in Unicode, such as Arabic, have special orthographic rules that require certain combinations of letterforms to be combined into special ligature (writing), ligature forms. In English, the common ampersand (&) developed from a ligature ...
(U+06xx) is + . This is the only one compulsory for fonts and word-processing. Other ranges are for compatibility to older standards and contain other ligatures, which are optional. Note:
Unicode Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
also has in its Presentation Form B FExx range a code for this ligature. If your browser and font are configured correctly for Arabic, the ligature displayed above should be identical to this one, U+FEFB


Diacritics

Users of Arabic usually write
long vowels In linguistics, vowel length is the perceived or actual duration of a vowel sound when pronounced. Vowels perceived as shorter are often called short vowels and those perceived as longer called long vowels. On one hand, many languages do not d ...
but omit short ones, so readers must utilize their knowledge of the language in order to supply the missing vowels. However, in the education system and particularly in classes on Arabic grammar these vowels are used since they are crucial to the grammar. An Arabic sentence can have a completely different meaning by a subtle change of the vowels. This is why in an important text such as the the three basic vowel signs are mandated, like the Arabic diacritics and other types of marks, like the cantillation signs.


Short vowels

In the Arabic handwriting of everyday use, in general publications, and on street signs, short vowels are typically not written. On the other hand, copies of the cannot be endorsed by the religious institutes that review them unless the diacritics are included. Children's books, elementary school texts, and Arabic-language grammars in general will include diacritics to some degree. These are known as " vocalized" texts. Short vowels may be written with diacritics placed above or below the consonant that precedes them in the syllable, called . All Arabic vowels, long and short, follow a consonant; in Arabic, words like "Ali" or "alif", for example, start with a consonant: , .


Nunation

Nunation Nunation (, '), in some Semitic languages such as Arabic, is the addition of one of three vowel diacritics (''ḥarakāt'') to a noun or adjective. This is used to indicate the word ends in an alveolar nasal without the addition of the letter ''n ...
( ) is the addition of a final   to a
noun In grammar, a noun is a word that represents a concrete or abstract thing, like living creatures, places, actions, qualities, states of existence, and ideas. A noun may serve as an Object (grammar), object or Subject (grammar), subject within a p ...
or
adjective An adjective (abbreviations, abbreviated ) is a word that describes or defines a noun or noun phrase. Its semantic role is to change information given by the noun. Traditionally, adjectives are considered one of the main part of speech, parts of ...
. The vowel before it indicates
grammatical case A grammatical case is a category of nouns and noun modifiers (determiners, adjectives, participles, and Numeral (linguistics), numerals) that corresponds to one or more potential grammatical functions for a Nominal group (functional grammar), n ...
. In written Arabic, nunation is indicated by doubling the vowel diacritic at the end of the word, e.g. .


Gemination

Gemination In phonetics and phonology, gemination (; from Latin 'doubling', itself from '' gemini'' 'twins'), or consonant lengthening, is an articulation of a consonant for a longer period of time than that of a singleton consonant. It is distinct from ...
is the doubling of a consonant. Instead of writing the letter twice, Arabic places a ''W''-shaped sign called above it.


Vowel omission

An Arabic
syllable A syllable is a basic unit of organization within a sequence of speech sounds, such as within a word, typically defined by linguists as a ''nucleus'' (most often a vowel) with optional sounds before or after that nucleus (''margins'', which are ...
can be open (ending with a vowel) or closed (ending with a consonant): * open: CV onsonant-vowel(long or short vowel) * closed: CVC (short vowel only) A normal text is composed only of a series of consonants plus vowel-lengthening letters; thus, the word ''qalb'', "heart", is written ''qlb'', and the word ''qalaba'' "he turned around", is also written ''qlb''. To write ''qalaba'' without this ambiguity, we could indicate that the ''l'' is followed by a short ''a'' by writing a ''fatḥah'' above it. To write ''qalb'', we would instead indicate that the ''l'' is followed by no vowel by marking it with a
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
called ''sukūn'' (), like this: . This is one step down from full vocalization, where the vowel after the ''q'' would also be indicated by a ''fatḥah'': . The ''
Qurʾān The Quran, also romanized Qur'an or Koran, is the central religious text of Islam, believed by Muslims to be a revelation directly from God (''Allāh''). It is organized in 114 chapters (, ) which consist of individual verses ('). Besides i ...
'' is traditionally written in full vocalization. The long ''i'' sound in some editions of the ''Qur’ān'' is written with a ''kasrah'' followed by a diacritic-less ''y'', and long ''u'' by a ''ḍammah'' followed by a bare ''w''. In others, these ''y'' and ''w'' carry a ''sukūn''. Outside of the ''Qur’ān'', the latter convention is extremely rare, to the point that ''y'' with ''sukūn'' will be unambiguously read as the
diphthong A diphthong ( ), also known as a gliding vowel or a vowel glide, is a combination of two adjacent vowel sounds within the same syllable. Technically, a diphthong is a vowel with two different targets: that is, the tongue (and/or other parts of ...
, and ''w'' with ''sukūn'' will be read . For example, the letters can be read like English ''meel'' or ''mail'', or (theoretically) also like ''mayyal'' or ''mayil''. But if a ''sukūn'' is added on the ''y'' then the ''m'' cannot have a ''sukūn'' (because two letters in a row cannot be ''sukūn''ated), cannot have a ''ḍammah'' (because there is never an ''uy'' sound in Arabic unless there is another vowel after the ''y''), and cannot have a ''kasrah'' (because ''kasrah'' before ''sukūn''ated ''y'' is never found outside the ''Qur’ān''), so it ''must'' have a ''fatḥah'' and the only possible pronunciation is (meaning mile, or even e-mail). By the same token, m-y-t with a ''sukūn'' over the ''y'' can be ''mayt'' but not ''mayyit'' or ''meet'', and m-w-t with a ''sukūn'' on the ''w'' can only be ''mawt'', not ''moot'' (''iw'' is impossible when the ''w'' closes the syllable). Vowel marks are always written as if the '' i‘rāb'' vowels were in fact pronounced, even when they must be skipped in actual pronunciation. So, when writing the name ''Aḥmad'', it is optional to place a ''sukūn'' on the ''ḥ'', but a ''sukūn'' is forbidden on the ''d'', because it would carry a ''ḍammah'' if any other word followed, as in ''Aḥmadu zawjī'' "Ahmad is my husband". Another example: the sentence that in correct literary Arabic must be pronounced ''Aḥmadu zawjun shirrīr'' "Ahmad is a wicked husband", is usually pronounced (due to influence from vernacular Arabic varieties) as ''Aḥmad zawj shirrīr''. Yet, for the purposes of Arabic grammar and orthography, is treated as if it were not mispronounced and as if yet another word followed it, i.e., if adding any vowel marks, they must be added as if the pronunciation were ''Aḥmadu zawjun sharrīrun'' with a ''tanwīn'' 'un' at the end. So, it is correct to add an ''un'' ''tanwīn'' sign on the final ''r'', but actually pronouncing it would be a hypercorrection. Also, it is never correct to write a ''sukūn'' on that ''r'', even though in actual pronunciation it is (and in correct Arabic MUST be) ''sukūn''ed. Of course, if the correct ''i‘rāb'' is a ''sukūn'', it may be optionally written. The ''sukūn'' is also used for transliterating words into the Arabic script. The English name "Mark" is written , for example, might be written with a ''sukūn'' above the to signify that there is no vowel sound between that letter and the .


Additional diacritics

These diacritics are uncommon in modern publications but are often used in Quran and some manuscripts. ٰThe alif khanjariyyah (, 'dagger ’alif') is written as short vertical stroke on top of a letter. It indicates a long sound for which '' '' is normally not written. For example: (') or ('). The
Wasla The ''waṣla'' () or (, 'hamza of connection') is a variant of the letter ''hamza The hamza ( ') () is an Arabic script character that, in the Arabic alphabet, denotes a glottal stop and, in non-Arabic languages, indicates a diphthong, vow ...
or (, '
hamza The hamza ( ') () is an Arabic script character that, in the Arabic alphabet, denotes a glottal stop and, in non-Arabic languages, indicates a diphthong, vowel, or other features, depending on the language. Derived from the letter '' ʿayn'' ( ...
of connection') is a variant of the letter ''
hamza The hamza ( ') () is an Arabic script character that, in the Arabic alphabet, denotes a glottal stop and, in non-Arabic languages, indicates a diphthong, vowel, or other features, depending on the language. Derived from the letter '' ʿayn'' ( ...
'' () resembling part of the letter () that is rarely placed over the letter ( ''ʾalif al-waṣl'' ()) to form () at the beginning of the word (). It indicates that the is not pronounced as a glottal stop (written as the ''hamza''), but that the word is connected to the previous word (like
liaison Liaison or Liaisons may refer to: General usage * Affair, an unfaithful sexual relationship * Collaboration * Co-operation * Liaison, an egg-based thickening used in cooking Arts and entertainment * Liaisons (''Desperate Housewives''), a 2007 ...
in French). Outside of vocalised liturgical texts, the is usually not written. e.g. Abdullah can be written with hamzat al-wasl on the first letter of the word but it is mostly written without it .


Additional letters


Regional variations

Some letters take a traditionally different form in specific regions:


Non-standard letters

Some modified letters are used to represent non-native sounds to Modern Standard Arabic. These letters are used as an optional alternative in transliterated names, loanwords and dialectal words. The usage of these letters depends on the writer and their country of origin and their usage is not mandatory. The phoneme (considered a standard pronunciation of in Egypt, Oman, and coastal Yemen) has the highest number of variations when writing loanwords or foreign proper nouns in Literary Arabic, and it can be written with either the standard letters , , , and or with the non-standard letters (used only in Tunisia and Algeria), (used only in Morocco), and (used mainly in Iraq) for example "
Golf Golf is a club-and-ball sport in which players use various Golf club, clubs to hit a Golf ball, ball into a series of holes on a golf course, course in as few strokes as possible. Golf, unlike most ball games, cannot and does not use a standa ...
" pronounced can be written , , , , , or depending on the writer and their country of origin. On the other hand, is considered a native phoneme in most Arabic dialects, either as a reflex of as in lower Egypt, parts of Oman and parts of Yemen (e.g. ) or as a reflex of as in most of the Arabian peninsula, Iraq, Sudan, and parts of Egypt, Levant and North Africa (e.g. ). Note: The sounds and are non-native to most Arabic dialects (excl. Anatolian Arabic where "Wolf" is pronounced vīp instead of Standard Arabic ), while , and appear as a native phoneme or allophone in many dialects.


Used in languages other than Arabic


Numerals

There are two main kinds of numerals used along with Arabic text;
Western Arabic numerals The ten Arabic numerals (0, 1, 2, 3, 4, 5, 6, 7, 8, and 9) are the most commonly used symbols for writing numbers. The term often also implies a positional notation number with a decimal base, in particular when contrasted with Roman numerals. ...
and
Eastern Arabic numerals The Eastern Arabic numerals, also called Indo-Arabic numerals or Arabic-Indic numerals as known by Unicode, are the symbols used to represent numerical digits in conjunction with the Arabic alphabet in the countries of the Mashriq (the east o ...
. In most of present-day North Africa, the usual Western Arabic numerals are used. Like Western Arabic numerals, in Eastern Arabic numerals, the units are always right-most, and the highest value left-most. Eastern Arabic numbers are written from left to right.


Letters as numerals

In addition, the Arabic alphabet can be used to represent numbers (
Abjad numerals The Abjad numerals, also called Hisab al-Jummal (, ), are a decimal alphabetic numeral system/alphanumeric code, in which the 28 letters of the Arabic alphabet are assigned numerical values. They have been used in the Arab world, Arabic-speaking ...
). This usage is based on the order of the alphabet. is 1, is 2, is 3, and so on until = 10, = 20, = 30, ..., = 200, ..., = 1000. This is sometimes used to produce
chronogram A chronogram is a sentence or inscription in which specific letters, interpreted as numerals (such as Roman numerals), stand for a particular date when rearranged. The word, meaning "time writing", derives from the Greek words ''chronos'' ( ...
s.


History

The Arabic alphabet can be traced back to the
Nabataean script The Nabataean script is an abjad (consonantal alphabet) that was used to write Nabataean Aramaic and Nabataean Arabic from the second century BC onwards.Nabataean Aramaic Nabataean Aramaic is the extinct Aramaic variety used in inscriptions by the Nabataeans of the Transjordan_(region), East Bank of the Jordan River, the Negev, and the Sinai Peninsula. Compared with other varieties of Aramaic, it is notable for ...
. A transitional phase, between the Nabataean Aramaic script and a subsequent, recognizably Arabic script, is known as
Nabataean Arabic Nabataean Arabic was the dialect of Arabic spoken by the Nabataeans in antiquity. In the first century AD, the Nabataeans wrote their inscriptions, such as the legal texts carved on the façades of the monumental tombs at Mada'in Salih, ancient ...
. The pre-Islamic phase of the script as it existed in the fifth and sixth centuries, once it had become recognizably similar to the script as it came to be known in the Islamic era, is known as
Paleo-Arabic Paleo-Arabic (or Palaeo-Arabic, previously called pre-Islamic Arabic or Old Arabic) is a pre-Islamic script used to write Arabic. It began to be used in the fifth century, when it succeeded the earlier Nabataeo-Arabic script, and it was used unti ...
. The first known text in the Arabic alphabet is a late fourth-century inscription from 50 km east of in
Jordan Jordan, officially the Hashemite Kingdom of Jordan, is a country in the Southern Levant region of West Asia. Jordan is bordered by Syria to the north, Iraq to the east, Saudi Arabia to the south, and Israel and the occupied Palestinian ter ...
, but the Zabad trilingual inscription is the earliest dated Arabic text from 512, and was discovered in
Syria Syria, officially the Syrian Arab Republic, is a country in West Asia located in the Eastern Mediterranean and the Levant. It borders the Mediterranean Sea to the west, Turkey to Syria–Turkey border, the north, Iraq to Iraq–Syria border, t ...
. Nevertheless, the
epigraphic Epigraphy () is the study of inscriptions, or epigraphs, as writing; it is the science of identifying graphemes, clarifying their meanings, classifying their uses according to dates and cultural contexts, and drawing conclusions about the wr ...
record is extremely sparse. Later, dots were added above and below the letters to differentiate them. (The Aramaic language had fewer phonemes than the Arabic, and some originally distinct Aramaic letters had become indistinguishable in shape, so that in the early writings 14 distinct letter-shapes had to do duty for 28 sounds; cf. the similarly ambiguous
Book Pahlavi Book Pahlavi is the cursive variant of the Pahlavi scripts, Pahlavi script, which was derived from the Aramaic alphabet, Aramaic script during the Sasanian Empire, Sassanid period to write the Middle Persian, Middle Persian language. Book Pahlav ...
.) The first surviving document that definitely uses these dots is also the first surviving Arabic
papyrus Papyrus ( ) is a material similar to thick paper that was used in ancient times as a writing surface. It was made from the pith of the papyrus plant, ''Cyperus papyrus'', a wetland sedge. ''Papyrus'' (plural: ''papyri'' or ''papyruses'') can a ...
( PERF 558), dated April 643, although they did not become obligatory until much later. Important texts were and still are frequently memorized, especially in Qurʾan memorization. Later still, vowel marks and the hamza were introduced, beginning some time in the latter half of the 7th century, preceding the first invention of Syriac and
Tiberian vocalization The Tiberian vocalization, Tiberian pointing, or Tiberian niqqud () is a system of diacritics (''niqqud'') devised by the Masoretes of Tiberias to add to the consonantal text of the Hebrew Bible to produce the Masoretic Text. The system soon beca ...
s. Initially, this was done by a system of red dots, said to have been commissioned in the
Umayyad The Umayyad Caliphate or Umayyad Empire (, ; ) was the second caliphate established after the death of the Islamic prophet Muhammad and was ruled by the Umayyad dynasty. Uthman ibn Affan, the third of the Rashidun caliphs, was also a membe ...
era by
Abu al-Aswad al-Du'ali Abu al-Aswad ad-Duʾali (, '; -16 BH/603 – 69 AH/688/89), whose full name was ʾAbū al-Aswad Ẓālim ibn ʿAmr ibn Sufyān ibn Jandal ibn Yamār ibn Hīls ibn Nufātha ibn al-ʿĀdi ibn ad-Dīl ibn Bakr, surnamed ad-Dīlī, or ad-Duwalī, was ...
, a dot above = , a dot below = , a dot on the line = , and doubled dots indicated
nunation Nunation (, '), in some Semitic languages such as Arabic, is the addition of one of three vowel diacritics (''ḥarakāt'') to a noun or adjective. This is used to indicate the word ends in an alveolar nasal without the addition of the letter ''n ...
. However, this was cumbersome and easily confusable with the letter-distinguishing dots, so about 100 years later, the modern system was adopted. The system was finalized around 786 by
al-Khalil ibn Ahmad al-Farahidi Abu ‘Abd ar-Raḥmān al-Khalīl ibn Aḥmad ibn ‘Amr ibn Tammām al-Farāhīdī al-Azdī al-Yaḥmadī (; 718 – 786 CE), known as al-Farāhīdī, or al-Khalīl, was an Arab philologist, lexicographer and leading grammarian of Basra in ...
.


Other tributes and alphabets written in Arabic dialects

Arabic dialects were written in different alphabets before the spread of the Arabic alphabet currently in use. The most important of these alphabets and inscriptions are the
Safaitic Safaitic ( ''Al-Ṣafāʾiyyah'') is a variety of the South Semitic scripts used by the Arabs in southern Syria and northern Jordan in the Harrat al-Sham, Ḥarrah region, to carve rock inscriptions in various dialects of Old Arabic and Ancient N ...
inscriptions, amounting to 30,000 inscriptions discovered in the Levant desert. There are about 3,700 inscriptions in
Hismaic Hismaic () is a variety of the Ancient North Arabian script and the language most commonly expressed in it. The Hismaic script may have been used to write Safaitic dialects of Old Arabic, but the language of most inscriptions differs from Safaiti ...
in central Jordan and northwest of the Arabian Peninsula, and Nabataean inscriptions, the most important of which are the Umm al-Jimal I inscription and the Numara inscription.


Arabic printing

Medieval Arabic blockprinting flourished from the 10th century until the 14th. It was devoted to tiny texts, which were usually used in
amulet An amulet, also known as a good luck charm or phylactery, is an object believed to confer protection upon its possessor. The word "amulet" comes from the Latin word , which Pliny's ''Natural History'' describes as "an object that protects a perso ...
s. In 1514, following
Johannes Gutenberg Johannes Gensfleisch zur Laden zum Gutenberg ( – 3 February 1468) was a German inventor and Artisan, craftsman who invented the movable type, movable-type printing press. Though movable type was already in use in East Asia, Gutenberg's inven ...
's invention of the printing press in 1450, Gregorio de Gregorii, a Venetian, published an entire
book of hours A book is a structured presentation of recorded information, primarily verbal and graphical, through a medium. Originally physical, electronic books and audiobooks are now existent. Physical books are objects that contain printed material, ...
in Arabic script; it was entitled '' Kitab Salat al-Sawa'i'' and was intended for eastern Christian communities. Between 1580 and 1586, type designer
Robert Granjon Robert Granjon (Paris, c. 1513 - Rome, 1590) was a French punchcutter, a designer and creator of metal type, and printer. He worked in Paris, Lyon, Antwerp, and Rome. He is best known for having introduced the typeface style Civilité, for his ...
designed Arabic typefaces for Cardinal Ferdinando de' Medici, and the Medici Oriental Press published many Christian prayer and scholarly Arabic texts in the late 16th century.
Maronite Maronites (; ) are a Syriac Christianity, Syriac Christian ethnoreligious group native to the Eastern Mediterranean and the Levant (particularly Lebanon) whose members belong to the Maronite Church. The largest concentration has traditionally re ...
monks at
Monastery of Qozhaya Monastery of Saint Anthony of Qozhaya, or Monastery of Qozhaya (also Qazahya; , ) is a large and important Maronite monastery. It is located in the Zgharta District in the North Governorate of Lebanon. The Monastery of Qozhaya holds significant ...
on
Mount Lebanon Mount Lebanon (, ; , ; ) is a mountain range in Lebanon. It is about long and averages above in elevation, with its peak at . The range provides a typical alpine climate year-round. Mount Lebanon is well-known for its snow-covered mountains, ...
published the first Arabic books to use movable type in the Middle East. The monks employed
Garshuni Garshuni or Karshuni (Syriac alphabet: , Arabic alphabet: ) are Arabic writings using the Syriac alphabet. The word "Garshuni", derived from the word "grasha" which literally translates as "pulling", was used by George Kiraz to coin the term " gar ...
, the practice of writing Arabic using the
Syriac script The Syriac alphabet ( ) is a writing system primarily used to write the Syriac language since the 1st century. It is one of the Semitic languages, Semitic abjads descending from the Aramaic alphabet through the Palmyrene alphabet, and shares sim ...
, usually by Christians. Although
Napoleon Napoleon Bonaparte (born Napoleone di Buonaparte; 15 August 1769 – 5 May 1821), later known by his regnal name Napoleon I, was a French general and statesman who rose to prominence during the French Revolution and led Military career ...
generally receives credit for introducing the
printing press A printing press is a mechanical device for applying pressure to an inked surface resting upon a printing, print medium (such as paper or cloth), thereby transferring the ink. It marked a dramatic improvement on earlier printing methods in whi ...
to Egypt during his invasion of the country in 1798, and though he did indeed bring printing presses and Arabic presses to print the French occupation's official newspaper ''Al-Tanbiyyah'' "The Courier," printing in the Arabic language had started several centuries earlier. A goldsmith (like Gutenberg) designed and implemented an Arabic script
movable type Movable type (US English; moveable type in British English) is the system and technology of printing and typography that uses movable Sort (typesetting), components to reproduce the elements of a document (usually individual alphanumeric charac ...
printing press A printing press is a mechanical device for applying pressure to an inked surface resting upon a printing, print medium (such as paper or cloth), thereby transferring the ink. It marked a dramatic improvement on earlier printing methods in whi ...
in the Middle East. The Lebanese Melkite monk
Abdallah Zakher Abdallah Zakher (; ) (1684–1748) was a Syrian-born (in Aleppo) typographer and Catholic deacon who set up the first printing press in the Middle East. His printing press used Arabic movable type and was installed in 1733 in the motherhouse of th ...
set up an Arabic printing press using movable type at the monastery of Saint John at the town of Dhour El Shuwayr in Mount Lebanon, the first homemade press in Lebanon using Arabic script. He cut the type molds and founded the typeface. The first book came off his press in 1734; this press continued in use until 1899.


Computers

The Arabic alphabet can be encoded using several
character set Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using computers. The numerical values that make up a c ...
s, including
ISO-8859-6 ISO/IEC 8859-6:1999, ''Information technology — 8-bit single-byte coded graphic character sets — Part 6: Latin/Arabic alphabet'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1987. ...
, Windows-1256 and
Unicode Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
, the latter of which contains the "Arabic segment", entries U+0600 to U+06FF. However, none of the sets indicates the form that each character should take in context. It is left to the
rendering engine Rendering is the process of generating a photorealistic or non-photorealistic image from input data such as 3D models. The word "rendering" (in one of its senses) originally meant the task performed by an artist when depicting a real or imag ...
to select the proper
glyph A glyph ( ) is any kind of purposeful mark. In typography, a glyph is "the specific shape, design, or representation of a character". It is a particular graphical representation, in a particular typeface, of an element of written language. A ...
to display for each character. Each letter has a position-independent encoding in Unicode, and the rendering software can infer the correct glyph form (initial, medial, final or isolated) from its joining context. That is the current recommendation. However, for compatibility with previous standards, the initial, medial, final and isolated forms can also be encoded separately.


Unicode

As of Unicode , the Arabic script is contained in the following blocks: *
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
(0600–06FF, 256 characters) * Arabic Supplement (0750–077F, 48 characters) * Arabic Extended-A (08A0–08FF, 96 characters) * Arabic Extended-B (0870–089F, 42 characters) * Arabic Extended-C (10EC0–10EFF, 7 characters) *
Arabic Presentation Forms-A Arabic Presentation Forms-A is a Unicode block encoding contextual forms and ligatures of letter variants needed for Persian, Urdu, Sindhi and Central Asian languages. This block also allocates 32 noncharacters in Unicode, designed specifically ...
(FB50–FDFF, 631 characters) *
Arabic Presentation Forms-B Arabic Presentation Forms-B is a Unicode block encoding spacing forms of Arabic diacritics, and contextual letter forms. The special codepoint ZWNBSP (''zero width no-break space'') is also here, which is only meant for a byte order mark (that ma ...
(FE70–FEFF, 141 characters) * Rumi Numeral Symbols (10E60–10E7F, 31 characters) * Indic Siyaq Numbers (1EC70–1ECBF, 68 characters) * Ottoman Siyaq Numbers (1ED00–1ED4F, 61 characters) *
Arabic Mathematical Alphabetic Symbols Arabic Mathematical Alphabetic Symbols is a Unicode block A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative ...
(1EE00—1EEFF, 143 characters) The basic Arabic range encodes the standard letters and diacritics but does not encode contextual forms (U+0621-U+0652 being directly based on
ISO 8859-6 ISO/IEC 8859-6:1999, ''Information technology — 8-bit single-byte coded graphic character sets — Part 6: Latin/Arabic alphabet'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1987. ...
). It also includes the most common diacritics and Arabic-Indic digits. U+06D6 to U+06ED encode Qur'anic annotation signs such as "end of '' ayah''" ۝ۖ and "start of '' rub el hizb''" ۞. The Arabic supplement range encodes letter variants mostly used for writing African (non-Arabic) languages. The Arabic Extended-A range encodes additional Qur'anic annotations and letter variants used for various non-Arabic languages. The Arabic Presentation Forms-A range encodes contextual forms and ligatures of letter variants needed for Persian,
Urdu Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
, Sindhi and Central Asian languages. The Arabic Presentation Forms-B range encodes spacing forms of Arabic diacritics, and more contextual letter forms. The Arabic Mathematical Alphabetical Symbols block encodes characters used in Arabic mathematical expressions. See also the notes of the section on modified letters.


Keyboards

Keyboards designed for different nations have different layouts, so proficiency in one style of keyboard, such as Iraq's, does not transfer to proficiency in another, such as Saudi Arabia's. Differences can include the location of non-alphabetic characters. All Arabic keyboards allow typing Roman characters, e.g., for the URL in a
web browser A web browser, often shortened to browser, is an application for accessing websites. When a user requests a web page from a particular website, the browser retrieves its files from a web server and then displays the page on the user's scr ...
. Thus, each Arabic keyboard has both Arabic and Roman characters marked on the keys. Usually, the Roman characters of an Arabic keyboard conform to the
QWERTY QWERTY ( ) is a keyboard layout for Latin-script alphabets. The name comes from the order of the first six Computer keyboard keys#Types, keys on the top letter row of the keyboard: . The QWERTY design is based on a layout included in the Sh ...
layout, but in
North Africa North Africa (sometimes Northern Africa) is a region encompassing the northern portion of the African continent. There is no singularly accepted scope for the region. However, it is sometimes defined as stretching from the Atlantic shores of t ...
, where French is the most common language typed using the Roman characters, the Arabic keyboards are
AZERTY AZERTY ( ) is a specific layout for the characters of the Latin alphabet on typewriter keys and computer keyboards. The layout takes its name from the first six letters to appear on the first row of alphabetical keys; that is, ( ). Like oth ...
. To encode a particular written form of a character, there are extra code points provided in Unicode which can be used to express the exact written form desired. The range ''Arabic presentation forms A'' (U+FB50 to U+FDFF) contain ligatures while the range ''Arabic presentation forms B'' (U+FE70 to U+FEFF) contains the positional variants. These effects are better achieved in Unicode by using the ''
zero-width joiner The zero-width joiner (ZWJ, ; rendered: ; HTML entity: or ) is a non-printing character used in the computerized typesetting of writing systems in which the shape or positioning of a grapheme depends on its relation to other graphemes (complex ...
'' and ''
zero-width non-joiner The zero-width non-joiner (ZWNJ, ; rendered: ; HTML entity: or ) is a non-printing character used in the computerization of writing systems that make use of Typographic ligature, ligatures. For example, in writing systems that feature initial, ...
'', as these presentation forms are deprecated in Unicode and should generally only be used within the internals of text-rendering software; when using Unicode as an intermediate form for conversion between character encodings; or for backwards compatibility with implementations that rely on the hard-coding of glyph forms. Finally, the Unicode encoding of Arabic is in ''logical order'', that is, the characters are entered, and stored in computer memory, in the order that they are written and pronounced without worrying about the direction in which they will be displayed on paper or on the screen. Again, it is left to the rendering engine to present the characters in the correct direction, using Unicode's
bi-directional text A bidirectional text contains two text directionalities, right-to-left (RTL) and left-to-right (LTR). It generally involves text containing different types of alphabets, but may also refer to boustrophedon, which is changing text direction in ea ...
features. In this regard, if the Arabic words on this page are written left to right, it is an indication that the Unicode rendering engine used to display them is out of date. There are competing online tools, e.g. Yamli editor, which allow entry of Arabic letters without having Arabic support installed on a PC, and without knowledge of the layout of the Arabic keyboard.


Variations


See also

*
Ancient South Arabian script The Ancient South Arabian script (Old South Arabian: ; modern ) branched from the Proto-Sinaitic script in about the late 2nd millennium BCE, and remained in use through the late sixth century CE. It is an abjad, a writing system where only con ...
*
Algerian braille Algerian Braille was a braille alphabet used to write the Arabic language in Algeria. It is apparently obsolete.Arabic braille *
Arabic calligraphy Arabic calligraphy is the artistic practice of penmanship, handwriting and calligraphy based on the Arabic alphabet. It is known in Arabic language, Arabic as ''khatt'' (), derived from the words 'line', 'design', or 'construction'. Kufic is the ...
*
Arabic chat alphabet The Arabic chat alphabet, also known as ''Arabizi'', ''Arabeezi'', ''Arabish'', Franco-Arabic or simply Franco (from ) refer to the romanized alphabets for informal Arabic dialects in which Arabic script is transcribed or encoded into a combinati ...
* Arabic letter frequency *
Arabic numerals The ten Arabic numerals (0, 1, 2, 3, 4, 5, 6, 7, 8, and 9) are the most commonly used symbols for writing numbers. The term often also implies a positional notation number with a decimal base, in particular when contrasted with Roman numera ...
* ArabTeX – provides Arabic support for
TeX Tex, TeX, TEX, may refer to: People and fictional characters * Tex (nickname), a list of people and fictional characters with the nickname * Tex Earnhardt (1930–2020), U.S. businessman * Joe Tex (1933–1982), stage name of American soul singer ...
and
LaTeX Latex is an emulsion (stable dispersion) of polymer microparticles in water. Latices are found in nature, but synthetic latices are common as well. In nature, latex is found as a wikt:milky, milky fluid, which is present in 10% of all floweri ...
*
History of the Arabic alphabet The Arabic alphabet is thought to be traced back to a Nabataean variation of the Aramaic alphabet, known as Nabataean Aramaic. This script itself descends from the Phoenician alphabet, an ancestral alphabet that additionally gave rise to the H ...
* Modern Arabic mathematical notation *
Romanization of Arabic The romanization of Arabic is the systematic rendering of Modern Standard Arabic, written and varieties of Arabic, spoken Arabic language, Arabic in the Latin script. Romanized Arabic is used for various purposes, among them transcription of na ...


Notes


References


Sources

* *


External links

* {{DEFAULTSORT:Arabic Alphabet Arabic orthography