The Info List - Tashkil

--- Advertisement ---

The Arabic script
Arabic script
has numerous diacritics, including i'jam ⟨إِعْجَام⟩ - i‘jām, consonant pointing and tashkil ⟨تَشْكِيل⟩ - tashkīl, supplementary diacritics. The latter include the ḥarakāt ⟨حَرَكَات⟩ vowel marks - singular: ḥarakah ⟨حَرَكَة⟩. The Arabic script
Arabic script
is an impure abjad, where short consonants and long vowels are represented by letters but short vowels and consonant length are not generally indicated in writing. Tashkīl is optional to represent missing vowels and consonant length. Modern Arabic
is always written with the i‘jām - consonant pointing, but only religious texts, children's books and works for learners are written with the full tashkīl - vowel guides and consonant length.


1 Tashkil
(marks used as phonetic guides)

1.1 Harakat
(short vowel marks)

1.1.1 Fatḥah 1.1.2 Kasrah 1.1.3 Ḍammah 1.1.4 Alif Khanjariyah

1.2 Maddah 1.3 Alif waslah 1.4 Sukun 1.5 Tanwin (final postnasalized or long vowels) 1.6 Shaddah
(consonant gemination mark)

2 I‘jām (phonetic distinctions of consonants) 3 Hamza
(glottal stop semi-consonant) 4 History

4.1 Abu al-Aswad's system 4.2 Al Farahidi's system

5 See also 6 References 7 External links

(marks used as phonetic guides)[edit] The literal meaning of tashkīl is 'forming'. As the normal Arabic text does not provide enough information about the correct pronunciation, the main purpose of tashkīl (and ḥarakāt) is to provide a phonetic guide or a phonetic aid; i.e. show the correct pronunciation. It serves the same purpose as furigana (also called "ruby") in Japanese or pinyin or zhuyin in Mandarin Chinese
Mandarin Chinese
for children who are learning to read or foreign learners. The bulk of Arabic script
Arabic script
is written without ḥarakāt (or short vowels). However, they are commonly used in texts that demand strict adherence to exact wording. This is true, primarily, of the Qur'an ⟨الْقُرْآن⟩ (al-Qur’ān) and poetry. It is also quite common to add ḥarakāt to hadiths ⟨الْحَدِيث⟩ (al-ḥadīth; plural: aḥādīth) and the Bible. Another use is in children's literature. Moreover, ḥarakāt are used in ordinary texts in individual words when an ambiguity of pronunciation cannot easily be resolved from context alone. Arabic
dictionaries with vowel marks provide information about the correct pronunciation to both native and foreign Arabic
speakers. In art and calligraphy, ḥarakāt might be used simply because their writing is considered aesthetically pleasing. An example of a fully vocalised (vowelised or vowelled) Arabic
from the Basmala:

بِسْمِ ٱللهِ ٱلرَّحْمٰنِ ٱلرَّحِيمِ Bismi Llāhi r-Raḥmāni r-Raḥīmi In the Name of God, the Most Gracious, the Most Merciful...

Some Arabic
textbooks for foreigners now use ḥarakāt as a phonetic guide to make learning reading Arabic
easier. The other method used in textbooks is phonetic romanisation of unvocalised texts. Fully vocalised Arabic
texts (i.e. Arabic
texts with ḥarakāt/diacritics) are sought after by learners of Arabic. Some online bilingual dictionaries also provide ḥarakāt as a phonetic guide similarly to English dictionaries providing transcription. Harakat
(short vowel marks) [edit] The ḥarakāt, which literally means 'motions', are the short vowel marks. There is some ambiguity as to which tashkīl are also ḥarakāt; the tanwīn, for example, are markers for both vowels and consonants. Fatḥah[edit] "Fatha" redirects here. For the jazz pianist, see Earl Hines. ـَ The fatḥah ⟨فَتْحَة⟩ is a small diagonal line placed above a letter, and represents a short /a/ (like the initial sound in English word "up"). The word fatḥah itself (فَتْحَة) means opening and refers to the opening of the mouth when producing an /a/. For example, with dāl (henceforth, the base consonant in the following examples): ⟨دَ⟩ /da/. When a fatḥah is placed before the letter ⟨ا⟩ (alif), it represents a long /aː/ (as in the English word "father"). For example: ⟨دَا⟩ /daː/. The fatḥah is not usually written in such cases. When a fathah placed before the letter ⟨ﻱ⟩ (yā’), it creates an /aj/ (as in "lie"); and when placed before the letter ⟨و⟩ (wāw), it creates an /aw/ (as in "cow")

Kasrah[edit] ـِ A similar diagonal line below a letter is called a kasrah ⟨كَسْرَة⟩ and designates a short /i/ (as in "Tim"). For example: ⟨دِ⟩ /di/.[1] When a kasrah is placed before the letter ⟨ﻱ⟩ (yā’), it represents a long /iː/ (as in the English word "steed"). For example: ⟨دِي⟩ /diː/. The kasrah is usually not written in such cases, but if yā’ is pronounced as a diphthong /aj/, fatḥah should be written on the preceding consonant to avoid mispronunciation. The word kasrah means 'breaking'.

Ḍammah[edit] ـُ The ḍammah ⟨ضَمَّة⟩ is a small curl-like diacritic placed above a letter to represent a short /u/ (and sounds like the 'oo' sound in the English word "took"). For example: ⟨دُ⟩ /du/.[1] When a ḍammah is placed before the letter ⟨و⟩ (wāw), it represents a long /uː/ (like the 'oo' sound in the English word "swoop"). For example: ⟨دُو⟩ /duː/. The ḍammah is usually not written in such cases, but if wāw is pronounced as a diphthong /aw/, fatḥah should be written on the preceding consonant to avoid mispronunciation.

Alif Khanjariyah[edit] ــٰ The superscript (or dagger) alif ⟨أَلِف خَنْجَرِيَّة⟩ (alif khanjarīyah), is written as short vertical stroke on top of a consonant. It indicates a long /aː/ sound for which alif is normally not written. For example: ⟨هٰذَا⟩ (hādhā) or ⟨رَحْمٰن⟩ (raḥmān). The dagger alif occurs in only a few words, but they include some common ones; it is seldom written, however, even in fully vocalised texts. Most keyboards do not have dagger alif. The word Allah ⟨الله⟩ (Allāh) is usually produced automatically by entering alif lām lām hāʾ. The word consists of alif + ligature of doubled lām with a shaddah and a dagger alif above lām.

Maddah[edit] ـٓ The maddah ⟨مَدَّة⟩ is a tilde-like diacritic, which can appear mostly on top of an alif and indicates a glottal stop /ʔ/ followed by a long /aː/. In theory, the same sequence /ʔaː/ could also be represented by two alifs, as in *⟨أَا⟩, where a hamza above the first alif represents the /ʔ/ while the second alif represents the /aː/. However, consecutive alifs are never used in the Arabic
orthography. Instead, this sequence must always be written as a single alif with a maddah above it, the combination known as an alif maddah. For example: ⟨قُرْآن⟩ /qurˈʔaːn/. Alif waslah[edit] Main article: Wasla (diacritic) ٱ The waṣlah ⟨وَصْلَة⟩, alif waṣlah ⟨أَلِف وَصْلَة⟩ or hamzat waṣl ⟨هَمْزَة وَصْل⟩ looks like a small letter ṣād on top of an alif ⟨ٱ⟩ (also indicated by an alif ⟨ا⟩ without a hamzah). It means that the alif is not pronounced. For example: ⟨بِٱسْمِ⟩ (bismi). It occurs only in the beginning of words, but it can occur after prepositions and the definite article. It is commonly found in imperative verbs, the perfective aspect of verb stems VII to X and their verbal nouns (maṣdar). The alif of the definite article is considered a waṣlah.

It occurs in phrases and sentences (connected speech, not isolated/dictionary forms):

To replace the elided hamza whose alif-seat has assimilated to the previous vowel. For example: فِ ي ٱلْيَمَن‎ or في اليمن‎ (fi l-Yaman) ‘in Yemen’. In hamza-initial imperative forms following a vowel, especially following the conjunction و (wa-) ‘and’. For example: َقُمْ وَٱشْرَبِ ٱلْمَاءَ‎ (qum wa-shrab-i l-mā’) ‘and then drink the water’.

Sukun[edit] ـْـ The sukūn ⟨سُكُون⟩ is a circle-shaped diacritic placed above a letter. It indicates that the consonant to which it is attached is not followed by a vowel. It is a necessary symbol for writing consonant-vowel-consonant syllables, which are very common in Arabic. For example: ⟨دَدْ⟩ (dad). The sukūn may also be used to help represent a diphthong. A fatḥah followed by the letter ⟨ﻱ⟩ (yā’) with a sukūn over it (ـَيْ‎) indicates the diphthong ay ( IPA
/aj/). A fatḥah, followed by the letter ⟨ﻭ⟩ (wāw) with a sukūn, (ـَوْ‎) indicates /aw/. ـۡـ The sukūn may have also an alternative form of the small high dotless head of khāʾ (U+06E1 ۡ ), particularly in some Qurans. Other shapes may exist as well (for example, like a small comma above ⟨ʼ⟩ or like a circumflex ⟨ˆ⟩ in nastaʿlīq).[2]

Tanwin (final postnasalized or long vowels)[edit] Main article: Nunation ـٌ  ـٍ  ـً The three vowel diacritics may be doubled at the end of a word to indicate that the vowel is followed by the consonant n. They may or may not be considered ḥarakāt and are known as tanwīn ⟨تَنْوِين⟩, or nunation. The signs indicate, from right to left, -un, -in, -an. These endings are used as non-pausal grammatical indefinite case endings in literary Arabic
or classical Arabic
(triptotes only). In a vocalised text, they may be written even if they are not pronounced (see pausa). See i‘rāb for more details. In many spoken Arabic dialects, the endings are absent. Many Arabic
textbooks introduce standard Arabic
without these endings. The grammatical endings may not be written in some vocalized Arabic
texts, as knowledge of i‘rāb varies from country to country, and there is a trend towards simplifying Arabic
grammar. The sign ⟨ـً⟩ is most commonly written in combination with ⟨ـًا⟩ (alif), ⟨ةً⟩ (tā’ marbūṭah) or stand-alone ⟨ءً⟩ (hamzah). Alif should always be written (except for words ending in tā’ marbūṭah, hamzah or diptotes) even if an is not. Grammatical cases and tanwīn endings in indefinite triptote forms:

-un: nominative case; -an: accusative case, also serves as an adverbial marker; -in: genitive case.

(consonant gemination mark)[edit] Main article: Shadda ـّـ The shadda or shaddah ⟨شَدَّة⟩ (shaddah), or tashdid ⟨تَشْدِيد⟩ (tashdīd), is a diacritic shaped like a small written Latin "w". It is used to indicate gemination (consonant doubling or extra length), which is phonemic in Arabic. It is written above the consonant which is to be doubled. It is the only ḥarakah that is sometimes used in ordinary spelling to avoid ambiguity. For example: ⟨دّ⟩ /dd/; madrasah ⟨مَدْرَسَة⟩ ('school') vs. mudarrisah ⟨مُدَرِّسَة⟩ ('teacher', female).

I‘jām (phonetic distinctions of consonants)[edit]

7th-century kufic script without either ḥarakāt or i‘jām.

The i‘jām ⟨إِعْجَام⟩ are the diacritic points that distinguish various consonants that have the same form (rasm), such as ⟨ـبـ⟩ /b/, ⟨ـتـ⟩ /t/, ⟨ـثـ⟩ /θ/, ⟨ـنـ⟩ /n/, and ⟨ـيـ⟩ /j/. Typically i‘jām are not considered diacritics but part of the letter. Early manuscripts of the Qur’ān did not use diacritics either for vowels or to distinguish the different values of the rasm. Vowel pointing was introduced first, as a red dot placed above, below, or beside the rasm, and later consonant pointing was introduced, as thin, short black single or multiple dashes placed above or below the rasm (image). These i‘jām became black dots about the same time as the ḥarakāt became small black letters or strokes. Typically, Egyptians do not use dots under final yā’ ⟨ي⟩, which looks exactly like alif maqṣūrah ⟨ى⟩ in handwriting and in print. This practice is also used in copies of the muṣḥaf (Qurʾān) scribed by ‘Uthman Ṭāhā. The same unification of yā and alif maqṣūrā has happened in Persian, resulting in what the Unicode Standard calls "arabic letter farsi yeh", that looks exactly the same as yā in initial and medial forms, but exactly the same as alif maqṣūrah in final and isolated forms ⟨یـ  ـیـ  ـی⟩. سـۡ سـۜ سۣـ سـٚ سٜـ ڛـ

On Tunisian license plates and banknotes, sīn appears as ⟨سۡ⟩ to avoid confusion with shīn ⟨ش⟩

At the time when the i‘jām was optional, letters deliberately lacking the points of i‘jām: ⟨ح⟩ /ħ/, ⟨د⟩ /d/, ⟨ر⟩ /r/, ⟨س⟩ /s/, ⟨ص⟩ /sˤ/, ⟨ط⟩ /tˤ/, ⟨ع⟩ /ʕ/, ⟨ل⟩ /l/, ⟨ه⟩ /h/ — could be marked with a small v-shaped sign above or below the letter, or a semicircle, or a miniature of the letter itself (e.g. a small س
to indicate that the letter in question is س
and not ش), or one or several subscript dots, or a superscript hamza, or a superscript stroke.[3] These signs, collectively known as ‘alāmātu-l-ihmāl, are still occasionally used in modern Arabic calligraphy, either for their original purpose (i.e. marking letters without i‘jām), or often as purely decorative space-fillers. The small ک above the kāf in its final and isolated forms ⟨ك  ـك⟩ was originally ‘alāmatu-l-ihmāl, but became a permanent part of the letter. Previously this sign could also appear above the medial form of kāf, instead of the stroke on its ascender.[4] Hamza
(glottal stop semi-consonant)[edit] Main article: Hamza ئ  ؤ  إ  أ ء Although often a diacritic is not considered a letter of the alphabet, the hamza هَمْزَة (hamzah, glottal stop), often stands as a separate letter in writing, is written in unpointed texts and is not considered a tashkīl. It may appear as a letter by itself or as a diacritic over or under an alif, wāw, or yā. Which letter is to be used to support the hamzah depends on the quality of the adjacent vowels;

If the syllable occurs at the beginning of the word, the glottal stop is always indicated by hamza on an alif. if the syllable occurs in the middle of the word, alif is used only if it is not preceded or followed by /i/ or /u/. If /i(ː)/ is before or after the glottal stop, a yā with a hamzah is used (the two dots which are usually beneath the yāʾ disappear in this case): ⟨ئ⟩. If /u(ː)/ is before or after the glottal stop, a wāw with a hamzah is used: ⟨ؤ⟩.

Consider the following words: ⟨أَخ⟩ /ʔax/ ("brother"), ⟨إِسماعيل⟩ /ʔismaːʕiːl/ ("Ismael"), ⟨أُمّ⟩ /ʔumm/ ("mother"). All three of above words "begin" with a vowel opening the syllable, and in each case, alif is used to designate the initial glottal stop (the actual beginning). But if we consider middle syllables "beginning" with a vowel: ⟨نَشْأَة⟩ /naʃʔa/ ("origin"), ⟨أْفِئدة⟩ /ʔafʔida/ ("hearts" — notice the /ʔi/ syllable; singular ⟨فُؤَاد⟩ /fuʔaːd/), ⟨رُؤُوس⟩ /ruʔuːs/ ("heads", singular ⟨رَأْس⟩ /raʔs/), the situation is different, as noted above. See the comprehensive article on hamzah for more details.


Evolution of early Arabic calligraphy
Arabic calligraphy
(9th – 11th century). The Basmala
was taken as an example, from kufic Qur’ān manuscripts. (1) Early 9th century, script with no dots or diacritic marks (see image of early Basmala
Kufic); (2) and (3) 9th–10th century under Abbasid dynasty, Abu al-Aswad’s system established red dots with each arrangement or position indicating a different short vowel; later, a second black-dot system was used to differentiate between letters like fā’ and qāf (see image of middle Kufic); (4) 11th century, in al-Farāhídi’s system (system we know today) dots were changed into shapes resembling the letters to transcribe the corresponding long vowels (see image of modern Kufic
in Qur'an).

According to tradition, the first to commission a system of harakat was Ali
who appointed Abu al-Aswad al-Du'ali for the task. Abu al-Aswad devised a system of dots to signal the three short vowels (along with their respective allophones) of Arabic. This system of dots predates the i‘jām, dots used to distinguish between different consonants.

Early Basmala

Middle Kufic

Modern Kufic
in Qur'an

Abu al-Aswad's system[edit] Abu al-Aswad's system of Harakat
was different from the system we know today. The system used red dots with each arrangement or position indicating a different short vowel. A dot above a letter indicated the vowel a, a dot below indicated the vowel i, a dot on the side of a letter stood for the vowel u, and two dots stood for the tanwīn. However, the early manuscripts of the Qur'an
did not use the vowel signs for every letter requiring them, but only for letters where they were necessary for a correct reading. Al Farahidi's system[edit] This is the precursor to the system we know today. al-Farāhīdī found that the task of writing using two different colours was tedious and impractical. Another complication was that the i‘jām had been introduced by then, which, while they were short strokes rather than the round dots seen today, meant that without a color distinction the two could become confused. Accordingly, he replaced the ḥarakāt with small superscript letters: small alif, yā’, and wāw for the short vowels corresponding to the long vowels written with those letters, a small s(h)īn for shaddah (geminate), a small khā’ for khafīf (short consonant; no longer used). His system is essentially the one we know today.[5] See also[edit]


I‘rāb (إِﻋْﺮَﺍﺏ‎), the case system of Arabic Rasm
(رَسْم‎), the basic system of Arabic
consonants Tajwīd (تَجْوِيد‎), the phonetic rules of recitation of Qur'an
in Arabic

Niqqud, the Hebrew equivalent of ḥarakāt Dagesh, the Hebrew diacritic similar to Arabic
i‘jām and shaddah


^ a b "Introduction to Written Arabic". University of Victoria, Canada.  ^ " Arabic
character notes". r12a.  ^ Gacek, Adam (2009). "Unpointed letters". Arabic
Manuscripts: A Vademecum for Readers. BRILL. p. 286. ISBN 90-04-17036-7.  ^ Gacek, Adam (1989). "Technical Practices and Recommendations Recorded by Classical and Post- Classical Arabic
Classical Arabic
Scholars Concerning the Copying and Correction of Manuscripts" (PDF). In François Déroche. Les manuscrits du Moyen-Orient: essais de codicologie et de paléographie. Actes du colloque d'Istanbul (Istanbul 26–29 mai 1986). p. 57 (§8. Diacritical marks and vowelisation).  ^ Versteegh, C. H. M. (1997). The Arabic
Language. Columbia University Press. pp. 56ff. ISBN 978-0-231-11152-2. 

External links[edit]

Online Arabic
Tool by Multillect

v t e



Language Alphabet History Romanization Numerology Influence on other languages


Nabataean alphabet Perso- Arabic
alphabet Ancient North Arabian Ancient South Arabian script

Zabūr script

numerals Eastern numerals Arabic



i‘jām Tashkil Harakat Tanwin Shaddah

Hamza Tāʾ marbūṭah


ʾAlif Bāʾ Tāʾ

Tāʾ marbūṭah

Ṯāʾ Ǧīm Ḥāʾ Ḫāʾ Dāl Ḏāl Rāʾ Zāy Sīn Šīn Ṣād Ḍād Ṭāʾ Ẓāʾ ʿAyn Ġayn Fāʾ Qāf Kāf Lām Mīm Nūn Hāʾ

Tāʾ marbūṭah

Wāw Yāʾ Hamza

Notable varieties


Proto-Arabic Old Arabic Ancient North Arabian Old South Arabian


Classical Modern Standard Maltese[a]


Nilo-Egyptian Levantine Maghrebi

Pre-Hilalian dialects Hilalian dialects Moroccan Darija Tunisian Arabic Sa'idi Arabic

Mesopotamian Peninsular

Yemeni Arabic Tihamiyya Arabic

Sudanese Chadian Modern South Arabian

Ethnic / religious



Juba Arabic Nubi language Babalia Creole Arabic Maridi Arabic Maltese


Literature Names


Phonology Sun and moon letters ʾIʿrāb (inflection) Grammar Triliteral root Mater lectionis IPA Quranic Arabic

Calligraphy Script

Diwani Jawi script Kufic Rasm Mashq Hijazi script Muhaqqaq Thuluth Naskh (script) Ruqʿah script Taʿlīq script Nastaʿlīq script Shahmukhī script Sini (script)


keyboard Arabic script
Arabic script
in Unicode ISO/IEC 8859-6 Windows-1256 MS-DOS codepages

708 709 710 711 720 864

Mac Arabic