Comorian (''Shikomori'', or ''Shimasiwa'', the "language of islands") is the name given to a group of four
Bantu language
The Bantu languages (English: , Proto-Bantu language, Proto-Bantu: *bantʊ̀), or Ntu languages are a language family of about 600 languages of Central Africa, Central, Southern Africa, Southern, East Africa, Eastern and Southeast Africa, South ...
s spoken in the
Comoro Islands
The Comoro Islands are a group of volcanic islands in the Mozambique Channel, an arm of the Indian Ocean lying between Madagascar and the African mainland. Three of the islands form the Union of the Comoros, a sovereign nation, while Mayotte bel ...
, an
archipelago
An archipelago ( ), sometimes called an island group or island chain, is a chain, cluster, or collection of islands. An archipelago may be in an ocean, a sea, or a smaller body of water. Example archipelagos include the Aegean Islands (the o ...
in the southwestern
Indian Ocean
The Indian Ocean is the third-largest of the world's five oceanic divisions, covering or approximately 20% of the water area of Earth#Surface, Earth's surface. It is bounded by Asia to the north, Africa to the west and Australia (continent), ...
between
Mozambique
Mozambique, officially the Republic of Mozambique, is a country located in Southeast Africa bordered by the Indian Ocean to the east, Tanzania to the north, Malawi and Zambia to the northwest, Zimbabwe to the west, and Eswatini and South Afr ...
and
Madagascar
Madagascar, officially the Republic of Madagascar, is an island country that includes the island of Madagascar and numerous smaller peripheral islands. Lying off the southeastern coast of Africa, it is the world's List of islands by area, f ...
. It is named as one of the official languages of the
Union of the Comoros in the Comorian constitution. Shimaore, one of the languages, is spoken on the disputed island of
Mayotte
Mayotte ( ; , ; , ; , ), officially the Department of Mayotte (), is an Overseas France, overseas Overseas departments and regions of France, department and region and single territorial collectivity of France. It is one of the Overseas departm ...
, a
French department
In the administrative divisions of France, the department (, ) is one of the three levels of government under the national level (" territorial collectivities"), between the administrative regions and the communes. There are a total of 101 ...
claimed by Comoros.
Like
Swahili, the Comorian languages are
Sabaki languages, part of the Bantu language family. Each island has its own language, and the four are conventionally divided into two groups: the eastern group is composed of ''
Shindzuani'' (spoken on
Ndzuani) and ''
Shimaore'' (Mayotte), while the western group is composed of ''
Shimwali'' (
Mwali) and ''
Shingazija
Comorian (''Shikomori'', or ''Shimasiwa'', the "language of islands") is the name given to a group of four Bantu languages spoken in the Comoro Islands, an archipelago in the southwestern Indian Ocean between Mozambique and Madagascar. It is name ...
'' (
Ngazidja). Although the languages of different groups are not usually
mutually intelligible
In linguistics, mutual intelligibility is a relationship between different but related language varieties in which speakers of the different varieties can readily understand each other without prior familiarity or special effort. Mutual intellig ...
, only sharing about 80% of their lexicon, there is mutual intelligibility between the languages within each group, suggesting that Shikomori should be considered as two language groups, each including two languages, rather than four distinct languages.
Historically, the language was written in the Arabic-based
Ajami script
Ajami (, ) or Ajamiyya (, ), which comes from the Arabic root for 'foreign' or 'stranger', is an Arabic script, Arabic-derived script used for writing Languages of Africa, African languages, particularly Songhai languages, Songhai, Mandé languages ...
. The
French colonial administration introduced the
Latin script
The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Gree ...
. In 2009 the current independent government decreed a modified version of the Latin script for official use.
Many Comorians now use the Latin script when writing the Comorian language although the Ajami script is still widely used, especially by women. Recently, some scholars have suggested that the language may be on its way to endangerment, citing the unstable code-switching and numerous French words used in daily speech.
It is the language of ''
Umodja wa Masiwa,'' the national anthem.
History and classification
The first Bantu speakers arrived at the Comoros sometime between the 5th and 10th centuries, before the Shirazi Arabs.
Shimwali
The Shimwali dialect was possibly one of the earliest Bantu languages to be recorded by a European. On July 3, 1613, Walter Payton claimed to have recorded 14 words on the island of
Moheli, stating "They speak a kind of
Morisco
''Moriscos'' (, ; ; "Moorish") were former Muslims and their descendants whom the Catholic Church and Habsburg Spain commanded to forcibly convert to Christianity or face compulsory exile after Spain outlawed Islam. Spain had a sizeable Mus ...
language."
Sir Thomas Roe and Thomas Herbert also claimed to have recorded vocabulary.
Until the 1970s, it was considered a dialect or archaic form of Swahili. This was first proposed in 1871, when Kersten suggested it might be a mixture of Shingazija, Swahili, and Malagasy. In 1919 Johnston, referring to it as 'Komoro Islands Swahili - the dialect of 'Mohila' and 'the 'Mohella' language', suggested that, taken together with the other two dialects in the Comoros, it might be an ancient and corrupt form of Swahili. However, Ottenheimer et al. (1976) found this to not be the case. Instead, they classify Shimwali, as well as the other Comorian languages, as a separate language group from Swahili.
Shinzwani
Shinzwani was first noted by a South African missionary Reverend William Elliott in 1821 and 1822. During a 13-months' mission stay on the island of
Anjouan
Anjouan () is an autonomous volcanic island in the Comoro Islands in the southwestern Indian Ocean, part of the Comoros, Union of the Comoros. It is known in Comorian languages, Shikomori as Ndzuani, Ndzuwani or Nzwani, and, until the early twentie ...
he compiled a vocabulary and grammar of the language. Elliott included a 900-word vocabulary and provided 98 sample sentences in Shinzwani. He does not appear to have recognized noun- classes (of which there are at least six in Shinzwani) nor does he appear to have considered Shinzwani a Bantu language, only making a superficial connection to Swahili.
The dialect was noted again in 1841 by Casalis, who placed it within Bantu, and by Peters, who collected a short word list. In 1875 Hildebrandt published a Shinzwani vocabulary and suggested in 1876 that Shinzwani was an older form of Swahili.
The idea of the distinctness of Shingazija and Shinzwani from Swahili finally gained prominence during the latter part of the 19th century and the early 20th century. In 1883, an analysis by Gust distinguished Shinzwani from Swahili. He discusses Shinzwani and Swahili as two separate languages which had contributed to the port-language which he referred to as Barracoon.
In 1909 two publications reaffirmed and clarified the distinctiveness of Shinzwani, Shingazija and Swahili. Struck published a word list which appeared to have been recorded by a Frenchman in Anjouan in 1856, identified the words as belonging to Shinzwani and noted some influence from Swahili.
In his Swahili Grammar, Sacleux cautioned that although Swahili was spoken in the Comoros it must not be confused with the native languages of the Comoros, Shinzwani and Shingazija. He said that while Swahili was mostly spoken in cities, the Comorian languages were widely spoken in the countryside.
Shingazija
Shingazija was not documented until 1869 when Bishop Edward Steere collected a word list and commented that he did not know which language family it belonged to. In 1870 Gevrey characterized both Shingazija and Shinzwani as the 'Souaheli des Comores' (Swahili of the Comoros) which was only a 'patois de celui de Zanzibar'. However, Kersten noted in 1871 that Shingazija was not at all like Swahili but was a separate Bantu language.
Torrend was the first to identify the difference between Shingazija and Shinzwani in 1891. He attempted to account for Shingazija by suggesting that it was a mixture of Shinzwani and Swahili.
Phonology
The consonants and vowels in the Comorian languages:
Vowels
Consonants
The consonants ''mb'', ''nd'', ''b'', ''d'' are phonemically
implosives
Implosive consonants are a group of stop consonants (and possibly also some affricates) with a mixed glottalic ingressive and pulmonic egressive airstream mechanism. That is, the airstream is controlled by moving the glottis downward in additio ...
, but may also be phonetically recognized as ranging from implosives to voiced stops as , , , . A glottal stop can also be heard when in between vowels.
In the Shimaore dialect, if when inserting a prefix the leading consonant becomes intervocalic, becomes , becomes , becomes , becomes , and is deleted.
There is a preference for multi-syllable words and a CV syllable structures. Vowels are frequently deleted and inserted to better fit the CV structure. There is also an alternate strategy of ''h''-insertion in scenarios which would otherwise results in VV.
There is a strong preference for penultimate stress. There was previously a tone system in the language, but it has been mostly phased out and no longer plays an active role in the majority of cases.
Orthography
Comorian is most commonly written in Latin alphabet today. Traditionally and historically, Arabic alphabet is used as well but to a lesser extent. Arabic alphabet has been universally known in Comoros, due to the fact that there was a near universal attendance at Quranic schools on the islands, whereas knowledge and literacy in French was lacking. Since independence from
France
France, officially the French Republic, is a country located primarily in Western Europe. Overseas France, Its overseas regions and territories include French Guiana in South America, Saint Pierre and Miquelon in the Atlantic Ocean#North Atlan ...
, the situation has changed, with improvements to infrastructure of secular education, in which French is the language of instruction.
Latin alphabet
Note: In
Shimaore, the digraphs " vh " and " bv " are used for representing the phoneme .
Arabic alphabet
Comoros
The Comoros, officially the Union of the Comoros, is an archipelagic country made up of three islands in Southeastern Africa, located at the northern end of the Mozambique Channel in the Indian Ocean. Its capital and largest city is Moroni, ...
being located near the East African coast, the archipelago being connected by deep trade links to the mainland, and Comorian being a
Bantu language
The Bantu languages (English: , Proto-Bantu language, Proto-Bantu: *bantʊ̀), or Ntu languages are a language family of about 600 languages of Central Africa, Central, Southern Africa, Southern, East Africa, Eastern and Southeast Africa, South ...
much like
Swahili language
Swahili, also known as as it is referred to endonym and exonym, in the Swahili language, is a Bantu languages, Bantu language originally spoken by the Swahili people, who are found primarily in Tanzania, Kenya, and Mozambique (along the East Af ...
, means that historically, the Arabic orthography of Comorian followed the
Swahili suit in being part of the tradition of the African
Ajami script
Ajami (, ) or Ajamiyya (, ), which comes from the Arabic root for 'foreign' or 'stranger', is an Arabic script, Arabic-derived script used for writing Languages of Africa, African languages, particularly Songhai languages, Songhai, Mandé languages ...
. Key components of the Ajami tradition are mainly that vowels were always represented with
diacritics
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
(thus differing from
Persian conventions). The letters ''alif'' , ''wāw'' , and ''yāʼ'' were used for indicating stressed syllable or long vowels. Furthermore, whereas Bantu languages have 5 vowels, while Arabic has 3 vowels and 3 diacritics; until recently, specifically until the early 20th century, there hasn't been an agreed upon way of writing the vowels and . Furthermore, sounds unique to Bantu languages were generally shown with the closest matching letter in the Arabic alphabet, avoiding as much as possible the creation of new letters in order not to deviate from the authentic 28-letter base. In addition,
prenasalized consonants were shown using digraphs.
Archive
.
The 20th century marked the start of a process of orthographic reform and standardization across the
Muslim world
The terms Islamic world and Muslim world commonly refer to the Islamic community, which is also known as the Ummah. This consists of all those who adhere to the religious beliefs, politics, and laws of Islam or to societies in which Islam is ...
. This process included standardizing, unifying, and clarifying the Arabic script in most places, ditching the Arabic script in favour of Latin or Cyrillic in others in places such as
Soviet Turkistan and
Soviet Caucasus, to
Turkey
Turkey, officially the Republic of Türkiye, is a country mainly located in Anatolia in West Asia, with a relatively small part called East Thrace in Southeast Europe. It borders the Black Sea to the north; Georgia (country), Georgia, Armen ...
and
Kurdistan
Kurdistan (, ; ), or Greater Kurdistan, is a roughly defined geo- cultural region in West Asia wherein the Kurds form a prominent majority population and the Kurdish culture, languages, and national identity have historically been based. G ...
, to the Eastern African coast (
Swahili Ajami) and Comoros.
The mantle of standardization and improvement of Arabic-based orthography in Comoros was carried by the literaturist ''Said Kamar-Eddine (1890-1974)'' in 1960. Only two decades before, in 1930s and 1940s, Swahili literaturists such as ''Sheikh el Amin'' and ''Sheikh Yahya Ali Omar'' had developed the Swahili Arabic alphabet as well.
In
Swahili, two new diacritics were added to the 3 original
diacritics
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
, namely to represent the phoneme , and to represent the phoneme . Furthermore, the usage of the 3
mater lectionis
A ''mater lectionis'' ( , ; , ''matres lectionis'' ; original ) is any consonant letter that is used to indicate a vowel, primarily in the writing of Semitic languages such as Arabic, Hebrew and Syriac. The letters that do this in Hebrew are ...
(or vowel carrier letters) followed the following convention too: Vowels in stressed (
second-to-last) syllable of the word are marked with diacritic as well as a carrier letter, namely ''alif'' for vowel , ''yāʼ'' for vowels and , and ''wāw'' for vowels and .
But, in the proposal by ''Said Kamar-Eddine'' for Comorian, there was a departure from the
Ajami tradition and a divergence from what was done by Swahili literaturists. ''Kamar-Eddine'' had an eye on
Iraqi and
Iranian
Iranian () may refer to:
* Something of, from, or related to Iran
** Iranian diaspora, Iranians living outside Iran
** Iranian architecture, architecture of Iran and parts of the rest of West Asia
** Iranian cuisine, cooking traditions and practic ...
Kurdistan
Kurdistan (, ; ), or Greater Kurdistan, is a roughly defined geo- cultural region in West Asia wherein the Kurds form a prominent majority population and the Kurdish culture, languages, and national identity have historically been based. G ...
, and the orthographic reforms implemented there. In Kurdish, the direction of the reforms of the alphabet favoured elimination of all diacriticts and designating specific letters to each and every vowel sound, thus creating a full ''
alphabet
An alphabet is a standard set of letter (alphabet), letters written to represent particular sounds in a spoken language. Specifically, letters largely correspond to phonemes as the smallest sound segments that can distinguish one word from a ...
''. Kurdish orthography wasn't unique in this regard. A similar direction was pursued in various Turkic languages such as
Uzbek,
Azerbaijani,
Uyghur
Uyghur may refer to:
* Uyghurs, a Turkic ethnic group living in Eastern and Central Asia (West China)
** Uyghur language, a Turkic language spoken primarily by the Uyghurs
*** Old Uyghur language, a different Turkic language spoken in the Uyghur K ...
, and
Kazakh, as well as languages of the Caucasus such as
Western
Western may refer to:
Places
*Western, Nebraska, a village in the US
*Western, New York, a town in the US
*Western Creek, Tasmania, a locality in Australia
*Western Junction, Tasmania, a locality in Australia
*Western world, countries that id ...
and
Eastern Circassian languages and
Chechen language
Chechen ( , ; , , ) is a Northeast Caucasian languages, Northeast Caucasian language spoken by approximately 1.8 million people, mostly in the Chechnya, Chechen Republic and by Chechens, members of the Chechen diaspora throughout Russia and the ...
. This makes ''Said Kamar-Eddine'' orthography for Comorian, a unique case for Sub-saharan African languages that have been written with the Arabic script.
In the initial position, the vowels are written as a single letter. No preceding ''alif'' or ''hamza'' is required. (This is similar to the convention of
Kazakh Arabic alphabet)
In
Kurdish, new vowel letters were created by adding accents on existing letters. The phonemes and are written with and respectively. In Comorian, new independent letters were assigned instead. The letter ''hāʾ'' in two of its variants are used for both aforementioned phonemes. A standard Arabic ''hāʾ'', in all its 4 positional shapes () is used for the vowel . This is a unique innovation exclusive to this orthography. The letter ''hāʾ'' in these shapes is not used as vowel in any other Arabic orthography. A letter ''hāʾ'', in a fixed ''medial'' zigzag shape (medial form of what's known in Urdu as
gol he) () is used for the vowel . The usage of this variant of the letter ''hāʾ'' as a vowel is not unique to Comorian. In the early 20th century,
West
West is one of the four cardinal directions or points of the compass. It is the opposite direction from east and is the direction in which the Sun sets on the Earth.
Etymology
The word "west" is a Germanic word passed into some Romance langu ...
and
East
East is one of the four cardinal directions or points of the compass. It is the opposite direction from west and is the direction from which the Sun rises on the Earth.
Etymology
As in other languages, the word is formed from the fact that ea ...
Circassian Arabic orthography also used this variant of the letter ''hāʾ'' to represent the vowel (written as ''ы'' in Cyrillic).
Letters representing consonant phonemes that are not present in Arabic have been formed in either of the two following methods. First method is similar to
Persian and
Kurdish, where new letters are created by adding or modifying of dots. The second method is to use the Arabic gemination diacritic ''
Shaddah'' on letters that are most similar to the missing consonant phoneme. This is similar to the tradition of
Sorabe (Arabo-Malagasy) orthograhpy, where a geminated ''r'' () is meant to represent or , and where a geminated ''f'' () is meant to represent or .
There are two types of
vowel sequencees in Comorian, a glide or a vowel hiatus. Latin letters ''w'' and ''y'', represented by and , are considered semivowels. When these letters follow another vowel, they are written sequentially.
Other succession of vowels are treated as ''vowel hiatus''. In these instances, a ''hamza'' () is written in between.
Prenasalized consonant
Prenasalized consonants are phonetic sequences of a nasal and an obstruent (or occasionally a non-nasal sonorant) that behave phonologically like single consonants. The primary reason for considering them to be single consonants, rather than ...
s are written as digraphs, with either ''m'' () or ''n'' ().
Sample text
''Comorian Latin Alphabet'':
* Ha mwakinisho ukaya ho ukubali ye sheo shaho wo ubinadamu piya pvamwedja ne ze haki za wadjibu zaho usawa, zahao, uwo ndo mshindzi waho uhuria, no mlidzanyiso haki, ne amani yahe duniya kamili.
''Comorian Arabic (Kamar-Eddine's) Alphabet'':
*
Grammar
Noun class
As in other Bantu languages, Shikomor displays a noun class/gender system in which classes share a prefix. Classes 1 through 10 generally have singular/plural pairings.
Classes 9 & 10 consists mainly of borrowed words, such as ''dipe'' (from French ''du pain'' 'some bread') and do not take prefixes. Class 7 & 8 and class 9 & 10 take on the same agreements in adjectives and verbs. Class 10a contains a very small amount of words, generally plurals of Class 11. Class 15 consists of verbal infinitives, much like English gerunds.
Class 16 contains only two words, ''vahana'' and ''vahali'', both meaning 'place'. It was probably borrowed from Swahili ''pahali'', which was borrowed from Arabic ''mahal''. Class 17 consists of locatives with the prefix ''ha-'', and Class 18 consists of locatives with the prefix ''mwa-.''
Numerals
Numerals in Comorian follow the noun. If the number is 1 through 5 or 8, it must agree with the class of its noun.
Demonstratives
There are three demonstratives: One that refers to a proximate object, a non-proximate object, and an object that was previously mentioned in the conversation.
Possessives
The possessive element -''a'' agrees with the possessed noun. The general order of a possessive construction is possessed-''Ca''-possessor.
Verbs
Comorian languages exhibit a typical Bantu verb structure.
Although there is only one form of the subject marker for personal plural subjects and for subjects belonging to the classes 3-18.
In Proto-Sabaki, the 2sg and 2sg subject markers were ''*ku'' and *''ka'', respectively. However, the ''*k'' was weakened to ''h'' in Shingazija and further to ''Ø'' in all other dialects.
Verbs can be negated by adding the prefix ''ka-''. However, occasionally other morphemes of the verb may take on different meanings when the negative prefix is added, such as in the following example, where the suffix ''-i'', usually the past tense, takes on the present habitual meaning when it is in a negative construction.
The present progressive uses the prefix si-/su-, the future tense uses tso-, and the conditional uses a-tso-.There are two past tense constructions in Comorian.
The first of these is the simple past tense, which uses the structure SM-Root-Suffix 1.
The second is the compound past, using the structure SM-''ka'' SM-Root-Suffix 1.
References
Works cited
*
* 2 vols.
*
*
Further reading
*Ahmed-Chamanga, Mohamed. (1997) ''Dictionnaire français-comorien (dialecte Shindzuani)''. Paris: L'Harmattan.
*Djohar, Abdou. (2014) ''Approche contrastive Franco-comorienne: les séquences figées à caractère adjectival''. Université Paris-Nord.
*Johansen, Aimee. ''A History of Comorian Linguistics''. in John M. Mugane (ed.), ''Linguistic Typology and Representation of African Languages''. Africa World Press. Trenton, New Jersey.
*Rey, Veronique. (1994) ''Première approche du mwali''. Africana Linguistica XI. Tervuren: MRAC.
External links
Shingazidja
{{Authority control
Languages of the Comoros
Swahili language
Culture of Mayotte
Comoros archipelago
Languages of Madagascar
Bantu languages
East Bantu languages
Northeast Coast Bantu languages
Northeast Bantu languages