HOME

TheInfoList



OR:

Albanian (
endonym An endonym (also known as autonym ) is a common, name for a group of people, individual person, geographical place, language, or dialect, meaning that it is used inside a particular group or linguistic community to identify or designate them ...
: , , or ) is an
Indo-European The Indo-European languages are a language family native to the northern Indian subcontinent, most of Europe, and the Iranian plateau with additional native branches found in regions such as Sri Lanka, the Maldives, parts of Central Asia (e. ...
language and the only surviving representative of the Albanoid branch, which belongs to the Paleo-Balkan group. It is the native language of the Albanian people. Standard Albanian is the
official language An official language is defined by the Cambridge English Dictionary as, "the language or one of the languages that is accepted by a country's government, is taught in schools, used in the courts of law, etc." Depending on the decree, establishmen ...
of
Albania Albania ( ; or ), officially the Republic of Albania (), is a country in Southeast Europe. It is located in the Balkans, on the Adriatic Sea, Adriatic and Ionian Seas within the Mediterranean Sea, and shares land borders with Montenegro to ...
and
Kosovo Kosovo, officially the Republic of Kosovo, is a landlocked country in Southeast Europe with International recognition of Kosovo, partial diplomatic recognition. It is bordered by Albania to the southwest, Montenegro to the west, Serbia to the ...
, and a co-official language in
North Macedonia North Macedonia, officially the Republic of North Macedonia, is a landlocked country in Southeast Europe. It shares land borders with Greece to the south, Albania to the west, Bulgaria to the east, Kosovo to the northwest and Serbia to the n ...
and
Montenegro , image_flag = Flag of Montenegro.svg , image_coat = Coat of arms of Montenegro.svg , coa_size = 80 , national_motto = , national_anthem = () , image_map = Europe-Mont ...
, where it is the primary language of significant Albanian minority communities. Albanian is recognized as a minority language in
Italy Italy, officially the Italian Republic, is a country in Southern Europe, Southern and Western Europe, Western Europe. It consists of Italian Peninsula, a peninsula that extends into the Mediterranean Sea, with the Alps on its northern land b ...
,
Croatia Croatia, officially the Republic of Croatia, is a country in Central Europe, Central and Southeast Europe, on the coast of the Adriatic Sea. It borders Slovenia to the northwest, Hungary to the northeast, Serbia to the east, Bosnia and Herze ...
,
Romania Romania is a country located at the crossroads of Central Europe, Central, Eastern Europe, Eastern and Southeast Europe. It borders Ukraine to the north and east, Hungary to the west, Serbia to the southwest, Bulgaria to the south, Moldova to ...
, and
Serbia , image_flag = Flag of Serbia.svg , national_motto = , image_coat = Coat of arms of Serbia.svg , national_anthem = () , image_map = , map_caption = Location of Serbia (gree ...
. It is also spoken in
Greece Greece, officially the Hellenic Republic, is a country in Southeast Europe. Located on the southern tip of the Balkan peninsula, it shares land borders with Albania to the northwest, North Macedonia and Bulgaria to the north, and Turkey to th ...
and by the Albanian diaspora, which is generally concentrated in the
Americas The Americas, sometimes collectively called America, are a landmass comprising the totality of North America and South America.''Webster's New World College Dictionary'', 2010 by Wiley Publishing, Inc., Cleveland, Ohio. When viewed as a sing ...
,
Europe Europe is a continent located entirely in the Northern Hemisphere and mostly in the Eastern Hemisphere. It is bordered by the Arctic Ocean to the north, the Atlantic Ocean to the west, the Mediterranean Sea to the south, and Asia to the east ...
and
Oceania Oceania ( , ) is a region, geographical region including Australasia, Melanesia, Micronesia, and Polynesia. Outside of the English-speaking world, Oceania is generally considered a continent, while Mainland Australia is regarded as its co ...
. Albanian is estimated to have as many as 7.5 million native speakers. Albanian and other
Paleo-Balkan languages The Paleo-Balkan languages are a geographical grouping of various Indo-European languages that were spoken in the Balkans and surrounding areas in ancient times. In antiquity, Dacian, Greek, Illyrian, Messapic, Paeonian, Phrygian and Thracian wer ...
had their formative core in the
Balkans The Balkans ( , ), corresponding partially with the Balkan Peninsula, is a geographical area in southeastern Europe with various geographical and historical definitions. The region takes its name from the Balkan Mountains that stretch throug ...
after the
Indo-European migrations The Indo-European migrations are hypothesized migrations of Proto-Indo-Europeans, peoples who spoke Proto-Indo-European language, Proto-Indo-European (PIE) and the derived Indo-European languages, which took place from around 4000 to 1000 BCE, ...
in the region. Albanian in antiquity is often thought to have been an
Illyrian language The Illyrian language () was an Indo-European languages, Indo-European language or group of languages spoken by the Illyrians in Southeast Europe during antiquity. The language is unattested with the exception of personal names and placenames. ...
for obvious geographic and historical reasons,: "It is often thought (for obvious geographic reasons) that Albanian descends from ancient Illyrian (see above), but this cannot be ascertained as we know next to nothing about Illyrian itself." or otherwise an unmentioned Balkan Indo-European language that was closely related to Illyrian and Messapic. The Indo-European subfamily that gave rise to Albanian is called
Albanoid Albanoid or Albanic is a branch or subfamily of the Indo-European languages, Indo-European (IE) languages, of which Albanian language varieties are the only surviving representatives. In current classifications of the IE language family, Albania ...
in reference to a specific ethnolinguistically pertinent and historically compact language group. Whether descendants or sisters of what was called ' Illyrian' by classical sources, Albanian and Messapic, on the basis of shared features and innovations, are grouped together in a common branch in the current phylogenetic classification of the Indo-European language family. The first written mention of Albanian was in 1284 in a witness testimony from the
Republic of Ragusa The Republic of Ragusa, or the Republic of Dubrovnik, was an maritime republics, aristocratic maritime republic centered on the city of Dubrovnik (''Ragusa'' in Italian and Latin; ''Raguxa'' in Venetian) in South Dalmatia (today in southernmost ...
, while a letter written by Dominican Friar Gulielmus Adea in 1332 mentions the Albanians using the Latin alphabet in their writings. The oldest surviving attestation of modern Albanian is from 1462. The two main Albanian dialect groups (or varieties), Gheg and Tosk, are primarily distinguished by phonological differences and are mutually intelligible in their standard varieties, with Gheg spoken to the north and Tosk spoken to the south of the Shkumbin river. Their characteristics in the treatment of both native words and loanwords provide evidence that the split into the northern and the southern dialects occurred after Christianisation of the region (4th century AD),: "The dialectal split into Geg and Tosk happened sometime after the region became Christianized in the fourth century AD: Christian Latin loanwords show Tosk rhotacism, such as Tosk ''murgu'' 'monk' (Geg ''mungu'') from Lat. ''monachus''." and most likely not later than the 6th century AD, hence possibly occupying roughly their present area divided by the Shkumbin river since the Post-Roman and Pre-Slavic period, straddling the Jireček Line. Centuries-old communities speaking Albanian dialects can be found scattered in
Greece Greece, officially the Hellenic Republic, is a country in Southeast Europe. Located on the southern tip of the Balkan peninsula, it shares land borders with Albania to the northwest, North Macedonia and Bulgaria to the north, and Turkey to th ...
(the Arvanites and some communities in
Epirus Epirus () is a Region#Geographical regions, geographical and historical region, historical region in southeastern Europe, now shared between Greece and Albania. It lies between the Pindus Mountains and the Ionian Sea, stretching from the Bay ...
,
Western Macedonia Western Macedonia (, ) is one of the thirteen Regions of Greece, administrative regions of Greece, consisting of the western part of Macedonia (Greece), Macedonia. Located in north-western Greece, it is divided into the regional units of Greece ...
and Western Thrace),
Croatia Croatia, officially the Republic of Croatia, is a country in Central Europe, Central and Southeast Europe, on the coast of the Adriatic Sea. It borders Slovenia to the northwest, Hungary to the northeast, Serbia to the east, Bosnia and Herze ...
(the Arbanasi),
Italy Italy, officially the Italian Republic, is a country in Southern Europe, Southern and Western Europe, Western Europe. It consists of Italian Peninsula, a peninsula that extends into the Mediterranean Sea, with the Alps on its northern land b ...
(the Arbëreshë) as well as in
Romania Romania is a country located at the crossroads of Central Europe, Central, Eastern Europe, Eastern and Southeast Europe. It borders Ukraine to the north and east, Hungary to the west, Serbia to the southwest, Bulgaria to the south, Moldova to ...
,
Turkey Turkey, officially the Republic of Türkiye, is a country mainly located in Anatolia in West Asia, with a relatively small part called East Thrace in Southeast Europe. It borders the Black Sea to the north; Georgia (country), Georgia, Armen ...
and
Ukraine Ukraine is a country in Eastern Europe. It is the List of European countries by area, second-largest country in Europe after Russia, which Russia–Ukraine border, borders it to the east and northeast. Ukraine also borders Belarus to the nor ...
. The Malsia e Madhe Gheg Albanian and two varieties of the Tosk dialect, Arvanitika in Greece and Arbëresh in southern Italy, have preserved archaic elements of the language. Ethnic Albanians constitute a large
diaspora A diaspora ( ) is a population that is scattered across regions which are separate from its geographic place of birth, place of origin. The word is used in reference to people who identify with a specific geographic location, but currently resi ...
, with many having long assimilated in different cultures and communities. Consequently, Albanian-speakers do not correspond to the total ethnic Albanian population, as many ethnic Albanians may identify as Albanian but are unable to speak the language. Standard Albanian is a standardised form of spoken Albanian based on Tosk.


Geographic distribution

The language is spoken by approximately 6 million people in the Balkans, primarily in Albania,
Kosovo Kosovo, officially the Republic of Kosovo, is a landlocked country in Southeast Europe with International recognition of Kosovo, partial diplomatic recognition. It is bordered by Albania to the southwest, Montenegro to the west, Serbia to the ...
,
North Macedonia North Macedonia, officially the Republic of North Macedonia, is a landlocked country in Southeast Europe. It shares land borders with Greece to the south, Albania to the west, Bulgaria to the east, Kosovo to the northwest and Serbia to the n ...
,
Serbia , image_flag = Flag of Serbia.svg , national_motto = , image_coat = Coat of arms of Serbia.svg , national_anthem = () , image_map = , map_caption = Location of Serbia (gree ...
,
Montenegro , image_flag = Flag of Montenegro.svg , image_coat = Coat of arms of Montenegro.svg , coa_size = 80 , national_motto = , national_anthem = () , image_map = Europe-Mont ...
and
Greece Greece, officially the Hellenic Republic, is a country in Southeast Europe. Located on the southern tip of the Balkan peninsula, it shares land borders with Albania to the northwest, North Macedonia and Bulgaria to the north, and Turkey to th ...
. However, due to old communities in
Italy Italy, officially the Italian Republic, is a country in Southern Europe, Southern and Western Europe, Western Europe. It consists of Italian Peninsula, a peninsula that extends into the Mediterranean Sea, with the Alps on its northern land b ...
and the large Albanian diaspora, the worldwide total of speakers is much higher than in Southern Europe and numbers approximately 7.5 million.


Europe

The Albanian language is the official language of
Albania Albania ( ; or ), officially the Republic of Albania (), is a country in Southeast Europe. It is located in the Balkans, on the Adriatic Sea, Adriatic and Ionian Seas within the Mediterranean Sea, and shares land borders with Montenegro to ...
and
Kosovo Kosovo, officially the Republic of Kosovo, is a landlocked country in Southeast Europe with International recognition of Kosovo, partial diplomatic recognition. It is bordered by Albania to the southwest, Montenegro to the west, Serbia to the ...
and a co-official language in
North Macedonia North Macedonia, officially the Republic of North Macedonia, is a landlocked country in Southeast Europe. It shares land borders with Greece to the south, Albania to the west, Bulgaria to the east, Kosovo to the northwest and Serbia to the n ...
and
Montenegro , image_flag = Flag of Montenegro.svg , image_coat = Coat of arms of Montenegro.svg , coa_size = 80 , national_motto = , national_anthem = () , image_map = Europe-Mont ...
. Albanian is a recognised minority language in
Croatia Croatia, officially the Republic of Croatia, is a country in Central Europe, Central and Southeast Europe, on the coast of the Adriatic Sea. It borders Slovenia to the northwest, Hungary to the northeast, Serbia to the east, Bosnia and Herze ...
,
Italy Italy, officially the Italian Republic, is a country in Southern Europe, Southern and Western Europe, Western Europe. It consists of Italian Peninsula, a peninsula that extends into the Mediterranean Sea, with the Alps on its northern land b ...
,
Romania Romania is a country located at the crossroads of Central Europe, Central, Eastern Europe, Eastern and Southeast Europe. It borders Ukraine to the north and east, Hungary to the west, Serbia to the southwest, Bulgaria to the south, Moldova to ...
and in
Serbia , image_flag = Flag of Serbia.svg , national_motto = , image_coat = Coat of arms of Serbia.svg , national_anthem = () , image_map = , map_caption = Location of Serbia (gree ...
. Albanian is also spoken by a minority in Greece, specifically in the Thesprotia and
Preveza Preveza (, ) is a city in the region of Epirus (region), Epirus, northwestern Greece, located on the northern peninsula of the mouth of the Ambracian Gulf. It is the capital of the Preveza (regional unit), regional unit of Preveza, which is the s ...
regional units and in a few villages in Ioannina and Florina regional units in Greece. It is also spoken by 450,000 Albanian immigrants in Greece, making it one of the commonly spoken languages in the country after Greek. Albanian is the third most common mother tongue among foreign residents in
Italy Italy, officially the Italian Republic, is a country in Southern Europe, Southern and Western Europe, Western Europe. It consists of Italian Peninsula, a peninsula that extends into the Mediterranean Sea, with the Alps on its northern land b ...
. This is due to a substantial Albanian immigration to Italy. Italy has a historical Albanian minority of about 500,000, scattered across southern Italy, known as Arbëreshë. Approximately 1 million Albanians from Kosovo are dispersed throughout
Germany Germany, officially the Federal Republic of Germany, is a country in Central Europe. It lies between the Baltic Sea and the North Sea to the north and the Alps to the south. Its sixteen States of Germany, constituent states have a total popu ...
,
Switzerland Switzerland, officially the Swiss Confederation, is a landlocked country located in west-central Europe. It is bordered by Italy to the south, France to the west, Germany to the north, and Austria and Liechtenstein to the east. Switzerland ...
and
Austria Austria, formally the Republic of Austria, is a landlocked country in Central Europe, lying in the Eastern Alps. It is a federation of nine Federal states of Austria, states, of which the capital Vienna is the List of largest cities in Aust ...
. These are mainly immigrants from Kosovo who migrated during the 1990s. In
Switzerland Switzerland, officially the Swiss Confederation, is a landlocked country located in west-central Europe. It is bordered by Italy to the south, France to the west, Germany to the north, and Austria and Liechtenstein to the east. Switzerland ...
, the Albanian language is the sixth most spoken language with 176,293 native speakers. Albanian became an official language in North Macedonia on 15 January 2019.


Americas

There are large numbers of Albanian speakers in the United States, Argentina, Chile, Uruguay, and
Canada Canada is a country in North America. Its Provinces and territories of Canada, ten provinces and three territories extend from the Atlantic Ocean to the Pacific Ocean and northward into the Arctic Ocean, making it the world's List of coun ...
. Some of the first ethnic Albanians to arrive in the United States were the Arbëreshë. The Arbëreshë have a strong sense of identity and are unique in that they speak an archaic dialect of Tosk Albanian called Arbëresh. In the United States and Canada, there are approximately 250,000 Albanian speakers. It is primarily spoken on the East Coast of the United States, in cities like New York City, Boston, Chicago, Philadelphia, and Detroit, as well as in parts of the states of New Jersey, Ohio, and Connecticut. In Argentina, there are nearly 40,000 Albanian speakers, mostly in Buenos Aires.


Asia and Africa

Approximately 1.3 million people of Albanian ancestry live in
Turkey Turkey, officially the Republic of Türkiye, is a country mainly located in Anatolia in West Asia, with a relatively small part called East Thrace in Southeast Europe. It borders the Black Sea to the north; Georgia (country), Georgia, Armen ...
, with more than 500,000 recognizing their ancestry, language and
culture Culture ( ) is a concept that encompasses the social behavior, institutions, and Social norm, norms found in human societies, as well as the knowledge, beliefs, arts, laws, Social norm, customs, capabilities, Attitude (psychology), attitudes ...
. There are other estimates, however, that place the number of people in Turkey with Albanian ancestry and or background upward to 5 million. However, the vast majority of this population is assimilated and no longer possesses fluency in the Albanian language, though a vibrant Albanian community maintains its distinct identity in
Istanbul Istanbul is the List of largest cities and towns in Turkey, largest city in Turkey, constituting the country's economic, cultural, and historical heart. With Demographics of Istanbul, a population over , it is home to 18% of the Demographics ...
to this day.
Egypt Egypt ( , ), officially the Arab Republic of Egypt, is a country spanning the Northeast Africa, northeast corner of Africa and Western Asia, southwest corner of Asia via the Sinai Peninsula. It is bordered by the Mediterranean Sea to northe ...
also lays claim to about 18,000 Albanians, mostly Tosk speakers. Many are descendants of the Janissary of Muhammad Ali Pasha, an Albanian who became Wāli, and self-declared
Khedive Khedive ( ; ; ) was an honorific title of Classical Persian origin used for the sultans and grand viziers of the Ottoman Empire, but most famously for the Khedive of Egypt, viceroy of Egypt from 1805 to 1914.Adam Mestyan"Khedive" ''Encyclopaedi ...
of Egypt and
Sudan Sudan, officially the Republic of the Sudan, is a country in Northeast Africa. It borders the Central African Republic to the southwest, Chad to the west, Libya to the northwest, Egypt to the north, the Red Sea to the east, Eritrea and Ethiopi ...
. In addition to the
dynasty A dynasty is a sequence of rulers from the same family, usually in the context of a monarchy, monarchical system, but sometimes also appearing in republics. A dynasty may also be referred to as a "house", "family" or "clan", among others. H ...
that he established, a large part of the former Egyptian and Sudanese
aristocracy Aristocracy (; ) is a form of government that places power in the hands of a small, privileged ruling class, the aristocracy (class), aristocrats. Across Europe, the aristocracy exercised immense Economy, economic, Politics, political, and soc ...
was of Albanian origin. In addition to the recent emigrants, there are older diasporic communities around the world.


Oceania

Albanian is also spoken by Albanian diaspora communities residing in
Australia Australia, officially the Commonwealth of Australia, is a country comprising mainland Australia, the mainland of the Australia (continent), Australian continent, the island of Tasmania and list of islands of Australia, numerous smaller isl ...
and
New Zealand New Zealand () is an island country in the southwestern Pacific Ocean. It consists of two main landmasses—the North Island () and the South Island ()—and List of islands of New Zealand, over 600 smaller islands. It is the List of isla ...
.


Number of native speakers

The following table provides data on the number of native Albanian speakers by country, according to the most recent publicly available sources. The number of speakers is presented as an absolute value for the total population, obtained by projecting the valid answers to the total population (thus not taking into account unanswered questionnaires, missing data or excluded age ranges).


Dialects

* Albanian ** Tosk *** Northern Tosk *** Labërisht *** Cham *** Arvanitika *** Arbëresh ** Gheg *** Northwest Gheg *** Northeast Gheg *** Central Gheg *** Southern Gheg The Albanian language has two distinct dialects, Tosk which is spoken in the south, and Gheg spoken in the north. Standard Albanian is based on the Tosk dialect. The Shkumbin River is the rough dividing line between the two dialects.''The river Shkumbin in central Albania historically forms the boundary between those two dialects, with the population on the north speaking varieties of Geg and the population on the south varieties of Tosk.'' (page 23
Concise Encyclopedia of Languages of the World
By Keith Brown, Sarah Ogilvie Contributor Keith Brown, Sarah Ogilvie Edition: illustrated Published by Elsevier, 2008 ,
Gheg is divided into four sub-dialects: Northwest Gheg, Northeast Gheg, Central Gheg and Southern Gheg. It is primarily spoken in northern Albania,
Kosovo Kosovo, officially the Republic of Kosovo, is a landlocked country in Southeast Europe with International recognition of Kosovo, partial diplomatic recognition. It is bordered by Albania to the southwest, Montenegro to the west, Serbia to the ...
, and throughout
Montenegro , image_flag = Flag of Montenegro.svg , image_coat = Coat of arms of Montenegro.svg , coa_size = 80 , national_motto = , national_anthem = () , image_map = Europe-Mont ...
and northwestern
North Macedonia North Macedonia, officially the Republic of North Macedonia, is a landlocked country in Southeast Europe. It shares land borders with Greece to the south, Albania to the west, Bulgaria to the east, Kosovo to the northwest and Serbia to the n ...
. One fairly divergent dialect is the Upper Reka dialect, which is however classified as Central Gheg. There is also a diaspora dialect in
Croatia Croatia, officially the Republic of Croatia, is a country in Central Europe, Central and Southeast Europe, on the coast of the Adriatic Sea. It borders Slovenia to the northwest, Hungary to the northeast, Serbia to the east, Bosnia and Herze ...
, the Arbanasi dialect. Tosk is divided into five sub-dialects, including Northern Tosk (the most numerous in speakers), Labërisht, Cham, Arvanitika, and Arbëresh. Tosk is spoken in southern Albania, southwestern North Macedonia and northern and southern Greece. Cham Albanian is spoken in North-western Greece, while Arvanitika is spoken by the Arvanites in southern Greece. In addition, Arbëresh is spoken by the Arbëreshë people, descendants of 15th and 16th century migrants who settled in southeastern Italy, in small communities in the regions of
Sicily Sicily (Italian language, Italian and ), officially the Sicilian Region (), is an island in the central Mediterranean Sea, south of the Italian Peninsula in continental Europe and is one of the 20 regions of Italy, regions of Italy. With 4. ...
and
Calabria Calabria is a Regions of Italy, region in Southern Italy. It is a peninsula bordered by the region Basilicata to the north, the Ionian Sea to the east, the Strait of Messina to the southwest, which separates it from Sicily, and the Tyrrhenian S ...
. These settlements originated from the (Arvanites) communities probably of Peloponnese known as Morea in the Middle Ages. Among them the Arvanites call themselves Arbëror and sometime Arbëresh. The Arbëresh dialect is closely related to the Arvanites dialect with more Italian vocabulary absorbed during different periods of time.


Orthography

The Albanian language has been written using many alphabets since the earliest records from the 15th century. The history of Albanian language orthography is closely related to the cultural orientation and knowledge of certain foreign languages among Albanian writers. The earliest written Albanian records come from the Gheg area in makeshift spellings based on Italian or Greek. Originally, the Tosk dialect was written in the
Greek alphabet The Greek alphabet has been used to write the Greek language since the late 9th or early 8th century BC. It was derived from the earlier Phoenician alphabet, and is the earliest known alphabetic script to systematically write vowels as wel ...
and the Gheg dialect was written in the
Latin script The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Gree ...
. Both dialects had also been written in the Ottoman Turkish version of the
Arabic script The Arabic script is the writing system used for Arabic (Arabic alphabet) and several other languages of Asia and Africa. It is the second-most widely used alphabetic writing system in the world (after the Latin script), the second-most widel ...
,
Cyrillic The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
, and some local alphabets ( Elbasan, Vithkuqi, Todhri, Veso Bey, Jan Vellara and others, see original Albanian alphabets). More specifically, the writers from northern Albania and under the influence of the Catholic Church used Latin letters, those in southern Albania and under the influence of the Greek Orthodox church used Greek letters, while others throughout Albania and under the influence of Islam used Arabic letters. There were initial attempts to create an original Albanian alphabet during the 1750–1850 period. These attempts intensified after the League of Prizren and culminated with the Congress of Manastir held by Albanian intellectuals from 14 to 22 November 1908, in Manastir (present day Bitola), which decided on which alphabet to use, and what the standardised spelling would be for standard Albanian. This is how the literary language remains. The alphabet is the Latin alphabet with the addition of the letters , , and ten digraphs: , , , , , , , , and . According to Robert Elsie:Elsie, Robert. (2017). ''Albanian Alphabets: Borrowed and Invented''. London, UK: CreateSpace Independent Publishing Platform. .


Classification

Albanian constitutes one of the eleven major branches of the Indo-European language family, within which it occupies an independent position. In 1854, Albanian was demonstrated to be an
Indo-European language The Indo-European languages are a language family native to the northern Indian subcontinent, most of Europe, and the Iranian plateau with additional native branches found in regions such as Sri Lanka, the Maldives, parts of Central Asia ( ...
by the
philologist Philology () is the study of language in oral and written historical sources. It is the intersection of textual criticism, literary criticism, history, and linguistics with strong ties to etymology. Philology is also defined as the study of ...
Franz Bopp. Albanian was formerly compared by a few Indo-European linguists with Germanic and Balto-Slavic, all of which share a number of
isogloss An isogloss, also called a heterogloss, is the geographic boundary of a certain linguistics, linguistic feature, such as the pronunciation of a vowel, the meaning of a word, or the use of some morphological or syntactic feature. Isoglosses are a ...
es with Albanian. Other linguists linked the Albanian language with
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
, Greek and
Armenian Armenian may refer to: * Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia * Armenians, the national people of Armenia, or people of Armenian descent ** Armenian diaspora, Armenian communities around the ...
, while placing Germanic and Balto-Slavic in another branch of Indo-European.Google Books
Mallory, J. P. and Adams, D. Q.: The Oxford Introduction to Proto-Indo-European and the Proto-Indo-European World
JHholm.de
, Holm, Hans J.: The Distribution of Data in Word Lists and its Impact on the Subgrouping of Languages. In: Christine Preisach, Hans Burkhardt, Lars Schmidt-Thieme, Reinhold Decker (eds.): Data Analysis, Machine Learning, and Applications. Proc. of the 31st Annual Conference of the German Classification Society (GfKl), University of Freiburg, 7–9 March 2007. Springer-Verlag, Heidelberg-Berlin
HJholm.de
A possible Homeland of the Indo-European Languages And their Migrations in the Light of the Separation Level Recovery (SLRD) Method – Hans J. Holm
In current scholarship there is evidence that Albanian is closely related to Greek and Armenian, while the fact that it is a satem language is less significant. Messapic is considered the closest language to Albanian, grouped in a common branch titled ''Illyric'' in Hyllested & Joseph (2022). Hyllested & Joseph (2022) in agreement with recent bibliography identify Greco-Phrygian as the IE branch closest to the Albanian-Messapic one. These two branches form an areal grouping – which is often called "Balkan IE" – with Armenian. The hypothesis of the " Balkan Indo-European" continuum posits a common period of prehistoric coexistence of several Indo-European dialects in the
Balkans The Balkans ( , ), corresponding partially with the Balkan Peninsula, is a geographical area in southeastern Europe with various geographical and historical definitions. The region takes its name from the Balkan Mountains that stretch throug ...
prior to 2000 BC. To this group would belong Albanian,
Ancient Greek Ancient Greek (, ; ) includes the forms of the Greek language used in ancient Greece and the classical antiquity, ancient world from around 1500 BC to 300 BC. It is often roughly divided into the following periods: Mycenaean Greek (), Greek ...
,
Armenian Armenian may refer to: * Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia * Armenians, the national people of Armenia, or people of Armenian descent ** Armenian diaspora, Armenian communities around the ...
, Phrygian, fragmentary attested languages such as Macedonian, Thracian, or Illyrian, and the relatively well-attested Messapic in Southern Italy. The common features of this group appear at the phonological, morphological, and lexical levels, presumably resulting from the contact between the various languages. The concept of this linguistic group is explained as a kind of language league of the
Bronze Age The Bronze Age () was a historical period characterised principally by the use of bronze tools and the development of complex urban societies, as well as the adoption of writing in some areas. The Bronze Age is the middle principal period of ...
(a specific areal-linguistics phenomenon), although it also consisted of languages that were related to each other. A common prestage posterior to PIE comprising Albanian, Greek, and Armenian, is considered as a possible scenario. In this light, due to the larger number of possible shared innovations between Greek and Armenian, it appears reasonable to assume, at least tentatively, that Albanian was the first Balkan IE language to branch off. This split and the following ones were perhaps very close in time, allowing only a narrow time frame for shared innovations. Albanian represents one of the core languages of the Balkan ''Sprachbund''. Glottolog and Ethnologue recognize four Albanian languages. They are classified as follows: * Indo-European ** Albanian *** Tosk **** Northern Tosk Albanian **** Southern Tosk ***** Arbëreshë Albanian ***** Arvanitika Albanian *** Gheg Albanian


History


Historical documentation

The first attested written mention of the Albanian language was on 14 July 1284 in Ragusa in modern
Croatia Croatia, officially the Republic of Croatia, is a country in Central Europe, Central and Southeast Europe, on the coast of the Adriatic Sea. It borders Slovenia to the northwest, Hungary to the northeast, Serbia to the east, Bosnia and Herze ...
(
Dubrovnik Dubrovnik, historically known as Ragusa, is a city in southern Dalmatia, Croatia, by the Adriatic Sea. It is one of the most prominent tourist destinations in the Mediterranean Sea, Mediterranean, a Port, seaport and the centre of the Dubrovni ...
) when a crime witness named Matthew testified: "I heard a voice crying on the mountain in the Albanian language" (). The Albanian language is also mentioned in the '' Descriptio Europae Orientalis'' dated in 1308:
' (Namely, the above-mentioned Albanians have a language that is different from the languages of Latins, Greeks and Slavs, so that they do not understand each other at all.)
The oldest attested document written in Albanian dates to 1462, while the first audio recording in the language was made by Norbert Jokl on 4 April 1914 in
Vienna Vienna ( ; ; ) is the capital city, capital, List of largest cities in Austria, most populous city, and one of Federal states of Austria, nine federal states of Austria. It is Austria's primate city, with just over two million inhabitants. ...
. However, as Fortson notes, Albanian written works existed before this point; they have simply been lost. The existence of written Albanian is explicitly mentioned in a letter attested from 1332, and the first preserved books, including both those in Gheg and in Tosk, share orthographic features that indicate that some form of common literary language had developed. By the
Late Middle Ages The late Middle Ages or late medieval period was the Periodization, period of History of Europe, European history lasting from 1300 to 1500 AD. The late Middle Ages followed the High Middle Ages and preceded the onset of the early modern period ( ...
, during the period of
Humanism Humanism is a philosophy, philosophical stance that emphasizes the individual and social potential, and Agency (philosophy), agency of human beings, whom it considers the starting point for serious moral and philosophical inquiry. The me ...
and the European Renaissance, the term was preferred in the intellectual, literary, and clerical circles of the time, and used as a synonym for the Albanian language. Published in Rome in 1635, by the Albanian bishop and writer Frang Bardhi, the first dictionary of the Albanian language was titled . During the five-century period of the Ottoman presence in
Albania Albania ( ; or ), officially the Republic of Albania (), is a country in Southeast Europe. It is located in the Balkans, on the Adriatic Sea, Adriatic and Ionian Seas within the Mediterranean Sea, and shares land borders with Montenegro to ...
, the language was not officially recognised until 1909, when the Congress of Dibra decided that Albanian schools would finally be allowed.


Linguistic affinities

Albanian is an isolate within the Indo-European language family; no other language has been conclusively linked to its branch. The only other languages that are the sole surviving members of a branch of Indo-European are
Armenian Armenian may refer to: * Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia * Armenians, the national people of Armenia, or people of Armenian descent ** Armenian diaspora, Armenian communities around the ...
and Greek. The Albanian language is part of the
Indo-European The Indo-European languages are a language family native to the northern Indian subcontinent, most of Europe, and the Iranian plateau with additional native branches found in regions such as Sri Lanka, the Maldives, parts of Central Asia (e. ...
language family A language family is a group of languages related through descent from a common ancestor, called the proto-language of that family. The term ''family'' is a metaphor borrowed from biology, with the tree model used in historical linguistics ...
and the only surviving representative of its own branch, which belongs to the Paleo-Balkan group. However, it is still uncertain exactly which ancient language of the Balkans is the ancestor of Albanian or where in the region its speakers originally lived. In general, there is insufficient evidence to connect Albanian with one of those languages, whether Illyrian, Thracian, or Dacian.Among these possibilities, Illyrian is the most probable. Although Albanian shares lexical
isogloss An isogloss, also called a heterogloss, is the geographic boundary of a certain linguistics, linguistic feature, such as the pronunciation of a vowel, the meaning of a word, or the use of some morphological or syntactic feature. Isoglosses are a ...
es with Greek, Germanic, and to a lesser extent Balto-Slavic, the vocabulary of Albanian is quite distinct. In 1995, Taylor, Ringe, and Warnow used quantitative linguistic techniques that appeared to obtain an Albanian subgrouping with Germanic, a result which the authors had already reasonably downplayed. Indeed, the Albanian and Germanic branches share a relatively moderate number of lexical cognates. Many shared grammatical elements or features of these two branches do not corroborate the lexical isoglosses. Albanian also shares lexical linguistic affinity with Latin and Romance languages. Sharing linguistic features unique to the languages of the Balkans, Albanian also forms a part of the Balkan linguistic area or ''sprachbund''.


Historical presence and location

The place and the time that the Albanian language was formed are uncertain. The American linguist Eric Hamp has said that during an unknown chronological period a pre-Albanian population (termed as "Albanoid" by Hamp) inhabited areas stretching from
Poland Poland, officially the Republic of Poland, is a country in Central Europe. It extends from the Baltic Sea in the north to the Sudetes and Carpathian Mountains in the south, bordered by Lithuania and Russia to the northeast, Belarus and Ukrai ...
to the southwestern Balkans. Further analysis has suggested that it was in a mountainous region rather than on a plain or seacoast. The words for plants and animals characteristic of mountainous regions are entirely original, but the names for fish and for agricultural activities (such as
plough A plough or ( US) plow (both pronounced ) is a farm tool for loosening or turning the soil before sowing seed or planting. Ploughs were traditionally drawn by oxen and horses but modern ploughs are drawn by tractors. A plough may have a wooden ...
ing) are borrowed from other languages.Tripod.co
Eric Hamp, "The position of Albanian, Ancient IE dialects, Proceedings of the Conference on IE linguistics held at the University of California, Los Angeles, April 25–27, 1963, ed. By Henrik Birnbaum and Jaan Puhvel. "It is clear that in the Middle Ages the Albanians extended farther north (Jokl, Albaner §2); that there are persuasive arguments which have been advanced against their having extended as far as the Adriatic coast—the fact that Scodra 'Scutari' (Shkodër) shows un-Albanian development (see §6 below), that there is no demonstrated old maritime vocabulary (see above), and that there are few ancient Greek loans (Jokl, Albaner §5; but see §5 below)
A deeper analysis of the vocabulary, however, shows that could be a consequence of a prolonged
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
domination of the coastal and plain areas of the country, rather than evidence of the original environment in which the Albanian language was formed. For example, the word for 'fish' is borrowed from Latin, but not the word for 'gills' which is native. Indigenous are also the words for 'ship', 'raft', 'navigation', 'sea shelves' and a few names of fish kinds, but not the words for 'sail', 'row' and 'harbor'; objects pertaining to navigation itself and a large part of sea fauna. This rather shows that Proto-Albanians were pushed away from coastal areas in early times (probably after the Latin conquest of the region) and thus lost a large amount (or the majority) of their sea environment lexicon. A similar phenomenon could be observed with agricultural terms. While the words for 'arable land', 'wheat', 'cereals', 'vineyard', 'yoke', 'harvesting', 'cattle breeding', etc. are native, the words for 'ploughing', 'farm' and 'farmer', agricultural practices, and some harvesting tools are foreign. This, again, points to intense contact with other languages and people, rather than providing evidence of a possible linguistic homeland (also known as a ''Urheimat''). The centre of Albanian settlement remained the Mat River. In 1079, the Albanians were recorded farther south in the valley of the Shkumbin River. The Shkumbin, a 181 km long river that lies near the old Via Egnatia, is approximately the boundary of the primary dialect division for Albanian, Tosk and Gheg. The characteristics of Tosk and Gheg in the treatment of the native words and loanwords from other languages are evidence that the dialectal split preceded the Slavic migrations to the Balkans, which means that in that period (the 5th to 6th centuries AD), Albanians were occupying nearly the same area around the Shkumbin river, which straddled the Jireček Line. References to the existence of Albanian as a distinct language survive from the 14th century, but they failed to cite specific words. The oldest surviving documents written in Albanian are the "" (Baptismal formula), '. ("I baptize thee in the name of the Father, and the Son, and the Holy Spirit") recorded by Pal Engjelli, Bishop of Durrës in 1462 in the Gheg dialect, and some
New Testament The New Testament (NT) is the second division of the Christian biblical canon. It discusses the teachings and person of Jesus in Christianity, Jesus, as well as events relating to Christianity in the 1st century, first-century Christianit ...
verses from that period. The linguists Stefan Schumacher and Joachim Matzinger (University of Vienna) assert that the first literary records of Albanian date from the 16th century. The oldest known Albanian printed book, '' Meshari'', or "missal", was written in 1555 by Gjon Buzuku, a
Roman Catholic The Catholic Church (), also known as the Roman Catholic Church, is the largest Christian church, with 1.27 to 1.41 billion baptized Catholics worldwide as of 2025. It is among the world's oldest and largest international institut ...
cleric. In 1635, Frang Bardhi wrote the first Latin–Albanian dictionary. The first Albanian school is believed to have been opened by
Franciscans The Franciscans are a group of related organizations in the Catholic Church, founded or inspired by the Italian saint Francis of Assisi. They include three independent religious orders for men (the Order of Friars Minor being the largest conte ...
in 1638 in . One of the earliest Albanian dictionaries was written in 1693; it was the Italian manuscript ''Pratichae Schrivaneschae'' authored by the Montenegrin sea captain Julije Balović and includes a multilingual dictionary of hundreds of the most frequently used words in everyday life in Italian, Slavic, Greek, Albanian, and Turkish.


Pre-Indo-European substratum

Pre-Indo-European (PreIE) sites are found throughout the territory of Albania. Such PreIE sites existed in Maliq, Vashtëmi, Burimas, Barç, Dërsnik in the Korçë District, Kamnik in Kolonja, Kolsh in the Kukës District, Rashtan in Librazhd, and Nezir in the Mat District. As in other parts of Europe, these PreIE people joined the migratory Indo-European tribes that entered the Balkans and contributed to the formation of the historical Paleo-Balkan tribes. In terms of linguistics, the pre-Indo-European substrate language spoken in the southern Balkans probably influenced pre-Proto-Albanian, the ancestor idiom of Albanian. The extent of this linguistic impact cannot be determined with precision due to the uncertain position of Albanian among
Paleo-Balkan languages The Paleo-Balkan languages are a geographical grouping of various Indo-European languages that were spoken in the Balkans and surrounding areas in ancient times. In antiquity, Dacian, Greek, Illyrian, Messapic, Paeonian, Phrygian and Thracian wer ...
and their scarce attestation. Some loanwords, however, have been proposed, such as ' pomegranate' or ' orach'; compare Pre-Greek , ''lápathon'' ' monk's rhubarb'.


Literary tradition


Earliest undisputed texts

The earliest known texts in Albanian: * the (Baptismal Formula), which dates back to 1462 and was authored by Pal Engjëlli (or Paulus Angelus) ( – 1470), Archbishop of Durrës. Engjëlli was a close friend and counsellor of Skanderbeg. It was written in a pastoral letter for a synod at the Holy Trinity in Mat and read in Latin characters as follows: (standard Albanian: ; English: "I baptise you in the name of the Father and the Son and the Holy Spirit"). It was discovered and published in 1915 by Nicolae Iorga. * the ( Arnold Ritter von Harff's lexicon), a short list of Albanian phrases with German glosses, dated 1496. * a song, recorded in the Greek alphabet, retrieved from an old
codex The codex (: codices ) was the historical ancestor format of the modern book. Technically, the vast majority of modern books use the codex format of a stack of pages bound at one edge, along the side of the text. But the term ''codex'' is now r ...
that was written in Greek. The document is also called or ("The Song of the Easter Gospel, or "The Song of Saint Matthew's Gospel"). Although the codex is dated to during the 14th century, the song, written in Albanian by an anonymous writer, seems to be a 16th-century writing. The document was found by Arbëreshë people who had emigrated to Italy in the 15th century. * The first book in Albanian is the Meshari ("The Missal"), written by Gjon Buzuku between 20 March 1554 and 5 January 1555. The book was written in the Gheg dialect in the
Latin script The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Gree ...
with some Slavic letters adapted for Albanian vowels. The book was discovered in 1740 by Gjon Nikollë Kazazi, the Albanian
archbishop In Christian denominations, an archbishop is a bishop of higher rank or office. In most cases, such as the Catholic Church, there are many archbishops who either have jurisdiction over an ecclesiastical province in addition to their own archdi ...
of
Skopje Skopje ( , ; ; , sq-definite, Shkupi) is the capital and largest city of North Macedonia. It lies in the northern part of the country, in the Skopje Basin, Skopje Valley along the Vardar River, and is the political, economic, and cultura ...
. It contains the liturgies of the main
holiday A holiday is a day or other period of time set aside for festivals or recreation. ''Public holidays'' are set by public authorities and vary by state or region. Religious holidays are set by religious organisations for their members and are often ...
s. There are also texts of prayers and rituals and catechetical texts. The grammar and the vocabulary are more archaic than those in the Gheg texts from the 17th century. The 188 pages of the book comprise about 154,000 words with a total vocabulary of c. 1,500 different words. The text is archaic yet easily interpreted because it is mainly a translation of known texts, in particular portions of the
Bible The Bible is a collection of religious texts that are central to Christianity and Judaism, and esteemed in other Abrahamic religions such as Islam. The Bible is an anthology (a compilation of texts of a variety of forms) originally writt ...
. The book also contains passages from the Psalms, the Book of Isaiah, the
Book of Jeremiah The Book of Jeremiah () is the second of the Latter Prophets in the Hebrew Bible, and the second of the Prophets in the Christian Old Testament. The superscription at chapter Jeremiah 1#Superscription, Jeremiah 1:1–3 identifies the book as "th ...
, the Letters to the Corinthians, and many illustrations. The uniformity of spelling seems to indicate an earlier tradition of writing. The only known copy of the Meshari is held by the Apostolic Library. In 1968 the book was published with transliterations and comments by linguists. * The first printed work in Tosk Albanian is the ''Mbsuame e krështerë'' (in Italian: ''Dottrina cristiana'') by Lekë Matrënga or (in Italian) Luca Matranga. It was published in 1592 and is written in an early form of the Arbëresh language (also known as Italo-Albanian). Albanian scripts were produced earlier than the first attested document, , but none yet have been discovered. We know of their existence by earlier references. For example, a French monk signed as "Broccardus" notes, in 1332, that "Although the Albanians have another language totally different from Latin, ''they still use Latin letters in all their books''".


Disputed earlier texts

In 1967 two scholars claimed to have found a Letter text in Albanian inserted into the Bellifortis text, a book written in Latin dating to 1402–1405. Robert Elsie, a specialist in Albanian studies, considers that "The Todericiu/Polena Romanian translation of the non-Latin lines, although it may offer some clues if the text is indeed Albanian, is fanciful and based, among other things, on a false reading of the manuscript, including the exclusion of a whole line."


Ottoman period

In 1635, Frang Bardhi (1606–1643) published in
Rome Rome (Italian language, Italian and , ) is the capital city and most populated (municipality) of Italy. It is also the administrative centre of the Lazio Regions of Italy, region and of the Metropolitan City of Rome. A special named with 2, ...
his , the first known Latin-Albanian dictionary. Other scholars who studied the language during the 17th century include Andrea Bogdani (1600–1685), author of the first Latin-Albanian grammar book, Nilo Katalanos (1637–1694) and others.


Indo-European features


Indo-European vocabulary


PIE phonological correspondences

Phonologically, Albanian is not so conservative. Like many IE stocks, it has merged the two series of voiced stops (e.g. both PIE and became ). In addition, voiced stops tend to disappear in between vowels. There is almost complete loss of final syllables and very widespread loss of other unstressed syllables (e.g. 'friend' from Lat. ). PIE appears as (also as if a high front vowel follows), while PIE and become , and PIE appears as . The palatals, velars, and labiovelars show distinct developments, with Albanian showing the three-way distinction also found in Luwian. Labiovelars are for the most part differentiated from all other Indo-European velar series before front vowels, but they merge with the "pure" (back) velars elsewhere. The palatal velar series, consisting of Proto-Indo-European and the merged and , usually developed into and , but were depalatalised to merge with the back velars when in contact with sonorants. Because the original Proto-Indo-European tripartite distinction between dorsals is preserved in such reflexes, Albanian is therefore neither centum nor satem, despite having a "satem-like" realization of the palatal dorsals in most cases. Thus PIE , , and become , , and , respectively (before back vowels PIE becomes , while and merge as ). A minority of scholars reconstruct a fourth laryngeal allegedly surfacing as Alb. word-initially, e.g. Alb. 'testicles' presumably from PIE , (rather than the usual reconstruction ), but this is generally not followed elsewhere, as has arisen elsewhere idiosyncratically (for example Alb. < Lat. ).


Standard Albanian

Since
World War II World War II or the Second World War (1 September 1939 – 2 September 1945) was a World war, global conflict between two coalitions: the Allies of World War II, Allies and the Axis powers. World War II by country, Nearly all of the wo ...
, standard Albanian used in Albania has been based on the Tosk dialect. Kosovo and other areas where Albanian is official adopted the Tosk standard in 1969.


Elbasan-based standard

Until the early 20th century, Albanian writing developed in three main literary traditions: Gheg, Tosk, and Arbëreshë. Throughout this time, a Gheg subdialect spoken around Elbasan served as ''lingua franca'' among the Albanians, but was less prevalent in writing. The Congress of Manastir of Albanian writers held in 1908 recommended the use of the Elbasan subdialect for literary purposes and as a basis of a unified national language. While technically classified as a southern Gheg variety, the Elbasan speech is closer to Tosk in phonology and practically a hybrid between other Gheg subdialects and literary Tosk. Between 1916 and 1918, the Albanian Literary Commission met in Shkodër under the leadership of
Luigj Gurakuqi Luigj Gurakuqi (19 February 1879 – 2 March 1925), also called Louis Gurakuchi, was an Albanians, Albanian writer and politician. He was an important figure of the Albanian National Awakening and was honored with the People's Hero of Albania ...
with the purpose of establishing a unified orthography for the language. The commission, made up of representatives from the north and south of Albania, reaffirmed the Elbasan subdialect as the basis of a national tongue. The rules published in 1917 defined spelling for the Elbasan variety for official purposes. The commission did not, however, discourage publications in one of the dialects, but rather laid a foundation for Gheg and Tosk to gradually converge into one. When the Congress of Lushnje met in the aftermath of World War I to form a new Albanian government, the 1917 decisions of the Literary Commission were upheld. The Elbasan subdialect remained in use for administrative purposes and many new writers embraced it for creative writing. Gheg and Tosk continued to develop freely and interaction between the two dialects increased.


Tosk standard

At the end of World War II, however, the new communist regime radically imposed the use of the Tosk dialect in all facets of life in Albania: administration, education, and literature. Most Communist leaders were Tosks from the south. Standardisation was directed by the Albanian Institute of Linguistics and Literature of the Academy of Sciences of Albania. Two dictionaries were published in 1954: an Albanian language dictionary and a Russian–Albanian dictionary. New orthography rules were eventually published in 1967 and in 1973 with the ''Drejtshkrimi i gjuhës shqipe'' (Orthography of the Albanian Language). Until 1968, Kosovo and other Albanian-speaking areas in Yugoslavia followed the 1917 standard based on the Elbasan dialect, though it was gradually infused with Gheg elements in an effort to develop a Kosovan language separate from communist Albania's Tosk-based standard. Albanian intellectuals in the former Yugoslavia consolidated the 1917 standard twice in the 1950s, culminating with a thorough codification of orthographic rules in 1964. The rules already provided for a balanced variety that accounted for both Gheg and Tosk dialects, but only lasted through 1968. Viewing divergences with Albania as a threat to their identity, Kosovars arbitrarily adopted the Tosk project that Tirana had published the year before. Although it was never intended to serve outside of Albania, the project became the "unified literary language" in 1972, when approved by a rubberstamp Orthography Congress. Only about 1 in 9 participants were from Kosovo. The Congress, held at Tirana, authorized the orthography rules that came out the following year, in 1973. More recent dictionaries from the Albanian government are (1976) (Orthographic Dictionary of the Albanian Language) and ''Dictionary of Today's Albanian language'' () (1980). Prior to World War II, dictionaries consulted by developers of the standard have included (Albanian: ( Kostandin Kristoforidhi, 1904), (1908), and (1941).


Calls for reform

Since the fall of the communist regime, Albanian orthography has stirred heated debate among scholars, writers, and public opinion in Albania and Kosovo, with hardliners opposed to any changes in the orthography, moderates supporting varying degrees of reform, and radicals calling for a return to the Elbasan dialect. Criticism of Standard Albanian has centred on the exclusion of the 'me + participle' infinitive and the Gheg lexicon. Critics say that Standard Albanian disenfranchises and stigmatises Gheg speakers, affecting the quality of writing and impairing effective public communication. Supporters of the Tosk standard view the 1972 Congress as a milestone achievement in Albanian history and dismiss calls for reform as efforts to "divide the nation" or "create two languages." Moderates, who are especially prevalent in Kosovo, generally stress the need for a unified Albanian language, but believe that the 'me + participle' infinitive and Gheg words should be included. Proponents of the Elbasan dialect have been vocal, but have gathered little support in the public opinion. In general, those involved in the language debate come from diverse backgrounds and there is no significant correlation between one's political views, geographic origin, and position on Standard Albanian. Many writers continue to write in the Elbasan dialect but other Gheg variants have found much more limited use in literature. Most publications adhere to a strict policy of not accepting submissions that are not written in Tosk. Some print media even translate direct speech, replacing the 'me + participle' infinitive with other verb forms and making other changes in grammar and word choice. Even authors who have published in the Elbasan dialect will frequently write in the Tosk standard. In 2013, a group of academics for Albania and Kosovo proposed minor changes to the orthography. Hardline academics boycotted the initiative, while other reformers have viewed it as well-intentioned but flawed and superficial.


Education

Albanian is the medium of instruction in most Albanian schools. The literacy rate in Albania for the total population, age 9 or older, is about 99%. Elementary education is compulsory (grades 1–9), but most students continue at least until a secondary education. Students must pass graduation exams at the end of the 9th grade and at the end of the 12th grade in order to continue their education.


Phonology

Standard Albanian has seven
vowel A vowel is a speech sound pronounced without any stricture in the vocal tract, forming the nucleus of a syllable. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness a ...
s and 29
consonant In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract, except for the h sound, which is pronounced without any stricture in the vocal tract. Examples are and pronou ...
s. Like English, Albanian has dental fricatives (like the ''th'' in ''thin'') and (like the ''th'' in ''this''), written as and , which are rare cross-linguistically. Gheg uses long and
nasal vowel A nasal vowel is a vowel that is produced with a lowering of the soft palate (or velum) so that the air flow escapes through the nose and the mouth simultaneously, as in the French vowel /ɑ̃/ () or Amoy []. By contrast, oral vowels are p ...
s, which are absent in Tosk, and the mid-central vowel is lost at the end of the word. The stress is fixed mainly on the last syllable. Gheg (: compare English ''feminine'') changes to by rhotacism in Tosk ().


Consonants

Notes: * The contrast between flapped and trilled is the almost the same as in Spanish or
Armenian Armenian may refer to: * Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia * Armenians, the national people of Armenia, or people of Armenian descent ** Armenian diaspora, Armenian communities around the ...
. However, in most of the dialects, as also in standard Albanian, the single changes from an alveolar flap to an alveolar approximant . * The palatal nasal corresponds to the Spanish and the French and Italian ''gn''. It is pronounced as one sound, not a nasal plus a glide. * The sound is a velarised lateral, close to English dark ''l''. * The letter is sometimes written ''ch'' due to technical limitations, in analogy to the other digraphs , , and . Usually it is written simply or more rarely with context resolving any ambiguities. * The sounds spelled with and show variation. They may range between occurring as palatal affricates or as palatal stops among dialects. Some speakers merge them into the palatoalveolar sounds and . This is especially common in Northern Gheg, but is increasingly the case in Tosk as well. Other speakers reduced them into in consonant clusters, such as in the word , which before standardisation was written as ( < Medieval Greek ). * The can be pronounced as in final position, otherwise it is an allophone of before and . * Before and , is always pronounced but this is not reflected in the orthography.


Vowels


Notes

* can also range to an open-mid sound in the Northern Tosk dialect. * Mid sounds can also be heard as more open-mid sounds , in free variation.


Schwa

The schwa in Albanian has a great degree of variability from extreme back to extreme front articulation. Although the Indo-European schwa ( or ) was preserved in Albanian, in some cases it was lost, possibly when a stressed syllable preceded it. Until the standardisation of the modern Albanian alphabet, in which the schwa is spelled as , as in the work of Gjon Buzuku in the 16th century, various vowel letters and digraphs were employed, including by Lekë Matrënga and by Pjetër Bogdani in the late 16th and early 17th century. Within the borders of Albania, the phoneme is pronounced about the same in both the Tosk and the Gheg dialect due to the influence of standard Albanian. However, in the Gheg dialects spoken in the neighbouring Albanian-speaking areas of Kosovo and North Macedonia, the phoneme is still pronounced as back and rounded.


Grammar

Albanian has a canonical word order of SVO (subject–verb–object) like English and many other Indo-European languages. Albanian
noun In grammar, a noun is a word that represents a concrete or abstract thing, like living creatures, places, actions, qualities, states of existence, and ideas. A noun may serve as an Object (grammar), object or Subject (grammar), subject within a p ...
s are categorised by
gender Gender is the range of social, psychological, cultural, and behavioral aspects of being a man (or boy), woman (or girl), or third gender. Although gender often corresponds to sex, a transgender person may identify with a gender other tha ...
(masculine, feminine and neuter) and
inflected In linguistic Morphology (linguistics), morphology, inflection (less commonly, inflexion) is a process of word formation in which a word is modified to express different grammatical category, grammatical categories such as grammatical tense, ...
for
number A number is a mathematical object used to count, measure, and label. The most basic examples are the natural numbers 1, 2, 3, 4, and so forth. Numbers can be represented in language with number words. More universally, individual numbers can ...
(singular and plural) and case. There are five
declension In linguistics, declension (verb: ''to decline'') is the changing of the form of a word, generally to express its syntactic function in the sentence by way of an inflection. Declension may apply to nouns, pronouns, adjectives, adverbs, and det ...
s and six cases (
nominative In grammar, the nominative case ( abbreviated ), subjective case, straight case, or upright case is one of the grammatical cases of a noun or other part of speech, which generally marks the subject of a verb, or (in Latin and formal variants of E ...
, accusative, genitive, dative, ablative, and vocative), although the vocative only occurs with a limited number of words (such as '' ("son"), vocative case: , ("bird") vocative case: Breu, W. (2021) Italo-Albanian: Balkan Inheritance and Romance Influence p. 154), and the forms of the genitive and dative are identical (a genitive construction employs the prepositions alongside dative morphemes). Some dialects also retain a locative case, which is not present in standard Albanian (e.g. "" loc.sg.def). The cases apply to both definite and indefinite nouns, and there are numerous cases of
syncretism Syncretism () is the practice of combining different beliefs and various school of thought, schools of thought. Syncretism involves the merging or religious assimilation, assimilation of several originally discrete traditions, especially in the ...
. The following shows the declension of (mountain), a noun in the masculine class which takes "i" in the definite singular: The following shows the declension of the noun (bird), a noun in the masculine class which takes "u" in the definite singular: The following table shows the declension of the noun (girl) in the feminine class: The definite article is placed after the noun as in many other Balkan languages, like in Romanian, Macedonian and Bulgarian. * The definite article can be in the form of noun suffixes, which vary with gender and case. ** For example, in singular nominative, masculine nouns add , or those ending in // take (to avoid palatalization): *** (mountain) / (the mountain); *** (book) / (the book); *** (bird) / (the bird). ** Nouns in the feminine class take the suffix : *** (car) / (the car); *** (house) / (the house); *** (flower) / (the flower). * Nouns in the neuter class take . Albanian has developed an analytical
verb A verb is a word that generally conveys an action (''bring'', ''read'', ''walk'', ''run'', ''learn''), an occurrence (''happen'', ''become''), or a state of being (''be'', ''exist'', ''stand''). In the usual description of English, the basic f ...
al structure in place of the earlier synthetic system, inherited from
Proto-Indo-European Proto-Indo-European (PIE) is the reconstructed common ancestor of the Indo-European language family. No direct record of Proto-Indo-European exists; its proposed features have been derived by linguistic reconstruction from documented Indo-Euro ...
. Its complex system of moods (six types) and tenses (three simple and five complex constructions) is distinctive among Balkan languages. There are two general types of conjugations. Albanian has a series of
verb form In linguistics, conjugation ( ) is the creation of derived forms of a verb from its principal parts by inflection (alteration of form according to rules of grammar). For instance, the verb ''break'' can be conjugated to form the words ''break'' ...
s called miratives or admiratives. These may express surprise on the part of the speaker, but may also have other functions, such as expressing
irony Irony, in its broadest sense, is the juxtaposition of what, on the surface, appears to be the case with what is actually or expected to be the case. Originally a rhetorical device and literary technique, in modernity, modern times irony has a ...
, doubt, or reportedness. The Albanian use of admirative forms is unique in the Balkan context. In English, the expression of surprise can be rendered by 'oh, look!' or 'lookee there!'; the expression of doubt can be rendered by 'indeed!'; the expression of neutral reportedness can be rendered by 'apparently'. * "You speak Albanian." (indicative) * "You (surprisingly) speak Albanian!" (admirative) * "The street is closed." (indicative) * "(Apparently,) The street is closed." (admirative) For more information on verb conjugation and on inflection of other parts of speech, see Albanian morphology.


Word order

Albanian word order is relatively free. To say '' in Albanian, one may use any of the following orders, with slight pragmatic differences: * SVO: * SOV: * OVS: * OSV: * VSO: * VOS: However, the most common order is subject–verb–object. The verb can optionally occur in sentence-initial position, especially with verbs in the passive form ('): * "An interruption is anticipated".


Negation

Verbal negation in Albanian is mood-dependent, a trait shared with some fellow
Indo-European languages The Indo-European languages are a language family native to the northern Indian subcontinent, most of Europe, and the Iranian plateau with additional native branches found in regions such as Sri Lanka, the Maldives, parts of Central Asia (e. ...
such as Greek. In indicative, conditional, or admirative sentences, negation is expressed by the particles or ''s in front of the verb, for example: * ' "Tony does not speak English"; * ' "Tony doesn't speak English"; * ' "I do not know"; * ' "I don't know". Subjunctive, imperative, optative, or non-finite forms of verbs are negated with the particle ''mos'': * ' "Do not forget!".


Numerals


Notes

* In certain dialects, numerals with an extra syllable may undergo metrical syncope. For example, becomes .


Vigesimal system

Beside the Indo-European decimal numeration, there are also remnants of the vigesimal system, as and . The Arbëreshë in Italy and Arvanites in Greece may still use and . Albanian is the only Balkan language that has preserved the Pre-Indo-European vigesimal system.


Lexicon

Albanian is known within historical linguistics as a case of a language which, although surviving through many periods of foreign rule and multilingualism, saw a "disproportionately high" influx of loans from other languages augmenting and replacing much of its original
vocabulary A vocabulary (also known as a lexicon) is a set of words, typically the set in a language or the set known to an individual. The word ''vocabulary'' originated from the Latin , meaning "a word, name". It forms an essential component of languag ...
. Of all the foreign influences in Albanian, the deepest reaching and most impactful was the absorption of loans from Latin in the Classical period and its Romance successors afterward. Scholars have estimated a great number of Latin loanwords in Albanian, some even claiming 60% of the Albanian vocabulary. Sawicka, Irena
"A Crossroad Between West, East and Orient–The Case of Albanian Culture."
Colloquia Humanistica. No. 2. Instytut Slawistyki Polskiej Akademii Nauk, 2013. Page 97: "Even according to Albanian linguists, Albanian vocabulary is composed in 60 percent of Latin words from different periods... When albanological studies were just emerging, it happened that Albanian was classified as a Romance language. Already there exists the idea of a common origin of both Albanian and Rumanian languages. The Rumanian grammar is almost identical to that of Albanian, but it may be as well the effect of later convergence within the Balkan Sprachbund.."
Major work in reconstructing Proto-Albanian has been done with the help of knowledge of the original forms of loans from Ancient Greek, Latin and Slavic, while Ancient Greek loanwords are scarce the Latin loanwords are of extreme importance in
phonology Phonology (formerly also phonemics or phonematics: "phonemics ''n.'' 'obsolescent''1. Any procedure for identifying the phonemes of a language from a corpus of data. 2. (formerly also phonematics) A former synonym for phonology, often pre ...
. The presence of loanwords from more well-studied languages from time periods before Albanian was attested, reaching deep back into the Classical Era, has been of great use in phonological reconstructions for earlier ancient and medieval forms of Albanian.Matasovic, Ranko (2018)
''A Grammatical Sketch of Albanian for students of Indo-European''
p. 6.
Some words in the core vocabulary of Albanian have no known etymology linking them to Proto-Indo-European or any known source language, and as of 2018 are thus tentatively attributed to an unknown, unattested, pre-Indo-European substrate language; some words among these include (heart) and (iron). Some among these putative pre-IE words are thought to be related to putative pre-IE substrate words in neighboring Indo-European languages, such as (flower), which has been tentatively linked to Latin and Greek . Lexical distance of Albanian to other languages in a lexicostatistical analysis by Ukrainian linguist Tyshchenko shows the following results (the lower figure, the higher similarity): 49% Slovenian, 53% Romanian, 56% Greek, 82% French, 86% Macedonian, 86% Bulgarian.


Cognates with Illyrian


Early linguistic influences

The earliest
loanword A loanword (also a loan word, loan-word) is a word at least partly assimilated from one language (the donor language) into another language (the recipient or target language), through the process of borrowing. Borrowing is a metaphorical term t ...
s attested in Albanian come from
Doric Greek Doric or Dorian (), also known as West Greek, was a group of Ancient Greek dialects; its Variety (linguistics), varieties are divided into the Doric proper and Northwest Doric subgroups. Doric was spoken in a vast area, including northern Greec ...
, whereas the strongest influence came from
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
. Some scholars argue that Albanian originated from an area located east of its present geographic spread due to the several common lexical items found between the Albanian and Romanian languages. However it does not necessarily define the genealogical history of Albanian language, and it does not exclude the possibility of Proto-Albanian presence in both Illyrian and Thracian territory. The period during which Proto-Albanian and Latin interacted was protracted, lasting from the 2nd century BC to the 5th century AD. Over this period, the lexical borrowings can be roughly divided into three layers, the second of which is the largest. The first and smallest occurred at the time of less significant interaction. The final period, probably preceding the Slavic or Germanic invasions, also has a notably smaller number of borrowings. Each layer is characterised by a different treatment of most vowels: the first layer follows the evolution of Early Proto-Albanian into Albanian; while later layers reflect vowel changes endemic to Late Latin (and presumably Proto-Romance). Other formative changes include the syncretism of several noun case endings, especially in the plural, as well as a large-scale palatalisation. A brief period followed, between the 7th and the 9th centuries, that was marked by heavy borrowings from South Slavic, some of which predate the "o-a" shift common to the modern forms of this language group.


Early Greek loans

There are some 30
Ancient Greek Ancient Greek (, ; ) includes the forms of the Greek language used in ancient Greece and the classical antiquity, ancient world from around 1500 BC to 300 BC. It is often roughly divided into the following periods: Mycenaean Greek (), Greek ...
loanwords in Proto-Albanian. Many of these reflect a dialect which voiced its aspirants, as did the Macedonian dialect. Other loanwords are Doric; these words mainly refer to commodity items and trade goods and probably came through trade with a now-extinct intermediary. * ; "sickle" < ( Northwest Greek) Ancient Indo-European dialects: proceedings, Volume 1963 Ancient Indo-European Dialects: Proceedings, University of California, Los Angeles. Center for Research in Languages and Linguistics Authors Henrik Birnbaum, Jaan Puhvel, University of California, Los Angeles. Center for Research in Languages and Linguistics Editors Henrik Birnbaum, Jaan Puhvel Publisher University of California Press, 1966 p.102 * ; "hive, bee" < Attic "bee" (vs. Ionic ). * ; "plum" < * ; "cabbage, green vegetables" < "green; vegetable" * ; "orach, dock" < * ; "to smear, to oil"< Proto-Albanian *elaiwanja < *elaiwa (olive oil) < Greek elaion * ; "millstone" < (Northwest) "device, instrument"The Field of Linguistics, Volume 2 Volume 1 of World of linguistics Authors Bernd Kortmann, Johan Van Der Auwera Editors Bernd Kortmann, Johan Van Der Auwera Publisher Walter de Gruyter, 2010 , p.412 * ; "apple" < "fruit" * ; "palm of the hand" < * ; "melon" < * ; "leek" < * ; "thyme" < (Northwest) , * ; "pond, pool" < "high sea" According to Huld (1986), the following come from a Greek dialect without any significant attestation called "Makedonian" because it was akin to the native idiom of the Greek-speaking population in the Argead kingdom: * ; "elbow" < * * ; "tamarisk" < * * ; 'mallow' < * (with the reflex of /ɡ/ for Greek <χ> indicating a dialectal voicing of the what came as an aspirate stop from Greek) * "fennel" < * (cf Romanian , Ionic ; with the Albanian simplification of -dri̯- to -j- reflecting that of earlier * to "water")


Latin influence

Scholars have estimated a great number of Latin loanwords in Albanian, some even claiming 60% of the Albanian vocabulary. They include many frequently used core vocabulary items, including ("very", from Latin ), ("few", Latin ), ("narrow", Latin ), ("tree", Latin ), ("to come", Latin ), ("sand", Latin ), ("straight", Latin ), ("beast", Latin , meaning "thing"), and ("far away", Latin ). Jernej Kopitar (1780–1844) was the first to note Latin's influence on Albanian and claimed "the Latin loanwords in the Albanian language had the pronunciation of the time of Emperor Augustus". Kopitar gave examples such as Albanian 'chickpea' from Latin , 'city, town' from , 'fish' from , and 'arrow' from . The hard pronunciations of Latin and are retained as palatal and velar stops in the Albanian loanwords. Gustav Meyer (1888) and Wilhelm Meyer-Lübke (1914) later corroborated this. Meyer noted the similarity between the Albanian verbs "to speak clearly, enunciate" and "to pronounce, articulate" and the Latin word (meaning "to welcome"). Therefore, he believed that the word ''Shqiptar'' "Albanian person" was derived from , which in turn was derived from the Latin word . Johann Georg von Hahn, an Austrian linguist, had proposed the same hypothesis in 1854. Eqrem Çabej also noticed, among other things, the archaic Latin elements in Albanian: # Latin /au/ becomes Albanian /a/ in the earliest loanwords: → 'gold'; → 'joy'; → 'laurel'. Latin /au/ is retained in later loans, but is altered in a way similar to Greek: 'thing' → 'thing; beast, brute'; → . # Latin /oː/ becomes Albanian /e/ in the oldest Latin loans: → 'fruit tree'; → 'time, instance'. An analogous mutation occurred from Proto-Indo-European to Albanian; PIE became Albanian 'we', PIE + suffix -ti- became Albanian 'eight', etc. # Latin unstressed internal and initial syllables become lost in Albanian: → 'elbow'; → 'physician'; 'swamp' →
Vulgar Latin Vulgar Latin, also known as Colloquial, Popular, Spoken or Vernacular Latin, is the range of non-formal Register (sociolinguistics), registers of Latin spoken from the Crisis of the Roman Republic, Late Roman Republic onward. ''Vulgar Latin'' a ...
→ 'forest'. An analogous mutation occurred from Proto-Indo-European to Albanian. In contrast, in later Latin loanwords, the internal syllable is retained: → ; → 'wound', etc. # Latin /tj/, /dj/, /kj/ palatalized to Albanian /s/, /z/, /c/: → 'vice; worries'; → 'reason'; → 'ray; spoke'; → 'face, cheek'; → 'mate, comrade', 'husband', etc. In turn, Latin /s/ was altered to /ʃ/ in Albanian. Haralambie Mihăescu demonstrated that: * Some 85 Latin words have survived in Albanian but not (as inherited) in any
Romance language The Romance languages, also known as the Latin or Neo-Latin languages, are the languages that are Language family, directly descended from Vulgar Latin. They are the only extant subgroup of the Italic languages, Italic branch of the Indo-E ...
. A few examples include Late Latin → dial. → 'hydra', → 'winter pasture', 'used for packing, loading' → 'forked peg, grapnel, forked hanger', 'nightshade', lit. 'sun plant' → ' 'sunny place out of the wind, sunbathed area', → 'spleen', → 'pitchfork'. * 151 Albanian words of Latin origin were not inherited in Romanian. A few examples include Latin → Albanian 'friend', → 'foe, enemy', → , → , 'ploughman, herdsman' → , 'peasant', → 'drinking glass', → 'castle', → 'hundred', → 'rooster', → 'limb; joint', → 'doctor', → 'net', → dial. ', 'to hope', 'to await', () → 'will; volunteer'. * Some Albanian church terminology has phonetic features which demonstrate their very early borrowing from Latin. A few examples include Albanian 'to bless' from , 'angel' from , 'church' from , ' 'Christian' from , 'cross' from (), (obsolete) 'altar' from Latin , 'to curse' from , 'mass' from , 'monk' from , 'bishop' from , and 'gospel' from . Other authors have detected Latin loanwords in Albanian that could come from Latin before the palatalization of velar consonants, as early as the 2nd or 3rd century. For example, Albanian ' 'saddle girth; dwarf elder' from Latin and Albanian ' 'old, aged; former' from but influenced by Latin . The
Romance languages The Romance languages, also known as the Latin or Neo-Latin languages, are the languages that are Language family, directly descended from Vulgar Latin. They are the only extant subgroup of the Italic languages, Italic branch of the Indo-E ...
inherited these words from Vulgar Latin: became (via *''clinga'') Romanian 'girdle; saddle girth', and '' veterānus'' became Romanian 'old'. Albanian,
Basque Basque may refer to: * Basques, an ethnic group of Spain and France * Basque language, their language Places * Basque Country (greater region), the homeland of the Basque people with parts in both Spain and France * Basque Country (autonomous co ...
, and the surviving
Celtic languages The Celtic languages ( ) are a branch of the Indo-European language family, descended from the hypothetical Proto-Celtic language. The term "Celtic" was first used to describe this language group by Edward Lhuyd in 1707, following Paul-Yve ...
such as Breton and Welsh are the non-Romance languages today that have this sort of extensive Latin element dating from ancient Roman times, which has undergone the sound changes associated with the languages. Other languages in or near the former Roman area either came on the scene later (Turkish, the Slavic languages, Arabic) or borrowed little from Latin despite coexisting with it (Greek, German), although German does have a few such ancient Latin loanwords ( 'window', 'cheese'). Romanian scholars such as Vatasescu and Mihaescu, using lexical analysis of the Albanian language, have concluded that Albanian was heavily influenced by an extinct Romance language that was distinct from both Romanian and Dalmatian. Because the Latin words common to only Romanian and Albanian are significantly fewer in number than those that are common to only Albanian and Western Romance, Mihaescu argues that the Albanian language evolved in a region with much greater contact with Western Romance regions than with Romanian-speaking regions, and located this region in present-day Albania, Kosovo and Western Macedonia, spanning east to Bitola and
Pristina Pristina or Prishtina ( , ), . is the capital and largest city of Kosovo. It is the administrative center of the eponymous municipality and District of Pristina, district. In antiquity, the area of Pristina was part of the Dardanian Kingdo ...
.


Slavic influence

After the
Slavs The Slavs or Slavic people are groups of people who speak Slavic languages. Slavs are geographically distributed throughout the northern parts of Eurasia; they predominantly inhabit Central Europe, Eastern Europe, Southeastern Europe, and ...
arrived in the Balkans, the
Slavic languages The Slavic languages, also known as the Slavonic languages, are Indo-European languages spoken primarily by the Slavs, Slavic peoples and their descendants. They are thought to descend from a proto-language called Proto-Slavic language, Proto- ...
became an additional source of loanwords. Contact between Albanian with the Slavic languages lasted very intensively for almost four centuries, and continued even in the late Middle Ages. Slavic loanwords in Albanian constitute a less studied area in literature. Per Vladimir Orel (1998), there are about 556 Slavic loanwords in Albanian.


Turkish influence

The rise of the
Ottoman Empire The Ottoman Empire (), also called the Turkish Empire, was an empire, imperial realm that controlled much of Southeast Europe, West Asia, and North Africa from the 14th to early 20th centuries; it also controlled parts of southeastern Centr ...
meant an influx of Turkish words; this also entailed the borrowing of Persian and Arabic words through Turkish. Some Turkish personal names, such as ''Altin'', are common. There are some loanwords from Modern Greek, especially in the south of Albania. Many borrowed words have been replaced by words with Albanian roots or modern Latinised (international) words. According to calculations mentioned by Emanuele Banfi (1985), the total number of Turkish loanwords in Albanian is about two thousand. However, when taking into account obsolete and rare words, and restricted dialectalisms, their number is considerably larger.


Gothic

Albanian is also known to possess a small set of loans from Gothic, with early inquiry into the matter done by Norbert Jokl and Sigmund Feist, though such loans had been claimed earlier in the 19th century by early linguists such as Gustav Meyer. Many words claimed as Gothic have now been attributed to other origins by later linguists of Albanian ( and , though used for major claims by Huld in 1994, are now attributed to Latin, for example), or may instead be native to Albanian, inherited from Proto-Indo-European.Matasovic, Ranko (2019).
A Grammatical Sketch of Albanian for Students of Indo-European
'. Zagreb. Page 39
Today, it is accepted that there are a few words from Gothic in Albanian, but for the most part they are scanty because the Goths had few contacts with Balkan peoples.Curtis, M.C., 2012
Slavic-Albanian language contact, convergence, and coexistence
. Page 19
Martin HuldHuld, M.E., 1994
Albanian zverk – Gothic *swairhs
''Historische Sprachforschung/Historical Linguistics'', 107(1. H), pp.165–171. Pages 167–8.
defends the significance of the admittedly sparse Gothic loans for Albanian studies, however, arguing that Gothic is the only clearly post-Roman and "pre-Ottoman" language after Latin with a notable influence on the Albanian lexicon (the influence of Slavic languages is both pre-Ottoman and Ottoman). He argues that Gothic words in Albanian are attributable to the late fourth and early fifth centuries during the invasions of various Gothic speaking groups of the Balkans under Alaric, Odoacer, and
Theodoric Theodoric is a Germanic given name. First attested as a Gothic name in the 5th century, it became widespread in the Germanic-speaking world, not least due to its most famous bearer, Theodoric the Great, king of the Ostrogoths. Overview The name w ...
. He argues that Albanian Gothicisms bear evidence for the ordering of developments within Proto-Albanian at this time: for example, he argues Proto-Albanian at this stage had already shifted to as Gothic words with reflect with in Albanian, not as seen in most Latin and ancient Greek loans, but had not yet experienced the shift of to , since loans from Gothic words with replace with or another close sound. Notable words that continue to be attributed to Gothic in Albanian by multiple modern sources include: * "felt gaiters, white felt" (cf Romanian "top of boot") < Gothic or , cf Old High German "gaiters" * "foam" < Gothic , perhaps via an intermediary in a Romance (cf. Romanian ) * "fence, garden" is either considered a native Albanian word that was loaned into Romanian as * "nape, back of neck" < Gothic ;Huld, M.E., 1994
Albanian zverk – Gothic *swairhs
''Historische Sprachforschung/Historical Linguistics'', 107(1. H), pp.165–171.
the "difficult" word having various otherwise been attributed (with phonological issues) to Celtic, Greek or native development. * "villain, scoundrel" and "whore" < Gothic "adulterer, cf Old Norse "whore" * "purse", diminutive of < Gothic "purse" (cf. Romanian )


Patterns in loaning

Although Albanian is characterised by the absorption of many loans, even, in the case of Latin, reaching deep into the core vocabulary, certain
semantic field In linguistics, a semantic field is a related set of words grouped semantically (by meaning) that refers to a specific subject.Howard Jackson, Etienne Zé Amvela, ''Words, Meaning, and Vocabulary'', Continuum, 2000, p14. The term is also used in ...
s nevertheless remained more resistant. Terms pertaining to social organisation are often preserved, though not those pertaining to political organisation, while those pertaining to trade are all loaned or innovated. Hydronyms present a complicated picture; the term for "sea" () is native and an "Albano-Germanic" innovation referring to the concept of depth, but a large amount of maritime vocabulary is loaned. Words referring to large streams and their banks tend to be loans, but ("river") is native, as is (the flow of water). Words for smaller streams and stagnant pools of water are more often native, but the word for "pond", is in fact a semantically shifted descendant of the old Greek word for "high sea", suggesting a change in location after Greek contact. Albanian has maintained since Proto-Indo-European a specific term referring to a riverside forest (), as well as its words for marshes. Albanian has maintained native terms for "whirlpool", "water pit" and (aquatic) "deep place", leading Orel to speculate that the Albanian Urheimat likely had an excess of dangerous whirlpools and depths. Regarding forests, words for most conifers and shrubs are native, as are the terms for "alder", "elm", "oak", "beech", and "linden", while "ash", "chestnut", "birch", "maple", "poplar", and "willow" are loans. The original kinship terminology of Indo-European was radically reshaped; changes included a shift from "mother" to "sister", and were so thorough that only three terms retained their original function, the words for "son-in-law", "mother-in-law" and "father-in-law". All the words for second-degree blood kinship, including "aunt", "uncle", "nephew", "niece", and terms for grandchildren, are ancient loans from Latin. The Proto-Albanians appear to have been cattle breeders given the vastness of preserved native vocabulary pertaining to cow breeding, milking and so forth, while words pertaining to dogs tend to be loaned. Many words concerning horses are preserved, but the word for horse itself is a Latin loan.


See also

* Abetare * Arbëresh language * Arvanitika * Gheg Albanian *
Illyrian language The Illyrian language () was an Indo-European languages, Indo-European language or group of languages spoken by the Illyrians in Southeast Europe during antiquity. The language is unattested with the exception of personal names and placenames. ...
* IPA/Albanian *
Messapic language Messapic (; also known as Messapian; or as Iapygian) is an extinct Indo-European Paleo-Balkanic language of the southeastern Italian Peninsula, once spoken in Salento by the Iapygian peoples of the region: the Calabri and Salentini (known colle ...
* Thraco-Illyrian * Tosk Albanian


Notes


References


Bibliography

* * (reprint in ''Gjurmime albanologjike – Seria e shkencave filologjike II – 1972''. Pristina: 1974, pp. 33–44). * * * * * * * * * * * (In German ''Studia Albanica'' 1 (1964)) * ** * Çabej, Eqrem. "Rreth disa çështjeve të historisë së gjuhës shqipe", ''Buletin i Universitetit Shtetëror të Tiranës. Seria e Shkencave Shoqërore'' 3 (1963): 69–101. (In Romanian ''Studii și cercetări lingvistiche'' 4 (1954)) * Çabej, Eqrem. "Mbi disa rregulla të fonetikës historike të shqipes", ''Studime Filologjike'' 2 (1970): 77–95 (In German "Über einige Lautregeln des Albanischen", ''Die Sprache'' 18 (1972): 132–54) * * * Çabej, Eqrem. ''Studime etimologjike në fushë të shqipes''. 7 vols. Tirana: Akademia et Shkencave e Republikës Popullore të Shqipërisë, Instituti i Gjuhësisë dhe i Letërsisë, 1976–2014. * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * Lambertz, Maximilian. ''Lehrgang des Albanischen''. 3 vols., vol. 1: ''Albanisch-deutsches Wörterbuch''; vol. 2: ''Albanische Chrestomathie''; vol. 3: ''Grammatik der albanischen Sprache''. Berlin: Deutscher Verlag der Wissenschaften 1954; Berlin 1955; Halle an der Saale 1959. * * * * Matzinger, Joachim. "Der lateinisch-albanische Sprachkontakt und seine Implikationen für Vorgeschichte des Albanischen und der Albaner", in ''Südosteuropäische Romania: Siedlungs-/Migrationsgeschichte und Sprachtypologie''. Edited by Wolfgang Dahmen et al. Tübingen: Narr Verlag, 2012, pp. 75–103. * * Mayer, Anton. ''Die Sprache der alten Illyrier''. 2 vols. Vienna: Österreichische Akademie der Wissenschaften, 1957/1959. * * Meyer, Gustav.
Albanesische Studien I. Die Pluralbildungen der albanesischen Nomina
, in ''Sitzungsberichte der philosophisch-historischen Classe der Kaiserlichen Akademie der Wissenschaften'' 104 (1883): 257–362. * Miklosich, Franz. ''Albanische Forschungen''. 2 vols., vol. 1: ''Die slavischen Elemente im Albanischen''; vol. 2: ''Die romanischen Elemente im Albanischen''. Vienna: Karl Gerold's Sohn, 1870. * * * * * * * * Erlangen (1909) * * Pellegrini, Giovan Battista. "Disa vëzhgime mbi elementin Latin të shqipes" ome observations on the Latin element of the Albanian language ''Studime Filologjike'' 3 (1982); (in Italian) "Alcune osservazioni sull'elemento latino dell'albanese", ''Studia Albanica'' 1983: 63–83. * Pellegrini, Giovan Battista. ''Avviamento alla linguistica albanese''. Edizione rinnovata. Rende: Università degli studi della Calabria, Centro editoriale e librario, 1997. * reprint in ''Saggi di linguistica storica: Scritti scelti''. Torino: Rosenberg & Sellier, 1959, pp. 96–114. * * * * * * * * * * * * * von Hahn, Johann Georg. ''Albanesische Studien''. 3 vols. Jena: F. Mauko, 1854. * Watkins, Calvert. "Proto-Indo-European: Comparison and Reconstruction", in ''The Indo-European Languages''. Edited by Anna Giacalone Ramat & Paolo Ramat. London-NY: Routledge, 1998, pp. 25–73. * Ylli, Xhelal. ''Das slawische Lehngut im Albanischen''. 2 vols., vol. 1: ''Lehnwörter''; vol. 2: ''Ortsnamen''. Munich: Verlag Otto Sagner, 1997/2000. *


External links


Albanian Online
by Brian Joseph, Angelo Costanzo, and Jonathan Slocum, free online lessons at th
Linguistics Research Center
at the
University of Texas at Austin The University of Texas at Austin (UT Austin, UT, or Texas) is a public university, public research university in Austin, Texas, United States. Founded in 1883, it is the flagship institution of the University of Texas System. With 53,082 stud ...

glottothèque – Ancient Indo-European Grammars online
an online collection of introductory videos to Ancient Indo-European languages produced by the University of Göttingen {{Authority control Languages attested from the 15th century Indo-European languages Illyrian languages Languages of Albania Languages of Greece Languages of Italy Languages of Kosovo Languages of North Macedonia Languages of Romania Languages of Montenegro Languages of Serbia Languages of Sicily Languages of Turkey Subject–verb–object languages