Haplogroup F, also known as F-M89 and previously as Haplogroup FT is a very common Y-chromosome haplogroup. The clade and its
subclade
In genetics, a subclade is a subgroup of a haplogroup.
Naming convention
Although human mitochondrial DNA (mtDNA) and Y chromosome DNA (Y-DNA) haplogroups and subclades are named in a similar manner, their names belong to completely separate sy ...
s constitute over 90% of paternal lineages outside of Africa.
The vast majority of individual males with F-M89 fall into its direct descendant
Haplogroup GHIJK
Haplogroup GHIJK, defined by the SNPs M3658, F1329, PF2622, and YSC0001299,ISOGG, 2015, ''Y-DNA Haplogroup F and its Subclades - 2015'' (8 September 2015). in addition to GHIJK, haplogroup F has three other immediate descendant subclades: F1 (P91/P104), F2 (M427/M428), and F3 (M481). These three, with F* (M89*), constitute the
paragroup
Paragroup is a term used in population genetics to describe lineages within a haplogroup that are not defined by any additional unique markers.
In human Y-chromosome DNA haplogroups, paragroups are typically represented by an asterisk (*) placed ...
F(xGHIJK). They are primarily found throughout South Asia, Southeast Asia and parts of East Asia.
Haplogroup GHIJK branches subsequently split into two direct descendants: G (M201/PF2957) and HIJK (F929/M578/PF3494/S6397). HIJK in turn splits into H (L901/M2939) and IJK (F-L15). The descendants of the haplogroup IJK include the clades I, J, K, and, ultimately, several major haplogroups descended from Haplogroup K, namely: haplogroups M, N, O, P, Q, R, S, L, and T.
Origins
It is estimated that the SNP M89 appeared 38,700–55,700 years ago, and most likely originated in
South Asia
South Asia is the southern subregion of Asia, which is defined in both geographical and ethno-cultural terms. The region consists of the countries of Afghanistan, Bangladesh, Bhutan, India, Maldives, Nepal, Pakistan, and Sri Lanka.;;;;;;;; ...
or
Southeast Asia
Southeast Asia, also spelled South East Asia and South-East Asia, and also known as Southeastern Asia, South-eastern Asia or SEA, is the geographical United Nations geoscheme for Asia#South-eastern Asia, south-eastern region of Asia, consistin ...
It has also been suggested by previous research that F-M89 most likely first appeared in the
Arabian Peninsula
The Arabian Peninsula, (; ar, شِبْهُ الْجَزِيرَةِ الْعَرَبِيَّة, , "Arabian Peninsula" or , , "Island of the Arabs") or Arabia, is a peninsula of Western Asia, situated northeast of Africa on the Arabian Plate ...
,
Levant
The Levant () is an approximate historical geographical term referring to a large area in the Eastern Mediterranean region of Western Asia. In its narrowest sense, which is in use today in archaeology and other cultural contexts, it is eq ...
or
North Africa
North Africa, or Northern Africa is a region encompassing the northern portion of the African continent. There is no singularly accepted scope for the region, and it is sometimes defined as stretching from the Atlantic shores of Mauritania in ...
, about 43,800–56,800 years ago. It has also been speculated that the possible location of this lineage's first expansion and rise to prevalence appears to have been in the
Indian Subcontinent
The Indian subcontinent is a list of the physiographic regions of the world, physiographical region in United Nations geoscheme for Asia#Southern Asia, Southern Asia. It is situated on the Indian Plate, projecting southwards into the Indian O ...
, or somewhere close to it, and most of the descendant subclades and haplogroups appear to have radiated outward from South Asia and/or neighbouring parts of the
Middle East
The Middle East ( ar, الشرق الأوسط, ISO 233: ) is a geopolitical region commonly encompassing Arabian Peninsula, Arabia (including the Arabian Peninsula and Bahrain), Anatolia, Asia Minor (Asian part of Turkey except Hatay Pro ...
and
Southeast Asia
Southeast Asia, also spelled South East Asia and South-East Asia, and also known as Southeastern Asia, South-eastern Asia or SEA, is the geographical United Nations geoscheme for Asia#South-eastern Asia, south-eastern region of Asia, consistin ...
.
Some lineages derived from Haplogroup F-M89 appear to have back-migrated into Africa from
South Asia
South Asia is the southern subregion of Asia, which is defined in both geographical and ethno-cultural terms. The region consists of the countries of Afghanistan, Bangladesh, Bhutan, India, Maldives, Nepal, Pakistan, and Sri Lanka.;;;;;;;; ...
, during prehistory. For example, subclades of F-M89 were discovered in ancient DNA samples from Sudan, which were associated with both Meroitic and Post-Meroiti burials.
Distribution
The vast majority of living individuals carrying F-M89 belong to subclades of GHIJK. By comparison, cases of the paragroup F(xG,H,I,J,K) – that is, either basal F* (M89) or the primary subclades F1 (P91; P104), F2 (M427; M428) and F3 (M481) – are relatively rare worldwide.
F(xG,H,I,J,K)
A lack of precise, high resolution testing in the past makes it difficult to discuss F*, F1, F2* and F3* separately.
ISOGG
The International Society of Genetic Genealogy (ISOGG) is an independent non-commercial nonprofit organization of genetic genealogists run by volunteers. It was founded by a group of surname DNA project administrators in 2005 to promote DNA te ...
states that F(xG,H,I,J,K) has not been well studied, occurs "infrequently" in modern populations and peaks in
South Asia
South Asia is the southern subregion of Asia, which is defined in both geographical and ethno-cultural terms. The region consists of the countries of Afghanistan, Bangladesh, Bhutan, India, Maldives, Nepal, Pakistan, and Sri Lanka.;;;;;;;; ...
, especially
Sri Lanka
Sri Lanka (, ; si, ශ්රී ලංකා, Śrī Laṅkā, translit-std=ISO (); ta, இலங்கை, Ilaṅkai, translit-std=ISO ()), formerly known as Ceylon and officially the Democratic Socialist Republic of Sri Lanka, is an ...
. It also appears to have long been present in
South East Asia
Southeast Asia, also spelled South East Asia and South-East Asia, and also known as Southeastern Asia, South-eastern Asia or SEA, is the geographical south-eastern region of Asia, consisting of the regions that are situated south of mainland ...
. However, the possibility of misidentification is considered to be relatively high and some cases may in fact belong to misidentified subclades of Haplogroup GHIJK. This was, for instance, the case with the subclade Haplogroup H2 (P96), which was originally named "F3", i.e. – a name that has since been reassigned to F-M481.
F(xF1,F2,F3) has been reported among 10% of males in Sri Lanka, 5.2% of males across India (including up to 10% of males in South India), 5% in Pakistan, as well as lower levels among the
Tamang people
The Tamang (; Devanagari: तामाङ; ''tāmāṅ'') are an Tibeto-Burmese ethnic group of Nepal. In Nepal Tamang/Moormi people constitute 5.6% of the Nepalese population at over 1.3 million in 2001, increasing to 1,539,830 as of the 2011 c ...
(Nepal), and in
Iran
Iran, officially the Islamic Republic of Iran, and also called Persia, is a country located in Western Asia. It is bordered by Iraq and Turkey to the west, by Azerbaijan and Armenia to the northwest, by the Caspian Sea and Turkmeni ...
.
Men originating in
Indonesia
Indonesia, officially the Republic of Indonesia, is a country in Southeast Asia and Oceania between the Indian and Pacific oceans. It consists of over 17,000 islands, including Sumatra, Java, Sulawesi, and parts of Borneo and New Guine ...
have also been reported to carry F(xG,H,I,J,K) – especially F-M89* – at relatively significant levels. It has been reported at rates of 4-5% in
Sulawesi
Sulawesi (), also known as Celebes (), is an island in Indonesia. One of the four Greater Sunda Islands, and the world's eleventh-largest island, it is situated east of Borneo, west of the Maluku Islands, and south of Mindanao and the Sulu Ar ...
and
Lembata
Lembata is an island in the Lesser Sunda Islands, also known as Lomblen island; it is the largest island of the Solor Archipelago, in the Lesser Sunda Islands, Indonesia. It forms a separate regency of the province of Nusa Tenggara Timur. The ...
. One study, which did not comprehensively screen for other subclades of F-M89 (including some subclades of GHIJK), found that Indonesian men with the SNP P14/PF2704 (which is equivalent to M89), comprise 1.8% of men in
West Timor
West Timor ( id, Timor Barat) is an area covering the western part of the island of Timor, except for the district of Oecussi-Ambeno (an East Timorese exclave). Administratively, West Timor is part of East Nusa Tenggara Province, Indonesia. The ca ...
, 1.5% of
Flores
Flores is one of the Lesser Sunda Islands, a group of islands in the eastern half of Indonesia. Including the Komodo Islands off its west coast (but excluding the Solor Archipelago to the east of Flores), the land area is 15,530.58 km2, and th ...
5.4% of
Lembata
Lembata is an island in the Lesser Sunda Islands, also known as Lomblen island; it is the largest island of the Solor Archipelago, in the Lesser Sunda Islands, Indonesia. It forms a separate regency of the province of Nusa Tenggara Timur. The ...
2.3% of
Sulawesi
Sulawesi (), also known as Celebes (), is an island in Indonesia. One of the four Greater Sunda Islands, and the world's eleventh-largest island, it is situated east of Borneo, west of the Maluku Islands, and south of Mindanao and the Sulu Ar ...
and 0.2% in
Sumatra
Sumatra is one of the Sunda Islands of western Indonesia. It is the largest island that is fully within Indonesian territory, as well as the sixth-largest island in the world at 473,481 km2 (182,812 mi.2), not including adjacent i ...
. F1 (P91), F2 (M427) and F3 (M481; previously F5) are all highly rare and virtually exclusive to regions/ethnic minorities in Sri Lanka, India, Nepal,
South China
South China () is a geographical and cultural region that covers the southernmost part of China. Its precise meaning varies with context. A notable feature of South China in comparison to the rest of China is that most of its citizens are not n ...
,
Thailand
Thailand ( ), historically known as Siam () and officially the Kingdom of Thailand, is a country in Southeast Asia, located at the centre of the Indochinese Peninsula, spanning , with a population of almost 70 million. The country is bo ...
,
Burma
Myanmar, ; UK pronunciations: US pronunciations incl. . Note: Wikipedia's IPA conventions require indicating /r/ even in British English although only some British English speakers pronounce r at the end of syllables. As John Wells explai ...
, and
Vietnam
Vietnam or Viet Nam ( vi, Việt Nam, ), officially the Socialist Republic of Vietnam,., group="n" is a country in Southeast Asia, at the eastern edge of mainland Southeast Asia, with an area of and population of 96 million, making i ...
.
In
Central Asia
Central Asia, also known as Middle Asia, is a subregion, region of Asia that stretches from the Caspian Sea in the west to western China and Mongolia in the east, and from Afghanistan and Iran in the south to Russia in the north. It includes t ...
, examples of F(xG,H,I,J,K) have been reported in individuals from
Turkmenistan
Turkmenistan ( or ; tk, Türkmenistan / Түркменистан, ) is a country located in Central Asia, bordered by Kazakhstan to the northwest, Uzbekistan to the north, east and northeast, Afghanistan to the southeast, Iran to the sout ...
and
Uzbekistan
Uzbekistan (, ; uz, Ozbekiston, italic=yes / , ; russian: Узбекистан), officially the Republic of Uzbekistan ( uz, Ozbekiston Respublikasi, italic=yes / ; russian: Республика Узбекистан), is a doubly landlocked cou ...
.
Kutanan ''et al.'' (2020) have found F*-M89 in 50.0% (8/16) of a sample of Red Lahu, 47.1% (8/17) of a sample of Black Lahu, and 6.7% (1/15) of a sample of
Lisu Lisu may refer to:
*Lisu people, an ethnic group of Southeast Asia
*Lisu language, spoken by the Lisu people
* Old Lisu Alphabet or Fraser Alphabet
*Lisu syllabary
* Lisu (Unicode block), the block of Unicode characters for the Lisu language.
*Lisu ...
in
Mae Hong Son Province
Mae Hong Son province ( Burmese: မဲဟောင်ဆောင်; th, แม่ฮ่องสอน, ; Northern Thai: ; Shan: ; formerly called ''Mae Rong Son''), also spelled ''Maehongson'', ''Mae Hong Sorn'' or ''Maehongsorn'', is one of ...
of Thailand. All these
Loloish
The Loloish languages, also known as Yi in China and occasionally Ngwi or Nisoic, are a family of fifty to a hundred Sino-Tibetan languages spoken primarily in the Yunnan province of China. They are most closely related to Burmese and its relat ...
-speaking members of F*-M89 in northwestern Thailand have been found to be quite closely related in the paternal line, with the
TMRCA
In biology and genetic genealogy, the most recent common ancestor (MRCA), also known as the last common ancestor (LCA) or concestor, of a set of organisms is the most recent individual from which all the organisms of the set are Common descent, ...
of their Y-DNA estimated to be 584 years before present. However, the aforementioned Y-chromosomes are only distantly related to instances of F*-M89 observed in samples of other populations of Thailand, including 5.6% (1/18) of a sample of Phuan from
Central Thailand
Central Thailand (Central plain) or more specifically Siam (also known as Suvarnabhumi and Dvaravati) is one of the regions of Thailand, covering the broad alluvial plain of the Chao Phraya River. It is separated from northeast Thailand (Isan) by ...
Northeast Thailand
Northeast Thailand or Isan (Isan/ th, อีสาน, ; lo, ອີສານ; also written as Isaan, Isarn, Issarn, Issan, Esan, or Esarn; from Pali ''īsānna'' or Sanskrit ईशान्य ''īśānya'' "northeast") consists of 20 provin ...
Northeast Thailand
Northeast Thailand or Isan (Isan/ th, อีสาน, ; lo, ອີສານ; also written as Isaan, Isarn, Issarn, Issan, Esan, or Esarn; from Pali ''īsānna'' or Sanskrit ईशान्य ''īśānya'' "northeast") consists of 20 provin ...
. The TMRCA of the Loloish cluster from
North Thailand
Northern Thailand, or more specifically Lanna, is geographically characterised by several mountain ranges, which continue from the Shan Hills in bordering Myanmar to Laos, and the river valleys which cut through them. Though like most of Thailan ...
and the Y-DNA of the Phuan individual from Central Thailand has been estimated to be 12,675 years before present. The TMRCA of the F*-M89 cluster from Northeast Thailand has been estimated to be 6,492 years before present. The TMRCA of all these F*-M89 individuals from Thailand has been estimated to be 16,006 years before present.Wibhu Kutanan, Rasmi Shoocongdej, Metawee Srikummool, ''et al.'' (2020), "Cultural variation impacts paternal and maternal genetic lineages of the Hmong-Mien and Sino-Tibetan groups from Thailand." ''European Journal of Human Genetics''. https://doi.org/10.1038/s41431-020-0693-x
There is also evidence of westward
Paleolithic
The Paleolithic or Palaeolithic (), also called the Old Stone Age (from Greek: παλαιός ''palaios'', "old" and λίθος ''lithos'', "stone"), is a period in human prehistory that is distinguished by the original development of stone too ...
back-migration of F(xG,H,I,J,K) from South Asia, to
Iran
Iran, officially the Islamic Republic of Iran, and also called Persia, is a country located in Western Asia. It is bordered by Iraq and Turkey to the west, by Azerbaijan and Armenia to the northwest, by the Caspian Sea and Turkmeni ...
,
Arabia
The Arabian Peninsula, (; ar, شِبْهُ الْجَزِيرَةِ الْعَرَبِيَّة, , "Arabian Peninsula" or , , "Island of the Arabs") or Arabia, is a peninsula of Western Asia, situated northeast of Africa on the Arabian Plate. ...
and
North East Africa
Northeast Africa, or ''Northeastern Africa'' or Northern East Africa as it was known in the past, is a geographic regional term used to refer to the countries of Africa situated in and around the Red Sea. The region is intermediate between North ...
, as well as subclades of haplogroup K to
South-East Europe
Southeast Europe or Southeastern Europe (SEE) is a geographical subregion of Europe, consisting primarily of the Balkans. Sovereign states and territories that are included in the region are Albania, Bosnia and Herzegovina, Bulgaria, Croatia (al ...
.
Neolithic migration into Europe from
Southwest Asia
Western Asia, West Asia, or Southwest Asia, is the westernmost subregion of the larger geographical region of Asia, as defined by some academics, UN bodies and other institutions. It is almost entirely a part of the Middle East, and includes Anat ...
Neolithic
The Neolithic period, or New Stone Age, is an Old World archaeological period and the final division of the Stone Age. It saw the Neolithic Revolution, a wide-ranging set of developments that appear to have arisen independently in several parts ...
remains, dating from circa 4000 BCE. These remains, according to Herrerra et al. (2012) showed a "greater genetic similarity" to "individuals from the modern Near East" than to modern Europeans. F(xG,H,I,J,K) ''may'' have been found in
Bronze Age
The Bronze Age is a historic period, lasting approximately from 3300 BC to 1200 BC, characterized by the use of bronze, the presence of writing in some areas, and other early features of urban civilization. The Bronze Age is the second pri ...
remains from Europe, namely the individuals known as ''DEB 20'' and ''DEB 38'', who lived about 7,000–7,210 BP, and were found at the Derenburg Meerenstieg II site in Germany.
Three less certain cases, which have not been tested for all subclades of GHIJK, have been found among Neolithic remains in Europe. ''I0411'' (''Troc 4''), who lived 7,195–7,080 years BP, was found in the Els Trocs cave, near Bisaurri (modern Spain) – while haplogroups G, I1 (I-M450; I-S247) J, L1b2, Q1b1, Q1a2a, R1a1a1 (R-L449), R1b1a2b1a (R-M35) and T were ruled out, I2a1b1 and R1b1a2 were found in other remains from the same site (''Troc 5'' and ''Troc 2''). Similarly, three sets of remains from Hungary were not tested for all subclades of GHIJK: ''BAM 17'', ''BAM 26'' (both from Alsónyék Bátaszék, circa 7,850–7,675 years BP) and ''TOLM 3'' (7,030–7,230 BP, found in Tolna-Mözs).Jean Manco, 2016, ''DNA from the European Neolithic'' (1 March 2016). (An individual known to scholars as " Oase 1", who lived circa 37,800 years BP in Eastern Europe, was initially classified as belonging either to paragroup F(xGHIJK) or within K. However, subsequent research has revealed that Oase 1 belonged to K2a*.)
Some cases reported amongst modern populations of Europeans, Native Americans and Pacific Islanders may be due to migration and admixture of F(xG,H,I,J,K), as a result of contact with South and/or South East Asia, during the early modern era (16th–19th Century).
Such examples include:
* low levels in
Polynesia
Polynesia () "many" and νῆσος () "island"), to, Polinisia; mi, Porinihia; haw, Polenekia; fj, Polinisia; sm, Polenisia; rar, Porinetia; ty, Pōrīnetia; tvl, Polenisia; tkl, Polenihia (, ) is a subregion of Oceania, made up of ...
;
* some individuals among
Seminole
The Seminole are a Native American people who developed in Florida in the 18th century. Today, they live in Oklahoma and Florida, and comprise three federally recognized tribes: the Seminole Nation of Oklahoma, the Seminole Tribe of Florida, an ...
and
Boruca
The Boruca (also known as the Brunca or the Brunka) are the indigenous people living in Costa Rica. The tribe has about 2,660 members, most living on a reservation in the Puntarenas Province in southwestern Costa Rica, a few miles away from the ...
Native Americans;
* rare cases in the
Netherlands
)
, anthem = ( en, "William of Nassau")
, image_map =
, map_caption =
, subdivision_type = Sovereign state
, subdivision_name = Kingdom of the Netherlands
, established_title = Before independence
, established_date = Spanish Netherl ...
;
* two cases in
Portugal
Portugal, officially the Portuguese Republic ( pt, República Portuguesa, links=yes ), is a country whose mainland is located on the Iberian Peninsula of Southwestern Europe, and whose territory also includes the Atlantic archipelagos of ...
.
F* (M89*)
Basal F-M89* has been reported among 5.2% of males in India. A regional breakdown was provided by Chiaroni et al. (2009): 10% in
South India
South India, also known as Dakshina Bharata or Peninsular India, consists of the peninsular southern part of India. It encompasses the Indian states of Andhra Pradesh, Karnataka, Kerala, Tamil Nadu, and Telangana, as well as the union territo ...
; 8% in
Central India
Central India is a loosely defined geographical region of India. There is no clear official definition and various ones may be used. One common definition consists of the states of Chhattisgarh and Madhya Pradesh, which are included in alm ...
; about 1.0% in
North India
North India is a loosely defined region consisting of the northern part of India. The dominant geographical features of North India are the Indo-Gangetic Plain and the Himalayas, which demarcate the region from the Tibetan Plateau and Central ...
and
Western India
Western India is a loosely defined region of India consisting of its western part. The Ministry of Home Affairs in its Western Zonal Council Administrative division includes the states of Goa, Gujarat, and Maharashtra along with the Union te ...
, as well as 5% in
Pakistan
Pakistan ( ur, ), officially the Islamic Republic of Pakistan ( ur, , label=none), is a country in South Asia. It is the world's List of countries and dependencies by population, fifth-most populous country, with a population of almost 24 ...
; 10% in
Sri Lanka
Sri Lanka (, ; si, ශ්රී ලංකා, Śrī Laṅkā, translit-std=ISO (); ta, இலங்கை, Ilaṅkai, translit-std=ISO ()), formerly known as Ceylon and officially the Democratic Socialist Republic of Sri Lanka, is an ...
; 4% among the
Tamang people
The Tamang (; Devanagari: तामाङ; ''tāmāṅ'') are an Tibeto-Burmese ethnic group of Nepal. In Nepal Tamang/Moormi people constitute 5.6% of the Nepalese population at over 1.3 million in 2001, increasing to 1,539,830 as of the 2011 c ...
of
Nepal
Nepal (; ne, नेपाल ), formerly the Federal Democratic Republic of Nepal ( ne,
सङ्घीय लोकतान्त्रिक गणतन्त्र नेपाल ), is a landlocked country in South Asia. It is mai ...
; 2% in
Borneo
Borneo (; id, Kalimantan) is the third-largest island in the world and the largest in Asia. At the geographic centre of Maritime Southeast Asia, in relation to major Indonesian islands, it is located north of Java, west of Sulawesi, and eas ...
and
Java
Java (; id, Jawa, ; jv, ꦗꦮ; su, ) is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea to the north. With a population of 151.6 million people, Java is the world's List ...
; 4-5% in
Sulawesi
Sulawesi (), also known as Celebes (), is an island in Indonesia. One of the four Greater Sunda Islands, and the world's eleventh-largest island, it is situated east of Borneo, west of the Maluku Islands, and south of Mindanao and the Sulu Ar ...
and
Lembata
Lembata is an island in the Lesser Sunda Islands, also known as Lomblen island; it is the largest island of the Solor Archipelago, in the Lesser Sunda Islands, Indonesia. It forms a separate regency of the province of Nusa Tenggara Timur. The ...
in
Southeast Asia
Southeast Asia, also spelled South East Asia and South-East Asia, and also known as Southeastern Asia, South-eastern Asia or SEA, is the geographical United Nations geoscheme for Asia#South-eastern Asia, south-eastern region of Asia, consistin ...
.
In
Iran
Iran, officially the Islamic Republic of Iran, and also called Persia, is a country located in Western Asia. It is bordered by Iraq and Turkey to the west, by Azerbaijan and Armenia to the northwest, by the Caspian Sea and Turkmeni ...
Xi'an
Xi'an ( , ; ; Chinese: ), frequently spelled as Xian and also known by #Name, other names, is the list of capitals in China, capital of Shaanxi, Shaanxi Province. A Sub-provincial division#Sub-provincial municipalities, sub-provincial city o ...
(1/34, ),
Haplogroup F-M89 has also been observed in Northeast Africa among two Christian period individuals, who were excavated on the
Nile
The Nile, , Bohairic , lg, Kiira , Nobiin language, Nobiin: Áman Dawū is a major north-flowing river in northeastern Africa. It flows into the Mediterranean Sea. The Nile is the longest river in Africa and has historically been considered ...
's
Fourth Cataract
The Cataracts of the Nile are shallow lengths (or whitewater rapids) of the Nile river, between Khartoum and Aswan, where the surface of the water is broken by many small boulders and stones jutting out of the river bed, as well as many rocky ...
This subclade is defined by the SNP P91. It is most common in
Sri Lanka
Sri Lanka (, ; si, ශ්රී ලංකා, Śrī Laṅkā, translit-std=ISO (); ta, இலங்கை, Ilaṅkai, translit-std=ISO ()), formerly known as Ceylon and officially the Democratic Socialist Republic of Sri Lanka, is an ...
.
F2 (M427)
F2 Y-chromosomes have been reported among minorities from the borderlands of
South China
South China () is a geographical and cultural region that covers the southernmost part of China. Its precise meaning varies with context. A notable feature of South China in comparison to the rest of China is that most of its citizens are not n ...
(
Yunnan
Yunnan , () is a landlocked Provinces of China, province in Southwest China, the southwest of the People's Republic of China. The province spans approximately and has a population of 48.3 million (as of 2018). The capital of the province is ...
and
Guizhou
Guizhou (; formerly Kweichow) is a landlocked province in the southwest region of the People's Republic of China. Its capital and largest city is Guiyang, in the center of the province. Guizhou borders the autonomous region of Guangxi to t ...
),
Thailand
Thailand ( ), historically known as Siam () and officially the Kingdom of Thailand, is a country in Southeast Asia, located at the centre of the Indochinese Peninsula, spanning , with a population of almost 70 million. The country is bo ...
,
Burma
Myanmar, ; UK pronunciations: US pronunciations incl. . Note: Wikipedia's IPA conventions require indicating /r/ even in British English although only some British English speakers pronounce r at the end of syllables. As John Wells explai ...
, and
Vietnam
Vietnam or Viet Nam ( vi, Việt Nam, ), officially the Socialist Republic of Vietnam,., group="n" is a country in Southeast Asia, at the eastern edge of mainland Southeast Asia, with an area of and population of 96 million, making i ...
, namely the Yi and Kucong or Lahu Shi ("Yellow Lahu"), a subgroup of the Lahu.
F3 (M481)
The newly defined and rare subclade F3 (M481; previously F5) has been found in India and Nepal, among the
Tharu people
The Tharu people are an ethnic group indigenous to the Terai in southern Nepal and northern India. They speak Tharu languages. They are recognized as an official nationality by the Government of Nepal. In the Indian Terai, they live foremost i ...
and in
Andhra Pradesh
Andhra Pradesh (, abbr. AP) is a state in the south-eastern coastal region of India. It is the seventh-largest state by area covering an area of and tenth-most populous state with 49,386,799 inhabitants. It is bordered by Telangana to the ...
. F-M481 should not be confused with Haplogroup H2 (L279, L281, L284, L285, L286, M282, P96), which was previously misclassified under F-M89, as "F3".
Haplogroup GHIJK
Basal GHIJK has never been found, either in living males or ancient remains.
Subclades – including some major haplogroups – are widespread in modern populations of the
Caucasus
The Caucasus () or Caucasia (), is a region between the Black Sea and the Caspian Sea, mainly comprising Armenia, Azerbaijan, Georgia, and parts of Southern Russia. The Caucasus Mountains, including the Greater Caucasus range, have historically ...
,
Middle East
The Middle East ( ar, الشرق الأوسط, ISO 233: ) is a geopolitical region commonly encompassing Arabian Peninsula, Arabia (including the Arabian Peninsula and Bahrain), Anatolia, Asia Minor (Asian part of Turkey except Hatay Pro ...
,
South Asia
South Asia is the southern subregion of Asia, which is defined in both geographical and ethno-cultural terms. The region consists of the countries of Afghanistan, Bangladesh, Bhutan, India, Maldives, Nepal, Pakistan, and Sri Lanka.;;;;;;;; ...
,
Europe
Europe is a large peninsula conventionally considered a continent in its own right because of its great physical size and the weight of its history and traditions. Europe is also considered a Continent#Subcontinents, subcontinent of Eurasia ...
,
South East Asia
Southeast Asia, also spelled South East Asia and South-East Asia, and also known as Southeastern Asia, South-eastern Asia or SEA, is the geographical south-eastern region of Asia, consisting of the regions that are situated south of mainland ...
,
Pacific Islands
Collectively called the Pacific Islands, the islands in the Pacific Ocean are further categorized into three major island groups: Melanesia, Micronesia, and Polynesia. Depending on the context, the term ''Pacific Islands'' may refer to one of se ...
subclade
In genetics, a subclade is a subgroup of a haplogroup.
Naming convention
Although human mitochondrial DNA (mtDNA) and Y chromosome DNA (Y-DNA) haplogroups and subclades are named in a similar manner, their names belong to completely separate sy ...
s are the branches of haplogroups. These subclades are also defined by single nucleotide polymorphisms (SNPs) or unique event polymorphisms (UEPs).
Phylogenetic trees
There are several confirmed and proposed phylogenetic trees available for haplogroup F-M89. The scientifically accepted one is the Y-Chromosome Consortium (YCC) one published in Karafet 2008 and subsequently updated. A draft tree that shows emerging science is provided by Thomas Krahn at the Genomic Research Center in
Houston, Texas
Houston (; ) is the most populous city in Texas, the most populous city in the Southern United States, the fourth-most populous city in the United States, and the sixth-most populous city in North America, with a population of 2,304,580 in ...
. The International Society of Genetic Genealogy (ISOGG) also provides an amateur tree.
The Genomic Research Center draft tree
The Genomic Research Center's draft tree for haplogroup F-M89 is as follows. (Only the first three levels of subclades are shown.)
*F-M89 P14, M89, M213, P133, P134, P135, P136, P138, P139, P140, P141, P142, P145, P146, P148, P149, P151, P157, P158, P159, P160, P161, P163, P166, P187, P316, L132.1, L313, L498
**F-P91 P91, P104
**''F-M427'' M427, M428
**F-P96 P96, M282, L279, L281, L284, L285, L286
***F-L280 L280
**G-M201 M201, P257, L116, L154, L204, L240, L269, L402, L605, L769, L770, L836, L837, L1258, U2, U3, U6, U7, U12, U17, U20, U21, U23, U33
**H-M69 M69, M370, PAGES00049
**IJK L15, L16
YCC/ISOGG tree
This is the official scientific tree produced by the Y-Chromosome Consortium (YCC). The last major update was in 2008. Subsequent updates have been quarterly and biannual. The current version is a revision of the 2010 update.
* CF
**F (L132.1, M89/PF2746).
***F1 (P91, P104)
***F2 (M427, M428)
***F3 (M481)
***Macrohaplogroup GHIJK (F1329/M3658/PF2622/YSC0001299).
Phylogenetic history
Prior to 2002, there were in academic literature at least seven naming systems for the Y-Chromosome Phylogenetic tree. This led to considerable confusion. In 2002, the major research groups came together and formed the Y-Chromosome Consortium (YCC). They published a joint paper that created a single new tree that all agreed to use. Later, a group of citizen scientists with an interest in population genetics and genetic genealogy formed a working group to create an amateur tree aiming at being above all timely. The table below brings together all of these works at the point of the landmark 2002 YCC Tree. This allows a researcher reviewing older published literature to quickly move between nomenclatures.