NBPF Repeat
   HOME

TheInfoList



OR:

The Olduvai domain, known until 2018 as DUF1220 (
domain of unknown function A domain of unknown function (DUF) is a protein domain that has no characterised function. These families have been collected together in the Pfam database using the prefix DUF followed by a number, with examples being DUF2992 and DUF1220. As of 201 ...
1220) and the NBPF repeat, is a
protein domain In molecular biology, a protein domain is a region of a protein's polypeptide chain that is self-stabilizing and that folds independently from the rest. Each domain forms a compact folded three-dimensional structure. Many proteins consist of s ...
that shows a striking human lineage-specific (HLS) increase in
copy number Copy number variation (CNV) is a phenomenon in which sections of the genome are repeated and the number of repeats in the genome varies between individuals. Copy number variation is a type of structural variation: specifically, it is a type of d ...
and appears to be involved in human brain evolution. The protein domain has also been linked to several neurogenetic disorders such as
schizophrenia Schizophrenia is a mental disorder characterized by continuous or relapsing episodes of psychosis. Major symptoms include hallucinations (typically hearing voices), delusions, and disorganized thinking. Other symptoms include social withdra ...
(in reduced copies) and increased severity of
autism The autism spectrum, often referred to as just autism or in the context of a professional diagnosis autism spectrum disorder (ASD) or autism spectrum condition (ASC), is a neurodevelopmental condition (or conditions) characterized by difficulti ...
(in increased copies). In 2018, it was named by its discoverers after
Olduvai Gorge The Olduvai Gorge or Oldupai Gorge in Tanzania is one of the most important paleoanthropology, paleoanthropological localities in the world; the many sites exposed by the gorge have proven invaluable in furthering understanding of early human ev ...
in Tanzania, one of the most important archaeological sites for
early humans ''Homo'' () is the genus that emerged in the (otherwise extinct) genus ''Australopithecus'' that encompasses the extant species ''Homo sapiens'' (modern humans), plus several extinct species classified as either ancestral to or closely related ...
, to reflect data indicating its role in human brain size and evolution. Olduvai domains form the core of ''
NBPF The neuroblastoma breaking point family (''NBPF'') is a family of genes involved in neuronal development. The family is highly specific to primates, with minimal similarity or presence in other mammals and no presence in other animals, and its gen ...
'' genes, which first appeared in
placental mammals Placental mammals (infraclass Placentalia ) are one of the three extant subdivisions of the class Mammalia, the other two being Monotremata and Marsupialia. Placentalia contains the vast majority of extant mammals, which are partly distinguishe ...
and experienced a rapid expansion in
monkeys Monkey is a common name that may refer to most mammals of the infraorder Simiiformes, also known as the simians. Traditionally, all animals in the group now known as simians are counted as monkeys except the apes, which constitutes an incomple ...
(
simians The simians, anthropoids, or higher primates are an infraorder (Simiiformes ) of primates containing all animals traditionally called monkeys and apes. More precisely, they consist of the parvorders New World monkeys (Platyrrhini) and Catarrhi ...
) through
duplication Duplication, duplicate, and duplicator may refer to: Biology and genetics * Gene duplication, a process which can result in free mutation * Chromosomal duplication, which can cause Bloom and Rett syndrome * Polyploidy, a phenomenon also known ...
to reach over 20 genes in humans. In humans, Olduvai domains are repeated often dozens of times within these genes. The only other gene an Olduvai domain has been found in is
mammalian Mammals () are a group of vertebrate animals constituting the class (biology), class Mammalia (), characterized by the presence of mammary glands which in Female#Mammalian female, females produce milk for feeding (nursing) their young, a ...
myomegalin Myomegalin, also known as phosphodiesterase 4D-interacting protein or cardiomyopathy-associated protein 2, is a protein that in humans is encoded by the ''PDE4DIP'' gene. It has roles in the formation of microtubules from the centrosome. Its name ...
, believed to be the origin of the ''NBPF'' genes via duplication. Myomegalin itself arose from a duplication of ''
CDK5RAP2 CDK5 regulatory subunit-associated protein 2 is a protein that in humans is encoded by the ''CDK5RAP2'' gene. It has necessary roles in the formation and stability of microtubules from the centrosome and has been found to be linked to human brain ...
'', and all of these genes have been implicated in the development of neurons. Olduvai copy number is the highest in
human Humans (''Homo sapiens'') are the most abundant and widespread species of primate, characterized by bipedalism and exceptional cognitive skills due to a large and complex brain. This has enabled the development of advanced tools, culture, ...
s (~289, with person-to-person variations), reduced in African
great apes The Hominidae (), whose members are known as the great apes or hominids (), are a taxonomic family of primates that includes eight extant species in four genera: '' Pongo'' (the Bornean, Sumatran and Tapanuli orangutan); ''Gorilla'' (the east ...
(~125 copies in
chimpanzee The chimpanzee (''Pan troglodytes''), also known as simply the chimp, is a species of great ape native to the forest and savannah of tropical Africa. It has four confirmed subspecies and a fifth proposed subspecies. When its close relative th ...
s, ~99 in
gorilla Gorillas are herbivorous, predominantly ground-dwelling great apes that inhabit the tropical forests of equatorial Africa. The genus ''Gorilla'' is divided into two species: the eastern gorilla and the western gorilla, and either four or fi ...
s, ~92 in
orangutan Orangutans are great apes native to the rainforests of Indonesia and Malaysia. They are now found only in parts of Borneo and Sumatra, but during the Pleistocene they ranged throughout Southeast Asia and South China. Classified in the genus ...
s), further reduced in
Old World monkeys Old World monkey is the common English name for a family of primates known taxonomically as the Cercopithecidae (). Twenty-four genera and 138 species are recognized, making it the largest primate family. Old World monkey genera include baboons ...
(~35), single- or low-copy in non-primate mammals and absent in non-mammals. Consequently, the Olduvai domain demonstrates the largest HLS increase in copy number of any protein-coding region over any other living species, an additional ~160 copies compared with chimpanzees. The increase in the number of copies that are present in connection with Olduvai seems to have a direct correlation with several phenotypes of the brain including the increase in brain size as seen through evolution. In the human genome, DUF1220 sequences are located primarily on
chromosome 1 Chromosome 1 is the designation for the largest human chromosome. Humans have two copies of chromosome 1, as they do with all of the autosomes, which are the non-sex chromosomes. Chromosome 1 spans about 249 million nucleotide base pairs, which ar ...
in region 1q21.1-q21.2, with several copies also found at 1p36, 1p13.3, and 1p12. They are approximately 65
amino acids Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha am ...
in length and are encoded by a two-exon doublet. Sequences encoding DUF1220 domains show rhythmicity, resonance and signs of positive selection, especially in primates, and are expressed in several human tissues including brain, where their expression is restricted to neurons. The various HLS domains do not show any interactions as suggested by
nuclear magnetic resonance Nuclear magnetic resonance (NMR) is a physical phenomenon in which nuclei in a strong constant magnetic field are perturbed by a weak oscillating magnetic field (in the near field) and respond by producing an electromagnetic signal with a ...
backbone chemical shift analyses.


Function

Research has found that the Olduvai domain has a role in the development of
neuron A neuron, neurone, or nerve cell is an electrically excitable cell that communicates with other cells via specialized connections called synapses. The neuron is the main component of nervous tissue in all animals except sponges and placozoa. N ...
s. Specifically, it appears to function to increase the number of
neural stem cells Neural stem cells (NSCs) are self-renewing, multipotent cells that firstly generate the radial glial progenitor cells that generate the neurons and glia of the nervous system of all animals during embryonic development. Some neural progenitor ste ...
by prolonging the developmental period of neurons. When Olduvai copy number is reduced, neurons appear to mature faster and divide less. Conversely, when Olduvai copy number is increased, neurons appear to mature for longer and divide in higher numbers.


Clinical significance


Autism

DUF1220 copy number variation have recently been investigated in autism which is a disorder associated with deletions and duplications of 1q21 yet the causative loci within such regions have not previously been identified. Such research has found that copy number of DUF1220 subtype CON1 is linearly associated with increasing severity of social impairment in autism. This evidence is relevant for current theories proposing that autism and psychosis are fundamentally related. The precise nature of this relationship is currently under debate, with alternative lines of argument suggesting that the two are diametrically opposed diseases, exist on a continuum or exhibit a more nuanced relationship.


Schizophrenia

Schizophrenia is a neurological condition in which there are issues in brain development. In contrast with autism, copy number increase of DUF1220 subtypes CON1 and HLS1 is associated with reduced severity of positive symptoms in schizophrenia.


Cognitive brain function and brain size

The dosage of the Olduvai protein domain increases along with brain size, which is seen through the evolution from primates to humans. Targeted 1q21
array CGH Comparative genomic hybridization (CGH) is a molecular cytogenetic method for analysing copy number variations (CNVs) relative to ploidy level in the DNA of a test sample compared to a reference sample, without the need for culturing cells. The a ...
investigation of the potential association between DUF1220 and brain size found that DUF1220 copy number decrease is associated with microcephaly in individuals with 1q21 CNVs. Of all 1q21 sequences tested, DUF1220 sequences were the only ones to show consistent correlation between copy number and brain size in both disease (micro/macrocephaly) and non-disease populations. In addition, in primates there is a significant correlation between DUF1220 copy number and both brain size and brain cortical neuron number. A 2015 study found that Olduvai copy number is linearly correlated with increased cognitive function, as measured by total IQ and mathematical aptitude scores, a finding replicated in two independent groups from different countries. The study specifically studied the Olduvai variants CON1 and CON2, noting that measurement of the very high copy number HLS1–3 variants had been challenging given technologies currently available. It found that those with a higher number of copies of CON2 had higher scores on the WISC IQ test and the Progressive Achievement Mathematics test. The strength of the association between CON2 and IQ was reported to be greater than that of any other single genetic candidate reported in any previous study. This effect was significantly more profound in males. The CON2 copy number of most of the males ranged from 26 to 33, with a mean of 29, and each additional copy was associated with an average IQ score increase of 3.3. CON1 number, on the other hand, was not found to have a significant association with IQ scores. Brain region associations were also studied. CON1 and CON2 copy number were found to raise the volumes and areas of all four bilateral
lobes of the brain The lobes of the brain are the major identifiable zones of the cerebral cortex, and they comprise the surface of each hemisphere of the cerebrum. The two hemispheres are roughly symmetrical in structure, and are connected by the corpus callosum. ...
studied. Most notably, right
frontal lobe The frontal lobe is the largest of the four major lobes of the brain in mammals, and is located at the front of each cerebral hemisphere (in front of the parietal lobe and the temporal lobe). It is parted from the parietal lobe by a groove betwe ...
surface area showed the strongest association with both CON1 and CON2 copy number. This association was slightly stronger with CON2 copy number. There were no CON1 or CON2 associations with
white matter White matter refers to areas of the central nervous system (CNS) that are mainly made up of myelinated axons, also called tracts. Long thought to be passive tissue, white matter affects learning and brain functions, modulating the distribution ...
volume or
gyrification Gyrification is the process of forming the characteristic folds of the cerebral cortex. The peak of such a fold is called a ''gyrus'' (pl. ''gyri''), and its trough is called a '' sulcus'' (pl. ''sulci''). The neurons of the cerebral cortex re ...
index. CON1 and CON2 number had been previously found to correlate to
grey matter Grey matter is a major component of the central nervous system, consisting of neuronal cell bodies, neuropil (dendrites and unmyelinated axons), glial cells (astrocytes and oligodendrocytes), synapses, and capillaries. Grey matter is distingui ...
volume in another study. These volume and area increases in the grey matter of all cerebral lobes were found to significantly correlate with higher IQ scores. Notably, bilateral temporal surface area appeared to correlate with a ''progressive'' increase in IQ, with left temporal surface area being slightly more important. However, it was found that CON2's effects on IQ remained substantial even after eliminating bilateral temporal surface area, right frontal lobe surface area and total grey matter volume as factors. A portion of CON2's association with IQ, however, was through its effects on bilateral temporal surface area. Notably, this contribution to IQ was larger than that of its effects on right frontal lobe surface area, despite the fact that it increased this area the most. It was concluded that the Olduvai domain appears to have a role in
neural stem cell Neural stem cells (NSCs) are self-renewing, multipotent cells that firstly generate the radial glial progenitor cells that generate the neurons and glia of the nervous system of all animals during embryonic development. Some neural progenitor ste ...
proliferation, since this proliferation seems to be the major contributor to lobe surface area while also explaining the effects of Olduvai dosage that could not be explained by brain region measurements. Corroborating this are stem cell
cultures Culture () is an umbrella term which encompasses the social behavior, institutions, and norms found in human societies, as well as the knowledge, beliefs, arts, laws, customs, capabilities, and habits of the individuals in these groups.Tyl ...
that have also shown Olduvai's proliferative effects neuronal stem cells. However, Olduvai also had effects on cortical thickness that appeared to be the result of mature
neuron A neuron, neurone, or nerve cell is an electrically excitable cell that communicates with other cells via specialized connections called synapses. The neuron is the main component of nervous tissue in all animals except sponges and placozoa. N ...
cell divisions, corroborated by higher neuron numbers in primates being associated with Olduvai copy number. Additionally, studies have shown that cerebral size in primates is almost exclusively correlated with a linear addition of neurons, rather than neuronal size or density. It was found that CON2's effects on IQ were strongly dependent on sex. There was no significant association found in females. Additionally, it was found that males with higher CON2 numbers appeared to have the largest increases in IQ over other males of the same age at a mean age of 11 years old. The correlation then appeared to decrease with age. A proportional advantage was also present in younger individuals. This corroborated studies that have shown that brain growth in the brightest children, and children with
autism The autism spectrum, often referred to as just autism or in the context of a professional diagnosis autism spectrum disorder (ASD) or autism spectrum condition (ASC), is a neurodevelopmental condition (or conditions) characterized by difficulti ...
, increases after birth and peaks at around age 11 or 12 before slowing down in adulthood. In the second group, birth head circumference was not found to significantly affect IQ, further corroborating these studies. The second cohort had previously had a genetic analysis rule out any effect on IQ of other genome-wide
copy number variations Copy number variation (CNV) is a phenomenon in which sections of the genome are repeated and the number of repeats in the genome varies between individuals. Copy number variation is a type of structural variation: specifically, it is a type of G ...
they had, further suggesting a critical period of activity of CON1 and CON2. This association has important implications for understanding the interplay between cognitive function and autism phenotypes. These findings also provide additional support for the involvement of Olduvai in a genomic trade-off model involving the human brain: the same key genes that have been major contributors to the evolutionary expansion of the human brain and human cognitive capacity may also, in different combinations, underlie psychiatric disorders such as autism and
schizophrenia Schizophrenia is a mental disorder characterized by continuous or relapsing episodes of psychosis. Major symptoms include hallucinations (typically hearing voices), delusions, and disorganized thinking. Other symptoms include social withdra ...
.


1q21.1 deletion and duplication syndromes

Olduvai domains are one of the many genetic elements located in the 1q21.1 region, which has a high number of repeated elements and therefore a high tendency towards deletions and duplications. This has led to several conditions that involve this region being identified, including
TAR syndrome TAR syndrome (thrombocytopenia with absent radius) is a rare genetic disorder that is characterized by the absence of the radius bone in the forearm and a dramatically reduced platelet count. Signs and symptoms * Presents with symptoms of thro ...
and the more general classifications of
1q21.1 deletion syndrome 1q21.1 deletion syndrome is a rare aberration of chromosome 1. A human cell has one pair of identical chromosomes on chromosome 1. With the 1q21.1 deletion syndrome, one chromosome of the pair is not complete, because a part of the sequence of the ...
and
1q21.1 duplication syndrome 1q21.1 duplication syndrome or 1q21.1 (recurrent) microduplication is a rare aberration of chromosome 1. In a common situation a human cell has one pair of identical chromosomes on chromosome 1. With the 1q21.1 duplication syndrome one chromosome ...
. Studies of deletions and duplications in the 1q21.1 region have consistently revealed microcephaly in association with deletions and macrocephaly in association with duplications.


Evolution

Genome sequences indicate that the Olduvai protein domain first appears as part of the
myomegalin Myomegalin, also known as phosphodiesterase 4D-interacting protein or cardiomyopathy-associated protein 2, is a protein that in humans is encoded by the ''PDE4DIP'' gene. It has roles in the formation of microtubules from the centrosome. Its name ...
gene (''PDE4DIP'') on chromosome 1q36 in mammals at least 200 million years ago. Myomegalin is a
paralog Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a spec ...
(duplicated relative) of
CDK5RAP2 CDK5 regulatory subunit-associated protein 2 is a protein that in humans is encoded by the ''CDK5RAP2'' gene. It has necessary roles in the formation and stability of microtubules from the centrosome and has been found to be linked to human brain ...
, a centrosomal protein involved in the
cell cycle The cell cycle, or cell-division cycle, is the series of events that take place in a cell that cause it to divide into two daughter cells. These events include the duplication of its DNA (DNA replication) and some of its organelles, and subs ...
, of
neuron A neuron, neurone, or nerve cell is an electrically excitable cell that communicates with other cells via specialized connections called synapses. The neuron is the main component of nervous tissue in all animals except sponges and placozoa. N ...
s especially, that lacks Olduvai sequences but, when mutated, has been implicated in
microcephaly Microcephaly (from New Latin ''microcephalia'', from Ancient Greek μικρός ''mikrós'' "small" and κεφαλή ''kephalé'' "head") is a medical condition involving a smaller-than-normal head. Microcephaly may be present at birth or it ...
.
Orthologs Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a spec ...
of myomegalin can be seen in
vertebrate Vertebrates () comprise all animal taxa within the subphylum Vertebrata () ( chordates with backbones), including all mammals, birds, reptiles, amphibians, and fish. Vertebrates represent the overwhelming majority of the phylum Chordata, ...
s as far back as
bony fish Osteichthyes (), popularly referred to as the bony fish, is a diverse superclass of fish that have skeletons primarily composed of bone tissue. They can be contrasted with the Chondrichthyes, which have skeletons primarily composed of cartilag ...
, around 450 million years ago, however, the Olduvai domain is not clearly seen until the emergence of mammals. The first Olduvai domain located outside of myomegalin is seen approximately 100–150 million years ago, when the domain was included in a duplication and transposition event which created a new gene, ''
NBPF1 Neuroblastoma breakpoint family, member 1, or NBPF1, is a protein that is encoded by the gene ''NBPF1'' in humans. This protein is member of the neuroblastoma breakpoint family of proteins, a group of proteins that are thought to be involved in th ...
'', which would eventually later give rise to a family of duplicated ''
NBPF The neuroblastoma breaking point family (''NBPF'') is a family of genes involved in neuronal development. The family is highly specific to primates, with minimal similarity or presence in other mammals and no presence in other animals, and its gen ...
'' genes. At least one ''NBPF'' gene has been found in
Laurasiatherians Laurasiatheria ("laurasian beasts") is a superorder of placental mammals that groups together true insectivores ( eulipotyphlans), bats ( chiropterans), carnivorans, pangolins ( pholidotes), even-toed ungulates (artiodactyls), odd-toed ungulates ...
,
Euarchontoglires Euarchontoglires (synonymous with Supraprimates) is a clade and a superorder of mammals, the living members of which belong to one of the five following groups: rodents, lagomorphs, treeshrews, colugos, and primates. Evolutionary affinities wit ...
and
elephants Elephants are the largest existing land animals. Three living species are currently recognised: the African bush elephant, the African forest elephant, and the Asian elephant. They are the only surviving members of the family Elephantidae and ...
(but not other Afrotherians), but not in
Xenarthrans Xenarthra (; from Ancient Greek wikt:ξένος, ξένος, xénos, "foreign, alien" + wikt:ἄρθρον, ἄρθρον, árthron, "joint") is a major clade of Placentalia, placental mammals native to the Americas. There are 31 living species: ...
(containing
sloths Sloths are a group of Neotropical xenarthran mammals constituting the suborder Folivora, including the extant arboreal tree sloths and extinct terrestrial ground sloths. Noted for their slowness of movement, tree sloths spend most of their li ...
). It was also found that several
rodents Rodents (from Latin , 'to gnaw') are mammals of the order Rodentia (), which are characterized by a single pair of continuously growing incisors in each of the upper and lower jaws. About 40% of all mammal species are rodents. They are nat ...
,
bat Bats are mammals of the order Chiroptera.''cheir'', "hand" and πτερόν''pteron'', "wing". With their forelimbs adapted as wings, they are the only mammals capable of true and sustained flight. Bats are more agile in flight than most bi ...
s and
eulipotyphla Eulipotyphla (, which means "truly fat and blind") is an order of mammals suggested by molecular methods of phylogenetic reconstruction, which includes the laurasiatherian members of the now-invalid polyphyletic order Lipotyphla, but not the afr ...
(containing
hedgehogs A hedgehog is a spiny mammal of the subfamily Erinaceinae, in the eulipotyphlan family Erinaceidae. There are seventeen species of hedgehog in five genera found throughout parts of Europe, Asia, and Africa, and in New Zealand by introduction ...
) had lost the gene. It was found in 2012 that the exceptional increase in human Olduvai copy number was a result of multiple duplications within the ''NBPF'' genes primarily involving a sequential series of three variants of the domain. These three variants were also found in gorilla and chimpanzee genomes but are not repeated in triplet form and are only present in around five copies overall. Based on this, the variants were given the names HLS1, HLS2 and HLS3, for ''human lineage-specific'', and together they were named the ''HLS DUF1220 triplet''. Hyper-amplification of the triplet resulted in the addition of ~149 copies of Olduvai specifically to the human lineage since its divergence from the genus '' Pan'' (chimpanzees and
bonobo The bonobo (; ''Pan paniscus''), also historically called the pygmy chimpanzee and less often the dwarf chimpanzee or gracile chimpanzee, is an endangered great ape and one of the two species making up the genus '' Pan,'' the other being the comm ...
s) approximately 6 million years ago.


Evolutionary adaptation in humans

In 2009, it was proposed that the larger brain size conferred by a high number of Olduvai domain copies in humans carried an evolutionary advantage which led to the persistence and maintenance of Olduvai copies within this high range. At the same time, the Olduvai domains, like many other repetitive genetic elements, are highly susceptible to increases and decreases in number of copies, through duplications or deletions, and the researchers referenced various studies from 2005 to 2009 that found that a higher number of copies contributed to autism severity while a lower number contributed to schizophrenia severity. Since these disorders are fairly common among humans, it was proposed that this explained their prevalence. This model was elaborated on in more detail in a 2018 article that included one of the original authors, in light of new evidence in the intervening years. In 2012, a genetic explanation for the high instability and persistence of the Olduvai-containing regions was put forward: it was found that the HLS Olduvai domains had been affected by a known pericentric inversion (in which the region around a chromosome's
centromere The centromere links a pair of sister chromatids together during cell division. This constricted region of chromosome connects the sister chromatids, creating a short arm (p) and a long arm (q) on the chromatids. During mitosis, spindle fibers a ...
inverts) that occurred between 1p11.2 and 1q21.2 in the human lineage after the separation from chimpanzees. This was theorised to have contributed to their hyper-amplification specifically in humans, because pairs of chromosomes in which one contains a pericentric inversion and the other does not (a form of
heterozygosity Zygosity (the noun, zygote, is from the Greek "yoked," from "yoke") () is the degree to which both copies of a chromosome or gene have the same genetic sequence. In other words, it is the degree of similarity of the alleles in an organism. Mo ...
) have difficulties in recombination which can lead to
non-allelic homologous recombination Non-allelic homologous recombination (NAHR) is a form of homologous recombination that occurs between two lengths of DNA that have high sequence similarity, but are not alleles. It usually occurs between sequences of DNA that have been previously ...
, in which deletions and duplications are much more propense to occur. This, combined with the fact that higher copies of Olduvai domains may have had an evolutionary advantage, could have resulted in the rapid duplication and persistence of Olduvai domains in humans.


Relation to NOTCH2NL genes in brain development

There are four human-specific ''
NOTCH2NL Notch homolog 2 N-terminal-like is a family of proteins that in humans consists of 3 proteins (NOTCH2NLA, NOTCH2NLB, and NOTCH2NLC) and is encoded by NOTCH2NL gene. It appears to play a key role in the development of the prefrontal cortex, a par ...
'' genes: ''NOTCH2NLA'', ''NOTCH2NLB'' and ''NOTCH2NLC'', located on 1q21.1, and ''NOTCH2NLR'' located on 1p11.2. While chimpanzee and gorilla have copies of ''NOTCH2NL'', none are functional. Immediately adjacent to, and downstream of, each of these four '' NOTCH''
paralogs Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a spec ...
is an
NBPF The neuroblastoma breaking point family (''NBPF'') is a family of genes involved in neuronal development. The family is highly specific to primates, with minimal similarity or presence in other mammals and no presence in other animals, and its gen ...
gene with its DUF1220 domains in the same orientation as its ''NOTCH2NL'' partner. This striking genomic arrangement suggests that each of the additional copies of ''NOTCH2NL'' that appeared in the human genome did not duplicate as a single gene, but rather did so as a two-gene module, composed of one ''NOTCH2NL'' gene and one ''NBPF'' gene. While the ''NOTCH2NL'' paralogs (and their ''NBPF'' partners) went from one gene to four in humans, DUF1220 copies encoded by these ''NBPF'' genes underwent human-specific hyper-amplification, increasing from 13 copies (encoded by ''
NBPF26 The neuroblastoma breaking point family (''NBPF'') is a family of genes involved in neuronal development. The family is highly specific to primates, with minimal similarity or presence in other mammals and no presence in other animals, and its gen ...
'') to 132 (i.e., adding 119 DUF1220 copies encoded by ''
NBPF10 Neuroblastoma breakpoint family member 10 is a protein that in ''Homo sapiens'' is encoded by the ''NBPF10'' gene. The full gene is 75,313 bp, with the major isoform of mRNA being 10,697 bp long. The gene is located at 1q21.1. NBPF contains what ...
'', '' NBPF14'' and ''
NBPF19 Neuroblastoma breakpoint family member 19, or NBPF19, is a protein that in humans is encoded by the NBPF19 gene. This protein is included in the neuroblastoma breakpoint family of proteins. Gene The NBPF19 gene is a protein-encoding gene in hum ...
'').


History

The Olduvai domain was first identified in 2004 in a study of copy number differences between human and great
ape Apes (collectively Hominoidea ) are a clade of Old World simians native to sub-Saharan Africa and Southeast Asia (though they were more widespread in Africa, most of Asia, and as well as Europe in prehistory), which together with its siste ...
species using
genome In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ge ...
-wide
array comparative genomic hybridization Comparative genomic hybridization (CGH) is a molecular cytogenetic method for analysing copy number variations (CNVs) relative to ploidy level in the DNA of a test sample compared to a reference sample, without the need for culturing cells. The ai ...
(CGH), which takes single DNA strands from each source and hybridizes them, or joins them such that they line up, and uses fluorescent dyeing, which shows different colours where the two strands no longer line up. The study found 134 genes that showed human lineage-specific increases in copy number, one of which, ''
NBPF15 Neuroblastoma breakpoint family, member 15, also known as NBPF15, is a protein which in humans is encoded by the ''NBPF15'' gene. The gene is 18762 bp long, with mRNA that is 3837 bp long. The gene is located on chromosome 1q21.1. Its sub-cellula ...
'' (then known as MGC8902,
cDNA In genetics, complementary DNA (cDNA) is DNA synthesized from a single-stranded RNA (e.g., messenger RNA (mRNA) or microRNA (miRNA)) template in a reaction catalyzed by the enzyme reverse transcriptase. cDNA is often used to express a speci ...
IMAGE:843276), contained six Olduvai domains. The domain remained unnamed as of that time and was given a
Pfam Pfam is a database of protein families that includes their annotations and multiple sequence alignments generated using hidden Markov models. The most recent version, Pfam 35.0, was released in November 2021 and contains 19,632 families. Uses ...
placeholder name for
domains of unknown function A domain of unknown function (DUF) is a protein domain that has no characterised function. These families have been collected together in the Pfam database using the prefix DUF followed by a number, with examples being DUF2992 and DUF1220. As of 201 ...
when entered into its database. The ''
NBPF The neuroblastoma breaking point family (''NBPF'') is a family of genes involved in neuronal development. The family is highly specific to primates, with minimal similarity or presence in other mammals and no presence in other animals, and its gen ...
'' (neuroblastoma breakpoint family) gene family, which contains all the known Olduvai domains except the one found in
myomegalin Myomegalin, also known as phosphodiesterase 4D-interacting protein or cardiomyopathy-associated protein 2, is a protein that in humans is encoded by the ''PDE4DIP'' gene. It has roles in the formation of microtubules from the centrosome. Its name ...
, was independently identified by Vandepoele et al. in 2005 as a result of a gene (which was named ''
NBPF1 Neuroblastoma breakpoint family, member 1, or NBPF1, is a protein that is encoded by the gene ''NBPF1'' in humans. This protein is member of the neuroblastoma breakpoint family of proteins, a group of proteins that are thought to be involved in th ...
'') being found to have existed at and been disrupted by a
chromosomal translocation In genetics, chromosome translocation is a phenomenon that results in unusual rearrangement of chromosomes. This includes balanced and unbalanced translocation, with two main types: reciprocal-, and Robertsonian translocation. Reciprocal translo ...
at 1q36 (i.e. it was located at the breakpoint) in a boy with
neuroblastoma Neuroblastoma (NB) is a type of cancer that forms in certain types of nerve tissue. It most frequently starts from one of the adrenal glands but can also develop in the neck, chest, abdomen, or spine. Symptoms may include bone pain, a lump in the ...
reported by G. Laureys et al. in 1990. The researchers noticed that a novel protein domain that seemed to match the DUF1220 Pfam entry was present in multiple copies in this gene and in several other places on chromosome 1, which led them to establish 22 ''NBPF'' genes, and they named the domain the ''NBPF repeat''. In 2018, DUF1220 was renamed by its discoverers after
Olduvai Gorge The Olduvai Gorge or Oldupai Gorge in Tanzania is one of the most important paleoanthropology, paleoanthropological localities in the world; the many sites exposed by the gorge have proven invaluable in furthering understanding of early human ev ...
in Tanzania, one of the most important archaeological sites for
early humans ''Homo'' () is the genus that emerged in the (otherwise extinct) genus ''Australopithecus'' that encompasses the extant species ''Homo sapiens'' (modern humans), plus several extinct species classified as either ancestral to or closely related ...
, to reflect data indicating its role in human brain size and evolution.


References


Further reading

* * * {{DEFAULTSORT:Duf1220 Human genes Protein domains