An Alu element is a short stretch of
DNA originally characterized by the action of the ''
Arthrobacter luteus
''Arthrobacter luteus'' (ALU) is a species of gram-positive bacteria in the genus '' Arthrobacter''. ''A. luteus'' is facultatively anaerobic, pleomorphic, branching, non-motile, non-sporulating, non-acid-fast, catalase-positive, and rod-sh ...
(Alu)''
restriction endonuclease
A restriction enzyme, restriction endonuclease, REase, ENase or'' restrictase '' is an enzyme that cleaves DNA into fragments at or near specific recognition sites within molecules known as restriction sites. Restriction enzymes are one class ...
.
''Alu'' elements are the most abundant
transposable element
A transposable element (TE, transposon, or jumping gene) is a nucleic acid sequence in DNA that can change its position within a genome, sometimes creating or reversing mutations and altering the cell's genetic identity and genome size. Trans ...
s, containing over one million copies dispersed throughout the
human genome
The human genome is a complete set of nucleic acid sequences for humans, encoded as DNA within the 23 chromosome pairs in cell nuclei and in a small DNA molecule found within individual mitochondria. These are usually treated separately as the ...
.
''Alu'' elements were thought to be selfish or parasitic DNA, because their sole known function is self reproduction. However, they are likely to play a role in evolution and have been used as
genetic marker A genetic marker is a gene or DNA sequence with a known location on a chromosome that can be used to identify individuals or species. It can be described as a variation (which may arise due to mutation or alteration in the genomic loci) that can be ...
s.
They are derived from the small cytoplasmic
7SL RNA, a component of the
signal recognition particle
The signal recognition particle (SRP) is an abundant, cytosolic, universally conserved ribonucleoprotein (protein-RNA complex) that recognizes and targets specific proteins to the endoplasmic reticulum in eukaryotes and the plasma membra ...
. ''Alu'' elements are highly conserved within primate
genome
In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ...
s and originated in the genome of an ancestor of
Supraprimates
Euarchontoglires (synonymous with Supraprimates) is a clade and a superorder of mammals, the living members of which belong to one of the five following groups: rodents, lagomorphs, treeshrews, colugos, and primates.
Evolutionary affinities w ...
.
''Alu'' insertions have been implicated in several inherited human diseases and in various forms of cancer.
The study of Alu elements has also been important in elucidating human
population genetics
Population genetics is a subfield of genetics that deals with genetic differences within and between populations, and is a part of evolutionary biology. Studies in this branch of biology examine such phenomena as adaptation, speciation, and pop ...
and the
evolution
Evolution is change in the heritable characteristics of biological populations over successive generations. These characteristics are the expressions of genes, which are passed on from parent to offspring during reproduction. Variation ...
of
primate
Primates are a diverse order (biology), order of mammals. They are divided into the Strepsirrhini, strepsirrhines, which include the lemurs, galagos, and lorisids, and the Haplorhini, haplorhines, which include the Tarsiiformes, tarsiers and ...
s, including the
evolution of humans
Human evolution is the evolutionary process within the history of primates that led to the emergence of '' Homo sapiens'' as a distinct species of the hominid family, which includes the great apes. This process involved the gradual development ...
.
Alu family
The Alu family is a family of repetitive elements in
primate
Primates are a diverse order (biology), order of mammals. They are divided into the Strepsirrhini, strepsirrhines, which include the lemurs, galagos, and lorisids, and the Haplorhini, haplorhines, which include the Tarsiiformes, tarsiers and ...
genomes, including the
human
Humans (''Homo sapiens'') are the most abundant and widespread species of primate, characterized by bipedalism and exceptional cognitive skills due to a large and complex brain. This has enabled the development of advanced tools, culture, ...
genome
In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ...
. Modern ''Alu'' elements are about 300
base pairs long and are therefore classified as
short interspersed nuclear element
Short interspersed nuclear elements (SINEs) are non-autonomous, non-coding transposable elements (TEs) that are about 100 to 700 base pairs in length. They are a class of retrotransposons, DNA elements that amplify themselves throughout eukaryo ...
s (SINEs) among the class of repetitive DNA elements. The typical structure is 5' - Part A - A5TACA6 - Part B - PolyA Tail - 3', where Part A and Part B (also known as "left arm" and "right arm") are similar nucleotide sequences. Expressed another way, it is believed modern ''Alu'' elements emerged from a head to tail fusion of two distinct FAMs (fossil antique monomers) over 100 million years ago, hence its dimeric structure of two similar, but distinct monomers (left and right arms) joined by an A-rich linker. Both monomers are thought to have evolved from 7SL, also known as
SRP RNA.
The length of the polyA tail varies between ''Alu'' families.
There are over one million ''Alu'' elements interspersed throughout the human genome, and it is estimated that about 10.7% of the human genome consists of ''Alu'' sequences. However, less than 0.5% are
polymorphic (i.e., occurring in more than one form or morph).
In 1988,
Jerzy Jurka
Jerzy Władysław Jurka (June 4, 1950 – July 19, 2014) was a Polish-American computational and molecular biologist. He served as the assistant director of research at the Linus Pauling Institute prior to founding the Genetic Information Res ...
and
Temple Smith discovered that ''Alu'' elements were split in two major subfamilies known as AluJ (named after Jurka) and AluS (named after Smith), and other Alu subfamilies were also independently discovered by several groups.
Later on, a sub-subfamily of AluS which included active Alu elements was given the separate name AluY. Dating back 65 million years, the AluJ lineage is the oldest and least active in the human genome. The younger AluS lineage is about 30 million years old and still contains some active elements. Finally, the AluY elements are the youngest of the three and have the greatest disposition to move along the human genome.
The discovery of ''Alu'' subfamilies led to the hypothesis of master/source genes, and provided the definitive link between transposable elements (active elements) and interspersed repetitive DNA (mutated copies of active elements).
Related elements
B1 elements in rats and mice are similar to Alus in that they also evolved from 7SL RNA, but they only have one left monomer arm. 95% percent of human Alus are also found in chimpanzees, and 50% of B elements in mice are also found in rats. These elements are mostly found in introns and upstream regulatory elements of genes.
The ancestral form of Alu and B1 is the fossil Alu monomer (FAM). Free-floating forms of the left and right arms exist, termed free left Alu monomers (FLAMs) and free right Alu monomers (FRAMs) respectively. A notable FLAM in primates is the
BC200 lncRNA
Brain cytoplasmic 200 long-noncoding RNA (or BC200 lncRNA) is a 200 nucleotide RNA transcript found predominantly in the brain with a primary function of regulating translation by inhibiting its initiation. As a long non-coding RNA, it belongs to ...
.
Sequence features
![LINE1s and SINEs](https://upload.wikimedia.org/wikipedia/commons/2/2f/LINE1s_and_SINEs.png)
Two main promoter "boxes" are found in Alu: a 5' A box with the consensus , and a 3' B box with the consensus (IUPAC
nucleic acid notation).
tRNA
Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino a ...
s, which are transcribed by
RNA polymerase III
In eukaryote cells, RNA polymerase III (also called Pol III) is a protein that transcribes DNA to synthesize ribosomal 5S rRNA, tRNA and other small RNAs.
The genes transcribed by RNA Pol III fall in the category of "housekeeping" genes whose ...
, have a similar but stronger promoter structure. Both boxes are located in the left arm.
[
Alu elements contain four or fewer ]retinoic acid
Retinoic acid (used simplified here for all-''trans''-retinoic acid) is a metabolite of vitamin A1 (all-''trans''- retinol) that mediates the functions of vitamin A1 required for growth and development. All-''trans''-retinoic acid is required in ...
response element hexamer sites in its internal promoter, with the last one overlapping with the "B box". In this 7SL ( SRP) RNA example below, functional hexamers are underlined using a solid line, with the non-functional third hexamer denoted using a dotted line:
.
The recognition sequence of the '' Alu I'' endonuclease is 5' ag/ct 3'; that is, the enzyme cuts the DNA segment between the guanine
Guanine () (symbol G or Gua) is one of the four main nucleobases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine ( uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside is c ...
and cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleobases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attached ...
residues (in lowercase above).
Alu elements
''Alu'' elements are responsible for regulation of tissue-specific genes. They are also involved in the transcription of nearby genes and can sometimes change the way a gene is expressed.
''Alu'' elements are retrotransposon
Retrotransposons (also called Class I transposable elements or transposons via RNA intermediates) are a type of genetic component that copy and paste themselves into different genomic locations (transposon) by converting RNA back into DNA through ...
s and look like DNA copies made from RNA polymerase III
In eukaryote cells, RNA polymerase III (also called Pol III) is a protein that transcribes DNA to synthesize ribosomal 5S rRNA, tRNA and other small RNAs.
The genes transcribed by RNA Pol III fall in the category of "housekeeping" genes whose ...
-encoded RNAs. ''Alu'' elements do not encode for protein products. They are replicated as any other DNA sequence, but depend on LINE
Line most often refers to:
* Line (geometry), object with zero thickness and curvature that stretches to infinity
* Telephone line, a single-user circuit on a telephone communication system
Line, lines, The Line, or LINE may also refer to:
Art ...
retrotransposons for generation of new elements.
''Alu'' element replication and mobilization begins by interactions with signal recognition particle
The signal recognition particle (SRP) is an abundant, cytosolic, universally conserved ribonucleoprotein (protein-RNA complex) that recognizes and targets specific proteins to the endoplasmic reticulum in eukaryotes and the plasma membra ...
s (SRPs), which aid newly translated proteins to reach their final destinations. ''Alu'' RNA forms a specific RNA:protein complex with a protein heterodimer consisting of SRP9 and SRP14. SRP9/14 facilitates ''Alu'''s attachment to ribosomes that capture nascent L1 proteins. Thus, an ''Alu'' element can take control of the L1 protein's reverse transcriptase
A reverse transcriptase (RT) is an enzyme used to generate complementary DNA (cDNA) from an RNA template, a process termed reverse transcription. Reverse transcriptases are used by viruses such as HIV and hepatitis B to replicate their genomes, ...
, ensuring that the ''Alu'''s RNA sequence gets copied into the genome rather than the L1's mRNA.
''Alu'' elements in primates form a fossil record that is relatively easy to decipher because ''Alu'' element insertion events have a characteristic signature that is both easy to read and faithfully recorded in the genome from generation to generation. The study of ''Alu Y'' elements (the more recently evolved) thus reveals details of ancestry because individuals will most likely only share a particular ''Alu'' element insertion if they have a common ancestor. This is because insertion of an Alu element occurs only 100 - 200 times per million years, and no known mechanism of deletion of one has been found. Therefore, individuals with an element likely descended from an ancestor with one—and vice versa, for those without. In genetics, the presence or lack thereof of a recently inserted ''Alu'' element may be a good property to consider when studying human evolution.
Most human ''Alu'' element insertions can be found in the corresponding positions in the genomes of other primates, but about 7,000 ''Alu'' insertions are unique to humans.
Impact in humans
''Alu'' elements have been proposed to affect gene expression
Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, protein or non-coding RNA, and ultimately affect a phenotype, as the final effect. ...
and been found to contain functional promoter regions for steroid hormone receptor
Steroid hormone receptors are found in the nucleus, cytosol, and also on the plasma membrane of target cells. They are generally intracellular receptors (typically cytoplasmic or nuclear) and initiate signal transduction for steroid hormones which ...
s. Due to the abundant content of CpG dinucleotides
The CpG sites or CG sites are regions of DNA where a cytosine nucleotide is followed by a guanine nucleotide in the linear DNA sequence, sequence of Base pair, bases along its Directionality (molecular biology), 5' → 3' direction. CpG sites o ...
found in ''Alu'' elements, these regions serve as a site of methylation
In the chemical sciences, methylation denotes the addition of a methyl group on a substrate, or the substitution of an atom (or group) by a methyl group. Methylation is a form of alkylation, with a methyl group replacing a hydrogen atom. These t ...
, contributing to up to 30% of the methylation sites in the human genome. ''Alu'' elements are also a common source of mutations in humans; however, such mutations are often confined to non-coding regions of pre-mRNA (introns
An intron is any nucleotide sequence within a gene that is not expressed or operative in the final RNA product. The word ''intron'' is derived from the term ''intragenic region'', i.e. a region inside a gene."The notion of the cistron .e., gene ...
), where they have little discernible impact on the bearer. Mutations in the introns (or non-coding regions of RNA) have little or no effect on phenotype of an individual if the coding portion of individual's genome does not contain mutations. The Alu insertions that can be detrimental to the human body are inserted into coding regions (exons
An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term ''exon'' refers to both the DNA sequence within a gene and to the corresponding sequenc ...
) or into mRNA after the process of splicing.
However, the variation generated can be used in studies of the movement and ancestry of human populations, and the mutagenic effect of ''Alu'' and retrotransposons in general has played a major role in the evolution of the human genome. There are also a number of cases where ''Alu'' insertions or deletions are associated with specific effects in humans:
Associations with human disease
''Alu'' insertions are sometimes disruptive and can result in inherited disorders. However, most ''Alu'' variation acts as markers that segregate with the disease so the presence of a particular ''Alu'' allele
An allele (, ; ; modern formation from Greek ἄλλος ''állos'', "other") is a variation of the same sequence of nucleotides at the same place on a long DNA molecule, as described in leading textbooks on genetics and evolution.
::"The chro ...
does not mean that the carrier will definitely get the disease. The first report of ''Alu''-mediated recombination causing a prevalent inherited predisposition to cancer was a 1995 report about ''hereditary nonpolyposis colorectal cancer''. In the human genome, the most recently active have been the 22 AluY and 6 AluS Transposon Element subfamilies due to their inherited activity to cause various cancers. Thus due to their major heritable damage it is important to understand the causes that affect their transpositional activity.
The following human diseases have been linked with ''Alu'' insertions:
* Alport syndrome
Alport syndrome is a genetic disorder affecting around 1 in 5,000-10,000 children, characterized by glomerulonephritis, end-stage kidney disease, and hearing loss. Alport syndrome can also affect the eyes, though the changes do not usually affect ...
* Breast cancer
Breast cancer is cancer that develops from breast tissue. Signs of breast cancer may include a lump in the breast, a change in breast shape, dimpling of the skin, milk rejection, fluid coming from the nipple, a newly inverted nipple, or ...
* chorioretinal degeneration
* Diabetes mellitus
Diabetes, also known as diabetes mellitus, is a group of metabolic disorders characterized by a high blood sugar level (hyperglycemia) over a prolonged period of time. Symptoms often include frequent urination, increased thirst and increased ...
type II
* Ewing's sarcoma
* Familial hypercholesterolemia
Familial hypercholesterolemia (FH) is a genetic disorder characterized by high cholesterol levels, specifically very high levels of low-density lipoprotein (LDL cholesterol), in the blood and early cardiovascular disease. The most common mutati ...
* Hemophilia
Haemophilia, or hemophilia (), is a mostly hereditary, inherited genetic disorder that impairs the body's ability to coagulation, make blood clots, a process needed to hemostasis, stop bleeding. This results in people bleeding for a longer ti ...
* Leigh syndrome
Leigh syndrome (also called Leigh disease and subacute necrotizing encephalomyelopathy) is an inherited neurometabolic disorder that affects the central nervous system. It is named after Archibald Denis Leigh, a British neuropsychiatrist who fi ...
* mucopolysaccharidosis
Mucopolysaccharidoses are a group of metabolic disorders caused by the absence or malfunctioning of lysosomal enzymes needed to break down molecules called glycosaminoglycans (GAGs). These long chains of sugar carbohydrates occur within the c ...
VII
* Neurofibromatosis
Neurofibromatosis (NF) is a group of three conditions in which tumors grow in the nervous system. The three types are neurofibromatosis type I (NF1), neurofibromatosis type II (NF2), and schwannomatosis. In NF1 symptoms include light brown ...
*Macular degeneration
Macular degeneration, also known as age-related macular degeneration (AMD or ARMD), is a medical condition which may result in blurred or no vision in the center of the visual field. Early on there are often no symptoms. Over time, however, som ...
And the following diseases have been associated with single-nucleotide DNA variation
In genetics, a single-nucleotide polymorphism (SNP ; plural SNPs ) is a germline substitution of a single nucleotide at a specific position in the genome. Although certain definitions require the substitution to be present in a sufficiently larg ...
s in Alu elements affecting transcription levels:
* Alzheimer's disease
* Lung cancer
Lung cancer, also known as lung carcinoma (since about 98–99% of all lung cancers are carcinomas), is a malignant lung tumor characterized by uncontrolled cell growth in tissues of the lung. Lung carcinomas derive from transformed, malign ...
* Gastric cancer
Stomach cancer, also known as gastric cancer, is a cancer that develops from the lining of the stomach. Most cases of stomach cancers are gastric carcinomas, which can be divided into a number of subtypes, including gastric adenocarcinomas. Ly ...
Associated human mutations
* The ''ACE'' gene, encoding angiotensin-converting enzyme
Angiotensin-converting enzyme (), or ACE, is a central component of the renin–angiotensin system (RAS), which controls blood pressure by regulating the volume of fluids in the body. It converts the hormone angiotensin I to the active vasoconstr ...
, has 2 common variants, one with an ''Alu'' insertion (''ACE''-I) and one with the ''Alu'' deleted (''ACE''-D). This variation has been linked to changes in sporting ability: the presence of the ''Alu'' element is associated with better performance in endurance-oriented events (e.g. triathlons), whereas its absence is associated with strength- and power-oriented performance.
* The opsin
Animal opsins are G-protein-coupled receptors and a group of proteins made light-sensitive via a chromophore, typically retinal. When bound to retinal, opsins become Retinylidene proteins, but are usually still called opsins regardless. Most pro ...
gene duplication
Gene duplication (or chromosomal duplication or gene amplification) is a major mechanism through which new genetic material is generated during molecular evolution. It can be defined as any duplication of a region of DNA that contains a gene ...
which resulted in the re-gaining of trichromacy
Trichromacy or trichromatism is the possessing of three independent channels for conveying color information, derived from the three different types of cone cells in the eye. Organisms with trichromacy are called trichromats.
The normal expla ...
in Old World primates (including humans) is flanked by an ''Alu'' element, implicating the role of ''Alu'' in the evolution of three colour vision.
References
External links
*
*
{{DEFAULTSORT:Alu Sequence
Repetitive DNA sequences
Human genetics