Alu Element
   HOME

TheInfoList



OR:

An Alu element is a short stretch of DNA originally characterized by the action of the ''
Arthrobacter luteus ''Arthrobacter luteus'' (ALU) is a species of gram-positive bacteria in the genus ''Arthrobacter''. ''A. luteus'' is facultatively anaerobic, pleomorphic, branching, non-motile, non-sporulating, non- acid-fast, catalase-positive, and rod-sh ...
(Alu)''
restriction endonuclease A restriction enzyme, restriction endonuclease, REase, ENase or'' restrictase '' is an enzyme that cleaves DNA into fragments at or near specific recognition sites within molecules known as restriction sites. Restriction enzymes are one class o ...
. ''Alu'' elements are the most abundant
transposable element A transposable element (TE, transposon, or jumping gene) is a nucleic acid sequence in DNA that can change its position within a genome, sometimes creating or reversing mutations and altering the cell's genetic identity and genome size. Transp ...
s, containing over one million copies dispersed throughout the
human genome The human genome is a complete set of nucleic acid sequences for humans, encoded as DNA within the 23 chromosome pairs in cell nuclei and in a small DNA molecule found within individual mitochondria. These are usually treated separately as the n ...
. ''Alu'' elements were thought to be selfish or parasitic DNA, because their sole known function is self reproduction. However, they are likely to play a role in evolution and have been used as
genetic marker A genetic marker is a gene or DNA sequence with a known location on a chromosome that can be used to identify individuals or species. It can be described as a variation (which may arise due to mutation or alteration in the genomic loci) that can be ...
s. They are derived from the small cytoplasmic
7SL RNA The signal recognition particle RNA, (also known as 7SL, 6S or 4.5S RNA) is part of the signal recognition particle (SRP) ribonucleoprotein complex. SRP recognizes the signal peptide and binds to the ribosome, halting protein synthesis. Signal re ...
, a component of the
signal recognition particle The signal recognition particle (SRP) is an abundant, cytosolic, universally conserved ribonucleoprotein (protein- RNA complex) that recognizes and targets specific proteins to the endoplasmic reticulum in eukaryotes and the plasma membr ...
. ''Alu'' elements are highly conserved within primate
genome In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ge ...
s and originated in the genome of an ancestor of Supraprimates. ''Alu'' insertions have been implicated in several inherited human diseases and in various forms of cancer. The study of Alu elements has also been important in elucidating human
population genetics Population genetics is a subfield of genetics that deals with genetic differences within and between populations, and is a part of evolutionary biology. Studies in this branch of biology examine such phenomena as adaptation, speciation, and pop ...
and the
evolution Evolution is change in the heritable characteristics of biological populations over successive generations. These characteristics are the expressions of genes, which are passed on from parent to offspring during reproduction. Variation ...
of
primate Primates are a diverse order of mammals. They are divided into the strepsirrhines, which include the lemurs, galagos, and lorisids, and the haplorhines, which include the tarsiers and the simians (monkeys and apes, the latter including huma ...
s, including the evolution of humans.


Alu family

The Alu family is a family of repetitive elements in
primate Primates are a diverse order of mammals. They are divided into the strepsirrhines, which include the lemurs, galagos, and lorisids, and the haplorhines, which include the tarsiers and the simians (monkeys and apes, the latter including huma ...
genomes, including the
human Humans (''Homo sapiens'') are the most abundant and widespread species of primate, characterized by bipedalism and exceptional cognitive skills due to a large and complex brain. This has enabled the development of advanced tools, culture, ...
genome In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ge ...
. Modern ''Alu'' elements are about 300
base pair A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
s long and are therefore classified as
short interspersed nuclear element Short interspersed nuclear elements (SINEs) are non-autonomous, non-coding transposable elements (TEs) that are about 100 to 700 base pairs in length. They are a class of retrotransposons, DNA elements that amplify themselves throughout eukaryoti ...
s (SINEs) among the class of repetitive DNA elements. The typical structure is 5' - Part A - A5TACA6 - Part B - PolyA Tail - 3', where Part A and Part B (also known as "left arm" and "right arm") are similar nucleotide sequences. Expressed another way, it is believed modern ''Alu'' elements emerged from a head to tail fusion of two distinct FAMs (fossil antique monomers) over 100 million years ago, hence its dimeric structure of two similar, but distinct monomers (left and right arms) joined by an A-rich linker. Both monomers are thought to have evolved from 7SL, also known as
SRP RNA The signal recognition particle RNA, (also known as 7SL, 6S or 4.5S RNA) is part of the signal recognition particle (SRP) ribonucleoprotein complex. SRP recognizes the signal peptide and binds to the ribosome, halting protein synthesis. Signal re ...
. The length of the polyA tail varies between ''Alu'' families. There are over one million ''Alu'' elements interspersed throughout the human genome, and it is estimated that about 10.7% of the human genome consists of ''Alu'' sequences. However, less than 0.5% are polymorphic (i.e., occurring in more than one form or morph). In 1988,
Jerzy Jurka Jerzy Władysław Jurka (June 4, 1950 – July 19, 2014) was a Polish-American computational and molecular biologist. He served as the assistant director of research at the Linus Pauling Institute prior to founding the Genetic Information Resear ...
and
Temple Smith Temple Ferris Smith (born March 7, 1939) is an emeritus professor in biomedical engineering who helped to develop the Smith-Waterman algorithm with Michael Waterman in 1981. The Smith-Waterman algorithm serves as the basis for multi sequence com ...
discovered that ''Alu'' elements were split in two major subfamilies known as AluJ (named after Jurka) and AluS (named after Smith), and other Alu subfamilies were also independently discovered by several groups. Later on, a sub-subfamily of AluS which included active Alu elements was given the separate name AluY. Dating back 65 million years, the AluJ lineage is the oldest and least active in the human genome. The younger AluS lineage is about 30 million years old and still contains some active elements. Finally, the AluY elements are the youngest of the three and have the greatest disposition to move along the human genome. The discovery of ''Alu'' subfamilies led to the hypothesis of master/source genes, and provided the definitive link between transposable elements (active elements) and interspersed repetitive DNA (mutated copies of active elements).


Related elements

B1 elements in rats and mice are similar to Alus in that they also evolved from 7SL RNA, but they only have one left monomer arm. 95% percent of human Alus are also found in chimpanzees, and 50% of B elements in mice are also found in rats. These elements are mostly found in introns and upstream regulatory elements of genes. The ancestral form of Alu and B1 is the fossil Alu monomer (FAM). Free-floating forms of the left and right arms exist, termed free left Alu monomers (FLAMs) and free right Alu monomers (FRAMs) respectively. A notable FLAM in primates is the BC200 lncRNA.


Sequence features

Two main promoter "boxes" are found in Alu: a 5' A box with the consensus , and a 3' B box with the consensus (IUPAC
nucleic acid notation The nucleic acid notation currently in use was first formalized by the International Union of Pure and Applied Chemistry (IUPAC) in 1970. This universally accepted notation uses the Roman characters G, C, A, and T, to represent the four nucleotides ...
).
tRNA Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino ac ...
s, which are transcribed by
RNA polymerase III In eukaryote cells, RNA polymerase III (also called Pol III) is a protein that transcribes DNA to synthesize ribosomal 5S rRNA, tRNA and other small RNAs. The genes transcribed by RNA Pol III fall in the category of "housekeeping" genes whose e ...
, have a similar but stronger promoter structure. Both boxes are located in the left arm. Alu elements contain four or fewer
retinoic acid Retinoic acid (used simplified here for all-''trans''-retinoic acid) is a metabolite of vitamin A1 (all-''trans''-retinol) that mediates the functions of vitamin A1 required for growth and development. All-''trans''-retinoic acid is required in ...
response element hexamer sites in its internal promoter, with the last one overlapping with the "B box". In this 7SL ( SRP) RNA example below, functional hexamers are underlined using a solid line, with the non-functional third hexamer denoted using a dotted line: . The recognition sequence of the '' Alu I'' endonuclease is 5' ag/ct 3'; that is, the enzyme cuts the DNA segment between the
guanine Guanine () ( symbol G or Gua) is one of the four main nucleobases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine (uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside is called ...
and
cytosine Cytosine () ( symbol C or Cyt) is one of the four nucleobases found in DNA and RNA, along with adenine, guanine, and thymine (uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attached (an am ...
residues (in lowercase above).


Alu elements

''Alu'' elements are responsible for regulation of tissue-specific genes. They are also involved in the transcription of nearby genes and can sometimes change the way a gene is expressed. ''Alu'' elements are
retrotransposon Retrotransposons (also called Class I transposable elements or transposons via RNA intermediates) are a type of genetic component that copy and paste themselves into different genomic locations (transposon) by converting RNA back into DNA through ...
s and look like DNA copies made from
RNA polymerase III In eukaryote cells, RNA polymerase III (also called Pol III) is a protein that transcribes DNA to synthesize ribosomal 5S rRNA, tRNA and other small RNAs. The genes transcribed by RNA Pol III fall in the category of "housekeeping" genes whose e ...
-encoded RNAs. ''Alu'' elements do not encode for protein products. They are replicated as any other DNA sequence, but depend on LINE retrotransposons for generation of new elements. ''Alu'' element replication and mobilization begins by interactions with
signal recognition particle The signal recognition particle (SRP) is an abundant, cytosolic, universally conserved ribonucleoprotein (protein- RNA complex) that recognizes and targets specific proteins to the endoplasmic reticulum in eukaryotes and the plasma membr ...
s (SRPs), which aid newly translated proteins to reach their final destinations. ''Alu'' RNA forms a specific RNA:protein complex with a protein heterodimer consisting of SRP9 and SRP14. SRP9/14 facilitates ''Alu'''s attachment to ribosomes that capture nascent L1 proteins. Thus, an ''Alu'' element can take control of the L1 protein's
reverse transcriptase A reverse transcriptase (RT) is an enzyme used to generate complementary DNA (cDNA) from an RNA template, a process termed reverse transcription. Reverse transcriptases are used by viruses such as HIV and hepatitis B to replicate their genomes, ...
, ensuring that the ''Alu'''s RNA sequence gets copied into the genome rather than the L1's mRNA. ''Alu'' elements in primates form a fossil record that is relatively easy to decipher because ''Alu'' element insertion events have a characteristic signature that is both easy to read and faithfully recorded in the genome from generation to generation. The study of ''Alu Y'' elements (the more recently evolved) thus reveals details of ancestry because individuals will most likely only share a particular ''Alu'' element insertion if they have a common ancestor. This is because insertion of an Alu element occurs only 100 - 200 times per million years, and no known mechanism of deletion of one has been found. Therefore, individuals with an element likely descended from an ancestor with one—and vice versa, for those without. In genetics, the presence or lack thereof of a recently inserted ''Alu'' element may be a good property to consider when studying human evolution. Most human ''Alu'' element insertions can be found in the corresponding positions in the genomes of other primates, but about 7,000 ''Alu'' insertions are unique to humans.


Impact in humans

''Alu'' elements have been proposed to affect
gene expression Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, protein or non-coding RNA, and ultimately affect a phenotype, as the final effect. The ...
and been found to contain functional promoter regions for
steroid hormone receptor Steroid hormone receptors are found in the nucleus, cytosol, and also on the plasma membrane of target cells. They are generally intracellular receptors (typically cytoplasmic or nuclear) and initiate signal transduction for steroid hormones which ...
s. Due to the abundant content of CpG dinucleotides found in ''Alu'' elements, these regions serve as a site of
methylation In the chemical sciences, methylation denotes the addition of a methyl group on a substrate, or the substitution of an atom (or group) by a methyl group. Methylation is a form of alkylation, with a methyl group replacing a hydrogen atom. These t ...
, contributing to up to 30% of the methylation sites in the human genome. ''Alu'' elements are also a common source of mutations in humans; however, such mutations are often confined to non-coding regions of pre-mRNA (
introns An intron is any nucleotide sequence within a gene that is not expressed or operative in the final RNA product. The word ''intron'' is derived from the term ''intragenic region'', i.e. a region inside a gene."The notion of the cistron .e., gene ...
), where they have little discernible impact on the bearer. Mutations in the introns (or non-coding regions of RNA) have little or no effect on phenotype of an individual if the coding portion of individual's genome does not contain mutations. The Alu insertions that can be detrimental to the human body are inserted into coding regions ( exons) or into mRNA after the process of splicing. However, the variation generated can be used in studies of the movement and ancestry of human populations, and the mutagenic effect of ''Alu'' and retrotransposons in general has played a major role in the evolution of the human genome. There are also a number of cases where ''Alu'' insertions or deletions are associated with specific effects in humans:


Associations with human disease

''Alu'' insertions are sometimes disruptive and can result in inherited disorders. However, most ''Alu'' variation acts as markers that segregate with the disease so the presence of a particular ''Alu''
allele An allele (, ; ; modern formation from Greek ἄλλος ''állos'', "other") is a variation of the same sequence of nucleotides at the same place on a long DNA molecule, as described in leading textbooks on genetics and evolution. ::"The chro ...
does not mean that the carrier will definitely get the disease. The first report of ''Alu''-mediated recombination causing a prevalent inherited predisposition to cancer was a 1995 report about ''hereditary nonpolyposis
colorectal cancer Colorectal cancer (CRC), also known as bowel cancer, colon cancer, or rectal cancer, is the development of cancer from the colon or rectum (parts of the large intestine). Signs and symptoms may include blood in the stool, a change in bowel m ...
''. In the human genome, the most recently active have been the 22 AluY and 6 AluS Transposon Element subfamilies due to their inherited activity to cause various cancers. Thus due to their major heritable damage it is important to understand the causes that affect their transpositional activity. The following human diseases have been linked with ''Alu'' insertions: *
Alport syndrome Alport syndrome is a genetic disorder affecting around 1 in 5,000-10,000 children, characterized by glomerulonephritis, end-stage kidney disease, and hearing loss. Alport syndrome can also affect the eyes, though the changes do not usually affect v ...
*
Breast cancer Breast cancer is cancer that develops from breast tissue. Signs of breast cancer may include a lump in the breast, a change in breast shape, dimpling of the skin, milk rejection, fluid coming from the nipple, a newly inverted nipple, or a re ...
*
chorioretinal degeneration The choroid, also known as the choroidea or choroid coat, is a part of the uvea, the Blood vessel, vascular layer of the eye, and contains connective tissues, and lies between the retina and the sclera. The human choroid is thickest at the far e ...
*
Diabetes mellitus Diabetes, also known as diabetes mellitus, is a group of metabolic disorders characterized by a high blood sugar level ( hyperglycemia) over a prolonged period of time. Symptoms often include frequent urination, increased thirst and increased ap ...
type II *
Ewing's sarcoma Ewing sarcoma is a type of cancer that forms in bone or soft tissue. Symptoms may include swelling and pain at the site of the tumor, fever, and a bone fracture. The most common areas where it begins are the legs, pelvis, and chest wall. In about ...
* Familial hypercholesterolemia * Hemophilia *
Leigh syndrome Leigh syndrome (also called Leigh disease and subacute necrotizing encephalomyelopathy) is an inherited neurometabolic disorder that affects the central nervous system. It is named after Archibald Denis Leigh, a British neuropsychiatrist who fi ...
*
mucopolysaccharidosis Mucopolysaccharidoses are a group of metabolic disorders caused by the absence or malfunctioning of lysosome, lysosomal enzymes needed to break down molecules called glycosaminoglycans (GAGs). These long chains of sugar carbohydrates occur within ...
VII *
Neurofibromatosis Neurofibromatosis (NF) is a group of three conditions in which tumors grow in the nervous system. The three types are neurofibromatosis type I (NF1), neurofibromatosis type II Neurofibromatosis type II (also known as MISME syndrome – multi ...
*
Macular degeneration Macular degeneration, also known as age-related macular degeneration (AMD or ARMD), is a medical condition which may result in blurred or no vision in the center of the visual field. Early on there are often no symptoms. Over time, however, som ...
And the following diseases have been associated with single-nucleotide DNA variations in Alu elements affecting transcription levels: *
Alzheimer's disease Alzheimer's disease (AD) is a neurodegeneration, neurodegenerative disease that usually starts slowly and progressively worsens. It is the cause of 60–70% of cases of dementia. The most common early symptom is difficulty in short-term me ...
*
Lung cancer Lung cancer, also known as lung carcinoma (since about 98–99% of all lung cancers are carcinomas), is a malignant lung tumor characterized by uncontrolled cell growth in tissue (biology), tissues of the lung. Lung carcinomas derive from tran ...
*
Gastric cancer Stomach cancer, also known as gastric cancer, is a cancer that develops from the lining of the stomach. Most cases of stomach cancers are gastric carcinomas, which can be divided into a number of subtypes, including gastric adenocarcinomas. Lymph ...


Associated human mutations

* The ''ACE'' gene, encoding angiotensin-converting enzyme, has 2 common variants, one with an ''Alu'' insertion (''ACE''-I) and one with the ''Alu'' deleted (''ACE''-D). This variation has been linked to changes in sporting ability: the presence of the ''Alu'' element is associated with better performance in endurance-oriented events (e.g. triathlons), whereas its absence is associated with strength- and power-oriented performance. * The
opsin Animal opsins are G-protein-coupled receptors and a group of proteins made light-sensitive via a chromophore, typically retinal. When bound to retinal, opsins become Retinylidene proteins, but are usually still called opsins regardless. Most pro ...
gene duplication Gene duplication (or chromosomal duplication or gene amplification) is a major mechanism through which new genetic material is generated during molecular evolution. It can be defined as any duplication of a region of DNA that contains a gene. ...
which resulted in the re-gaining of
trichromacy Trichromacy or trichromatism is the possessing of three independent channels for conveying color information, derived from the three different types of cone cells in the eye. Organisms with trichromacy are called trichromats. The normal expla ...
in Old World primates (including humans) is flanked by an ''Alu'' element, implicating the role of ''Alu'' in the evolution of three colour vision.


References


External links

* * {{DEFAULTSORT:Alu Sequence Repetitive DNA sequences Human genetics