Transcription factor 21 (TCF21), also known as pod-1, capsuling, or epicardin, is a
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respo ...
that in humans is encoded by the ''TCF21''
gene
In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a ba ...
on
chromosome 6
Chromosome 6 is one of the 23 pairs of chromosomes in humans. People normally have two copies of this chromosome. Chromosome 6 spans more than 170 million base pairs (the building material of DNA) and represents between 5.5 and 6% of the total ...
.
It is ubiquitously expressed in many tissues and cell types and highly significantly expressed in
lung
The lungs are the primary organs of the respiratory system in humans and most other animals, including some snails and a small number of fish. In mammals and most other vertebrates, two lungs are located near the backbone on either side of t ...
and
placenta
The placenta is a temporary embryonic and later fetal organ that begins developing from the blastocyst shortly after implantation. It plays critical roles in facilitating nutrient, gas and waste exchange between the physically separate mater ...
. TCF21 is crucial for the development of a number of cell types during
embryogenesis
An embryo is an initial stage of development of a multicellular organism. In organisms that reproduce sexually, embryonic development is the part of the life cycle that begins just after fertilization of the female egg cell by the male sperm ...
of the
heart
The heart is a muscular organ in most animals. This organ pumps blood through the blood vessels of the circulatory system. The pumped blood carries oxygen and nutrients to the body, while carrying metabolic waste such as carbon dioxide t ...
,
lung,
kidney
The kidneys are two reddish-brown bean-shaped organs found in vertebrates. They are located on the left and right in the retroperitoneal space, and in adult humans are about in length. They receive blood from the paired renal arteries; blood ...
,
and
spleen
The spleen is an organ found in almost all vertebrates. Similar in structure to a large lymph node, it acts primarily as a blood filter. The word spleen comes . .
TCF21 is also deregulated in several types of
cancer
Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal b ...
s, and thus known to function as a
tumor suppressor
A tumor suppressor gene (TSG), or anti-oncogene, is a gene that regulates a cell during cell division and replication. If the cell grows uncontrollably, it will result in cancer. When a tumor suppressor gene is mutated, it results in a loss or red ...
.
The ''TCF21'' gene also contains one of 27
SNPs
In genetics, a single-nucleotide polymorphism (SNP ; plural SNPs ) is a germline substitution of a single nucleotide at a specific position in the genome. Although certain definitions require the substitution to be present in a sufficiently larg ...
associated with increased risk of
coronary artery disease
Coronary artery disease (CAD), also called coronary heart disease (CHD), ischemic heart disease (IHD), myocardial ischemia, or simply heart disease, involves the reduction of blood flow to the heart muscle due to build-up of atherosclerotic pla ...
.
Discovery
TCF21 was discovered in 1998 when search for novel cell-type-specific bHLH proteins expressed in human and mouse kidneys by performing a search of the
expressed sequence tag In genetics, an expressed sequence tag (EST) is a short sub-sequence of a cDNA sequence. ESTs may be used to identify gene transcripts, and were instrumental in gene discovery and in gene-sequence determination. The identification of ESTs has proc ...
(EST) databases.
Because the transcript they found was highly expressed in visceral glomerular epithelial cells (
podocyte
Podocytes are cells in Bowman's capsule in the kidneys that wrap around capillaries of the glomerulus. Podocytes make up the epithelial lining of Bowman's capsule, the third layer through which filtration of blood takes place. Bowman's capsule f ...
s), TCF21 was initialled named Pod-1. Comparison of Pod-1 with previously characterized bHLH proteins identified Pod-1 as a novel member of a subfamily of bHLH proteins with important roles in mesodermal development.
The chromosomal location of Pod-1 in the mouse was then determined using an interspecific backcross panel along with genomic southern blot analysis to identify
restriction fragment length polymorphisms (RFLPs) between inbred mouse strains. Analysis showed Pod-1 to map to a region of mouse chromosome 10 that is syntenic with human chromosome 6q23-q24.
The tissue distribution of Pod-1 was determined by hybridization of a human multiple tissue
northern blot
The northern blot, or RNA blot,Gilbert, S. F. (2000) Developmental Biology, 6th Ed. Sunderland MA, Sinauer Associates. is a technique used in molecular biology research to study gene expression by detection of RNA (or isolated mRNA) in a sample.K ...
with a Pod-1 cDNA. A probe lacking the bHLH domain was used to minimize cross-reactivity along with high stringency hybridization and washing. Results showed that in humans and mice, Pod-1 was most highly expressed in the kidney, lung and heart, with selective expression at sites of epithelial-mesenchymal interaction in the kidney, lung, intestine and pancreas of developing mouse embryos.
RNA in situ hybridization using
33P-labeled riboprobes was used to identify the cell types that expressed Pod-1 in the developing kidney and other tissues. This revealed Pod-1 expression in mesenchymal cells and podocytes, with expression coinciding with the onset of podocyte differentiation.
It was found that expression of Pod-1 in embryonic kidney explants was inhibited through antisense oligonucleotides. This inhibition resulted in decreased mesenchymal cell condensation around the ureteric bud and a significant decrease in ureteric branching. Pod-1 was the first tissue-restricted bHLH protein to be identified in the developing kidney and tied to regulation of morphogenetic events.
In an effort to identify novel bHLH factors related to
dHAND and
eHAND (a novel subclass of cell type-restricted bHLH factors shown to play important roles in cardiac morphogenesis), they screened expressed sequence tag (EST) databases for sequences with homology to the bHLH regions of these factors.
The novel bHLH protein they identified in their search was also Pod-1, but they used the name capsulin.
Whole-mount in situ hybridization of Capsulin transcripts were used to define sites of expression, which showed to be specific to mesodermal precursor cells that surround the epithelium of the developing gastrointestinal, genitourinary and respiratory systems during mouse embryogenesis.
Expression patterns of capsulin mRNA in adult mouse tissues by Northern blot detected highest levels in the lungs, with lower levels in kidneys, heart and spleen. Capsulin transcripts were also found to mark the spiral septum of the heart and progenitor cells that give rise to the pericardium and coronary arteries.
Capsulin was translated in a rabbit reticulocyte lysate in the presence and absence of the widely expressed bHLH protein E12 and performed gel mobility shift assays with several E-box sequences as probes to test the protein's DNA binding activity.
Capsulin alone failed to bind any of the sequences tested. However, in the presence of E12 plus capsulin, a DNA complex was generated with the probe that migrated faster than the E12 homodimeric complex alone, representing the binding of capsulin/E12 heterodimers.
It was concluded that capsulin heterodimerizes with E12 and binds the specific E-box consensus sequence (CANNTG), though not activating transcription through this sequence on its own. Its restricted expression pattern and DNA binding activity identified Capsulin as a regulator of gene expression in specific subtypes of visceral mesodermal cells involved in organogenesis and in precursor cells that contribute to the pericardium, coronary arteries and regions of the heart.
Structure
Gene
The ''TCF21'' gene resides on chromosome 6 at the band 6q23.2 and includes 3
exon
An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term ''exon'' refers to both the DNA sequence within a gene and to the corresponding sequen ...
s.
These three exons are associated with
CpG islands
The CpG sites or CG sites are regions of DNA where a cytosine nucleotide is followed by a guanine nucleotide in the linear sequence of bases along its 5' → 3' direction. CpG sites occur with high frequency in genomic regions called CpG isl ...
CGI1, CGI2, and CGI3.
DNA methylation
DNA methylation is a biological process by which methyl groups are added to the DNA molecule. Methylation can change the activity of a DNA segment without changing the sequence. When located in a gene promoter, DNA methylation typically acts t ...
analysis revealed
hypermethylation
In the chemical sciences, methylation denotes the addition of a methyl group on a substrate, or the substitution of an atom (or group) by a methyl group. Methylation is a form of alkylation, with a methyl group replacing a hydrogen atom. These t ...
at CGI1 and CGI3, but not CGI2 in samples from various cancer tissues.
Luciferase
Luciferase is a generic term for the class of oxidative enzymes that produce bioluminescence, and is usually distinguished from a photoprotein. The name was first used by Raphaël Dubois who invented the words ''luciferin'' and ''luciferase'', ...
reporter
A journalist is an individual that collects/gathers information in form of text, audio, or pictures, processes them into a news-worthy form, and disseminates it to the public. The act or process mainly done by the journalist is called journalism ...
assay
An assay is an investigative (analytic) procedure in laboratory medicine, mining, pharmacology, environmental biology and molecular biology for qualitatively assessing or quantitatively measuring the presence, amount, or functional activity of a ...
s with constructs covering CGI3 sequences in
sense
A sense is a biological system used by an organism for sensation, the process of gathering information about the world through the detection of Stimulus (physiology), stimuli. (For example, in the human body, the brain which is part of the cen ...
and
antisense
In molecular biology and genetics, the sense of a nucleic acid molecule, particularly of a strand of DNA or RNA, refers to the nature of the roles of the strand and its complement in specifying a sequence of amino acids. Depending on the context, ...
orientation demonstrated that CGI3 harbors a
promoter that directs the synthesis of a previously unknown
long non-coding RNA
Long non-coding RNAs (long ncRNAs, lncRNA) are a type of RNA, generally defined as transcripts more than 200 nucleotides that are not translated into protein. This arbitrary limit distinguishes long ncRNAs from small non-coding RNAs, such as mic ...
s (lncRNAs) in antisense orientation to TCF21. This lncRNA has been named TARID (for TCF21 antisense RNA inducing demethylation).
Protein
TCF21 is a member of the
bHLH (basic helix-loop-helix) family of
transcription factor
In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The fu ...
s. This protein is predicted to span 179
amino acid residues
Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, the monomers of the polymer. A single amino acid monomer may ...
and contains a bHLH domain and an
arginine
Arginine is the amino acid with the formula (H2N)(HN)CN(H)(CH2)3CH(NH2)CO2H. The molecule features a guanidino group appended to a standard amino acid framework. At physiological pH, the carboxylic acid is deprotonated (−CO2−) and both the am ...
-rich sequence that may facilitate DNA binding.
Function
TCF21 encodes a
transcription factor
In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The fu ...
of the
basic helix-loop-helix
BASIC (Beginners' All-purpose Symbolic Instruction Code) is a family of general-purpose, high-level programming languages designed for ease of use. The original version was created by John G. Kemeny and Thomas E. Kurtz at Dartmouth College ...
(bHLH) family, which manages cell-fate specification, commitment and differentiation in various cell lineages during development.
The TCF21 product is
mesoderm
The mesoderm is the middle layer of the three germ layers that develops during gastrulation in the very early development of the embryo of most animals. The outer layer is the ectoderm, and the inner layer is the endoderm.Langman's Medical E ...
-specific and expressed in embryonic
epicardium
The pericardium, also called pericardial sac, is a double-walled sac containing the heart and the roots of the great vessels. It has two layers, an outer layer made of strong connective tissue (fibrous pericardium), and an inner layer made of ...
,
mesenchyme
Mesenchyme () is a type of loosely organized animal embryonic connective tissue of undifferentiated cells that give rise to most tissues, such as skin, blood or bone. The interactions between mesenchyme and epithelium help to form nearly every o ...
-derived tissues of lung, gut, gonad, and both mesenchymal and glomerular epithelial cells in the kidney.
TCF21 is essential for regulating properties of the mesenchyme that are critically important for several aspects of lung and kidney morphogenesis.
TCF21 is also essential for
cardiac fibroblast cell fate, as demonstrated by the failed development of cardiac fibroblasts in mice lacking TCF21.
In the absence of TCF21, these fibroblast progenitor cells do not undergo
epithelial-to-mesenchymal transition (EMT). While TCF21-expressing epicardial cells are initially multipotent, they become committed to the fibroblast lineage over time. Those TCF21-expressing cells that do not commit to the fibroblast lineage lose this expression and remain undifferentiated epicardial cells or coronary vascular smooth muscle cells.
TCF21 is expressed in mesodermal cells in the proepicardial organ that give rise to
coronary artery
The coronary arteries are the arterial blood vessels of coronary circulation, which transport oxygenated blood to the heart muscle. The heart requires a continuous supply of oxygen to function and survive, much like any other tissue or organ of ...
smooth muscle cells
Smooth muscle is an involuntary non-striated muscle, so-called because it has no sarcomeres and therefore no striations (''bands'' or ''stripes''). It is divided into two subgroups, single-unit and multiunit smooth muscle. Within single-unit mus ...
(SMC) and loss of TCF21 results in increased expression of smooth muscle markers by cells on the heart surface consistent with premature SMC differentiation. This suggests that early expression of TCF21 is important for expansion of the SMC compartment of the coronary circulation, with persistent TCF21 expression being required for cardiac fibroblast development.
TCF21 activation is directed by an
antisense long non-coding RNA, TARID (TCF21 antisense RNA inducing demethylation). TARID activates TCF21 expression by inducing promoter demethylation and affects expression levels of target genes by functioning as epigenetic regulators of chromatin structure through interactions with
histone modifiers,
chromatin remodeling
Chromatin remodeling is the dynamic modification of chromatin architecture to allow access of condensed genomic DNA to the regulatory transcription machinery proteins, and thereby control gene expression. Such remodeling is principally carried out ...
complexes, transcriptional regulators, and/or
DNA methylation
DNA methylation is a biological process by which methyl groups are added to the DNA molecule. Methylation can change the activity of a DNA segment without changing the sequence. When located in a gene promoter, DNA methylation typically acts t ...
machinery.
Role in development
Since the identification of Tcf21 significance in various cell lineages, further research has expanded understanding of the essential roles of this gene.
TCF21 is essential for regulating properties of the mesenchyme that are critically important for several aspects of lung and kidney morphogenesis. Null TCF21 mutant mice are born but die shortly after due to severely hypoplastic lungs and kidneys that lack alveoli and mature glomeruli.
While TCF21 is exclusively expressed in the mesenchyme and podocytes, major defects are observed in adjacent epithelia of TCF21 mutant mice. In the kidney, TCF21 is required for conversion of condensing mesenchyme to epithelium of the nephron, branching morphogenesis and terminal differentiation of tubular epithelium. In the lung, TCF21 is required to correctly pattern the proximodistal axis of airway epithelium and for normal branching to occur.
TCF21 null mice also fail to form a spleen, as TCF21 acts after splenic specification to control morphogenetic expansion of the splenic anlage and in its absence, splenic precursor cells undergo apoptosis.
Since this splenic phenotype resembles that of mice lacking the homeobox genes Hox11 and Bapx1, it is possible that TCF21, together with Hox11, and Bapx1 control a common essential early step in the developmental pathway for spleen organogenesis.
TCF21 is essential for cardiac fibroblast cell fate, as demonstrated by the failed development of cardiac fibroblasts in mice lacking TCF21.
In the absence of TCF21, these fibroblast progenitor cells do not undergo epithelial-to-mesenchymal transition (EMT). While TCF21-expressing epicardial cells are initially multipotent, they become committed to the fibroblast lineage over time. Those TCF21-expressing cells that do not commit to the fibroblast lineage lose this expression and remain undifferentiated epicardial cells or coronary vascular smooth muscle cells.
TCF21 is expressed in mesodermal cells in the proepicardial organ that give rise to coronary artery smooth muscle cells (SMC) and loss of TCF21 results in increased expression of smooth muscle markers by cells on the heart surface consistent with premature SMC differentiation. This suggests that early expression of TCF21 is important for expansion of the SMC compartment of the coronary circulation, with persistent TCF21 expression being required for cardiac fibroblast development.
Male TCF21 knockout mice, which die at birth due to respiratory failure, are reported to have feminized genitalia, implicating TCF21 in mouse gonadal development/differentiation.
TCF21 transcriptionally represses steroidogenic factor 1 (Sf1), a gene expression regulator that mediates sexual differentiation and is involved in coordinating cell fate decisions in gonadal progenitors.
Without TCF21, normal gonad development is disrupted as a result of ectopic expression of Sf1, which leads to abnormal committing of urogenital progenitor cells to steroidogenic cell fates. In the XY gonad, this disruption in organization contributes to changes in testicular structure and vasculature.
Clinical significance
As a cancer suppressor
In humans, TCF21 has been identified as a candidate tumor suppressor gene and is frequently epigenetically silenced in various human cancers.
Restriction landmark genomic scanning (RLGS) along a region of recurrent loss of heterozygosity (LOH) at chromosome 6q23-q24 to profile DNA methylation was used to test the hypothesis that abnormal promoter methylation could help pinpoint the location of a candidate tumor suppressor in regions of LOH. 6q23-q24 was the chosen chromosomal region due to frequently described LOH in human head and neck squamous cell carcinomas (HNSCC) and non-small-cell lung cancers (NSCLC) as well as in other tumor types, but with no identified tumor suppressor.
Hypermethylation was found to occur frequently in the same RLGS loci in HNSCC and NSCLC. Sodium bisulfite sequencing further identified tumor-specific methylation of TCF21 when compared to normal controls. RNA samples were isolated from tumor tissues and analyzed to correlate the amount of TCF21 mRNA and DNA methylation in the samples. Overall, tumor samples with higher levels of CpG island hypermethylation had decreased TCF21 expression than normal controls.
Exogenous expression of TCF21 in cells with silenced endogenous TCF21 loci resulted in a reductions in tumor properties both in vitro and in vivo. Based on these results, it was concluded that TCF21 is a tumor suppressor gene, often silenced by hypermethylation in cancer.
TCF21 has also been linked to metastatic melanoma progression through the inhibition of the KISS1 metastasis-suppressor gene.
DNA methylation analysis in melanoma patient biopsies has demonstrated downregulation of TCF21 due to promoter hypermethylation, which also correlates with decreased survival in patients suffering from metastatic skin melanoma.
TCF21, together with E12 and TCF12, bind the KISS1 promoter, sustaining its activity. Without TCF21 to interact with the KISS1 promoter, KISS1 expression is lost. Melanoma cells overexpressing TCF21 have also been found to display reduced motility compared with vector-only control cells.
Tobacco-induced lung cancer research has found TCF21 to be among the genes identified as highly methylated at both high and low concentrations of cigarette smoke condensate (CSC). In the presence genistein, one of the soy-derived bioactive isoflavones, methylation of TCF21 is significantly reduced.
Genistein has been known to affect tumorigenesis through epigenetic regulation, such as chromatin configuration and DNA methylation, activating other tumor suppressor genes that affect cancer cell survival. These findings support the hypothesis that increasing hypermethylated tumor suppressor genes such as TCF21 is a potential chemopreventative pathway in tobacco-induced lung cancer.
TCF21 may have also therapeutic potential for breast cancer treatment, as downregulation of TCF21 has been implicated in breast cancer tumorigenesis and proliferation.
TCF21 mRNA expression is very low in breast cancer cells compared with normal breast epithelial cells. This low expression is also associated with large tumor size and lymph node metastasis. Breast cancer tissues exhibit significantly downregulated expression of TCF21 mRNA and TCF21 mRNA overexpression has been found to inhibit cancerous cell proliferation.
Clinical marker
TCF21 methylation has been considered as a potential clinical marker in the diagnosis of renal cell carcinoma. Accordingly, TCF21 methylation levels in urine samples may be a useful means of diagnosing renal cell carcinoma. Also TCF21 rs12190287 polymorphism can regulate TCF21 expression and may serve as a potential marker for genetic susceptibility to breast cancer.
Additionally, a multi-locus genetic risk score study based on a combination of 27 loci, including the TCF21 gene, identified individuals at increased risk for both incident and recurrent coronary artery disease events, as well as an enhanced clinical benefit from statin therapy. The study was based on a community cohort study (the Malmo Diet and Cancer study) and four additional randomized controlled trials of primary prevention cohorts (JUPITER and ASCOT) and secondary prevention cohorts (CARE and PROVE IT-TIMI 22).
References
Further reading
*
*
*
*
*
*
*
*
{{Use dmy dates, date=April 2017
Tumor suppressor genes