HOME

TheInfoList



OR:

Uncharacterized protein C14orf80 is a
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, res ...
which in humans is encoded by the chromosome 14 open reading frame 80, C14orf80, gene.


Gene


Location

C14orf80 is located on
chromosome 14 Chromosome 14 is one of the 23 pairs of chromosomes in humans. People normally have two copies of this chromosome. Chromosome 14 spans about 107 million base pairs (the building material of DNA) and represents between 3 and 3.5% of the total DNA ...
(14q32.33) starting at 105,489,855bp and ending at 105,499,248bp. C14orf80 is 9,393 base pairs long and contains 11 exons that can be
alternatively spliced Alternative splicing, or alternative RNA splicing, or differential splicing, is an alternative splicing process during gene expression that allows a single gene to code for multiple proteins. In this process, particular exons of a gene may be ...
to form different mRNA variants.https://www.ncbi.nlm.nih.gov/gene/283643>


Variants

Transcription Transcription refers to the process of converting sounds (voice, music etc.) into letters or musical notes, or producing a copy of something in another medium, including: Genetics * Transcription (biology), the copying of DNA into RNA, the fir ...
of C14orf80 can produce 19 mRNA splice variants. Only six of these nineteen variants are predicted to not encode for a protein. Of the mRNA variants that have been found experimentally, the longest is 1,719 base pairs and produces a protein with 426 amino acids.


Expression

C14orf80 has been determined to be expressed in 77 types of tissues and 100
developmental stages Stages of development may refer to: Biology *Developmental biology, the study of the process by which animals and plants grow and develop *Prenatal development, also called fetal development, or embryology *Human development (biology), the proc ...
. It has also been determined to have a higher level of
expression Expression may refer to: Linguistics * Expression (linguistics), a word, phrase, or sentence * Fixed expression, a form of words with a specific meaning * Idiom, a type of fixed expression * Metaphorical expression, a particular word, phrase, o ...
in a few cases of
pancreatic The pancreas is an organ of the digestive system and endocrine system of vertebrates. In humans, it is located in the abdomen behind the stomach and functions as a gland. The pancreas is a mixed or heterocrine gland, i.e. it has both an endocr ...
and
prostate The prostate is both an accessory gland of the male reproductive system and a muscle-driven mechanical switch between urination and ejaculation. It is found only in some mammals. It differs between species anatomically, chemically, and phys ...
cancer Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal b ...
cells compared to normal tissue.


Homology


Paralogs

There are no
paralogs Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a sp ...
of C14orf80.


Orthologs

Using the
BLAST Blast or The Blast may refer to: *Explosion, a rapid increase in volume and release of energy in an extreme manner *Detonation, an exothermic front accelerating through a medium that eventually drives a shock front Film * ''Blast'' (1997 film), ...
program from NCBI, the
orthologs Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a sp ...
of C14orf80 were found to range from primates to
invertebrates Invertebrates are a paraphyletic group of animals that neither possess nor develop a vertebral column (commonly known as a ''backbone'' or ''spine''), derived from the notochord. This is a grouping including all animals apart from the chordat ...
. Below is a table that contains a variety of these orthologs.


Evolution rate

When compared to the slow-evolving cytochrome C gene and the fast-evolving
fibrinogen Fibrinogen (factor I) is a glycoprotein complex, produced in the liver, that circulates in the blood of all vertebrates. During tissue and vascular injury, it is converted enzymatically by thrombin to fibrin and then to a fibrin-based blood cl ...
gene, gene C14orf80 is also fast-evolving.


Protein


General properties

Uncharacterized protein C14orf80 is 426 amino acids long with a
molecular weight A molecule is a group of two or more atoms held together by attractive forces known as chemical bonds; depending on context, the term may or may not include ions which satisfy this criterion. In quantum physics, organic chemistry, and bioch ...
of 47 kDa. Its
isoelectric point The isoelectric point (pI, pH(I), IEP), is the pH at which a molecule carries no net electrical charge or is electrically neutral in the statistical mean. The standard nomenclature to represent the isoelectric point is pH(I). However, pI is also u ...
is 8.9.


Composition


Secondary structure

Uncharacterized protein C14orf80 is predicted to be entirely composed of
alpha helices The alpha helix (α-helix) is a common motif in the secondary structure of proteins and is a right hand-helix conformation in which every backbone N−H group hydrogen bonds to the backbone C=O group of the amino acid located four residues ear ...
. Using the program SOUSI-signal, it was predicted that uncharacterized protein C14orf80 does not contain a signal peptide and is a
soluble In chemistry, solubility is the ability of a substance, the solute, to form a solution with another substance, the solvent. Insolubility is the opposite property, the inability of the solute to form such a solution. The extent of the solubi ...
protein.


Function


Domains

Uncharacterized protein C14orf80 has two functional
domain Domain may refer to: Mathematics *Domain of a function, the set of input values for which the (total) function is defined **Domain of definition of a partial function **Natural domain of a partial function **Domain of holomorphy of a function * Do ...
s. The first domain is the domain of unknown function 4509 and the second is the domain of unknown function 4510. As their naming states the functions of these domains are still unknown. DUF4509 is located at amino acid 45 to amino acid 228. In this
domain of unknown function A domain of unknown function (DUF) is a protein domain that has no characterised function. These families have been collected together in the Pfam database using the prefix DUF followed by a number, with examples being DUF2992 and DUF1220. As of 201 ...
there is a conserved WLL sequence motif. DUF4510 is located at amino acid 263 to amino acid 425. In this domain of unknown function there are two conserved sequence motifs: LEA and WMD.


Post-translational modification

Uncharacterized protein C14orf80 is predicted to have
glycation Glycation (sometimes called non-enzymatic glycosylation) is the covalent attachment of a sugar to a protein or lipid. Typical sugars that participate in glycation are glucose, fructose, and their derivatives. Glycation is the non-enzymatic proces ...
and phosphorylation sites for
post-translational modification Post-translational modification (PTM) is the covalent and generally enzymatic modification of proteins following protein biosynthesis. This process occurs in the endoplasmic reticulum and the golgi apparatus. Proteins are synthesized by ribos ...
. Of these sites three are for glycation, eight are for serine phosphorylation and one site is for threonine phosphorylation.


Subcellular location

Uncharacterized protein C14orf80 is not predicted to be a
transmembrane A transmembrane protein (TP) is a type of integral membrane protein that spans the entirety of the cell membrane. Many transmembrane proteins function as gateways to permit the transport of specific substances across the membrane. They frequent ...
protein. It is mainly localized to the
golgi apparatus The Golgi apparatus (), also known as the Golgi complex, Golgi body, or simply the Golgi, is an organelle found in most eukaryotic cells. Part of the endomembrane system in the cytoplasm, it packages proteins into membrane-bound vesicles ...
but has been found in the
nucleus Nucleus ( : nuclei) is a Latin word for the seed inside a fruit. It most often refers to: *Atomic nucleus, the very dense central region of an atom * Cell nucleus, a central organelle of a eukaryotic cell, containing most of the cell's DNA Nucl ...
and
cytoplasm In cell biology, the cytoplasm is all of the material within a eukaryotic cell, enclosed by the cell membrane, except for the cell nucleus. The material inside the nucleus and contained within the nuclear membrane is termed the nucleoplasm. ...
also.


Interactions

Currently, there are 21 proteins that are predicted to interact with uncharacterized protein C14orf80. These 21 proteins were found using the databases Mentha, BioGRID, STRING,
GeneCards GeneCards is a database of human genes that provides genomic, proteomic, transcriptomic, genetic and functional information on all known and predicted human genes. It is being developed and maintained by the Crown Human Genome Center at the Wei ...
and IntAct. Below is a table of a variety of these 21 proteins.


Clinical significance

Uncharacterized protein C14orf80 has been associated with
tumors A neoplasm () is a type of abnormal and excessive growth of tissue. The process that occurs to form or produce a neoplasm is called neoplasia. The growth of a neoplasm is uncoordinated with that of the normal surrounding tissue, and persists ...
in the
breast The breast is one of two prominences located on the upper ventral region of a primate's torso. Both females and males develop breasts from the same embryological tissues. In females, it serves as the mammary gland, which produces and sec ...
, CNS, endometrium,
large intestine The large intestine, also known as the large bowel, is the last part of the gastrointestinal tract and of the digestive system in tetrapods. Water is absorbed here and the remaining waste material is stored in the rectum as feces before bein ...
, lung,
skin Skin is the layer of usually soft, flexible outer tissue covering the body of a vertebrate animal, with three main functions: protection, regulation, and sensation. Other animal coverings, such as the arthropod exoskeleton, have different de ...
, and
stomach The stomach is a muscular, hollow organ in the gastrointestinal tract of humans and many other animals, including several invertebrates. The stomach has a dilated structure and functions as a vital organ in the digestive system. The stomach i ...
.


References

{{reflist Human proteins