C1orf27
   HOME

TheInfoList



OR:

Uncharacterized protein Chromosome 1 Open Reading Frame 27 is a
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respo ...
in humans, encoded by the C1orf27
gene In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a ba ...
. It is accession number NM_017847. This is a
membrane protein Membrane proteins are common proteins that are part of, or interact with, biological membranes. Membrane proteins fall into several broad categories depending on their location. Integral membrane proteins are a permanent part of a cell membrane ...
that is 3926
base pair A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
s long with the most extensive string of
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha am ...
s being 454aa long. C1orf27 exhibits
cytoplasm In cell biology, the cytoplasm is all of the material within a eukaryotic cell, enclosed by the cell membrane, except for the cell nucleus. The material inside the nucleus and contained within the nuclear membrane is termed the nucleoplasm. The ...
ic expression in
epidermal The epidermis is the outermost of the three layers that comprise the skin, the inner layers being the dermis and hypodermis. The epidermis layer provides a barrier to infection from environmental pathogens and regulates the amount of water relea ...
tissues. Predicted associated biological processes of the gene include cell fate specification and developmental properties.


Gene


Locus

This gene is located on
chromosome 1 Chromosome 1 is the designation for the largest human chromosome. Humans have two copies of chromosome 1, as they do with all of the autosomes, which are the non-sex chromosomes. Chromosome 1 spans about 249 million nucleotide base pairs, which ar ...
at 1q31.1. It is encoded on the plus strand of DNA spanning from 186,344,406 to 186,390,514.


mRNA


Alternative splicing

There appear to be four
isoforms A protein isoform, or "protein variant", is a member of a set of highly similar proteins that originate from a single gene or gene family and are the result of genetic differences. While many perform the same or similar biological roles, some isof ...
due to splicing. Two of those are truncated on the 3' end of the protein from 266aa and 396aa. Additional location of alternative splice sites are from 79aa to 102aa and 246aa to 260aa.


Protein


General properties

The primary encoded protein of C1orf27 consists of 454
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha am ...
residues and is 3926
base pair A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
s long. It consists of 14 total
exon An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term ''exon'' refers to both the DNA sequence within a gene and to the corresponding sequen ...
s. The predicted
molecular weight A molecule is a group of two or more atoms held together by attractive forces known as chemical bonds; depending on context, the term may or may not include ions which satisfy this criterion. In quantum physics, organic chemistry, and bioch ...
of the primary, unmodified protein is approximately 51.1 kdal.


Aliases

As with many other genes, there are some common
aliases A pseudonym (; ) or alias () is a fictitious name that a person or group assumes for a particular purpose, which differs from their original or true name ( orthonym). This also differs from a new name that entirely or legally replaces an individu ...
found with this gene. Those aliases are Lymphocyte-Activation Gene-1 (LAG1) Interacting Protein, Transparent Testa Glabra 1 (TTG1), and Odorant Response Abnormal 4 (ODR4). The most common alias for C1orf27 is ODR4, and this is what most readily appears when searching the gene.


Composition

Computational analysis revealed the most abundant amino acid to be
leucine Leucine (symbol Leu or L) is an essential amino acid that is used in the biosynthesis of proteins. Leucine is an α-amino acid, meaning it contains an α-amino group (which is in the protonated −NH3+ form under biological conditions), an α- ca ...
at 10.1% of the total protein. The second most abundant was
serine Serine (symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − form un ...
which contributes to 8.6% of the total protein.
Glutamic acid Glutamic acid (symbol Glu or E; the ionic form is known as glutamate) is an α-amino acid that is used by almost all living beings in the biosynthesis of proteins. It is a non-essential nutrient for humans, meaning that the human body can synt ...
was third most abundant and contributes to 7.7% of the protein. This analysis also revealed that the protein appears to be deficient in
tryptophan Tryptophan (symbol Trp or W) is an α-amino acid that is used in the biosynthesis of proteins. Tryptophan contains an α-amino group, an α- carboxylic acid group, and a side chain indole, making it a polar molecule with a non-polar aromatic ...
as it only contributes to 1.1% of the protein. Based on the distribution of other amino acid types, there were five high scoring hydrophobic segments. There were also two
transmembrane domain A transmembrane domain (TMD) is a membrane-spanning protein domain. TMDs generally adopt an alpha helix topological conformation, although some TMDs such as those in porins can adopt a different conformation. Because the interior of the lipid bil ...
s located at 82-98aa and 432-449aa.


Post-translational modifications

C1orf27 is predicted to undergo multiple post translational modifications such as
glycosylation Glycosylation is the reaction in which a carbohydrate (or ' glycan'), i.e. a glycosyl donor, is attached to a hydroxyl or other functional group of another molecule (a glycosyl acceptor) in order to form a glycoconjugate. In biology (but not al ...
,
myristoylation Myristoylation is a lipidation modification where a myristoyl group, derived from myristic acid, is covalently attached by an amide bond to the alpha-amino group of an N-terminus, N-terminal glycine residue. Myristic acid is a 14-carbon saturat ...
, and
phosphorylation In chemistry, phosphorylation is the attachment of a phosphate group to a molecule or an ion. This process and its inverse, dephosphorylation, are common in biology and could be driven by natural selection. Text was copied from this source, wh ...
.


Interactions

There were eight interactions identified by Mentha. The first one was UFSP2 which hydrolyzes the
peptide bond In organic chemistry, a peptide bond is an amide type of covalent chemical bond linking two consecutive alpha-amino acids from C1 (carbon number one) of one alpha-amino acid and N2 (nitrogen number two) of another, along a peptide or protein cha ...
at the C-term gly of UFM1, a
ubiquitin Ubiquitin is a small (8.6 kDa) regulatory protein found in most tissues of eukaryotic organisms, i.e., it is found ''ubiquitously''. It was discovered in 1975 by Gideon Goldstein and further characterized throughout the late 1970s and 1980s. Fo ...
-like modifier protein bound to a number of target proteins. The second one was HSCB which acts as a co-chaperone in iron-sulfur cluster assembly in mitochondria. The third was GRB2 which is an adapter protein that provides a critical link between cell surface
growth factor receptor A growth factor receptor is a receptor that binds to a growth factor. Growth factor receptors are the first stop in cells where the signaling cascade for cell differentiation and proliferation begins. Growth factors, which are ligands that bind to ...
s and the Ras signaling pathway. The fourth was CYLD which is a
protease A protease (also called a peptidase, proteinase, or proteolytic enzyme) is an enzyme that catalyzes (increases reaction rate or "speeds up") proteolysis, breaking down proteins into smaller polypeptides or single amino acids, and spurring the ...
that cleaves Lys-63-linked polyubiquitin chains, controls regulation of cell survival, proliferation, and differentiation, and is required for normal cell cycle progress. The fifth was ATM which activates checkpoint signaling upon
double strand breaks DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. In human cells, both normal metabolic activities and environmental factors such as radiation can cause DNA da ...
, apaptosis, and genotoxic stress. The sixth was
FAM177A1 Family with sequence similarity 177 member A1 (FAM177A1) is a protein that in humans is encoded by the ''FAM177A1'' gene, previously known as C14orf24. The other member of this family is FAM177B. Function FAM177A1 has been linked to immune s ...
, the function of which is unknown. The last two were THID2 and Q81kP6 which are both in bacillus anthracis.


Subcellular localization

The c1orf27 protein is likely cytoplasmic. This was found with 55.5 reliability. The K-NN prediction was k=9/23 and the protein was found to be 55.6%
cytoplasm In cell biology, the cytoplasm is all of the material within a eukaryotic cell, enclosed by the cell membrane, except for the cell nucleus. The material inside the nucleus and contained within the nuclear membrane is termed the nucleoplasm. The ...
ic, 11.1%
mitochondrial A mitochondrion (; ) is an organelle found in the cells of most Eukaryotes, such as animals, plants and fungi. Mitochondria have a double membrane structure and use aerobic respiration to generate adenosine triphosphate (ATP), which is use ...
, 11.1%
vacuolar A vacuole () is a membrane-bound organelle which is present in Plant cell, plant and Fungus, fungal Cell (biology), cells and some protist, animal, and bacterial cells. Vacuoles are essentially enclosed compartments which are filled with water ...
, 11.1%
cytoskeletal The cytoskeleton is a complex, dynamic network of interlinking protein filaments present in the cytoplasm of all cells, including those of bacteria and archaea. In eukaryotes, it extends from the cell nucleus to the cell membrane and is compo ...
, and 11.1% golgi.


Structure

Alpha helices The alpha helix (α-helix) is a common motif in the secondary structure of proteins and is a right hand-helix conformation in which every backbone N−H group hydrogen bonds to the backbone C=O group of the amino acid located four residues ear ...
predicted in the c1orf27 protein are colored blue in the above picture.
Beta sheet The beta sheet, (β-sheet) (also β-pleated sheet) is a common motif of the regular protein secondary structure. Beta sheets consist of beta strands (β-strands) connected laterally by at least two or three backbone hydrogen bonds, forming a g ...
s are pictured by the red arrows.
Random coil In polymer chemistry, a random coil is a conformation of polymers where the monomer subunits are oriented randomly while still being bonded to adjacent units. It is not one specific shape, but a statistical distribution of shapes for all the cha ...
s are the purple strands between structures.


Expression

Overall, expression of c1orf27 seems to be ubiquitous.{{Cite web, url=https://www.ncbi.nlm.nih.gov/UniGene/ESTProfileViewer.cgi?uglist=Hs.371210, title=EST Profile - Hs.371210, last=Group, first=Schuler, website=www.ncbi.nlm.nih.gov, access-date=2018-05-06 Highest expression body sites (>50 TPM) were
bladder The urinary bladder, or simply bladder, is a hollow organ in humans and other vertebrates that stores urine from the kidneys before disposal by urination. In humans the bladder is a distensible organ that sits on the pelvic floor. Urine enters ...
,
bone marrow Bone marrow is a semi-solid tissue found within the spongy (also known as cancellous) portions of bones. In birds and mammals, bone marrow is the primary site of new blood cell production (or haematopoiesis). It is composed of hematopoietic ce ...
,
kidney The kidneys are two reddish-brown bean-shaped organs found in vertebrates. They are located on the left and right in the retroperitoneal space, and in adult humans are about in length. They receive blood from the paired renal arteries; blood ...
,
liver The liver is a major Organ (anatomy), organ only found in vertebrates which performs many essential biological functions such as detoxification of the organism, and the Protein biosynthesis, synthesis of proteins and biochemicals necessary for ...
,
pancreas The pancreas is an organ of the digestive system and endocrine system of vertebrates. In humans, it is located in the abdomen behind the stomach and functions as a gland. The pancreas is a mixed or heterocrine gland, i.e. it has both an end ...
,
parathyroid Parathyroid glands are small endocrine glands in the neck of humans and other tetrapods. Humans usually have four parathyroid glands, located on the back of the thyroid gland in variable locations. The parathyroid gland produces and secretes par ...
, and
vascular The blood vessels are the components of the circulatory system that transport blood throughout the human body. These vessels transport blood cells, nutrients, and oxygen to the tissues of the body. They also take waste and carbon dioxide away f ...
. Highest expression health sites (>50 TPM) were
adrenal tumor An adrenal tumor or adrenal mass is any benign or malignant neoplasms of the adrenal gland, several of which are notable for their tendency to overproduce endocrine hormones. Adrenal cancer is the presence of malignant adrenal tumors, and includes ...
s,
cervical In anatomy, cervical is an adjective that has two meanings: # of or pertaining to any neck. # of or pertaining to the female cervix: i.e., the ''neck'' of the uterus. *Commonly used medical phrases involving the neck are **cervical collar **cerv ...
tumors, and liver tumors. While both of these observations had relatively high TPM scores, there was still relatively low occurrence. This validates the assumption that expression is ubiquitous. There was moderate expression (>25 TPM) in the human
fetus A fetus or foetus (; plural fetuses, feti, foetuses, or foeti) is the unborn offspring that develops from an animal embryo. Following embryonic development the fetal stage of development takes place. In human prenatal development, fetal deve ...
, and expression increased with age. Expression was completely absent in the ears, esophagus,
lymph Lymph (from Latin, , meaning "water") is the fluid that flows through the lymphatic system, a system composed of lymph vessels (channels) and intervening lymph nodes whose function, like the venous system, is to return fluid from the tissues to ...
, nerve,
salivary gland The salivary glands in mammals are exocrine glands that produce saliva through a system of ducts. Humans have three paired major salivary glands (parotid, submandibular, and sublingual), as well as hundreds of minor salivary glands. Salivary gla ...
s,
thyroid The thyroid, or thyroid gland, is an endocrine gland in vertebrates. In humans it is in the neck and consists of two connected lobes. The lower two thirds of the lobes are connected by a thin band of tissue called the thyroid isthmus. The thy ...
,
tonsil The tonsils are a set of lymphoid organs facing into the aerodigestive tract, which is known as Waldeyer's tonsillar ring and consists of the adenoid tonsil, two tubal tonsils, two palatine tonsils, and the lingual tonsils. These organs play an ...
s, and
umbilical cord In placental mammals, the umbilical cord (also called the navel string, birth cord or ''funiculus umbilicalis'') is a conduit between the developing embryo or fetus and the placenta. During prenatal development, the umbilical cord is physiologic ...
. There was no expression in
bladder carcinoma Bladder cancer is any of several types of cancer arising from the tissues of the urinary bladder. Symptoms include blood in the urine, pain with urination, and low back pain. It is caused when epithelial cells that line the bladder become mali ...
despite expression being elevated in the bladder itself. There was high expression in
endothelial cells The endothelium is a single layer of squamous endothelial cells that line the interior surface of blood vessels and lymphatic vessels. The endothelium forms an interface between circulating blood or lymph in the lumen and the rest of the vessel ...
and neuronal cells but was undetectable in
glial cells Glia, also called glial cells (gliocytes) or neuroglia, are non-neuronal cells in the central nervous system (brain and spinal cord) and the peripheral nervous system that do not produce electrical impulses. They maintain homeostasis, form mye ...
and
neuropil Neuropil (or "neuropile") is any area in the nervous system composed of mostly unmyelinated axons, dendrites and glial cell processes that forms a synaptically dense region containing a relatively low number of cell bodies. The most prevalent anat ...
cells. Expression was also localized to the nucleoplasm and plasma membrane in humans but is localized to the
cytosol The cytosol, also known as cytoplasmic matrix or groundplasm, is one of the liquids found inside cells (intracellular fluid (ICF)). It is separated into compartments by membranes. For example, the mitochondrial matrix separates the mitochondri ...
in mice.


Homology


Paralogs

There were no
paralogs Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a spec ...
of C1orf27 identified in the human genome.


Orthologs

There were
orthologs Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a spec ...
identified in most animals for which there were complete genome data. The most distant, yet still relevant, orthologs identified were invertebrates from phylum
Cnidaria Cnidaria () is a phylum under kingdom Animalia containing over 11,000 species of aquatic animals found both in freshwater and marine environments, predominantly the latter. Their distinguishing feature is cnidocytes, specialized cells that th ...
.


Molecular Evolution

The ''m'' value, or number of corrected amino acid changes per 100 residues, for the C1orf27 gene was graphed against the species divergence in millions of years. When compared to divergence graphs of
fibrinogen Fibrinogen (factor I) is a glycoprotein complex, produced in the liver, that circulates in the blood of all vertebrates. During tissue and vascular injury, it is converted enzymatically by thrombin to fibrin and then to a fibrin-based blood clo ...
and
cytochrome C The cytochrome complex, or cyt ''c'', is a small hemeprotein found loosely associated with the inner membrane of the mitochondrion. It belongs to the cytochrome c family of proteins and plays a major role in cell apoptosis. Cytochrome c is hig ...
, it was determined that this gene closely resembles the evolutionary pattern observed in fibrinogen, suggesting a more rapid rate of
evolution Evolution is change in the heritable characteristics of biological populations over successive generations. These characteristics are the expressions of genes, which are passed on from parent to offspring during reproduction. Variation ...
. ''M'' values for C1orf27 were calculated using the percentage of identity, when compared to humans, observed in the mRNA sequences of the orthologs using the formula derived from the
Molecular Clock Hypothesis The molecular clock is a figurative term for a technique that uses the mutation rate of biomolecules to deduce the time in prehistory when two or more life forms diverged. The biomolecular data used for such calculations are usually nucleotid ...
.


References

Genes on human chromosome 1 Membrane proteins