HOME

TheInfoList



OR:

ARMH3 or Armadillo Like Helical Domain Containing 3, also known as UPF0668 and ''c10orf76'', is a
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respo ...
that in humans is encoded by the ''ARMH3''
gene In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a ba ...
. Its function is not currently known, but experimental evidence has suggested that it may be involved in
transcriptional regulation In molecular biology and genetics, transcriptional regulation is the means by which a cell regulates the conversion of DNA to RNA (transcription), thereby orchestrating gene activity. A single gene can be regulated in a range of ways, from alt ...
. The protein contains a conserved
proline Proline (symbol Pro or P) is an organic acid classed as a proteinogenic amino acid (used in the biosynthesis of proteins), although it does not contain the amino group but is rather a secondary amine. The secondary amine nitrogen is in the prot ...
-rich motif, suggesting that it may participate in protein-protein interactions via an SH3-binding domain, although no such interactions have been experimentally verified. The well-conserved gene appears to have emerged in
Fungi A fungus ( : fungi or funguses) is any member of the group of eukaryotic organisms that includes microorganisms such as yeasts and molds, as well as the more familiar mushrooms. These organisms are classified as a kingdom, separately from ...
approximately 1.2 billion years ago. The
locus Locus (plural loci) is Latin for "place". It may refer to: Entertainment * Locus (comics), a Marvel Comics mutant villainess, a member of the Mutant Liberation Front * ''Locus'' (magazine), science fiction and fantasy magazine ** ''Locus Award' ...
is
alternatively spliced Alternative splicing, or alternative RNA splicing, or differential splicing, is an alternative splicing process during gene expression that allows a single gene to code for multiple proteins. In this process, particular exons of a gene may be in ...
and predicted to yield five protein variants, three of which contain a protein domain of unknown function, DUF1741.


Function

It has been found to contain a potential SH3-binding domain, which is known to participate in protein-protein binding interactions; however, no protein interactions have been experimentally verified with c10orf76. A 2007
gene expression Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, protein or non-coding RNA, and ultimately affect a phenotype, as the final effect. The ...
study found c10orf76 expression to vary inversely with the expression of several other genes, including
NFYB Nuclear transcription factor Y subunit beta is a protein that in humans is encoded by the ''NFYB'' gene. Function The protein encoded by this gene is one subunit of a trimeric complex, forming a highly conserved transcription factor that binds ...
,
CCR5 C-C chemokine receptor type 5, also known as CCR5 or CD195, is a protein on the surface of white blood cells that is involved in the immune system as it acts as a receptor for chemokines. In humans, the ''CCR5'' gene that encodes the CCR5 pro ...
, and NSBP1, suggesting that the protein may function as a
transcriptional regulator In molecular biology and genetics, transcriptional regulation is the means by which a cell regulates the conversion of DNA to RNA (transcription), thereby orchestrating gene activity. A single gene can be regulated in a range of ways, from alt ...
.


Homology

ARMH3 is well- conserved throughout
Eumetazoa Eumetazoa (), also known as diploblasts, Epitheliozoa, or Histozoa, are a proposed basal animal clade as a sister group of the Porifera (sponges). The basal eumetazoan clades are the Ctenophora and the ParaHoxozoa. Placozoa is now also seen as a ...
ns. Some weakly similar
orthologs Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a spec ...
(approximately 35%
sequence identity In bioinformatics, a sequence alignment is a way of arranging the sequences of DNA, RNA, or protein to identify regions of similarity that may be a consequence of functional, structural, or evolutionary relationships between the sequences. Ali ...
) were identified in
Parazoa Parazoa (Parazoa, gr. Παρα-, para, "next to", and ζωα, zoa, "animal") are a taxon with sub-kingdom category that is located at the base of the phylogenetic tree of the animal kingdom in opposition to the sub-kingdom Eumetazoa; they grou ...
(i.e., ''A. queenslandica'') and in
Fungi A fungus ( : fungi or funguses) is any member of the group of eukaryotic organisms that includes microorganisms such as yeasts and molds, as well as the more familiar mushrooms. These organisms are classified as a kingdom, separately from ...
, specifically
Ascomycetes Ascomycota is a phylum of the kingdom Fungi that, together with the Basidiomycota, forms the subkingdom Dikarya. Its members are commonly known as the sac fungi or ascomycetes. It is the largest phylum of Fungi, with over 64,000 species. The defi ...
(i.e., ''A. oryzae''). The following table illustrates the sequence similarity between human c10orf76 protein and various orthologs. Similar sequences were identified with
BLAST Blast or The Blast may refer to: * Explosion, a rapid increase in volume and release of energy in an extreme manner *Detonation, an exothermic front accelerating through a medium that eventually drives a shock front Film * ''Blast'' (1997 film) ...
and BLAT tools.


Gene


Characteristics

In humans, the ARMH3 gene, also known by the alias FLJ13114, spans 210,577
base pair A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
s on the reverse strand of the long arm of chromosome 10. Its 26
alternatively spliced Alternative splicing, or alternative RNA splicing, or differential splicing, is an alternative splicing process during gene expression that allows a single gene to code for multiple proteins. In this process, particular exons of a gene may be in ...
exons encode 5 potential transcript variants, the largest of which being 4101 base pairs in length. The human ARMH3 locus is flanked on the left and right sides by HPS6 and
KCNIP2 Kv channel-interacting protein 2 also known as KChIP2 is a protein that in humans is encoded by the ''KCNIP2'' gene. Function This gene encodes a member of the family of voltage-gated potassium ( Kv) channel-interacting proteins (KCNIPs, also ...
, respectively. HPS6 is a protein that may play a role in organelle biogenesis, and KCNIP2 is a voltage-gated potassium channel interacting protein. The same pattern is observed in the
orthologous Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a spec ...
locus in mice, as well as most other vertebrates.


Expression

The NCBI (GenBank) gene profile for c10orf76 labels the start of the first transcribed
exon An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term ''exon'' refers to both the DNA sequence within a gene and to the corresponding sequen ...
as the beginning of the gene. The primary promoter predicted by the El Dorado tool from Genomatix begins 519 base pairs
upstream Upstream may refer to: * Upstream (bioprocess) * ''Upstream'' (film), a 1927 film by John Ford * Upstream (networking) * ''Upstream'' (newspaper), a newspaper covering the oil and gas industry * Upstream (petroleum industry) * Upstream (software ...
of this transcription start site. This promoter is predicted to be 658 base pairs in length and thus includes the first transcribed exon at its 3 prime end. The c10orf76
locus Locus (plural loci) is Latin for "place". It may refer to: Entertainment * Locus (comics), a Marvel Comics mutant villainess, a member of the Mutant Liberation Front * ''Locus'' (magazine), science fiction and fantasy magazine ** ''Locus Award' ...
is thought to be alternatively spliced into at least five unique
isoforms A protein isoform, or "protein variant", is a member of a set of highly similar proteins that originate from a single gene or gene family and are the result of genetic differences. While many perform the same or similar biological roles, some isof ...
, although it is unclear how this splicing is regulated. A second potential promoter, also predicted by El Dorado, likely drives expression of one of the shorter documented variants (positioned before exon 23).


Protein


Characteristics

The largest protein variant is 689
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha am ...
s in length. It has a molecular mass of approximately 78.7
kDa The dalton or unified atomic mass unit (symbols: Da or u) is a non-SI unit of mass widely used in physics and chemistry. It is defined as of the mass of an unbound neutral atom of carbon-12 in its nuclear and electronic ground state and at ...
and is isoelectric at pH 6.13. It may be secreted via a non-classical pathway. NCBI identifies a protein domain of unknown function between amino acids Asp435 and Leu671, known as DUF1741 (
Domain of Unknown Function A domain of unknown function (DUF) is a protein domain that has no characterised function. These families have been collected together in the Pfam database using the prefix DUF followed by a number, with examples being DUF2992 and DUF1220. As of 201 ...
1741). This domain is not known to exist in any other proteins.


Expression

A potential stem loop region at the 3 prime end of the first exon (and thus, the end of the promoter) was predicted by the Dotlet program from ExPASy. This could serve to regulate protein
translation Translation is the communication of the Meaning (linguistic), meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The ...
. Also, an Alu segment in the
3 prime untranslated region In molecular genetics, the three prime untranslated region (3′-UTR) is the section of messenger RNA (mRNA) that immediately follows the translation (biology), translation termination codon. The 3′-UTR often contains regulatory regions that P ...
of the mature mRNA could serve as a potential translational regulatory mechanism. The protein has been found to be differentially expressed in some medical conditions and in response to certain cellular signals. For example, decreased c10orf76 expression is observed in patients with chronic B-cell lymphocytic leukemia. Decreased expression is also observed in cells treated with
vascular endothelial growth factor Vascular endothelial growth factor (VEGF, ), originally known as vascular permeability factor (VPF), is a signal protein produced by many cells that stimulates the formation of blood vessels. To be specific, VEGF is a sub-family of growth factors, ...
. The protein is thought to be localized to the cytoplasm, although this is uncertain. It has also been predicted to be a 3-pass transmembrane protein. Also, a mitochondrial sorting signal was identified at the beginning of one of the protein isoforms using MitoProt II (located at Met416 of the largest protein variant).


Structure

The structure of the c10orf76 protein has not been experimentally explored. The
secondary structure Protein secondary structure is the three dimensional conformational isomerism, form of ''local segments'' of proteins. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
is predicted to be completely
helical Helical may refer to: * Helix, the mathematical concept for the shape * Helical engine, a proposed spacecraft propulsion drive * Helical spring, a coilspring * Helical plc, a British property company, once a maker of steel bar stock * Helicoil A t ...
in nature, with intervening regions of protein disorder. The potential SH3-binding domain is located on a predicted region of disorder, further supporting a protein-protein binding function for c10orf76. A helical region between amino acids 610-655 was predicted to be a coiled coil motif. A
PHYRE2 Phyre and Phyre2 (Protein Homology/AnalogY Recognition Engine; pronounced as 'fire') are free web-based services for protein structure prediction. Phyre is among the most popular methods for protein structure prediction having been cited over 150 ...
protein structure prediction suggested that the first 200 residues of c10orf76 may share strong structural similarities with Symplekin, a nuclear-localized protein that is thought to be a scaffold component of the polyadenylation complex.


Predicted protein Interactions

The expression of c10orf76 mRNA has been found to be inversely correlated with expression of various other mRNAs, including
NFYB Nuclear transcription factor Y subunit beta is a protein that in humans is encoded by the ''NFYB'' gene. Function The protein encoded by this gene is one subunit of a trimeric complex, forming a highly conserved transcription factor that binds ...
,
CCR5 C-C chemokine receptor type 5, also known as CCR5 or CD195, is a protein on the surface of white blood cells that is involved in the immune system as it acts as a receptor for chemokines. In humans, the ''CCR5'' gene that encodes the CCR5 pro ...
, and NSBP1. Although this study and the predicted SH3-binding domain suggest that c10orf76 partakes in protein-protein binding interactions, none have been experimentally verified. A short search using IntAct, MINT, and
STRING String or strings may refer to: *String (structure), a long flexible structure made from threads twisted together, which is used to tie, bind, or hang other objects Arts, entertainment, and media Films * ''Strings'' (1991 film), a Canadian anim ...
also yielded zero predicted protein-protein interactions.


Predicted posttranslational modifications

There is a potential that the protein is secreted via a non-classical pathway, which may underlie the functionality of some of the posttranslational modifications. There are ten conserved potential phosphorylation sites within the protein sequence. Also, there are nine residues that are confidently (>90%) predicted by NetOGlyc to undergo
O-linked glycosylation ''O''-linked glycosylation is the attachment of a sugar molecule to the oxygen atom of serine (Ser) or threonine (Thr) residues in a protein. ''O''-glycosylation is a post-translational modification that occurs after the protein has been synthesise ...
, all residing within the low complexity region between Leu325 and
Ser Ser or SER may refer to: Places * Ser, a village in Bogdand Commune, Satu Mare County, Romania * Serpens (Ser), an astronomical constellation of the northern hemisphere * Serres, known as Ser in Serbian, a city in Macedonia, Greece Organization ...
359.


Regions of potential research interest

The protein coded by the largest mRNA variant of c10orf76 encodes a proline-rich motif containing two PxxP domains, where "P" represents a
proline Proline (symbol Pro or P) is an organic acid classed as a proteinogenic amino acid (used in the biosynthesis of proteins), although it does not contain the amino group but is rather a secondary amine. The secondary amine nitrogen is in the prot ...
residue and "x" represents any other amino acid (highlighted in blue below). These domains have been shown to participate in protein-protein binding interactions, specifically via the SH3 protein binding domain. The potential SH3-binding domain exists within a low complexity region with an unusually high number of amino acids with oxygen-containing side-groups (highlighted in green below). An NetOGlyc analysis of the region suggests that these residues are likely to undergo O-linked glycosylation and thus may serve to regulate binding to the potential SH3-binding domain. 325 L V T T P V S P A P T T P V T P L G T T P P S S 359 An
Alu element An Alu element is a short stretch of DNA originally characterized by the action of the ''Arthrobacter luteus (Alu)'' restriction endonuclease. ''Alu'' elements are the most abundant transposable elements, containing over one million copies disp ...
was identified in the 3`-UTR of the longest mRNA transcript variant It is unclear as to whether this sequence serves any functional or regulatory purpose, but there is existing evidence for Alu-mediated protein translation regulation, so this cannot be ruled out in c10orf76. The
N-terminus The N-terminus (also known as the amino-terminus, NH2-terminus, N-terminal end or amine-terminus) is the start of a protein or polypeptide, referring to the free amine group (-NH2) located at the end of a polypeptide. Within a peptide, the ami ...
of a short transcript variant (exons 17-26) was predicted to have a mitochondrial sorting signal with 96% confidence using the MitoProt II tool. It is unclear as to whether this is a uniquely transcribed variant or it results from protein cleavage of the full-size protein. There are no predicted alternative promoters upstream of this variant's first exon.


Model organisms

Model organism A model organism (often shortened to model) is a non-human species that is extensively studied to understand particular biological phenomena, with the expectation that discoveries made in the model organism will provide insight into the workin ...
s have been used in the study of C10orf76 function. A conditional
knockout mouse A knockout mouse, or knock-out mouse, is a genetically modified mouse (''Mus musculus'') in which researchers have inactivated, or "knocked out", an existing gene by replacing it or disrupting it with an artificial piece of DNA. They are importan ...
line called ''9130011E15Riktm1a(EUCOMM)Wtsi'' was generated at the
Wellcome Trust Sanger Institute The Wellcome Sanger Institute, previously known as The Sanger Centre and Wellcome Trust Sanger Institute, is a non-profit British genomics and genetics research institute, primarily funded by the Wellcome Trust. It is located on the Wellcome G ...
. Male and female animals underwent a standardized
phenotypic screen In genetics, the phenotype () is the set of observable characteristics or traits of an organism. The term covers the organism's morphology or physical form and structure, its developmental processes, its biochemical and physiological proper ...
to determine the effects of deletion. Additional screens performed: - In-depth immunological phenotyping - in-depth bone and cartilage phenotyping


References


External links

* {{UCSC gene info, C10orf76