HOME

TheInfoList



OR:

CUB domain is an evolutionarily conserved
protein domain In molecular biology, a protein domain is a region of a protein's polypeptide chain that is self-stabilizing and that folds independently from the rest. Each domain forms a compact folded three-dimensional structure. Many proteins consist of s ...
. The CUB domain (for complement C1r/C1s, Uegf, Bmp1) is a structural motif of approximately 110 residues found almost exclusively in extracellular and plasma membrane-associated proteins, many of which are developmentally regulated. These proteins are involved in a diverse range of functions, including complement activation, developmental patterning, tissue repair, axon guidance and angiogenesis, cell signalling, fertilisation, haemostasis, inflammation, neurotransmission, receptor-mediated endocytosis, and tumour suppression. Many CUB-containing proteins are peptidases belonging to MEROPS peptidase families M12A (astacin) and S1A (chymotrypsin).


Examples

Proteins containing a CUB domain include: * Mammalian complement subcomponents C1s/C1r, which form the calcium-dependent complex C1, the first component of the classical pathway of the complement system. * '' Cricetidae sp.'' (Hamster) serine protease Casp, which degrades type I and IV collagen and fibronectin in the presence of calcium. * Mammalian complement-activating component of Ra-reactive factor (RARF), a protease that cleaves the C4 component of complement. * Vertebrate
enteropeptidase Enteropeptidase (also called enterokinase) is an enzyme produced by cells of the duodenum and is involved in digestion in humans and other animals. Enteropeptidase converts trypsinogen (a zymogen) into its active form trypsin, resulting in the ...
(), a type II membrane protein of the intestinal brush border, which activates trypsinogen. * Vertebrate bone morphogenic protein 1 (
BMP-1 The BMP-1 is a Soviet amphibious tracked infantry fighting vehicle, in service 1966–present. BMP stands for ''Boyevaya Mashina Pyekhoty 1'' (russian: link=no, Боевая Машина Пехоты 1; БМП-1), meaning "infantry fighting ...
), a protein which induces cartilage and bone formation and expresses metalloendopeptidase activity. * Sea urchin blastula proteins BP10 and SpAN. * ''
C. elegans ''Caenorhabditis elegans'' () is a free-living transparent nematode about 1 mm in length that lives in temperate soil environments. It is the type species of its genus. The name is a blend of the Greek ''caeno-'' (recent), ''rhabditis'' (r ...
'' hypothetical proteins F42A10.8 and R151.5. *
Neuropilin Neuropilin is a protein receptor active in neurons. There are two forms of Neuropilins, NRP-1 and NRP-2. Neuropilins are transmembrane glycoproteins, first documented to regulate neurogenesis and angiogenesis by complexing with Plexin receptor ...
(A5 antigen), a calcium-independent cell adhesion molecule that functions during the formation of certain neuronal circuits. * Fibropellins I and III from ''
Strongylocentrotus purpuratus ''Strongylocentrotus purpuratus'', the purple sea urchin, lives along the eastern edge of the Pacific Ocean extending from Ensenada, Mexico, to British Columbia, Canada. This sea urchin species is deep purple in color, and lives in lower in ...
'' (Purple sea urchin). * Mammalian hyaluronate-binding protein TSG-6 (or PS4), a serum and growth factor induced protein. * Mammalian spermadhesins. * ''
Xenopus laevis The African clawed frog (''Xenopus laevis'', also known as the xenopus, African clawed toad, African claw-toed frog or the ''platanna'') is a species of African aquatic frog of the family Pipidae. Its name is derived from the three short claws ...
'' embryonic protein UVS.2, which is expressed during dorsoanterior development. Several of the above proteins consist of a catalytic domain together with several CUB domains interspersed by calcium-binding EGF domains. Spermadhesin is a subdivision of the CUB domain family and forms a major component of the mammalian
seminal fluid Semen, also known as seminal fluid, is an organic bodily fluid created to contain spermatozoa. It is secreted by the gonads (sexual glands) and other sexual organs of male or hermaphroditic animals and can fertilize the female ovum. Semen i ...
. Spermadhesins are 110-133 amino acid polypeptides. The binding activity of spermadhesins, e.g. heparin and carbohydrate binding, enables their central role in promoting attachment of the spermatozoa to carbohydrate groups on the glycoproteins found on the surface of oocytes. The spermadhesins from pigs, bulls and stallions show 40-98% similarity in their amino acid sequences and all possess a disulphide bond between adjacent cysteine residues. The porcine spermadhesin polypeptides are coded by five closely linked genes. Bovine spermadhesin relies on a significantly lower number of genes with only two being associated with expression of this protein in bovine seminal fluid. Redundant genetic coding for spermadhesins have been observed in chimpanzees, dogs, and humans.Haase B, Schlötterer C, Hundrieser ME, Kuiper H, Distl O, Töpfer-Petersen E, Leeb T., Evolution of the spermadhesin gene family, Gene. (2005) 352, P-20-29 The region correlating to spermadhesin genes in rat and mice DNA is void of any spermadhesin code. These variations in expression and genetic coding of spermadhesins are seen to result from evolutionary adjustments in genes as a consequence of mutations and deletions in genetic material. Some CUB domains appear to be involved in oligomerisation and/or recognition of substrates and binding partners. For example, in the complement proteases, the CUB domains mediate dimerisation and binding to collagen-like regions of target proteins (e.g. C1q for C1r/C1s). The structure of CUB domains consists of a beta-sandwich with a
jelly-roll fold The jelly roll or Swiss roll fold is a protein fold or supersecondary structure composed of eight beta strands arranged in two four-stranded sheets. The name of the structure was introduced by Jane S. Richardson in 1981, reflecting its resemblanc ...
. Almost all CUB domains contain four conserved cysteines that probably form two disulphide bridges (C1-C2, C3-C4). The CUB1 domains of C1s and Map19 have calcium-binding sites. Human genes encoding proteins containing this domain include: *
ATRN Attractin is a protein that in humans is encoded by the ''ATRN'' gene. Attractin is a Group XI C-type lectin A C-type lectin (CLEC) is a type of carbohydrate-binding protein known as a lectin. The C-type designation is from their requirement ...
, ATRNL1, BMP1, *
C1R Complement C1r subcomponent (, ''activated complement C1r'', ''C overbar 1r esterase'', ''C1r'') is a protein involved in the complement system of the innate immune system. In humans, C1r is encoded by the ''C1R'' gene. C1r along with C1q and C1 ...
, C1RL,
C1S Complement component 1s (, '' C1 esterase'', ''activated complement C1s'', ''complement C overbar 1r'', ''C1s'') is a protein involved in the complement system. C1s is part of the C1 complex. In humans, it is encoded by the ''C1S'' gene. C1s cle ...
,
CDCP2 The Centers for Disease Control and Prevention (CDC) is the National public health institutes, national public health agency of the United States. It is a Federal agencies of the United States, United States federal agency, under the United S ...
,
CSMD1 CSMD1 CUB and Sushi multiple domains 1 is a protein that in humans is encoded by the ''CSMD1'' gene. Structure CSMD1 contains 14 N-terminal CUB domains that are separated from each other by a Sushi domains followed by an additional 15 tandem ...
,
CSMD2 CUB and sushi domain-containing protein 2 is a protein that in humans is encoded by the ''CSMD2'' gene In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ' ...
, CSMD3,
CUBN Cubilin is a protein that in humans is encoded by the ''CUBN'' gene. Function Cubilin (CUBN) acts as a receptor for intrinsic factor-vitamin B12 complexes. The role of receptor is supported by the presence of 27 CUB domains. Cubilin shows a re ...
, CUZD1, * DCBLD1,
DCBLD2 Discoidin, CUB and LCCL domain-containing protein 2 is a protein that in humans is encoded by the ''DCBLD2'' gene. Model organisms Model organisms have been used in the study of DCBLD2 function. A conditional knockout mouse line called ''Dcbld2tm ...
,
DMBT1 Deleted in malignant brain tumors 1 protein is a protein that in humans is encoded by the ''DMBT1'' gene. Function Loss of sequences from human chromosome 10q has been associated with the progression of human cancers. The gene DMBT1 was origi ...
, DREG, *
GPR126 G protein-coupled receptor 126 also known as VIGR and DREG is a protein encoded by the ''ADGRG6'' gene. GPR126 is a member of the adhesion GPCR family. Adhesion GPCRs are characterized by an extended extracellular region often possessing N-termina ...
, *
KREMEN1 Kremen protein 1 is a protein that in humans is encoded by the ''KREMEN1'' gene. ''Kremen1'' is conserved in chordates including amphioxus and most vertebrate species. The protein is a type I transmembrane receptor of ligands Dickkopf1, Dickkopf2, ...
, KREMEN2, *
LRP10 Low-density lipoprotein receptor-related protein 10 is a protein that in humans is encoded by the ''LRP10'' gene In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." ...
,
LRP12 Low density lipoprotein receptor-related protein 1 (LRP1), also known as alpha-2-macroglobulin receptor (A2MR), apolipoprotein E receptor (APOER) or cluster of differentiation 91 (CD91), is a protein forming a receptor found in the plasma membran ...
,
LRP3 Low density lipoprotein receptor-related protein 3 (LRP-3) is a protein that in humans is encoded by the ''LRP3'' gene In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of hered ...
, * MASP1, MASP2,
MFRP Membrane frizzled-related protein is a protein that in humans is encoded by the ''MFRP'' gene. References Further reading

* * * * * * * * {{protein-stub ...
, *
NETO1 Neuropilin (NRP) and tolloid (TLL)-like 1 is a protein that in humans is encoded by the NETO1 gene. Function This gene encodes a predicted transmembrane protein containing two extracellular CUB domains followed by a low-density lipoprotein clas ...
,
NETO2 Neto may refer to: General *Neto (deity), an Iberian god * Neto (suffix), a name suffix distinguishing a man from his grandfather (including a list of people with the name) * Neto 1, a human gene Places *Agostinho Neto Airport, an airport in Cap ...
,
NRP1 Neuropilin-1 is a protein that in humans is encoded by the ''NRP1'' gene. In humans, the neuropilin 1 gene is located at 10p11.22. This is one of two human neuropilins. Function NRP1 is a membrane-bound coreceptor to a tyrosine kinase recepto ...
,
NRP2 Neuropilin 2 (NRP2) is a protein that in humans is encoded by the ''NRP2'' gene. This gene encodes a member of the neuropilin Neuropilin is a protein receptor active in neurons. There are two forms of Neuropilins, NRP-1 and NRP-2. Neuropilin ...
, * OVCH1, OVCH2, *
PCOLCE Procollagen C-endopeptidase enhancer 1 is an enzyme that in humans is encoded by the ''PCOLCE'' gene. Fibrillar collagen types I-III are synthesized as precursor molecules known as procollagens. These precursors contain amino- and carboxyl-termi ...
,
PCOLCE2 Procollagen C-endopeptidase enhancer 2 is a protein that in humans is encoded by the ''PCOLCE2'' gene In biology, the word gene (from , ; "... Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ' ...
,
PDGFC Platelet-derived growth factor C, also known as PDGF-C, is a 345-amino acid protein that in humans is encoded by the ''PDGFC'' gene. Platelet-derived growth factors are important in connective tissue growth, survival and function, and consist of di ...
,
PDGFD Platelet-derived growth factor D is a protein that in humans is encoded by the ''PDGFD'' gene. The protein encoded by this gene is a member of the platelet-derived growth factor family. The four members of this family are mitogenic factors for ce ...
, PRSS7, *
RAMP An inclined plane, also known as a ramp, is a flat supporting surface tilted at an angle from the vertical direction, with one end higher than the other, used as an aid for raising or lowering a load. The inclined plane is one of the six clas ...
, *
SCUBE1 Signal peptide, CUB domain and EGF like domain containing 1 is a protein that in humans is encoded by the SCUBE1 gene. Function This gene encodes a cell surface glycoprotein that is a member of the SCUBE ( signal peptide, CUB domain, EGF ( e ...
, SCUBE2, SCUBE3, SEZ6,
SEZ6L Seizure 6-like protein is a protein that in humans is encoded by the ''SEZ6L'' gene In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or '' ...
,
SEZ6L2 Seizure 6-like protein is a protein that in humans is encoded by the ''SEZ6L'' gene In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or '' ...
,
ST14 Suppressor of tumorigenicity 14 protein, also known as matriptase, is a protein that in humans is encoded by the ST14 gene. ST14 orthologs have been identified in most mammals for which complete genome data are available. Function Matriptase ...
, *
TLL1 Tolloid-like protein 1 is a protein that in humans is encoded by the ''TLL1'' gene. This gene encodes an astacin-like zinc-dependent metalloprotease and is a subfamily member of the metzincin family. A similar protein in mice is required during ...
,
TLL2 Tolloid-like protein 2 is a protein that in humans is encoded by the ''TLL2'' gene. This gene encodes an astacin-like zinc-dependent metalloprotease and is a subfamily member of the metzincin family. Unlike other family members, a similar protein ...
, TMPRSS7,
TNFAIP6 Tumor necrosis factor-inducible gene 6 protein also known as TNF-stimulated gene 6 protein or TSG-6 is a protein that in humans is encoded by the ''TNFAIP6'' (tumor necrosis factor, alpha-induced protein 6) gene. Structure and function TSG-6 is ...
* psk-2


References

{{InterPro content, IPR000858 Protein domains Single-pass transmembrane proteins