HOME

TheInfoList



OR:

Solenoid protein domains are a highly modular type of
protein domain In molecular biology, a protein domain is a region of a protein's polypeptide chain that is self-stabilizing and that folds independently from the rest. Each domain forms a compact folded three-dimensional structure. Many proteins consist of s ...
. They consist of a chain of nearly identical folds, often simply called
tandem repeats Tandem repeats occur in DNA when a pattern of one or more nucleotides is repeated and the repetitions are directly adjacent to each other. Several protein domains also form tandem repeats within their amino acid primary structure, such as armadil ...
. They are extremely common among all types of proteins, though exact figures are unknown.


"Repeats" in molecular biology

In
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respo ...
s, a "repeat" is any sequence block that returns more than one time in the
sequence In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is calle ...
, either in an identical or a highly similar form. Repetitiveness does not in itself indicate anything about the structure of the protein. As a "rule of thumb", short repetitive sequences (e.g. those below the length of 10 amino acids) may be intrinsically disordered, and not part of any folded
protein domain In molecular biology, a protein domain is a region of a protein's polypeptide chain that is self-stabilizing and that folds independently from the rest. Each domain forms a compact folded three-dimensional structure. Many proteins consist of s ...
s. Repeats that are at least 30 to 40 amino acids long, are far more likely to be folded as part of a domain. Such long repeats are frequently indicative of the presence of a solenoid domain in the protein. Examples of disordered repetitive sequences include the 7-mer peptide repeats found in the RPB1 subunit of
RNA polymerase II RNA polymerase II (RNAP II and Pol II) is a multiprotein complex that transcribes DNA into precursors of messenger RNA (mRNA) and most small nuclear RNA (snRNA) and microRNA. It is one of the three RNAP enzymes found in the nucleus of eukaryoti ...
, or the tandem
beta-catenin Catenin beta-1, also known as beta-catenin (β-catenin), is a protein that in humans is encoded by the ''CTNNB1'' gene. Beta-catenin is a dual function protein, involved in regulation and coordination of cell–cell adhesion and gene transcripti ...
or
axin Axin-1 is a protein that in humans is encoded by the ''AXIN1'' gene. Function This gene encodes a cytoplasmic protein which contains a regulation of G-protein signaling (RGS) domain and a dishevelled and axin (DIX) domain. The encoded protei ...
binding linear motifs in APC (adenomatous polyposis coli). Examples of short repeats exhibiting ordered structures include the three-residue collagen repeat or the five-residue
pentapeptide repeat Pentapeptide repeats are a family of sequence motifs found in multiple tandem copies in protein molecules. Pentapeptide repeat proteins are found in all species, but they are found in many copies in cyanobacterial genomes. The repeats were first ...
that forms a
beta helix A beta helix is a tandem protein repeat structure formed by the association of parallel beta strands in a helical pattern with either two or three faces. The beta helix is a type of solenoid protein domain. The structure is stabilized by inter-st ...
structure.


Architecture of solenoid domains

Due to the identical form of their building blocks, solenoid domains can only assume a limited number of shapes. Two main topologies are possible: linear (or open, generally with some degree of helical curvature) and circular (or closed).


Linear (open) solenoids

If the two terminal repeats in a solenoid do not physically interact, it leads to an open or linear structure. Members of this group are frequently rod- or crescent-shaped. The number of individual repeats can range from 2 to over 50. A clear advantage of this topology is that both the N- and C-terminal ends are free to add new repeats and folds, or even remove existing ones during evolution without any gross impact on the structural stability of the entire domain. This type of domain is extremely common among extracellular segments of receptors or cell adhesion molecules. A non-exhaustive list of examples include: EGF repeats, cadherin repeats,
leucine-rich repeat A leucine-rich repeat (LRR) is a protein structural motif that forms an α/β horseshoe fold. It is composed of repeating 20–30 amino acid stretches that are unusually rich in the hydrophobic amino acid leucine. These tandem repeats common ...
s, HEAT repeats,
ankyrin repeat The ankyrin repeat is a 33-residue motif in proteins consisting of two alpha helices separated by loops, first discovered in signaling proteins in yeast Cdc10 and ''Drosophila'' Notch. Domains consisting of ankyrin tandem repeats mediate prot ...
s,
armadillo repeats An armadillo repeat is the name of a characteristic, repetitive amino acid sequence of about 40 residues in length that is found in many proteins. Proteins that contain armadillo repeats typically contain several tandemly repeated copies. Each a ...
, tetratricopeptide repeats, etc. Whenever a linear solenoid domain structure participates in protein-protein interactions, frequently at least 3 or more repetitive subunits form the ligand-binding sites. Thus - while individual repeats might have a (limited) ability to fold on their own – they usually cannot perform the functions of the entire domain alone.


Circular (closed) solenoids

In the case when the N- and C-terminal repeats lie in close physical contact in a solenoid domain, the result is a topologically compact, closed structure. Such domains typically display a high rotational symmetry (unlike open solenoids that only have translational symmetries), and assume a wheel-like shape. Because of the limitations of this structure, the number of individual repeats is not arbitrary. In the case of
WD40 repeat The WD40 repeat (also known as the WD or beta-transducin repeat) is a short structural motif of approximately 40 amino acids, often terminating in a tryptophan-aspartic acid (W-D) dipeptide. Tandem copies of these repeats typically fold togethe ...
s (perhaps the largest family of closed solenoids) the number of repeats can range from 4 to 10 (more usually between 5 and 7). Kelch repeats, beta-barrels and beta-trefoil repeats are further examples for this architecture. Closed solenoids frequently function as protein-protein interaction modules: it is possible that all repeats must be present to form the ligand-binding site if it is located at the centre or axis of the domain "wheel".


Repetitive supradomain modules

As common in biology, there are several borderline cases between solenoid architectures and regular protein domains. Proteins that contain tandem repeats of ordinary domains are very common in eukaryotes. Even if these domains are perfectly capable of folding on their own, some of them might bind together and assume a rigidly fixed orientation in the full protein. These supradomain modules can perform functions that its individual constituents are incapable of . A famous example is the case of tandem BRCT domains, found in the tumor suppressor protein
BRCA1 Breast cancer type 1 susceptibility protein is a protein that in humans is encoded by the ''BRCA1'' () gene. Orthologs are common in other vertebrate species, whereas invertebrate genomes may encode a more distantly related gene. ''BRCA1'' is a h ...
. While individual
BRCT domain BRCA1 C Terminus (BRCT) domain is a family of evolutionarily related proteins. It is named after the C-terminal domain of BRCA1, a DNA-repair protein that serves as a marker of breast cancer susceptibility. The BRCT domain is found predominant ...
s are found in certain proteins (e.g. some
DNA ligase DNA ligase is a specific type of enzyme, a ligase, () that facilitates the joining of DNA strands together by catalyzing the formation of a phosphodiester bond. It plays a role in repairing single-strand breaks in duplex DNA in living organ ...
s) binding DNA, these tandem BRCT domains evolved a novel function:
phosphorylated In chemistry, phosphorylation is the attachment of a phosphate group to a molecule or an ion. This process and its inverse, dephosphorylation, are common in biology and could be driven by natural selection. Text was copied from this source, whi ...
linear motif binding. In the case of
BRCA1 Breast cancer type 1 susceptibility protein is a protein that in humans is encoded by the ''BRCA1'' () gene. Orthologs are common in other vertebrate species, whereas invertebrate genomes may encode a more distantly related gene. ''BRCA1'' is a h ...
(and
MDC1 Mediator of DNA damage checkpoint protein 1 is a 2080 amino acid long protein that in humans is encoded by the ''MDC1'' gene located on the short arm (p) of chromosome 6. MDC1 protein is a regulator of the Intra-S phase and the G2/M cell cycle chec ...
), the peptide-binding groove lies in a cleft formed by the junction of the two domains. This elegantly explains why individual constituents of this supradomain block are incapable of ligand binding, while their proper assembly endows them with a novel function. Therefore, tandem BRCT domains can be regarded as a form of a single, linear solenoid domain as well.


References

{{Protein tandem repeats Protein tandem repeats Protein domains