A basic helix-loop-helix (bHLH) is a protein structural motif that characterizes one of the largest families of dimerizing transcription factors. It should not be confused with the helix-turn-helix domain.
The motif is characterized by two α-helices connected by a loop. In general, transcription factors including this domain are dimeric, each with one helix containing basic amino acid residues that facilitate DNA binding. In general, one helix is smaller, and, due to the flexibility of the loop, allows dimerization by folding and packing against another helix. The larger helix typically contains the DNA-binding regions. bHLH proteins typically bind to a consensus sequence called an E-box, CANNTG. The canonical E-box is CACGTG (palindromic), however some bHLH transcription factors, notably those of the bHLH-PAS family, bind to related non-palindromic sequences, which are similar to the E-box. bHLH TFs may homodimerize or heterodimerize with other bHLH TFs and form a large variety of dimers, each one with specific functions.
Examples of transcription factors containing a bHLH include:
bHLH transcription factors are often important in development or cell activity. BMAL1-Clock is a core transcription complex in the molecular circadian clock. Other genes, like c-Myc and HIF-1, have been linked to cancer due to their effects on cell growth and metabolism.
Since many bHLH transcription factors are heterodimeric, their activity is often highly regulated by the dimerization of the subunits. One subunit's expression or availability is often controlled, whereas the other subunit is constitutively expressed. Many of the known regulatory proteins, such as the Drosophila extramacrochaetae protein, have the helix-loop-helix structure but lack the basic region, making them unable to bind to DNA on their own. They are, however, able to form heterodimers with proteins that have the bHLH structure, and inactivate their abilities as transcription factors.
- 1989: Murre et al. showed that dimers of various bHLH proteins bind to a short DNA motif (later called E-Box). This E-box consists of the DNA sequence CANNTG, where N can be any nucleotide.
- 1994: Harrison's and Pabo's groups crystallize bHLH proteins bound to E-boxes, demonstrating that the parallel 4-helix bundle motif loop orients the basic sequences to interact with specific nucleotides in the major groove of the E-box.
- 1994: Wharton et al. identified asymmetric E-boxes bound by a subset of bHLH proteins with PAS domains (bHLH-PAS proteins), including Single-minded (Sim) and the aromatic hydrocarbon receptor.
- 1995: Semenza's group identifies hypoxia-inducible factor (HIF) as a bHLH-PAS heterodimer that binds a related asymmetric E-box.
- 2009: Grove, De Masi et al., identified novel short DNA motifs, bound by a subset of bHLH proteins, which they defined as "E-box-like sequences". These are in the form of CAYRMK, where Y stands for C or T, R is A or G, M is A or C and K is G or T.
Human proteins with helix-loop-helix DNA-binding domain
AHR; AHRR; ARNT; ARNT2; ARNTL; ARNTL2; ASCL1; ASCL2; ASCL3; ASCL4; ATOH1; ATOH7; ATOH8; BHLHB2; BHLHB3; BHLHB4; BHLHB5; BHLHB8; CLOCK; EPAS1; FERD3L; FIGLA; HAND1; HAND2; HES1; HES2; HES3; HES4; HES5; HES6; HES7; HEY1; HEY2; HIF1A; ID1; ID2; ID3; ID4; KIAA2018; LYL1; MASH1; MATH2; MAX; MESP1; MESP2; MIST1; MITF; MLX; MLXIP; MLXIPL; MNT; MSC; MSGN1; MXD1; MXD3; MXD4; MXI1; MYC; MYCL1; MYCL2; MYCN; MYF5; MYF6; MYOD1; MYOG; NCOA1; NCOA3; NEUROD1; NEUROD2; NEUROD4; NEUROD6; NEUROG1; NEUROG2; NEUROG3; NHLH1; NHLH2; NPAS1; NPAS2; NPAS3; NPAS4; OAF1; OLIG1; OLIG2; OLIG3; PTF1A; SCL; SCXB; SIM1; SIM2; SOHLH1; SOHLH2; SREBF1; SREBF2; TAL1; TAL2; TCF12; TCF15; TCF21; TCF3; TCF4; TCFL5; TFAP4; TFE3; TFEB; TFEC; TWIST1; TWIST2; USF1; USF2;
- ^ PDB: 1x0o; Card PB, Erbel PJ, Gardner KH (October 2005). "Structural basis of ARNT PAS-B dimerization: use of a common beta-sheet interface for hetero- and homodimerization". J. Mol. Biol. 353 (3): 664–77. doi:10.1016/j.jmb.2005.08.043. PMID 16181639.
- ^ Murre C, Bain G, van Dijk MA, Engel I, Furnari BA, Massari ME, Matthews JR, Quong MW, Rivera RR, Stuiver MH (June 1994). "Structure and function of helix-loop-helix proteins". Biochim. Biophys. Acta. 1218 (2): 129–35. doi:10.1016/0167-4781(94)90001-9. PMID 8018712.
- ^ Littlewood TD, Evan GI (1995). "Transcription factors 2: helix-loop-helix". Protein Profile. 2 (6): 621–702. PMID 7553065.
- ^ Massari ME, Murre C (January 2000). "Helix-loop-helix proteins: regulators of transcription in eucaryotic organisms". Mol. Cell. Biol. 20 (2): 429–40. doi:10.1128/MCB.20.2.429-440.2000. PMC 85097 . PMID 10611221.
- ^ Amoutzias, Grigoris D.; Robertson, David L.; Van de Peer, Yves; Oliver, Stephen G. (2008-05-01). "Choose your partners: dimerization in eukaryotic transcription factors". Trends in Biochemical Sciences. 33 (5): 220–229. doi:10.1016/j.tibs.2008.02.002. ISSN 0968-0004. PMID 18406148.
- ^ Lawrence Zipursky; Arnold Berk; Monty Krieger; Darnell, James E.; Lodish, Harvey F.; Kaiser, Chris; Matthew P Scott; Matsudaira, Paul T. McGill Lodish 5E Package - Molecular Cell Biology & McGill Activation Code. San Francisco: W. H. Freeman. ISBN 0-7167-8635-4.
- ^ a b Chaudhary J, Skinner MK (1999). "Basic helix-loop-helix proteins can act at the E-box within the serum response element of the c-fos promoter to influence hormone-induced promoter activation in Sertoli cells". Mol. Endocrinol. 13 (5): 774–86. doi:10.1210/mend.13.5.0271. PMID 10319327.
- ^ a b Amoutzias, Gregory D.; Robertson, David L.; Oliver, Stephen G.; Bornberg-Bauer, Erich (2004-03-01). "Convergent evolution of gene networks by single-gene duplications in higher eukaryotes". EMBO Reports. 5 (3): 274–279. doi:10.1038/sj.embor.7400096. ISSN 1469-221X. PMC 1299007 . PMID 14968135.
- ^ Cabrera CV, Alonso MC, Huikeshoven H (1994). "Regulation of scute function by extramacrochaete in vitro and in vivo". Development. 120 (12): 3595–603. PMID 7821225.
- ^ Murre C, McCaw PS, Vaessin H, et al. (1989). "Interactions between heterologous helix-loop-helix proteins generate complexes that bind specifically to a common DNA sequence". Cell. 58 (3): 537–44. doi:10.1016/0092-8674(89)90434-0. PMID 2503252.
- ^ Ellenberger T, Fass D, Arnaud M, Harrison SC (April 1994). "Crystal structure of transcription factor E47: E-box recognition by a basic region helix-loop-helix dimer". Genes Dev. 8 (8): 970–80. doi:10.1101/gad.8.8.970. PMID 7926781.
- ^ Ma PC, Rould MA, Weintraub H, Pabo CO (May 1994). "Crystal structure of MyoD bHLH domain-DNA complex: perspectives on DNA recognition and implications for transcriptional activation". Cell. 77 (3): 451–9. doi:10.1016/0092-8674(94)90159-7. PMID 8181063.
- ^ Wharton KA, Franks RG, Kasai Y, Crews ST (December 1994). "Control of CNS midline transcription by asymmetric E-box-like elements: similarity to xenobiotic responsive regulation". Development. 120 (12): 3563–9. PMID 7821222.
- ^ Wang GL, Jiang BH, Rue EA, Semenza GL (June 1995). "Hypoxia-inducible factor 1 is a basic helix-loop-helix-PAS heterodimer regulated by cellular O2 tension". Proc. Natl. Acad. Sci. U.S.A. 92 (12): 5510–4. doi:10.1073/pnas.92.12.5510. PMC 41725 . PMID 7539918.
- ^ Grove C, De Masi F, et al. (2009). "A multiparameter network reveals extensive divergence between C. elegans bHLH transcription factors". Cell. 138 (2): 314–27. doi:10.1016/j.cell.2009.04.058. PMC 2774807 . PMID 19632181.