Nucleic Acids Research, 2000, Vol. 28, No. 11 2229-2233
© 2000 Oxford University Press
Two tricks in one bundle: helixturnhelix gains enzymatic activity
Biochemistry Department, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX 75390-9038, USA
Received February 22, 2000; Revised and Accepted April 4, 2000.
| ABSTRACT |
|---|
|
|
|---|
Many examples of enzymes that have lost their catalytic activity and perform other biological functions are known. The opposite situation is rare. A previously unnoticed structural similarity between the
integrase family (Int) proteins and the AraC family of transcriptional activators implies that the Int family evolved by duplication of an ancient DNA-binding homeodomain-like module, which acquired enzymatic activity. The two helixturnhelix (HTH) motifs in Int proteins incorporate catalytic residues and participate in DNA binding. The active site of Int proteins, which include the type IB topoisomerases, is formed at the domain interface and the catalytic tyrosine residue is located in the second helix of the C-terminal HTH motif. Structural analysis of other tyrosine DNA-breaking/rejoining enzymes with similar enzyme mechanisms, namely prokaryotic topoisomerase I, topoisomerase II and archaeal topoisomerase VI, reveals that the catalytic tyrosine is placed in a HTH domain as well. Surprisingly, the location of this tyrosine residue in the structure is not conserved, suggesting independent, parallel evolution leading to the same catalytic function by homologous HTH domains. The tyrosine recombinases give a rare example of enzymes that evolved from ancient DNA-binding modules and present a unique case for homologous enzymatic domains with similar catalytic mechanisms but different locations of catalytic residues, which are placed at non-homologous sites. The wealth of biochemical, sequence and structural information accumulated over the years of molecular biology provides examples of proteins that change function in the course of evolution (14). Enzymes having a chemical requirement for invariant amino acids in the active site are particularly vulnerable to selection pressure. Using sequence similarity, one can detect proteins evolutionarily related to enzymes but lacking catalytic activity due to disruption of their active sites. These proteins may function, for example, as transcription regulators (4). Given that an overwhelming majority of homologs to such proteins are indeed enzymes and that the non-catalytic variants are uncommon (4), there is little doubt about the direction of evolution in these cases: the enzyme has lost its activity/acquired a new function. The reverse path of evolution is rather rare. There are few examples of normally non-enzymatic domains that gain catalytic activity (5,6), particularly for transcription regulators. One such example is discussed here.
The helixturnhelix (HTH) DNA-binding motif is ubiquitous and detected in many transcription regulators (79). HTH transcription factors are diversified across a variety of orthologous families and the HTH motif is incorporated into several structural scaffolds (9). The most common of these scaffolds, hereafter referred to as homeodomain-like (HHTH), has a hydrophobic core of two
-helices (helices B and C) completed by another, usually N-terminal,
-helix (helix A). This structure can be described as a right-handed three-helical bundle (Fig. 1b). Some examples of HHTH proteins are homeodomains, AraC-type transcriptional activators and members of the winged HTH family (HHTHw), typified by the C-terminal domain of catabolite gene activator protein (CAP) (7). HTH bundles can usually be distinguished from other three-helical structures by a sequence signal in the HTH motif (811). Very divergent representatives with known spatial structure can be recognized by the characteristic packing of
-helices B and C at nearly a right angle to each other (Fig. 1b, helices B1C1 and B2C2, Fig. 1c). The turn between
-helices B and C offsets
-helix C so that the N-terminal part of C is packed against the middle of B.
-Helix B is usually short (two or three turns) and C, which binds to the DNA major groove, is longer (12). A monophyletic origin for most HHTH proteins has been proposed (8).
|
Site-specific recombination allows living organisms to rearrange and redistribute their genetic content by cutting and rejoining DNA segments at specific sequences. Recombinases catalyze DNA breakage, strand exchange and ligation. One of the two major recombinase types, the
integrase family (Int), uses a tyrosine nucleophile in a reaction that proceeds through a stable 3'-phosphotyrosine DNAenzyme intermediate (13,14). The structures of several family members, namely bacteriophage
integrase (15), bacteriophage HP1 integrase (16), XerD from Escherichia coli (17) and Cre recombinase from bacteriophage P1 (Fig. 1a) (18), have recently been solved. The most extensive structural information obtained concerns the DNA-binding mode and mechanism of Cre enzyme (19,20). X-ray crystallography revealed that type IB topoisomerases (21), which include eukaryotic (22,23) and viral (24) enzymes, also belong to the Int family due to extensive conservation of the structural core, active site arrangement and the catalytic mechanism (25,26).
The Int family has always been treated as a unique fold without much structural similarity to other proteins (2729). SCOP (30,31) groups Int family structures into the fold named DNA-breaking/rejoining enzymes of
+ß class. CATH (32) places them in the mainly
class with non-bundle architecture. However, structure similarity searches with such programs as DALI and VAST initiated with Cre recombinase coordinates (18) (pdb entry 1crx, Fig. 1a) reveal a highly significant and striking match that spans the entire length of the MarA transcriptional activator molecule (33) (pdb entry 1bl0, Fig. 1b). DALI (34,35) superimposes 88 C
atoms of 1crx (322 residues) and 1bl0 (116 residues) with a Z score of 4.0, r.m.s.d. of 3.3 Å and 17% identity in the resulting sequence alignment (Fig. 1d). VAST (36) aligns 78 C
atoms of these proteins with a P value of 0.0002, r.m.s.d. of 2.5 Å and a sequence identity of 16.7%. Additionally, superposition of C
traces of Cre recombinase and MarA results in an almost perfect superposition of DNA molecules present in the crystals (Fig. 1ac) despite the fact that DNA coordinates were not used in r.m.s.d. minimization. Thus the modes of DNA binding are essentially identical for Cre recombinase and MarA. Such an extensive structural resemblance combined with similar substrate binding and non-random sequence identity (18%, Fig. 1c) argues for homology (3,37) between DNA-breaking/rejoining enzymes and MarA. Surprisingly, similarity between the two proteins remained unnoticed to date.
MarA is a member of the AraC family of transcription activators that control expression of a variety of genes (33). The MarA structure consists of two HHTH modules with a unique mutual arrangement, previously unrecognized for multi-HTH proteins, in which two HHTH domains are approximately related by a translation (33) (Fig. 1b). This arrangement results in tight packing of the two domains and places two almost parallel DNA-binding helices in the major groove at a separation of one DNA double helix turn (Fig. 1b). Both MarA domains have structural counterparts in the Cre recombinaseDNA complex and all six MarA
-helices are superimposable between the two proteins (Fig. 1). The homology of Cre and MarA suggested by structural, functional and sequence similarity implies that the catalytic segment of Int proteins consists of two consecutive HHTH domains. However, it is difficult to determine at present if the common ancestor of Int and MarA already contained two HHTH domains or if duplications in these proteins occurred in parallel. Interestingly, among the four articles describing different independently solved Int protein structures (1518), only one discusses the structural similarity of the first HHTH domain in Int proteins with the HTH motif of the catabolite activator protein DNA-binding domain (17). X-ray crystallography revealed that the second HHTH domain, which contains a catalytic tyrosine residue, is conformationally variable between different representatives of the family, as well as between different DNA complexes of the same Cre protein, and thus might fold into the HHTH structure upon DNA binding only (1518,2729). For example, in
integrase the catalytic tyrosine is modeled in a flexible ß-strand-like region. Such flexibility might be necessary for proper functioning of the enzymatic HHTH domain. It is well known that the active sites of many enzymes include regions of higher flexibility to accommodate changes in the substrate during catalysis. Therefore, it is likely that the second HHTH domain, which contains most of the active site residues (Fig. 1a and c), acquired some structural flexibility while the first HHTH domain, which is used mostly for DNA binding in a standard HTH-like manner, remained rigid.
Thus the Int family fold has likely evolved by a duplication of an ancient HHTH protein (Fig. 1a, red and blue). The first HHTH domain was elaborated with long insertions (Fig. 1a, gray) placed in the turn region (Fig. 1a, yellow) of the HTH motif. These insertions are structured in subdomains that contain small ß-sheets (Fig. 1a, gray). It is not unusual for HTH proteins to incorporate insertions in turn regions, found for example in the endonuclease FokI (38). The presence of these subdomains disrupting the HTH motif masks the sequence signal and prevents motif detection in Int proteins by sequence analysis. The first HHTH domain of Int proteins is used primarily for DNA binding while the second HHTH domain is adapted to a catalytic role.
The following question arises: are there other examples of HTH domains that are not only present in an enzyme as nucleotide-binding modules but possess enzymatic activity (i.e. carry at least some of the catalytic residues)? PDB (39,40) searches by DALI (34,35) and VAST (36,41) reveal domains of different topoisomerases that contain catalytic tyrosine residues as members of the HHTH fold. The presence of HHTH domains in type IA, II and VI topoisomerases (4244) has been detected previously (4446). Topoisomerase IA, II and VI HHTH domains contain a small amount of ß-sheet and should be classified as CAP-like winged HTH domains (Fig. 2a, c and d). Notably, all of these enzymes possess a catalytic mechanism similar to the one established for Int proteins, i.e. tyrosine is utilized as a nucleophile and found in an HHTH domain. The Int family includes type IB topoisomerases. Thus an evolutionary connection exists between all tyrosine DNA-breaking/rejoining enzymes with known structure, namely type IA, IB, II and VI topoisomerases, which all contain an enzymatic HHTH module. Structure superpositions of these domains in the four enzymes reveal that the position of the catalytic tyrosine residue is not structurally conserved (Fig. 2e). In the topoisomerase VI structure (44) Tyr103 is placed in
-helix B (Fig. 2a); in the Int family, including topoisomerase IB (21,22,24,47) and Cre recombinase (18), Tyr324 (Cre numbering) is incorporated in
-helix C (Fig. 2b); in topoisomerase IA (42) Tyr319 is at the C-terminal end of the first ß-stand in the wing segment of the HHTH domain (Fig. 2c); in topoisomerase II (43) Tyr782 is located after a long loop at the beginning of the second ß-strand in the wing (Fig. 2d). The sites in homologous HTH domains where catalytic tyrosines are located are not homologous; therefore, the catalytic properties of HTH domains in DNA-breaking/rejoining enzymes are likely to have evolved independently in parallel. Thus catalytic HHTH domains provide a unique example of homologous enzymes with a similar mechanism but different location of active site residues which are placed at non-homologous sites.
|
| ACKNOWLEDGEMENTS |
|---|
The author is grateful to Yuri Wolf for fruitful discussions and to Hong Zhang and Monica Horvath for critical reading of the manuscript and helpful comments.
| FOOTNOTES |
|---|
* Tel: +1 214 648 3386; Fax: +1 214 648 9099; Email: grishin@chop.swmed.edu
| REFERENCES |
|---|
|
|
|---|
-
1 Murzin,A.G. (1993) Trends Biochem. Sci., 18, 403405.[Web of Science][Medline]
2 Artymiuk,P.J., Poirrette,A.R., Rice,D.W. and Willett,P. (1997) Nature, 388, 3334.[Medline]
3 Murzin,A.G. (1998) Curr. Opin. Struct. Biol., 8, 380387.[Web of Science][Medline]
4 Aravind,L. and Koonin,E.V. (1998) Curr. Biol., 8, R111R113.[Web of Science][Medline]
5 Lorick,K.L., Jensen,J.P., Fang,S., Ong,A.M., Hatakeyama,S. and Weissman,A.M. (1999) Proc. Natl Acad. Sci. USA, 96, 1136411369.
6 Boerner,R.J., Consler,T.G., Gampe,R.T.,Jr, Weigl,D., Willard,D.H., Davis,D.G., Edison,A.M., Loganzo,F.,Jr, Kassel,D.B., Xu,R.X. et al. (1995) Biochemistry, 34, 1535115358.[Medline]
7 Wintjens,R. and Rooman,M. (1996) J. Mol. Biol., 262, 294313.[Web of Science][Medline]
8 Rosinski,J.A. and Atchley,W.R. (1999) J. Mol. Evol., 49, 301309.[Web of Science][Medline]
9 Aravind,L. and Koonin,E.V. (1999) Nucleic Acids Res., 27, 46584670.
10 Altschul,S.F., Madden,T.L., Schaffer,A.A., Zhang,J., Zhang,Z., Miller,W. and Lipman,D.J. (1997) Nucleic Acids Res., 25, 33893402.
11 Altschul,S.F. and Koonin,E.V. (1998) Trends Biochem. Sci., 23, 444447.[Web of Science][Medline]
12 Wintjens,R.T., Rooman,M.J. and Wodak,S.J. (1996) J. Mol. Biol., 255, 235253.[Web of Science][Medline]
13 Nunes-Duby,S.E., Kwon,H.J., Tirumalai,R.S., Ellenberger,T. and Landy,A. (1998) Nucleic Acids Res., 26, 391406.
14 Gopaul,D.N. and Duyne,G.D. (1999) Curr. Opin. Struct. Biol., 9, 1420.[Medline]
15 Kwon,H.J., Tirumalai,R., Landy,A. and Ellenberger,T. (1997) Science, 276, 126131.
16 Hickman,A.B., Waninger,S., Scocca,J.J. and Dyda,F. (1997) Cell, 89, 227237.[Web of Science][Medline]
17 Subramanya,H.S., Arciszewska,L.K., Baker,R.A., Bird,L.E., Sherratt,D.J. and Wigley,D.B. (1997) EMBO J., 16, 51785187.[Web of Science][Medline]
18 Guo,F., Gopaul,D.N. and van Duyne,G.D. (1997) Nature, 389, 4046.[Medline]
19 Gopaul,D.N., Guo,F. and Van Duyne,G.D. (1998) EMBO J., 17, 41754187.[Web of Science][Medline]
20 Guo,F., Gopaul,D.N. and Van Duyne,G.D. (1999) Proc. Natl Acad. Sci. USA, 96, 71437148.
21 Redinbo,M.R., Champoux,J.J. and Hol,W.G. (1999) Curr. Opin. Struct. Biol., 9, 2936.[Medline]
22 Redinbo,M.R., Stewart,L., Kuhn,P., Champoux,J.J. and Hol,W.G. (1998) Science, 279, 15041513.
23 Redinbo,M.R., Stewart,L., Champoux,J.J. and Hol,W.G. (1999) J. Mol. Biol., 292, 685696.[Web of Science][Medline]
24 Cheng,C., Kussie,P., Pavletich,N. and Shuman,S. (1998) Cell, 92, 841850.[Web of Science][Medline]
25 Sherratt,D.J. and Wigley,D.B. (1998) Cell, 93, 149152.[Web of Science][Medline]
26 Wigley,D.B. (1998) Structure, 6, 543548.[Medline]
27 Grindley,N.D. (1997) Curr. Biol., 7, R608R612.[Web of Science][Medline]
28 Lilley,D.M. (1997) Chem. Biol., 4, 717720.[Web of Science][Medline]
29 Yang,W. and Mizuuchi,K. (1997) Structure, 5, 14011406.[Medline]
30 Murzin,A.G., Brenner,S.E., Hubbard,T. and Chothia,C. (1995) J. Mol. Biol., 247, 536540.[Web of Science][Medline]
31 Hubbard,T.J., Ailey,B., Brenner,S.E., Murzin,A.G. and Chothia,C. (1999) Nucleic Acids Res., 27, 254256.
32 Orengo,C.A., Michie,A.D., Jones,S., Jones,D.T., Swindells,M.B. and Thornton,J.M. (1997) Structure, 5, 10931108.[Medline]
33 Rhee,S., Martin,R.G., Rosner,J.L. and Davies,D.R. (1998) Proc. Natl Acad. Sci. USA, 95, 1041310418.
34 Holm,L. and Sander,C. (1995) Trends Biochem. Sci., 20, 478480.[Web of Science][Medline]
35 Holm,L. and Sander,C. (1997) Nucleic Acids Res., 25, 231234.
36 Gibrat,J.F., Madej,T. and Bryant,S.H. (1996) Curr. Opin. Struct. Biol., 6, 377385.[Web of Science][Medline]
37 Russell,R.B., Saqi,M.A., Bates,P.A., Sayle,R.A. and Sternberg,M.J. (1998) Protein Eng., 11, 19.
38 Wah,D.A., Hirsch,J.A., Dorner,L.F., Schildkraut,I. and Aggarwal,A.K. (1997) Nature, 388, 97100.[Medline]
39 Bernstein,F.C., Koetzle,T.F., Williams,G.J., Meyer,E.F.,Jr, Brice,M.D., Rodgers,J.R., Kennard,O., Shimanouchi,T. and Tasumi,M. (1977) Eur. J. Biochem., 80, 319324.[Web of Science][Medline]
40 Abola,E.E., Sussman,J.L., Prilusky,J. and Manning,N.O. (1997) Methods Enzymol., 277, 556571.[Web of Science][Medline]
41 Marchler-Bauer,A., Addess,K.J., Chappey,C., Geer,L., Madej,T., Matsuo,Y., Wang,Y. and Bryant,S.H. (1999) Nucleic Acids Res., 27, 240243.
42 Lima,C.D., Wang,J.C. and Mondragon,A. (1994) Nature, 367, 138146.[Medline]
43 Berger,J.M., Gamblin,S.J., Harrison,S.C. and Wang,J.C. (1996) Nature, 379, 225232.[Medline]
44 Nichols,M.D., DeAngelis,K., Keck,J.L. and Berger,J.M. (1999) EMBO J., 18, 61776188.[Web of Science][Medline]
45 Murzin,A.G. (1994) Curr. Opin. Struct. Biol., 4, 441449.
46 Berger,J.M., Fass,D., Wang,J.C. and Harrison,S.C. (1998) Proc. Natl Acad. Sci. USA, 95, 78767881.
47 Shuman,S. (1998) Biochim. Biophys. Acta, 1400, 321337.[Medline]
48 Esnouf,R. (1997) J. Mol. Graph. Model., 15, 133138.
49 Kraulis,P. (1991) J. Appl. Crystallogr., 24, 946950.
This article has been cited by other articles:
![]() |
S. Balaji and L. Aravind The RAGNYA fold: a novel fold with multiple topological variants found in functionally diverse nucleic acid, nucleotide and peptide-binding proteins Nucleic Acids Res., September 27, 2007; 35(17): 5658 - 5671. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Martinez, J. A. Bengoechea, and F. Cuttitta Molecular Evolution of Proadrenomedullin N-Terminal 20 Peptide (PAMP): Evidence for Gene Co-Option Endocrinology, July 1, 2006; 147(7): 3457 - 3461. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Chen and P. A. Rice The Role of the Conserved Trp330 in Flp-mediated Recombination: FUNCTIONAL AND STRUCTURAL ANALYSIS J. Biol. Chem., June 27, 2003; 278(27): 24800 - 24807. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. M. Bankhead, B. J. Etzel, F. Wolven, S. Bordenave, J. L. Boldt, T. A. Larsen, and A. M. Segall Mutations at Residues 282, 286, and 293 of Phage {lambda} Integrase Exert Pathway-Specific Effects on Synapsis and Catalysis in Recombination J. Bacteriol., April 15, 2003; 185(8): 2653 - 2666. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||





