Article |
The HHpred interactive server for protein homology detection and structure prediction
Department of Protein Evolution, Max-Planck-Institute for Developmental Biology Spemannstrasse 35, 72076 Tübingen, Germany
*To whom correspondence should be addressed. Tel: +49 7071 601 451; Fax: +49 7071 601 349; Email: johannes.soeding{at}tuebingen.mpg.de
Received February 14, 2005. Revised March 21, 2005. Accepted March 21, 2005.
| ABSTRACT |
|---|
|
|
|---|
HHpred is a fast server for remote protein homology detection and structure prediction and is the first to implement pairwise comparison of profile hidden Markov models (HMMs). It allows to search a wide choice of databases, such as the PDB, SCOP, Pfam, SMART, COGs and CDD. It accepts a single query sequence or a multiple alignment as input. Within only a few minutes it returns the search results in a user-friendly format similar to that of PSI-BLAST. Search options include local or global alignment and scoring secondary structure similarity. HHpred can produce pairwise query-template alignments, multiple alignments of the query with a set of templates selected from the search results, as well as 3D structural models that are calculated by the MODELLER software from these alignments. A detailed help facility is available. As a demonstration, we analyze the sequence of SpoVT, a transcriptional regulator from Bacillus subtilis. HHpred can be accessed at http://protevo.eb.tuebingen.mpg.de/hhpred.
| INTRODUCTION |
|---|
|
|
|---|
It is well known that sequence search methods such as BLAST, FASTA or PSI-BLAST (13) are of prime importance for biological research because functional information of a protein or gene can be inferred from homologous proteins or genes identified in a sequence search. But quite often no significant relationship to a protein of known function can be established. This is certainly the case for the most interesting group of proteins, those for which no ortholog has yet been studied.
It is less well known that in cases where conventional sequence search methods fail, the recently developed, highly sensitive methods for homology detection or structure prediction (confer, e.g. (411) and descriptions and links at http://bioinfo.pl/Meta/servers.html) quite often allow to make inferences from more remotely homologous relationships (1217). If the relationship is so remote that no common function can be assumed, one can generally still derive hypotheses about possible mechanisms, active site positions and residues, or the class of substrate bound (18,19). When a homologous protein with known structure can be identified, its stucture can be used as a template to model the 3D structure for the protein of interest (5), since even remotely homologous proteins generally have quite similar 3D structure (20). The 3D model may then help to generate hypotheses to guide experiments.
The primary aim in developing HHpred was to provide biologists with a method for sequence database searching that is as easy to use as BLAST or PSI-BLAST and yet competitive in sensitivity with the most powerful servers for structure prediction that are currently available. We believe that HHpred is unique in the advantages it offers:
Speed: A search with a 300 residue sequence through the Protein Data Bank (PDB) (
9000 HMMs) takes
1 min.
Databases: A wide range of regularly updated structure and protein family databases can be searched: the PDB (21), SCOP (22), Pfam (23), SMART (24), COG (25) and CDD (26).
User-friendliness: Search results are presented in an easy-to-read format that is similar to PSI-BLAST. The summary hit list includes E-values and true probabilities. Alignments contain annotation about secondary structure, consensus sequences and position-specific reliability and they can be augmented by representative sequences from the underlying multiple alignments.
Flexibility: We try to offer the user maximum control and flexibility. He can paste his own input query alignment, search in local or global alignment mode, realign alignments with other parameters and edit the query-template (multiple) alignment with which to launch the comparative modelling.
Multi-domain proteins: HHpred has been designed to work equally well for single-domain and multi-domain query sequences. It can therefore be used to predict domain boundaries.
Documentation: A comprehensive help facility is available.
Selectivity: High-scoring false positives have systematically been reduced by developing a protocol for building query and database alignments that supresses non-homologous sequences (J. Söding, to be published).
Sensitivity: HHpred is among the most sensitive servers for remote homology detection. A comparison of the new version HHpred2.1 with the servers that took part in the recent structure prediction benchmark CAFASP4 (27) can be viewed at http://protevo.eb.tuebingen.mpg.de/hhpred/hhpred_in_CAFASP4.html. In a recent study (28), in which we benchmarked HHsearch, the method for HMMHMM comparison employed by our server, together with PSI-BLAST, HMMER, PROF_SIM and COMPASS (3,6,7,29), HHsearch was found to possesses the highest sensitivity and alignment accuracy.
| METHODS AND INPUT PARAMETERS |
|---|
|
|
|---|
In the first step, an alignment of homologs is built for the query sequence by multiple iterations of PSI-BLAST searches against the non-redundant database from NCBI. The maximum number of PSI-BLAST iterations and the E-value threshold can be specified on the start page (Figure 1). Instead of a single sequence, the user may also enter a multiple alignment to jumpstart PSI-BLAST, or he can choose to skip the PSI-BLAST iterations altogether by choosing zero for the maximum number of PSI-BLAST iterations.
|
The user can further specify a minimum coverage of the query by the PSI-BLAST matches. With a value of 50%, at least half of the query residues must be aligned (covered) with residues from the matched sequence in order for it to enter into the profile. Similarly, a minimum sequence identity of the PSI-BLAST match to the query sequence can be demanded. Our benchmarks (data not published) have shown that a value between 20 and 25% improves selectivity without compromising sensitivity. The final alignment from PSI-BLAST is annotated with the predicted secondary structure and confidence values from PSIPRED (30).
In the next step, a profile HMM is generated from the multiple alignment that includes the information about predicted secondary structure. A profile HMM is a concise statistical description of the underlying alignment. For each column in the multiple alignment that has a residue in the query sequence, an HMM column is created that contains the probabilities of each of the 20 amino acids, plus 4 probabilities that describe how often amino acids are inserted and deleted at this position (insert open/extend, delete open/extend). These insert/delete probabilites are translated into position-specific gap penalties when an HMM is aligned to a sequence or to another HMM.
The query HMM is then compared with each HMM in the selected database. The database HMMs have been precalculated and also contain secondary structure information, either predicted by PSIPRED, or assigned from 3D structure by DSSP (31). The database search is performed with the HHsearch software for HMMHMM comparison (28). Compared to methods that rely on pairwise comparison of simple sequence profiles, HHsearch gains sensitivity by using position-specific gap penalties. If the default setting Score secondary structure is active, a score for the secondary structure similarity is added to the total score. This increases the sensitivity for homologous proteins considerably (28). As a possible drawback, it may lead to marginally significant scores for structurally analogous, but non-homologous proteins.
The user can choose between local and global alignment mode. In global mode alignments extend in both directions up to the end of either the query or the database HMM. No penalties are charged for end gaps. In local mode, the highest-scoring local alignment is determined, which can start and end anywhere with respect to the compared HMMs. It is recommended to use the local alignment mode as a default setting since it has been shown in our benchmarks to be on average more sensitive in detecting remote relationships as well as being more robust in the estimation of statistical significance values. A global search might be appropriate when one expects the database entries to be (at least marginally) similar over their full length with the query sequence. In most cases it will be advisable to run a search in both modes to gain confidence in one's results.
| EXAMPLE ANALYSIS |
|---|
|
|
|---|
As an example we analyze the sequence of Stage V sporulation protein T (SpoVT) from Bacillus subtilis that is known to regulate forespore-specific
G-dependent transcription (32) (annotated as transcriptional regulator in GenBank). Input parameters are set as shown in Figure 1. The results consist of two parts (Figure 2): a summary list with matching database sequences (templates) and a list of querytemplate alignments below.
|
The first column of the summary hit list has indices that link to the corresponding alignment further down. Next are the first 30 characters from the description of the HMM. The Prob column lists the probability in percent that the database match is a true positive, i.e. that it is homologous to the query sequence at least in some core part. This is the most relevant statistical measure of significance and can be interpreted quite literally. The true-positive probability is a conservative measure in the sense that it corrects for occasional high-scoring false positives. (The major cause for high-scoring false positives are corrupted alignments that contain non-homologous sequences which slipped in during the automized alignment-building with PSI-BLAST.) [See (28) for details.] The E-values in HHpred are defined in the same way as in BLAST or PSI-BLAST. (The E-value for a sequence match is the expected number of false positives per database search with a score at least as good as the score of this sequence match.) But it is important to note that, in contrast to the true-positive probability, HHpred E-values do not take into account the secondary structure similarity. Hits can therefore be significant by the true-positive probability criterion even when the E-value is
1. The P-value is equal to the E-value divided by the number of HMMs in the searched database. The Score column gives the total score that includes the score from the secondary structure comparison which is listed in the next column (SS). Cols contains the total number of matched columns in the querytemplate alignment and the remaining columns describe the range of aligned residues in the query and template.
From the summary list in Figure 2 it is evident that the SpoVT protein consists of two domains, one from residue 1 to
51 and the other from residue 52 to 178. The N-terminal domain has two significant hits in SCOP at rank 1 and 3. The first hit is the DNA-binding domain of transition-state regulator AbrB (33), a known close homolog of SpoVT. AbrB is a protein that is broadly represented in bacterial species and is involved in switching from exponential growth to stationary phase by integrating a great number of environmental factors. The second hit is to MazE, the antidote of the antidote-toxin addiction module MazEF (34). How can both AbrB and MazE be homologous to the query if they are not even classified into the same class, let alone fold or superfamily, by the SCOP database? Can the match with MazE be a false positive despite the rather significant 84% probability?
To elucidate this, we can look at the SpoVTMazE alignment below. Five representative (i.e. maximally diverse) sequences from each of the two underlying alignments are shown for each HMM. (Their amino acids can be colored by biochemical properties by pressing one of the radio buttons entitled color alignments above the summary hit list.) First we note that the predicted secondary structure of SpoVT (sequence Q ss_pred) agrees very well with the actual secondary structure of MazE determined by the program DSSP (sequence T ss_dssp). Second, the hydrophobicity pattern in the aligned HMMs looks quite similar, which is especially evident with the coloring. Third, the HMMHMM alignment contains a single gap in MazE at a position where also some sequences in SpoVT exhibit a gap. All in all, the alignment looks very much like what one would expect for a distant homologous relationship.
The conflict posed by the manifest homology between MazE and AbrB and their grossly different structural topology prompted us to undertake a thorough bioinformatic investigation of the AbrB-like superfamily and to redetermine the AbrB structure by NMR (M. Coles and S. Djuranovic et al., manuscript submitted, PDB ID: 1YFB [PDB] ). Indeed, we found that the published structure of AbrB (PDB ID: 1EKT [PDB] ) is incorrect and that the correct structure for AbrB places it in the same superfamily as MazE.
Hits 2 and 49 in the summary list are all proteins from the same SCOP fold d.110. Clicking on the SCOP family IDs opens a window with the corresponding entry in SCOP. Irrespective of the specific significance values, the fact that so many quite divergent members from the same two superfamilies d.110.2 (GAF-domain) and d.110.3 (PAS-domain) appear among the best hits strongly indicates that these are not high-scoring chance hits but true homologs. Whether the C-terminal domain looks more like a GAF or a PAS domain, we can now generate an approximate structural model that could help us to guide experiments to investigate what regulatory substrate this domain may actually bind (32).
By clicking Create CM Model one can select the templates to be used for comparative modelling. HHpred then returns a multiple alignment in PIR format with the query sequence and the selected templates. This aligment may be edited by the user and then fed to the MODELLER software (35), accessible via the MPI toolkit for users of HHpred.
A very useful feature is the possibility to view and manually improve the query alignment that was used to generate the query HMM; via the tab Edit Query Alignment the user can modify the query alignment that appears in a text field and start a new search with the modified alignment.
By pressing Realign at the top, the user may also realign the identified templates in the summary hit list with different parameters without the need to rerun the database search. One can change the alignment mode from global to local, set the number of representative sequences or use filters to narrow down the set of sequences allowed into the query and template alignments. If the user wants to search another database with the same query HMM, she can select Restart with Query HMM.
| CONCLUSION |
|---|
|
|
|---|
Whenever biologists cannot get satisfactory results from BLAST, PSI-BLAST or other database searches due to insignificant matches with proteins of known structure or function, they should consider using one of the recently developed sensitive structure prediction and homology detection servers (411) that are listed, for instance, on the LiveBench/CAFASP site at http://bioinfo.pl/Meta/servers.html. Among these servers, HHpred offers a high degree of flexibility and user-friendliness combined with excellent sensitivity. In contrast to methods based on profileprofile comparison, HHpred exploits the information that is contained in insert and delete probabilities by including them in a statistical framework. But the speed of HHpred is perhaps the most important advantage, considering that the best-ranked servers in CAFASP4 generally take hours or even days to return a prediction. The speed enables the user to tweak the performance and gain confidence in the results by modifying input alignments, search parameters or selected databases on a trial and error basis.
| ACKNOWLEDGEMENTS |
|---|
We would like to thank Michael Remmert for his valuable help in setting up the web interface. We thank Sergej Djuranovic for first pointing out the HHpred prediction for the SpoVT C-terminal domain. J.S. is indebted to Alex Diemand for assistance in preparing the screenshots. Many thanks to all users who helped to improve our software with their questions and feedback. Funding to pay the Open Access publication charges for this article was provided by the Max-Planck Society.
Conflict of interest statement. None declared.
| REFERENCES |
|---|
|
|
|---|
- Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J. (1990) Basic local alignment search tool J. Mol. Biol., 215, 403410[CrossRef][Web of Science][Medline] .
- Pearson, W.R. (1991) Searching protein sequence libraries: comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms Genomics, 11, 635650[CrossRef][Web of Science][Medline] .
- Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs Nucleic Acid Res., 25, 33893402
[Abstract/Free Full Text] . - Pietrokovski, S. (1996) Searching databases of conserved sequence regions by aligning protein multiple-alignments Nucleic Acids Res., 24, 38363845
[Abstract/Free Full Text] . - Rychlewski, L., Zhang, B., Godzik, A. (1998) Fold and function predictions for Mycoplasma genitalium proteins Fold Des., 3, 229238[CrossRef][Web of Science][Medline] .
- Yona, G. and Levitt, M. (2002) Within the twilight zone: a sensitive profileprofile comparison tool based on information theory J. Mol. Biol., 315, 12571275[CrossRef][Web of Science][Medline] .
- Sadreyev, R.I. and Grishin, N.V. (2003) COMPASS: a tool for comparison of multiple protein alignments with assessment of statistical significance J. Mol. Biol., 326, 317336[CrossRef][Web of Science][Medline] .
- vonÖhsen, N., Sommer, I., Zimmer, R. (2003) Profileprofile alignment: a powerful tool for protein structure prediction Pac. Symp. Biocomput., 252263 .
- Panchenko, A.R. (2003) Finding weak similarities between proteins by sequence profile comparison Nucleic Acids Res., 31, 683689
[Abstract/Free Full Text] . - Fischer, D. (2003) 3D-SHOTGUN: a novel, cooperative, fold-recognition meta-predictor Proteins, 51, 434441[CrossRef][Web of Science][Medline] .
- Ginalski, K., Elofsson, A., Fischer, D., Rychlewski, L. (2003) 3D-Jury: a simple approach to improve protein structure predictions Bioinformatics, 19, 10151018
[Abstract/Free Full Text] . - Venclovas, C. and Thelen, M.P. (2000) Structure-based predictions of Rad1, Rad9, Hus1 and Rad17 participation in sliding clamp and clamp-loading complexes Nucleic Acids Res., 28, 24812493
[Abstract/Free Full Text] . - Zheng, M., Ginalski, K., Rychlewski, L., Grishin, N. (2005) Protein domain of unknown function DUF1023 is an alpha/beta hydrolase Proteins, 59, 16[CrossRef][Web of Science][Medline] .
- Ginalski, K., Rychlewski, L., Baker, D., Grishin, N.V. (2004) Protein structure prediction for the male-specific region of the human Y chromosome Proc. Natl Acad. Sci. USA, 101, 23052310
[Abstract/Free Full Text] . - Rand, T.A., Ginalski, K., Grishin, N.V., Wang, X. (2004) Biochemical identification of Argonaute 2 as the sole protein required for RNA-induced silencing complex activity Proc. Natl Acad. Sci. USA, 101, 1438514389
[Abstract/Free Full Text] . - Pawlak, S.D., Radlinska, M., Chmiel, A.A., Bujnicki, J.M., Skowronek, K.J. (2005) Inference of relationships in the twilight zone of homology using a combination of bioinformatics and site-directed mutagenesis: a case study of restriction endonucleases Bsp6I and PvuII Nucleic Acids Res., 33, 661671
[Abstract/Free Full Text] . - Kihara, D. and Skolnick, J. (2004) Microbial genomes have over 72% structure assignment by the threading algorithm PROSPECTOR_Q Proteins, 55, 464473[CrossRef][Web of Science][Medline] .
- Todd, A.E., Orengo, C.A., Thornton, J.M. (2001) Evolution of function in protein superfamilies, from a structural perspective J. Mol. Biol., 307, 11131143[CrossRef][Web of Science][Medline] .
- Pawlowski, K., Jaroszewski, L., Rychlewski, L., Godzik, A. (2000) Sensitive sequence comparison as protein function predictor Pac. Symp. Biocomput., 4253 .
- Kinch, L. and Grishin, N. (2002) Evolution of protein structures and functions Curr. Opin. Struct. Biol., 12, 400408[CrossRef][Web of Science][Medline] .
- Bourne, P.E., Addess, K.J., Bluhm, W.F., Chen, L., Deshpande, N., Feng, Z., Fleri, W., Green, R., Merino-Ott, J.C., Townsend-Merino, W., et al. (2004) The distribution and query systems of the RCSB protein data bank Nucleic Acid Res., 32, D223D225
[Abstract/Free Full Text] . - Murzin, A.G., Brenner, S.E., Hubbard, T., Chothia, C. (1995) SCOP: a structural classification of proteins database for the investigation of sequences and structures J. Mol. Biol., 247, 536540[CrossRef][Web of Science][Medline] .
- Sonnhammer, E.L., Eddy, S.R., Birney, E., Bateman, A., Durbin, R. (1998) Pfam: multiple sequence alignments and HMM-profiles of protein domains Nucleic Acids Res., 26, 320322
[Abstract/Free Full Text] . - Ponting, C.P., Schultz, J., Milpetz, F., Bork, P. (1999) SMART: identification and annotation of domains from signalling and extracellular protein sequences Nucleic Acids Res., 24, 229232 .
- Tatusov, R.L., Fedorova, N.D., Jackson, J.D., Jacobs, A.R., Kiryutin, B., Koonin, E.V., Krylov, D.M., Mazumder, R., Mekhedov, S.L., Nikolskaya, A.N., et al. (2003) The COG database: an updated version includes eukaryotes BMC Bioinformatics, 4, 4141[CrossRef][Medline] .
- Marchler-Bauer, A., Panchenko, A., Shoemaker, B., Thiessen, P., Geer, L., Bryant, S. (2002) CDD: a database of conserved domain alignments with links to domain three-dimensional structure Nucleic Acids Res., 30, 281283
[Abstract/Free Full Text] . - Fischer, D., Rychlewski, L., Dunbrack, R.L.J., Ortiz, A.R., Elofsson, A. (2003) CAFASP3: the third critical assessment of fully automated structure prediction methods Proteins, 53, 503516 .
- Söding, J. (2005) Protein homology detection by HMMHMM comparison Bioinformatics, 21, 951960
[Abstract/Free Full Text] . - Eddy, S.R. (1998) Profile hidden Markov models Bioinformatics, 14, 755763
[Abstract/Free Full Text] . - Jones, D.T. (1999) Protein secondary structure prediction based on position-specific scoring matrices J. Mol. Biol., 292, 195202[CrossRef][Web of Science][Medline] .
- Kabsch, W. and Sander, C. (1983) Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features Biopolymers, 22, 25772637[CrossRef][Web of Science][Medline] .
- Dong, T.C., Cutting, S.M., Lewis, R.J. (2004) DNA-binding studies on the Bacillus subtilis transcriptional regulator and AbrB homologue, SpoVT FEMS Microbiol. Lett., 233, 247256[CrossRef][Web of Science][Medline] .
- O'Reilly, M. and Devine, K.M. (1997) Expression of AbrB, a transition state regulator from Bacillus subtilis, is growth phase dependent in a manner resembling that of Fis, the nucleoid binding protein from Escherichia coli J. Bacteriol., 179, 522529
[Abstract/Free Full Text] . - Kamada, K., Hanaoka, F., Burley, S.K. (2003) Crystal structure of the MazE/MazF complex: molecular bases of antidote-toxin recognition Mol. Cell, 11, 875884[CrossRef][Web of Science][Medline] .
- Sali, A. and Blundell, T.L. (1993) Comparative protein modelling by satisfaction of spatial restraints J. Mol. Biol., 234, 779815[CrossRef][Web of Science][Medline]
.
This article has been cited by other articles:
![]() |
T. C. Petrossian and S. G. Clarke Multiple Motif Scanning to Identify Methyltransferases from the Yeast Proteome Mol. Cell. Proteomics, July 1, 2009; 8(7): 1516 - 1526. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. W. Brandt and J. Heringa webPRC: the Profile Comparer for alignment-based searching of public domain databases Nucleic Acids Res., July 1, 2009; 37(suppl_2): W48 - W52. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. I. Sadreyev, M. Tang, B.-H. Kim, and N. V. Grishin COMPASS server for homology detection: improved statistical accuracy, speed and functionality Nucleic Acids Res., July 1, 2009; 37(suppl_2): W90 - W94. [Abstract] [Full Text] [PDF] |
||||
![]() |
B.-H. Kim, H. Cheng, and N. V. Grishin HorA web server to infer homology between proteins using sequence and structural similarity Nucleic Acids Res., July 1, 2009; 37(suppl_2): W532 - W538. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Flemming, P. Sarges, P. Stelter, A. Hellwig, B. Bottcher, and E. Hurt Two structurally distinct domains of the nucleoporin Nup170 cooperate to tether a subset of nucleoporins to nuclear pores J. Cell Biol., May 4, 2009; 185(3): 387 - 395. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Shi, J. Pei, R. I. Sadreyev, L. N. Kinch, I. Majumdar, J. Tong, H. Cheng, B.-H. Kim, and N. V. Grishin Analysis of CASP8 targets, predictions and assessment methods Database, April 28, 2009; 2009(0): bap003 - bap003. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. K. Roth-Cross, H. Stokes, G. Chang, M. M. Chua, V. Thiel, S. R. Weiss, A. E. Gorbalenya, and S. G. Siddell Organ-Specific Attenuation of Murine Hepatitis Virus Strain A59 by Replacement of Catalytic Residues in the Putative Viral Cyclic Phosphodiesterase ns2 J. Virol., April 15, 2009; 83(8): 3743 - 3753. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Zhou, M. Kojic, and W. K. Holloman DNA-binding Domain within the Brh2 N Terminus Is the Primary Interaction Site for Association with DNA J. Biol. Chem., March 27, 2009; 284(13): 8265 - 8273. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. G. Leiman, M. Basler, U. A. Ramagopal, J. B. Bonanno, J. M. Sauder, S. Pukatzki, S. K. Burley, S. C. Almo, and J. J. Mekalanos From the Cover: Type VI secretion apparatus and phage tail-associated protein complexes share a common evolutionary origin PNAS, March 17, 2009; 106(11): 4154 - 4159. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Khazina and O. Weichenrieder Non-LTR retrotransposons encode noncanonical RRM domains in their first open reading frame PNAS, January 20, 2009; 106(3): 731 - 736. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Bateman, R. D. Finn, P. J. Sims, T. Wiedmer, A. Biegert, and J. Soding Phospholipid scramblases and Tubby-like proteins belong to a new superfamily of membrane tethered transcription factors Bioinformatics, January 15, 2009; 25(2): 159 - 162. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. V. Elias, R. Quiroga, N. Gottig, H. Nakanishi, T. E. Nash, A. Neiman, and H. D. Lujan Characterization of SNAREs Determines the Absence of a Typical Golgi Apparatus in the Ancient Eukaryote Giardia lamblia J. Biol. Chem., December 19, 2008; 283(51): 35996 - 36010. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Laneve, U. Gioia, R. Ragno, F. Altieri, C. Di Franco, T. Santini, M. Arceci, I. Bozzoni, and E. Caffarelli The Tumor Marker Human Placental Protein 11 Is an Endoribonuclease J. Biol. Chem., December 12, 2008; 283(50): 34712 - 34719. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Trowitzsch, G. Weber, R. Luhrmann, and M. C. Wahl An Unusual RNA Recognition Motif Acts as a Scaffold for Multiple Proteins in the Pre-mRNA Retention and Splicing Complex J. Biol. Chem., November 21, 2008; 283(47): 32317 - 32327. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. F. de Souza, V. Anantharaman, S. J. de Souza, L. Aravind, and F. J. Gueiros-Filho AMIN domains have a predicted role in localization of diverse periplasmic protein complexes Bioinformatics, November 1, 2008; 24(21): 2423 - 2426. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Meisner, X. Wang, M. Serrano, A. O. Henriques, and C. P. Moran Jr A channel connecting the mother cell and forespore during bacterial endospore formation PNAS, September 30, 2008; 105(39): 15100 - 15105. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Nuutinen, H. Tossavainen, K. Fredriksson, P. Pirila, P. Permi, H. Pospiech, and J. E. Syvaoja The solution structure of the amino-terminal domain of human DNA polymerase {varepsilon} subunit B is homologous to C-domains of AAA+ proteins Nucleic Acids Res., September 1, 2008; 36(15): 5102 - 5110. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Lasry, B. Berman, R. Straussberg, Y. Sofer, H. Bessler, M. Sharkia, F. Glaser, G. Jansen, S. Drori, and Y. G. Assaraf A novel loss-of-function mutation in the proton-coupled folate transporter from a patient with hereditary folate malabsorption reveals that Arg 113 is crucial for function Blood, September 1, 2008; 112(5): 2055 - 2061. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Decroly, I. Imbert, B. Coutard, M. Bouvet, B. Selisko, K. Alvarez, A. E. Gorbalenya, E. J. Snijder, and B. Canard Coronavirus Nonstructural Protein 16 Is a Cap-0 Binding Enzyme Possessing (Nucleoside-2'O)-Methyltransferase Activity J. Virol., August 15, 2008; 82(16): 8071 - 8084. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Rigden and M. Y. Galperin Sequence analysis of GerM and SpoVS, uncharacterized bacterial 'sporulation' proteins with widespread phylogenetic distribution Bioinformatics, August 15, 2008; 24(16): 1793 - 1797. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. I. Serysheva, S. J. Ludtke, M. L. Baker, Y. Cong, M. Topf, D. Eramian, A. Sali, S. L. Hamilton, and W. Chiu Subnanometer-resolution electron cryomicroscopy-based domain models for the cytoplasmic region of skeletal muscle RyR channel PNAS, July 15, 2008; 105(28): 9610 - 9615. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-M. Bourbon Comparative genomics supports a deep evolutionary origin for the large, four-module transcriptional mediator complex Nucleic Acids Res., July 1, 2008; 36(12): 3993 - 4008. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Schein, S. Sheffy-Levin, F. Glaser, and G. Schuster The RNase E/G-type endoribonuclease of higher plants is located in the chloroplast and cleaves RNA similarly to the E. coli enzyme RNA, June 1, 2008; 14(6): 1057 - 1068. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Orlowski and J. M. Bujnicki Structural and evolutionary classification of Type II restriction enzymes based on theoretical and experimental analyses Nucleic Acids Res., June 1, 2008; 36(11): 3552 - 3569. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Imamura, R. Zhou, M. Feig, and L. Kroos Evidence That the Bacillus subtilis SpoIIGA Protein Is a Novel Type of Signal-transducing Aspartic Protease J. Biol. Chem., May 30, 2008; 283(22): 15287 - 15299. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Gangloff, A. Murali, J. Xiong, C. J. Arnot, A. N. Weber, A. M. Sandercock, C. V. Robinson, R. Sarisky, A. Holzenburg, C. Kao, et al. Structural Insight into the Mechanism of Activation of the Toll Receptor by the Dimeric Ligand Spatzle J. Biol. Chem., May 23, 2008; 283(21): 14629 - 14635. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Itoh, J. D. Rice, C. Goller, A. Pannuri, J. Taylor, J. Meisner, T. J. Beveridge, J. F. Preston III, and T. Romeo Roles of pgaABCD Genes in Synthesis, Modification, and Export of the Escherichia coli Biofilm Adhesin Poly-{beta}-1,6-N-Acetyl-D-Glucosamine J. Bacteriol., May 15, 2008; 190(10): 3670 - 3680. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. G. Roessler, B. M. Hall, W. J. Anderson, W. M. Ingram, S. A. Roberts, W. R. Montfort, and M. H. J. Cordes Transitive homology-guided structural studies lead to discovery of Cro proteins with 40% sequence identity but different folds PNAS, February 19, 2008; 105(7): 2343 - 2348. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Tavares-Carreon, Y. Camacho-Villasana, A. Zamudio-Ochoa, M. Shingu-Vazquez, A. Torres-Larios, and X. Perez-Martinez The Pentatricopeptide Repeats Present in Pet309 Are Necessary for Translation but Not for Stability of the Mitochondrial COX1 mRNA in Yeast J. Biol. Chem., January 18, 2008; 283(3): 1472 - 1479. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Occhino, F. Ghiotto, S. Soro, M. Mortarino, S. Bosi, M. Maffei, S. Bruno, M. Nardini, M. Figini, A. Tramontano, et al. Dissecting the Structural Determinants of the Interaction between the Human Cytomegalovirus UL18 Protein and the CD85j Immune Receptor J. Immunol., January 15, 2008; 180(2): 957 - 968. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Sharma, D. N. Wilson, P. P. Datta, C. Barat, F. Schluenzen, P. Fucini, and R. K. Agrawal Cryo-EM study of the spinach chloroplast ribosome reveals the structural and functional roles of plastid-specific ribosomal proteins PNAS, December 4, 2007; 104(49): 19315 - 19320. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-K. Hyun, F. Coulibaly, A. P. Turner, E. N. Baker, A. A. Mercer, and A. K. Mitra The Structure of a Putative Scaffolding Protein of Immature Poxvirus Particles as Determined by Electron Microscopy Suggests Similarity with Capsid Proteins of Large Icosahedral DNA Viruses J. Virol., October 15, 2007; 81(20): 11075 - 11083. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Xiong, C. E. Bauer, and A. Pancholy Insight into the haem d1 biosynthesis pathway in heliobacteria through bioinformatics analysis Microbiology, October 1, 2007; 153(10): 3548 - 3562. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Pukatzki, A. T. Ma, A. T. Revel, D. Sturtevant, and J. J. Mekalanos Type VI secretion system translocates a phage tail spike-like protein into target cells where it cross-links actin PNAS, September 25, 2007; 104(39): 15508 - 15513. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. I. Sadreyev, M. Tang, B.-H. Kim, and N. V. Grishin COMPASS server for remote homology inference Nucleic Acids Res., July 13, 2007; 35(suppl_2): W653 - W658. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Wallner, P. Larsson, and A. Elofsson Pcons.net: protein structure prediction meta server Nucleic Acids Res., July 13, 2007; 35(suppl_2): W369 - W374. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Simeone, P. Constant, C. Guilhot, M. Daffe, and C. Chalut Identification of the Missing trans-Acting Enoyl Reductase Required for Phthiocerol Dimycocerosate and Phenolglycolipid Biosynthesis in Mycobacterium tuberculosis J. Bacteriol., July 1, 2007; 189(13): 4597 - 4602. [Abstract] [Full Text] [PDF] |
||||
![]() |
N.-H. Hsiao, J. Soding, D. Linke, C. Lange, C. Hertweck, W. Wohlleben, and E. Takano ScbA from Streptomyces coelicolor A3(2) has homology to fatty acid synthases and is able to synthesize {gamma}-butyrolactones Microbiology, May 1, 2007; 153(5): 1394 - 1404. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. V. Novoselov, G. V. Kryukov, X.-M. Xu, B. A. Carlson, D. L. Hatfield, and V. N. Gladyshev Selenoprotein H Is a Nucleolar Thioredoxin-like Protein with a Unique Expression Pattern J. Biol. Chem., April 20, 2007; 282(16): 11960 - 11968. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Sukackaite, A. Lagunavicius, K. Stankevicius, C. Urbanke, C. Venclovas, and V. Siksnys Restriction endonuclease BpuJI specific for the 5'-CCCGT sequence is related to the archaeal Holliday junction resolvase family Nucleic Acids Res., April 1, 2007; 35(7): 2377 - 2389. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. D. Silva, L. Shen, V. Tcherepanov, C. Watson, and C. Upton Predicted function of the vaccinia virus G5R protein Bioinformatics, December 1, 2006; 22(23): 2846 - 2850. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Dlakic DUF283 domain of Dicer proteins has a double-stranded RNA-binding fold Bioinformatics, November 15, 2006; 22(22): 2711 - 2714. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Liao and M. Kielian Site-Directed Antibodies against the Stem Region Reveal Low pH-Induced Conformational Changes of the Semliki Forest Virus Fusion Protein J. Virol., October 1, 2006; 80(19): 9599 - 9607. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. D. Ellermeier and R. Losick Evidence for a novel protease governing regulated intramembrane proteolysis and resistance to antimicrobial peptides in Bacillus subtilis Genes & Dev., July 15, 2006; 20(14): 1911 - 1922. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Soding, M. Remmert, and A. Biegert HHrep: de novo protein repeat detection and the origin of TIM barrels. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W137 - W142. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Biegert, C. Mayer, M. Remmert, J. Soding, and A. N. Lupas The MPI Bioinformatics Toolkit for protein sequence analysis. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W335 - W339. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Soding, M. Remmert, A. Biegert, and A. N. Lupas HHsenser: exhaustive transitive profile search using HMM-HMM comparison. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W374 - W378. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Becker, V. Meyer, H. Madaoui, and R. Guerois Detection of a tandem BRCT in Nbs1 and Xrs2 with functional implications in the DNA damage response Bioinformatics, June 1, 2006; 22(11): 1289 - 1292. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||


60%) and moderate (












