Skip Navigation

Nucleic Acids Research 2005 33(Web Server Issue):W289-W294; doi:10.1093/nar/gki390
This Article
Right arrow Abstract Freely available
Right arrow Print PDF (3766K) Freely available
Right arrow Screen PDF (427K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Simossis, V. A.
Right arrow Articles by Heringa, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Simossis, V. A.
Right arrow Articles by Heringa, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2005. Published by Oxford University Press. All rights reserved
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions{at}oupjournals.org


Article

PRALINE: a multiple sequence alignment toolbox that integrates homology-extended and secondary structure information

V. A. Simossis1 and J. Heringa1,2,*

1Bioinformatics Section, Faculty of Sciences, Vrije Universiteit De Boelelaan 1081A, 1081 HV, Amsterdam, The Netherlands 2Centre for Integrative Bioinformatics VU (IBIVU), Faculty of Sciences and Faculty of Earth & Life Sciences, Vrije Universiteit De Boelelaan 1081A, 1081 HV, Amsterdam, The Netherlands

*To whom correspondence should be addressed. Tel: +31 20 5987649; Fax: +31 20 5987653; Email: heringa{at}cs.vu.nl

Received February 11, 2005. Revised March 10, 2005. Accepted March 10, 2005.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 HOMOLOGY-EXTENDED MULTIPLE...
 INTEGRATION OF SECONDARY...
 PROFILE PRE-PROCESSING AND...
 THE NEW PRALINE SERVER
 SAMPLE OUTPUTS
 REFERENCES
 
PRofile ALIgNEment (PRALINE) is a fully customizable multiple sequence alignment application. In addition to a number of available alignment strategies, PRALINE can integrate information from database homology searches to generate a homology-extended multiple alignment. PRALINE also provides a choice of seven different secondary structure prediction programs that can be used individually or in combination as a consensus for integrating structural information into the alignment process. The program can be used through two separate interfaces: one has been designed to cater to more advanced needs of researchers in the field, and the other for standard construction of high confidence alignments. The web-based output is designed to facilitate the comprehensive visualization of the generated alignments by means of five default colour schemes based on: residue type, position conservation, position reliability, residue hydrophobicity and secondary structure, depending on the options set. A user can also define a custom colour scheme by selecting which colour will represent one or more amino acids in the alignment. All generated alignments are also made available in the PDF format for easy figure generation for publications. The grouping of sequences, on which the alignment is based, can also be visualized as a dendrogram. PRALINE is available at http://ibivu.cs.vu.nl/programs/pralinewww/.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 HOMOLOGY-EXTENDED MULTIPLE...
 INTEGRATION OF SECONDARY...
 PROFILE PRE-PROCESSING AND...
 THE NEW PRALINE SERVER
 SAMPLE OUTPUTS
 REFERENCES
 
The alignment of two or more sequences has become an essential sequence analysis technique in biological research. State-of-the-art multiple sequence alignment (MSA) methods, such as T-COFFEE (1) and MUSCLE (2), as well as other MSA methods available to date, perform alignments by only using the sequences in the given set. Although they use profile technology to match distant sequence sets, they do not use further homology information for the sequences that are available in current sequence databases. The benefit of using homologous information to align distant sequences has been shown in a number of studies (3), while the use of profiles to represent the additional homologous information has been shown to have many advantages (4,5). For this reason, the PRALINE toolbox (6,7) has been recently re-designed to include homology-extended multiple alignment (8), where as an initial step a profile for each sequence in a given set is built by using PSI-BLAST (9,10) and the progressive alignment then proceeds using the PSI-BLAST profiles instead of the given sequences. This approach has been previously applied with success to local pairwise alignment methods for homology modelling (1115) and is extended in PRALINE for global MSA. The recently updated MAFFT alignment tool (3,16) also uses homologous sequences to improve the alignment quality of distant sequences. However, in the MAFFT approach, the additional information is not incorporated in profiles for each of the query sequences, but homologous sequences are added to the original set and then aligned together using the various MAFFT alignment strategies. In the end, the homologous sequences are removed, leaving the aligned original sequences to form the final alignment.

In this paper we present the new web server for the PRALINE toolbox (6,7), where we have added two new alignment features: homology-extended multiple alignment (8) and the integration of predicted secondary structure information with iteration capabilities (V. A. Simossis and J. Heringa, submitted for publication). We show results for the cytochrome P450 HOMSTRAD (17) sequence set as an example to demonstrate how the homology-extended strategy and integrating secondary structure information, in combination with the visualization possibilities of the server output can lead to meaningful interpretations. Details about the PRALINE strategies and optimizations have been described previously (68,18).


    HOMOLOGY-EXTENDED MULTIPLE ALIGNMENT
 TOP
 ABSTRACT
 INTRODUCTION
 HOMOLOGY-EXTENDED MULTIPLE...
 INTEGRATION OF SECONDARY...
 PROFILE PRE-PROCESSING AND...
 THE NEW PRALINE SERVER
 SAMPLE OUTPUTS
 REFERENCES
 
The homology-extended MSA strategy enriches the information for each of the sequences in a given set by collecting putative homologous sequences. Each sequence is submitted as a query to PSI-BLAST over a database of choice [default: non-redundant (NR)]. The resulting PSI-BLAST alignments are then filtered for redundancy (100% sequence identity). In the event that no hits or only redundant hits are detected, the PSI-BLAST E-value threshold is automatically adjusted to a 10-fold less stringent setting (e.g. from 10 x 10–6 to 10 x 10–5) and the query is re-submitted. Once all the sequences to be aligned have at least one additional putative homologue, each PSI-BLAST alignment is converted into a profile and progressively aligned. A more detailed account of the PRALINE homology-extended multiple alignment algorithm and its performance is available in Ref. (8).

The advantage of this strategy is that it uses a much larger amount of position-specific information in the homology-extended profiles to score the alignment of two or more positions. As a result, the cases that benefit the most are those that evolution has changed so extensively (<30% identity) that the homology (common ancestry) between them is almost undetectable when compared directly (8).

In Table 1, the performance of the homology-extended alignment strategy on 254 HOMSTRAD (17) multiple alignment cases has been compared with the state-of-the-art methods T-COFFEEv2.03 and MUSCLEv3.51. The results show that for the strictest quality measure, column scoring, the overall improvement of the PRALINEPSI strategy is >3.5% relative to T-COFFEE and MUSCLE. Moreover, the improvement is >5% for the most distant and difficult test cases with sequences <30% sequence identity. In addition, PRALINEPSI has also been compared with the PRALINE standard global progressive alignment strategy (PRALINEBASIC) (6) and the PRALINEBASIC and PRALINEPSI strategies with integrated predicted [PSIPRED (19) and YASPIN (20)] secondary structure information, respectively, named as PRALINEBASIC-PSIPRED, PRALINEBASIC-YASPIN, PRALINEPSI-PSIPRED and PRALINEPSI-YASPIN. The latter secondary structure-guided alignment strategies of PRALINE are discussed in the next section. As shown in Table 1, the improvement in alignment quality achieved by homology-extended alignment (PRALINEPSI) as compared with other methods is significant in the more difficult alignment cases with average sequence identity percentages <60%. As would be expected, in the easier alignment cases that share >60% sequence identity, all the alignments are of comparable high quality.


View this table:
[in this window]
[in a new window]
 
Table 1 The quality assessment of 254 HOMSTRAD multiple alignment cases generated by different alignment strategies

 
When used as an option on the server, the homology-extended alignment strategy can further be customized by manually entering the desired iteration count, starting E-value cut-off and database to be searched by PSI-BLAST for the building of the homology-extended profiles (default: 3 iterations, starting with a cut-off of 10 x 10–6 on the NR database). The default parameters have been optimized by testing different settings on the HOMSTRAD database of structural alignments (8).


    INTEGRATION OF SECONDARY STRUCTURE
 TOP
 ABSTRACT
 INTRODUCTION
 HOMOLOGY-EXTENDED MULTIPLE...
 INTEGRATION OF SECONDARY...
 PROFILE PRE-PROCESSING AND...
 THE NEW PRALINE SERVER
 SAMPLE OUTPUTS
 REFERENCES
 
The rule-of-thumb that structure is more conserved than sequence is a well-documented fact (2124). As a result, many studies have shown that its use to guide sequence alignment improves alignment quality, especially between distant sequences (68,1115,25). To this end, we have devised a secondary structure scoring scheme for the alignment algorithm that combines exchange weights from four types of matrices: sequence or profile positions that have not been assigned the same secondary structure class are scored using a generic matrix (default: BLOSUM62), otherwise the positions that have matching helix, strand or coil assignments use the Lüthy (26) helix-, strand- and coil-specific matrices, respectively. The use of the secondary structure information significantly improves the PRALINEBASIC alignment quality and also boosts the PRALINEPSI alignments in the very difficult alignment cases <20% sequence identity (V. A. Simossis and J. Heringa, submitted for publication). In Table 1, it is clearly shown that the use of the secondary structure is beneficial for PRALINEBASIC (>4% improvement in cases with <60% identity), albeit not as significant as the improvements seen with PRALINEPSI.

The secondary structure integration options of PRALINE involve the use of any one of the seven prediction methods that are listed [PHDpsi (27), PROFsec (B. Rost, unpublished data), SSPRO 2.01 (28), YASPIN (20), PSIPRED (19), JNET (29) and PREDATOR (30,31)] to predict the secondary structure of the input sequences. In addition, the user can optionally select to also search the Protein Data Bank (PDB) to find 3D structure information for the input sequences and use the DSSP-derived secondary structure for the alignment. If both DSSP and a prediction method are selected, the predictions will only be integrated into the alignment for those sequences that do not have a PDB entry. Finally, in the same list as the seven prediction methods, an optimally segmented (24) or majority voting consensus can be alternatively used that currently combines the predictions of PROFsec, YASPIN and PSIPRED.


    PROFILE PRE-PROCESSING AND ITERATION
 TOP
 ABSTRACT
 INTRODUCTION
 HOMOLOGY-EXTENDED MULTIPLE...
 INTEGRATION OF SECONDARY...
 PROFILE PRE-PROCESSING AND...
 THE NEW PRALINE SERVER
 SAMPLE OUTPUTS
 REFERENCES
 
PRALINE provides a number of alignment strategies, such as profile pre-processing and iterative alignment optimization (6,7). The secondary structure-guided strategies using PHD, PROFsec, JNET and SSPRO, and the profile pre-processing strategies can be set to use consistency information to drive subsequent alignment rounds (iterations), each time drawing upon the theoretically higher quality information from the previous cycle. A detailed account of these strategies can be found in previously published work (6,7,18,25,32,33).


    THE NEW PRALINE SERVER
 TOP
 ABSTRACT
 INTRODUCTION
 HOMOLOGY-EXTENDED MULTIPLE...
 INTEGRATION OF SECONDARY...
 PROFILE PRE-PROCESSING AND...
 THE NEW PRALINE SERVER
 SAMPLE OUTPUTS
 REFERENCES
 
The PRALINE program is designed to use two or more input protein sequences in the FASTA format (34). The proposed maximum number of sequences that should be submitted to the server is set to 500 with length 2000, but this is mainly to limit the server load and is not the limit of the PRALINE program. In addition, owing to the long running time needed for strategies, such as PRALINEPSI, an optional email notification can be requested that is delivered upon a completion of the job and contains the link to the results and some statistics on the resulting alignment.

Similar to the previous version of the server (18), the gap opening and gap extension penalties and the amino acid substitution matrix can be manually set if needed [default: 12, 1 with BLOSUM62 (35)] for any of the PRALINE alignment strategies. The results page is automatically displayed once the job is complete and contains various sections depending on the options selected (Figure 1). In order to provide all generated files for the user, there is a link to download a compressed file with all the results in the job directory [Figure 1, (D)] and also individual links that allow the user to download specific files related to each sequence in the set (e.g. a PSI-BLAST profile or a secondary structure file) [Figure 1, (E)].



View larger version (36K):
[in this window]
[in a new window]
 
Figure 1 The PRALINE results page headers. A: The subtitle indicating which iteration results are presented on this page (only available if iteration >0 is selected). B: The time taken to run the job and statistics related to the visible alignment. C: The links to all other available iteration cycle results (only available if iteration >0 is selected). D: The link to download all job files as a compressed file. E: Links to tabulated specific file types. F: Links to iteration-specific output files (only available if iteration >0 is selected). G: The button that hides/reveals the profile pre-processing scores of the sequence set (only available if profile pre-processing is selected). H: The buttons that switch between colour schemes. I: The button that generates and opens a PDF version of the alignment in the visible colour scheme.

 
If the iteration number selected is >0, a subtitle informs the user which iteration cycle results are presented on the page [Figure 1, (A)]. The alignment from each iteration cycle is presented on a different page and is accessible by the corresponding links [Figure 1, (C)]. In addition, it informs the user of the total time taken for the process to complete, provides some statistics related to the visible alignment [Figure 1, (B)] and if the iterations were halted due to alignment convergence or limit cycle convergence and which iteration was the last (not applicable in the Figure 1 example). In the case of iteration-specific output, such as alignment of the iteration or secondary structure prediction, additional links are displayed [Figure 1, (F)].

If profile pre-processing is selected the user has the option of viewing the profile pre-processing scores for all pairwise alignments for deriving an optimum cut-off value [Figure 1, (G)].

Finally, depending on the selected parameters of the job, a series of buttons allows switching between the available colour-coded views [Figure 1, (H)] [details about the colour schemes are described in (18)]. At any point, the visible alignment can be converted into a PDF for printing or further manipulation [Figure 1, (I)]. The remaining of the results page consists of a short description of the visible colour scheme with a key to the colours, after which the colour-coded alignment follows (an example of the conservation and the secondary structure colour-coding is shown in Figure 2).



View larger version (84K):
[in this window]
[in a new window]
 
Figure 2 The PRALINEPSI P450 alignment using both PROFsec and DSSP secondary structure integration settings. The alignment has been sectioned to focus on the regions containing the conserved motifs of the cytochrome P450 enzymes (signified by the black bars above the rulers). (A) The oxygen-binding motif, (B) the ExxR motif and (C) the haem-binding motif. For each section, the top colour scheme shows conservation levels according to the colour key and the bottom one shows the secondary structure each residue belongs to (red: helix; green: strand; and clear: coil). The ruler on top of each alignment block shows which parts of the alignment are visible.

 

    SAMPLE OUTPUTS
 TOP
 ABSTRACT
 INTRODUCTION
 HOMOLOGY-EXTENDED MULTIPLE...
 INTEGRATION OF SECONDARY...
 PROFILE PRE-PROCESSING AND...
 THE NEW PRALINE SERVER
 SAMPLE OUTPUTS
 REFERENCES
 
Owing to the large number of possible outputs, we have provided a set of nine representative sample outputs for the P450 alignment on the server, each one representing a different combination of PRALINE strategies and settings. These examples are intended as supplementary material to this article and can be accessed through a dedicated link on the server pages or directly at http://ibivu.cs.vu.nl/programs/pralinewww/example/. They can also be used as an indication of CPU times needed by each of the PRALINE strategies.

In Figure 2, we illustrate sections of the PRALINEPSI alignment of the ‘p450’ HOMSTRAD sequence set (21% average sequence identity) using both DSSP (36) and PROFsec secondary structure integration settings. The colour schemes in the figure are for positional conservation and secondary structure. The secondary structure information for each sequence in this alignment has been derived by using DSSP, since all the sequences have a corresponding PDB structure.

The cytochrome P450 enzymes primarily act as oxidases in multi-component electron transport chains to break down naturally occurring toxins and mutagens. The structure is almost triangular, with the C-terminal part being mostly helical, while the N-terminal part is more ß-sheet rich. The signature motif of P450 enzymes is the haem-binding site, which is often represented as FxxGxxxCxG (Figure 2C). Other conserved regions include the motif A(A/G)x(E/D)T (Figure 2A) where the threonine (T) residue is part of the oxygen-binding site and an invariant ExxR sequence (Figure 2B). The ExxR and the C residue at the haem-binding site are the only completely conserved amino acids in P450s. These well-documented details are straightforwardly visualized in the PRALINE output conservation colour scheme, while the secondary structure view allows us to relate them in a structural context. As stated in the literature (37), the oxygen binding and ExxR motifs are each part of two distinct C-terminal helices, while the haem-binding motif flanks the N-terminal end of the last helix. Owing to space limitations the alignment has been sectioned to concentrate on these regions, but the full alignment can be viewed online in example 9 of the supplementary material.


    ACKNOWLEDGEMENTS
 
The authors would like to thank the Vrije Universiteit Amsterdam for funding this project. Special thanks are also due to Drs Franca Fraternali, Jens Kleinjung and John Romein for help with debugging and server testing. Funding to pay the Open Access publication charges of this article was provided by the Vrije Universiteit Amsterdam.

Conflict of interest statement. None declared.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 HOMOLOGY-EXTENDED MULTIPLE...
 INTEGRATION OF SECONDARY...
 PROFILE PRE-PROCESSING AND...
 THE NEW PRALINE SERVER
 SAMPLE OUTPUTS
 REFERENCES
 

  1. Notredame, C., Higgins, D.G., Heringa, J. (2000) T-Coffee: a novel method for fast and accurate multiple sequence alignment J. Mol. Biol., 302, 205–217[CrossRef][Web of Science][Medline] .

  2. Edgar, R.C. (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput Nucleic Acids Res., 32, 1792–1797[Abstract/Free Full Text] .

  3. Katoh, K., Kuma, K., Toh, H., Miyata, T. (2005) MAFFT version 5: improvement in accuracy of multiple sequence alignment Nucleic Acids Res., 33, 511–518[Abstract/Free Full Text] .

  4. Wang, G. and Dunbrack, R.L., Jr. (2004) Scoring profile-to-profile sequence alignments Protein Sci., 13, 1612–1626[CrossRef][Web of Science][Medline] .

  5. Edgar, R.C. and Sjolander, K. (2004) A comparison of scoring functions for protein sequence profile alignment Bioinformatics, 20, 1301–1308[Abstract/Free Full Text] .

  6. Heringa, J. (1999) Two strategies for sequence comparison: profile-preprocessed and secondary structure-induced multiple alignment Comput. Chem., 23, 341–364[CrossRef][Web of Science][Medline] .

  7. Heringa, J. (2002) Local weighting schemes for protein multiple sequence alignment Comput. Chem., 26, 459–477[CrossRef][Web of Science][Medline] .

  8. Simossis, V.A., Kleinjung, J., Heringa, J. (2005) Homology-extended sequence alignment Nucleic Acids Res., 33, 816–824[Abstract/Free Full Text] .

  9. Altschul, S.F. and Koonin, E.V. (1998) Iterated profile searches with PSI-BLAST—a tool for discovery in protein databases Trends Biochem. Sci., 23, 444–447[CrossRef][Web of Science][Medline] .

  10. Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs Nucleic Acids Res., 25, 3389–3402[Abstract/Free Full Text] .

  11. Chung, R. and Yona, G. (2004) Protein family comparison using statistical models and predicted structural information BMC Bioinformatics, 5, 183[CrossRef][Medline] .

  12. Ginalski, K., Pas, J., Wyrwicz, L.S., von Grotthuss, M., Bujnicki, J.M., Rychlewski, L. (2003) ORFeus: detection of distant homology using sequence profiles and predicted secondary structure Nucleic Acids Res., 31, 3804–3807[Abstract/Free Full Text] .

  13. Ginalski, K., von Grotthuss, M., Grishin, N.V., Rychlewski, L. (2004) Detecting distant homology with Meta-BASIC Nucleic Acids Res., 32, W576–W581[Abstract/Free Full Text] .

  14. Soding, J. (2004) Protein homology detection by HMM–HMM comparison Bioinformatics, 21, 951–960 .

  15. von Ohsen, N., Sommer, I., Zimmer, R., Lengauer, T. (2004) Arby: automatic protein structure prediction using profile–profile alignment and confidence measures Bioinformatics, 20, 2228–2235[Abstract/Free Full Text] .

  16. Katoh, K., Misawa, K., Kuma, K., Miyata, T. (2002) MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform Nucleic Acids Res., 30, 3059–3066[Abstract/Free Full Text] .

  17. Mizuguchi, K., Deane, C.M., Blundell, T.L., Overington, J.P. (1998) HOMSTRAD: a database of protein structure alignments for homologous families Protein Sci., 7, 2469–2471[Web of Science][Medline] .

  18. Simossis, V.A. and Heringa, J. (2003) The PRALINE online server: optimising progressive multiple alignment on the web Comput. Biol. Chem., 27, 511–519[CrossRef][Web of Science][Medline] .

  19. Jones, D.T. (1999) Protein secondary structure prediction based on position-specific scoring matrices J. Mol. Biol., 292, 195–202[CrossRef][Web of Science][Medline] .

  20. Lin, K., Simossis, V.A., Taylor, W.R., Heringa, J. (2005) A simple and fast secondary structure prediction method using hidden neural networks Bioinformatics, 21, 152–159[Abstract/Free Full Text] .

  21. Chothia, C. and Lesk, A.M. (1986) The relation between the divergence of sequence and structure in proteins EMBO J., 5, 823–826[Web of Science][Medline] .

  22. Rost, B. (1999) Twilight zone of protein sequence alignments Protein Eng., 12, 85–94[Abstract/Free Full Text] .

  23. Sander, C. and Schneider, R. (1991) Database of homology-derived protein structures and the structural meaning of sequence alignment Proteins, 9, 56–68[CrossRef][Web of Science][Medline] .

  24. Simossis, V.A. and Heringa, J. (2004) The influence of gapped positions in multiple sequence alignments on secondary structure prediction methods Comput. Biol. Chem., 28, 351–366[CrossRef][Web of Science][Medline] .

  25. Heringa, J. (2000) Computational methods for protein secondary structure prediction using multiple sequence alignments Curr. Protein Pept. Sci., 1, 273–301[CrossRef][Medline] .

  26. Lüthy, R., McLachlan, A.D., Eisenberg, D. (1991) Secondary structure-based profiles: use of structure-conserving scoring tables in searching protein sequence databases for structural similarities Proteins, 10, 229–239[CrossRef][Web of Science][Medline] .

  27. Przybylski, D. and Rost, B. (2002) Alignments grow, secondary structure prediction improves Proteins, 46, 197–205[CrossRef][Web of Science][Medline] .

  28. Pollastri, G., Przybylski, D., Rost, B., Baldi, P. (2002) Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles Proteins, 47, 228–235[CrossRef][Web of Science][Medline] .

  29. Cuff, J.A. and Barton, G.J. (2000) Application of multiple sequence alignment profiles to improve protein secondary structure prediction Proteins, 40, 502–511[CrossRef][Web of Science][Medline] .

  30. Frishman, D. and Argos, P. (1996) Incorporation of non-local interactions in protein secondary structure prediction from the amino acid sequence Protein Eng., 9, 133–142[Abstract/Free Full Text] .

  31. Frishman, D. and Argos, P. (1997) Seventy-five percent accuracy in protein secondary structure prediction Proteins, 27, 329–335[CrossRef][Web of Science][Medline] .

  32. Simossis, V.A. and Heringa, J. (2004) Integrating protein secondary structure prediction and multiple sequence alignment Curr. Protein Pept. Sci., 5, 249–266[CrossRef][Web of Science][Medline] .

  33. Simossis, V.A., Kleinjung, J., Heringa, J. (2003) An overview of multiple sequence alignment In Baxevanis, A.D. (Ed.). Current Protocols in Bioinformatics., NY John Wiley pp. 3.7.1–3.7.25 .

  34. Pearson, W.R. (2000) Flexible sequence similarity searching with the FASTA3 program package Methods Mol. Biol., 132, 185–219[Medline] .

  35. Dayhoff, M.O., Barker, W.C., Hunt, L.T. (1983) Establishing homologies in protein sequences Methods Enzymol., 91, 524–545[Web of Science][Medline] .

  36. Kabsch, W. and Sander, C. (1983) Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features Biopolymers, 22, 2577–2637[CrossRef][Web of Science][Medline] .

  37. In Ortiz de Montellano, P.R. (Ed.). Cytochrome P450: Structure, Mechanism, and Biochemistry, 2nd edn, (1995) NY Plenum Press .


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
C. Kemena and C. Notredame
Upcoming challenges for multiple sequence alignment methods in the high-throughput era
Bioinformatics, October 1, 2009; 25(19): 2455 - 2465.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
M. R. Aniba, S. Siguenza, A. Friedrich, F. Plewniak, O. Poch, A. Marchler-Bauer, and J. D. Thompson
Knowledge-based expert systems and a proof-of-concept case study for multiple sequence alignment construction and analysis
Brief Bioinform, January 1, 2009; 10(1): 11 - 23.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Pei, M. Tang, and N. V. Grishin
PROMALS3D web server for accurate multiple protein sequence and structure alignments
Nucleic Acids Res., July 1, 2008; 36(suppl_2): W30 - W34.
[Abstract] [Full Text] [PDF]


Home page
JCBHome page
W. Fei, G. Shui, B. Gaeta, X. Du, L. Kuerschner, P. Li, A. J. Brown, M. R. Wenk, R. G. Parton, and H. Yang
Fld1p, a functional homologue of human seipin, regulates the size of lipid droplets in yeast
J. Cell Biol., February 6, 2008; 180(3): 473 - 482.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. N.I. Pang, K. Lin, M. A. Wouters, J. Heringa, and R. A. George
Identifying foldable regions in protein sequence from the hydrophobic signal
Nucleic Acids Res., February 2, 2008; 36(2): 578 - 588.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Pei, B.-H. Kim, M. Tang, and N. V. Grishin
PROMALS web server for accurate multiple protein sequence alignments
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W649 - W652.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. A. Feenstra, W. Pirovano, K. Krab, and J. Heringa
Sequence harmony: detecting functional specificity from alignments
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W495 - W498.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Pei and N. V. Grishin
PROMALS: towards accurate multiple sequence alignments of distantly related proteins
Bioinformatics, April 1, 2007; 23(7): 802 - 808.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
W. Pirovano, K. A. Feenstra, and J. Heringa
Sequence comparison by sequence harmony identifies subtype-specific functional sites
Nucleic Acids Res., December 2, 2006; 34(22): 6540 - 6548.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. Zhang, C. J. Martyniuk, and V. L. Trudeau
SANTA domain: a novel conserved protein module in Eukaryota with potential involvement in chromatin regulation
Bioinformatics, October 15, 2006; 22(20): 2459 - 2462.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
F. Armougom, S. Moretti, O. Poirot, S. Audic, P. Dumas, B. Schaeli, V. Keduas, and C. Notredame
Expresso: automatic incorporation of structural information in multiple sequence alignments using 3D-Coffee.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W604 - W608.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
K. Mukherjee and T. R. Burglin
MEKHLA, a Novel Domain with Similarity to PAS Domains, Is Fused to Plant Homeodomain-Leucine Zipper III Proteins.
Plant Physiology, April 1, 2006; 140(4): 1142 - 1150.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (3766K) Freely available
Right arrow Screen PDF (427K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Simossis, V. A.
Right arrow Articles by Heringa, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Simossis, V. A.
Right arrow Articles by Heringa, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?