Published online 20 January 2005
Article |
MAFFT version 5: improvement in accuracy of multiple sequence alignment
1 Bioinformatics Center, Institute for Chemical Research, Kyoto University Uji, Kyoto 611-0011, Japan 2 Biohistory Research Hall Takatsuki, Osaka 569-1125, Japan 3 Department of Electrical Engineering and Bioscience, Science and Engineering, Waseda University Tokyo 169-8555, Japan 4 Department of Biophysics, Graduate School of Science, Kyoto University Kyoto 606-8502, Japan
*To whom correspondence should be addressed. Tel: +81 774 38 3119; Fax: +81 774 38 3059; Email: kkatoh{at}kuicr.kyoto-u.ac.jp
Received October 14, 2004. Revised November 16, 2004. Accepted December 29, 2004.
| ABSTRACT |
|---|
|
|
|---|
The accuracy of multiple sequence alignment program MAFFT has been improved. The new version (5.3) of MAFFT offers new iterative refinement options, H-INS-i, F-INS-i and G-INS-i, in which pairwise alignment information are incorporated into objective function. These new options of MAFFT showed higher accuracy than currently available methods including TCoffee version 2 and CLUSTAL W in benchmark tests consisting of alignments of >50 sequences. Like the previously available options, the new options of MAFFT can handle hundreds of sequences on a standard desktop computer. We also examined the effect of the number of homologues included in an alignment. For a multiple alignment consisting of
8 sequences with low similarity, the accuracy was improved (210 percentage points) when the sequences were aligned together with dozens of their close homologues (E-value < 1051020) collected from a database. Such improvement was generally observed for most methods, but remarkably large for the new options of MAFFT proposed here. Thus, we made a Ruby script, mafftE.rb, which aligns the input sequences together with their close homologues collected from SwissProt using NCBI-BLAST. | INTRODUCTION |
|---|
|
|
|---|
Multiple alignment is an important tool for computational analysis of nucleotide or amino acid sequences. MAFFT (1) is one of the fastest methods among the currently available multiple alignment tools (2), and used in several projects, such as Pfam (3), ASTRAL (4) and MEROPS (5). In MAFFT, an initial alignment is constructed by the progressive method (6,7) and then refined by the iterative refinement method (8,9). The outline of procedure of the previous version of MAFFT is briefly explained below and in the lower part of Table 1. A user can select an appropriate strategy from the fastest one (FFT-NS-1) to the most accurate one (FFT-NS-i).
|
Progressive alignment (1). A rough distance between every pair of input sequences is rapidly calculated based on the number of 6-tuples shared by the two sequences (1,10,11). A guide tree is constructed from the distances with the UPGMA method (12) with modified linkage (see supplementary material on our web page, http://www.biophys.kyoto-u.ac.jp/~katoh/programs/align/mafft/suppl/). Input sequences are progressively aligned (6,7) following the branching order of the guide tree. This procedure is referred to as FFT-NS-1.
Progressive alignment (2). The initial distance matrix is less reliable than that based on all pairwise alignments. We can obtain more reliable distance matrix by using the FFT-NS-1 alignment (1,11,13). Progressive alignment is re-performed based on the new tree calculated from the new distance matrix. This method is referred to as FFT-NS-2.
Iterative refinement. The FFT-NS-2 alignment is further improved by the iterative refinement method (8,9) that optimizes the weighted sum-of-pairs (WSP) score proposed by Gotoh (14), using an approximate group-to-group alignment algorithm (1) and the tree-dependent restricted partitioning technique (15). This procedure is referred to as FFT-NS-i.
For the progressive alignment processes, a fast Fourier transform (FFT) approximation (1) is used in the FFT-NS-2, FFT-NS-1 and FFT-NS-i options (collectively denoted as FFT-NS-[12i] hereafter). When the sequences under consideration are highly conserved, these options require CPU times effectively proportional to average sequence length L for amino acid or nucleotide sequence alignments consisting homologues of a single gene. Note that it is not L log L, although FFT takes L log L operations. This is because CPU time required by the FFT phase is much smaller than that by the dynamic programming (DP) phase, unless extremely long sequences like genomic sequences are input. The slopes of the FFT-NS-[2i] lines are indeed near to 1 in Figure 1A.
|
MAFFT also offers three options NW-NS-[12i] with full DP (16) for the same part. In contrast to the case of FFT-NS-[12i], the time complexity of full DP options is proportional to L 2, independently to the similarity among input sequences. In most cases, an FFT-based strategy results in the same alignment as that by the corresponding option with full DP. The difference in accuracy was not statistically significant between the alignments generated with and without the FFT approximation in all cases we tested.
The parameters of MAFFT, such as gap penalties and scoring matrix, have not been minutely investigated because of limitation in the number of reference alignments available when it had been developed. Recently, large reference alignment databases, such as HOMSTRAD (17), SABmark (18) and PREFAB (11), have been independently established. They are valuable resources to select good parameters for a sequence alignment program as well as to evaluate the performance of a program.
CLUSTAL W is a widely utilized program of multiple sequence alignment. Other algorithms have tried to improve on the accuracy of CLUSTAL W. Gotoh (19) developed PRRP/N and found significant improvement by the iterative refinement method that uses the WSP score as objective function. TCoffee (20) employs progressive strategy but achieved the highest accuracy (1,21,22). This is because TCoffee constructs a multiple sequence alignment by combining information derived from heterogeneous sources, such as a global multiple alignment and local alignments. Although this ability is of great value, TCoffee requires a large CPU time proportional to N 3. Thus, it is hard to apply TCoffee to a large alignment consisting of dozens of sequences. The accuracy of TCoffee has recently been further improved in version 2 when compared with version 1. Recently, Edgar (11,23) implemented progressive and iterative refinement alignment strategies in MUSCLE. Although the algorithms of major options of MUSCLE seem similar to NW-NS-[2i] explained above, MUSCLE has an original option, MUSCLE-fast, which is faster and less accurate than other options of MUSCLE in most cases. Do et al. (manuscript submitted) proposed a new method, PROBCONS, whose accuracy is comparable or slightly higher than TCoffee version 2.
In MAFFT version 5.3, (i) parameters were optimized based on a number of reference alignments and (ii) three new strategies (G-INS-i, H-INS-i and F-INS-i; collectively denoted as [GHF]-INS-i hereafter) were introduced. In an attempt to improve alignment accuracy, [GHF]-INS-i uses a TCoffee-like approach (20) in incorporating all pairwise alignment information into an objective function. Iterative refinement for the objective function can be performed in reasonable time, as [GHF] INS-i was designed to have low computational complexity.
It was suggested that the accuracy of a multiple alignment of distantly related sequences is improved if they are aligned together with a number of their homologues (24), because the information from many sequences is expected to reduce the noise (19). However, this possibility has not been quantitatively examined for recently developed methods. We also evaluated the effect of the number of homologues involved in an alignment.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Introducing pairwise alignment information
MAFFT version 5.3 has three new iterative refinement options, G-INS-i, H-INS-i and F-INS-i. In these three strategies, all pairwise alignment information are included when constructing a multiple alignment. Three different algorithms for all pairwise alignment were tested; G-INS-i uses global alignment with an FFT approximation (1), whereas the other two options ([HF]-INS-i) incorporate local alignment information. To obtain local alignment information, H-INS-i uses the fasta34 program of the FASTA version 3.4t24 (25). F-INS-i uses a modified fasta34 program, in which the SmithWaterman optimization is disabled. The [HF]-INS-i options do not use FFT.
The outlines of the algorithms of [GHF]-INS-i are as follows:
- An initial distance matrix is constructed from the pairwise scores, instead of shared 6-tuples, using the equation shown in (1) and a guide tree is build with the UPGMA method with modified linkage. Unlike FFT-NS-[2i] explained above, the re-construction of guide tree is not performed, because it did not provide significant improvement in accuracy in our tests.
- Each pairwise alignment is divided into gap-free segments and number n is assigned to each segment. The information of these segments is stored in a set of arrays, score [S(s, t, n)], which represents the alignment score of the nth gap-free segment between sequences s and t, length [L(s, t, n)] of the aligned segment (s, t, n), position [P(s, t, n)] in each of sequences s and t, and the importance value [E(s, t, n)] that is calculated, as described below, from the score of the segment and how frequently the residues are involved in gap-free segments. We denote (s, t, p, q)
P (s, t, n) if the pth site of sequence s is aligned to the qth site of sequence t in aligned segment (s, t, n).
- The frequency value f(s, p), which represents how frequently the pth site of sequence s is involved in gap-free segments, is calculated as
where wt is the weighting factor [for definition see (7)] for sequence t. The importance value E(s, t, n) for aligned segment is calculated as
We define the importance matrix I(s, t, p, q) between the pth site of sequence s and the qth site of sequence t as

- An alignment of a subset of given sequences, which is generated during the procedures of progressive and iterative refinement methods, is referred to as group. To align groups i and j, matrix H(groupi, groupj, p, q) is constructed as
where A(s, p) is the pth amino acid residue on sequence s.
is a score between a pair of amino acids a and b. The score matrices examined in this study are described in the parameter optimization section. wst is a weighting factor between sequences s and t. A weighting scheme proposed by Thompson et al. (7) is used in progressive alignment stage, and a weighting scheme proposed by Gotoh (14) is used in iterative refinement stage. WI is a weighting factor, which was set at 2.7 in the current version, for the importance value. The alignment between two groups is computed by applying the DP algorithm (16) to matrix H(,,,) at each step of progressive alignment. The alignment produced by this procedure is referred to as [GHF]-INS-1. Of these three progressive strategies, we evaluated the H-INS-1 option only.
- The [GHF]-INS-1 alignment is improved by the iterative refinement method ([GHF]-INS-i), which optimizes an objective score defined as the summation of the WSP score (14) and the importance values defined above. This score is referred to as WSP+I in this paper.
Performance evaluation
An up-to-date version of HOM39 (26) was extracted from the July 2004 release of HOMSTRAD (17) (http://www-cryst.bioc.cam.ac.uk/~homstrad/) based on two criteria used in (26). HOMSTRAD is a curated database of structural alignments of homologous proteins whose coordinates are available. Each entry of HOMSTRAD, a structural alignment, is extended by introducing homologous sequences with CLUSTAL W. Only the alignments based on structural superposition were used in this study. Out of 1033 entries of the HOMSTRAD, 55 entries (19.7% pairwise identity, 7.69 sequences and 159 aligned residues on average) were extracted for the evaluation of alignment accuracy. This dataset is referred to as HOM+0 in this paper.
We made the HOM+20, HOM+50 and HOM+100 datasets by extending each entry of HOM+0 in a way similar to PREFAB (11). Amino acid sequences similar (E-value < 1010) to each member of an entry were collected from the SwissProt database (rel. 43) using BLAST (27) and added to the entry. If more than n (=20, 50 or 100) sequences were collected, we randomly selected n sequences to be added. Only amino acid positions of the sequences that were reported to show significant similarity by BLAST were added. The accuracy of an alignment was measured by the fraction of columns aligned identically to the reference alignment. When we evaluated the accuracy, the n sequences added to the HOM+n were removed.
SABmark (18) version 1.65 was downloaded from http://bioinformatics.vub.ac.be/databases/databases.html. SABmark is designed to assess the performance of protein sequence alignment algorithms and consists of two parts, the Twilight Zone set (with very low similarity; referred to as the TWI set in this paper) and the Superfamily set (with low similarity; referred to as SUP). The TWI set was mainly used in the present study to examine the abilities of algorithms for aligning distantly related sequences. The TWI set was also extended in the same manner as described above. These are hereafter referred to as TWI+n (n = 0, 20 and 50). The accuracy value f D, the ratio of the number of correctly aligned residues divided by the length of reference alignment, was calculated using the score.pl script provided by the authors of SABmark. The accuracies were separately considered for two subsets. One subset (denoted as TWIf+n) includes only the sequence pairs classified to the same family by Van Walle et al. (18), and the other subset (denoted as TWIs+n) consists of the sequence pairs classified not to the same family but to the same superfamily.
The PREFAB (11) version 3 dataset was downloaded from http://www.drive5.com/muscle/prefab.htm. The accuracy was measured using Q, the number of correctly aligned residue pairs divided by the number of residue pairs in the reference alignment (11).
Parameter optimization
Gap-opening penalty Sop and the offset value Sa [for definitions see (1)] were determined to provide the highest accuracy of the FFT-NS-2 strategy for the TWIf+0 set using golden section search (28). We examined five scoring matrices (BLOSUM45, 62, 80, JTT100 and JTT200) and selected the matrix providing the highest accuracy.
Availability
MAFFT was written in C, and runs on Linux, Mac OS X and the Cygwin environment on Windows. The MAFFT package is available at http://www.biophys.kyoto-u.ac.jp/~katoh/programs/align/mafft/. The fasta34 program of the FASTA package (25) must be installed to run the H-INS-i option. The F-INS-i option requires a one-line modification of the source code of the fasta34 program (see supplementary material on our web page, http://www.biophys.kyoto-u.ac.jp/~katoh/programs/align/mafft/suppl/). The G-INS-i option requires no additional package. The performances were measured on a 2.8 GHz Xeon processor with 1 GB of RAM running SuSE Linux 9.0. The gcc version 3.3.1 compiler was used with the -O3 optimization option.
We also made a Ruby script mafftE.rb that aligns input sequences together with their homologues automatically collected from local database or SwissProt using NCBI-BLAST.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
We evaluated the performance of MAFFT version 5.3 using the HOM, TWIf, TWIs and PREFAB datasets, and compared it with those of several methods developed by other groups, including TCoffee version 2.02 (20), CLUSTAL W version 1.83 (7), PROBCONS version 1.06 and MUSCLE version 3.41 (11,23). As for MUSCLE, the most accurate option was used, which probably corresponds to NW-NS-i of MAFFT (see Table 1).
Improvement in accuracy attained by new parameter set
The golden section search provided the optimal parameters of 1.53 for Sop and 0.123 for Sa when the FFT-NS-2 option was applied to the TWIf+0. Of the five matrices examined, BLOSUM62 showed the highest accuracy. The improvement in accuracy from the previous parameter set (JTT200, Sop = 2.40, Sa = 0.06) was
5 percentage points. Although the new parameter set was not optimal for other options or for other datasets (HOM, TWIs and PREFAB), the new parameter set provided generally higher accuracy than the previous parameter set by
5 percentage points, as shown in Supplementary Material. Accuracy values for various combinations of Sop and Sa are also shown in Supplementary Material. Note that comparisons between MAFFT and other methods based on other than TWIf+0 are important, because FFT-NS-2 option of MAFFT was tuned for TWIf+0. The results of the comparisons are described below.
Improvement in accuracy by introducing pairwise alignment information
Tables 2, 3 and 4 show the results of benchmark tests using the HOM, TWIf, TWIs and PREFAB datasets, respectively. The difference in accuracy between [GHF]-INS-i and FFT-NS-i was at most 6%, which corresponded to the improvement by introducing the strategy presented in this paper. As noted in Materials and Methods, the WI value was set at 2.7 in the calculations shown in Tables 2, 3 and 4. However, the optimal WI value that provided the highest accuracy differed depending on the number of homologues, sequence similarity and other conditions. The accuracy values on various WI value are shown in Supplementary Material. For new strategies presented here, further investigation is necessary to determine the optimal parameters including the WI values and the parameters for pairwise alignments.
|
|
|
The difference in accuracy among newly proposed strategies, G-INS-i, H-INS-i and F-INS-i was small. A slight tendency was observed that G-INS-i is suitable for alignments consisting of large number of sequences, whereas F-INS-i and H-INS-i are suitable for alignments consisting of small number of sequences. In addition, G-INS-i is expected to be not suitable for alignments with large gaps, as it uses all pairwise global alignments. Although G-INS-i uses an FFT approximation for the all pairwise alignment process, its accuracy was virtually identical to that of a strategy in which full DP was performed for this process (data not shown).
Reducing CPU time
In the [GHF]-INS-i strategies, all pairwise alignment and iterative refinement processes are time consuming. Figure 1 AD shows the CPU time as the function of the sizes of sequence data generated by the ROSE program (29). The CPU time consumed by all pairwise alignment process can be reduced by approximations, such as banded alignment implemented in FASTA (F-INS-i uses it) or the FFT approximation (G-INS-i uses it), although the latter is effective only for highly conserved sequences.
The CPU time for the iterative refinement process can be slightly reduced by always accepting the new alignment produced by group-to-group re-alignment without calculation of WSP+I score. Note that the WSP+I score may become worse by re-alignment, as MAFFT employs an approximate DP algorithm for group-to-group alignment described previously (1). According to our test, this rough iterative strategy gave less accurate results for alignments of few sequences, such as HOM+0, TWIf+0 and TWIs+0. However, this strategy performed well for alignments composed of a large number of sequences, such as HOM+50, +100, TWIf+50 and TWIs+50. Its accuracy was sometimes rather higher than that of G-INS-i (data not shown).
FFT-NS-i is the most accurate option in the previous version. The accuracy has been improved by introducing the new options. However, when the highest accuracy is not required for the alignment, the FFT-NS-i option may be still useful, considering the balance between computational time and accuracy.
Comparison with other methods
For alignments involving dozens of sequences (HOM+n, TWIs+n and PREFAB; n
0), new strategies presented here ([GHF]-INS-i) outperformed other methods, including TCoffee and PROBCONS, in accuracy as shown in Tables 2, 3 and 4. According to the PREFAB test (Table 4), the difference was large when the input sequences were distantly related.
Results of the HOM+0 and TWI+0 tests are shown in Tables 2 and 3, respectively. In these cases, the number of input sequences is small (
8). We carried out two more tests based on BAliBASE (30) and the SUP set of SABmark, for which close homologues have not been added. PROBCONS outperformed [GHF]-INS-i by 13 percentage points for BAliBASE, whereas [GHF]-INS-i options outperformed PROBCONS by 13 percentage points for the SUP set of SABmark. In most of cases of such small alignments, TCoffee, PROBCONS and three new strategies ([GHF]-INS-i) of MAFFT were small in accuracy. However, for the HOM+0 test, the accuracy of PROBCONS was remarkably high as shown in Table 2; the difference from H-INS-i was 5% and statistically significant according to the Wilcoxon test.
The CPU times of [GHF]-INS-i were several times smaller than those of TCoffee and PROBCONS.
Effect of the number of homologues involved in an alignment
As expected, the accuracy of a multiple alignment tended to increase with increasing number of homologues involved in the alignment. Although observed more or less for most methods, this tendency was remarkable for MAFFT. The improvement by adding close homologues became small, when the position-specific gap penalty (1) was disabled (data not shown). Thus, this technique probably contributed to the improvement. The position-specific gap penalty (1,7,9,23) was motivated by a consideration as follows. In a group-to-group alignment process, each group of sequences may contain gaps. If the gap is newly introduced at the same position as one of such existing gaps, the new gap should be less penalized, because the new and existing gaps are probably resulting from a single insertion or deletion event.
According to the HOM tests (Table 2), the improvements by adding 50100 homologues were at most
10 percentage points and statistically significant according to both the Wilcoxon test and the Friedman test. The improvement was comparable with that by introducing the structural information of one or two proteins (26), and rather larger than that by modification of algorithm (from FFT-NS-i to [GHF]-INS-i) presented in this paper. Similar results were obtained for the TWIf+n and TWIs+n datasets shown in Table 3. The accuracy values under various conditions (n and E-values) are shown in Supplementary Material. The maximum accuracy was obtained in the case of n > 50 or n > 100 and threshold of E-value = 1051020.
These results suggest the importance of including a number of homologues for obtaining an accurate sequence alignment. An ability to handle a large number of sequences is therefore important for a multiple sequence alignment program.
Perspectives
There are several issues for further improvement in accuracy and speed. (i) TCoffee has a merit that it can combine alignments based on different principles. O'Sullivan et al. (26) reported that the accuracy of a multiple alignment is improved when structural information of many proteins is included. We are planning to enable MAFFT to include structural information. (ii) There might be a more efficient way to collect and select the homologues from databases for improving the accuracy of an alignment. For example, we should exclude very close homologues of a sequence already involved in the alignment, because such close homologues are expected to bring little information.
| SUPPLEMENTARY MATERIAL |
|---|
|
|
|---|
Supplementary Material is available at NAR Online.
| ACKNOWLEDGEMENTS |
|---|
This work was supported in part by a Grant-in-Aid for Creative Scientific Research and a Grant for the Biodiversity Research of the 21st Century COE (A14) from the Ministry of Education, Culture, Sports, Science and Technology of Japan. One of the authors (H.T.) is financially supported by PDBj (BIRD). Funding to pay the Open Access publication charges for this article was provided by the Ministry of Education, Culture, Sports, Science and Technology of Japan.
| REFERENCES |
|---|
|
|
|---|
- Katoh, K., Misawa, K., Kuma, K., Miyata, T. (2002) MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform Nucleic Acids Res., 30, 30593066
[Abstract/Free Full Text] . - Grasso, C. and Lee, C. (2004) Combining partial order alignment and progressive multiple sequence alignment increases alignment speed and scalability to very large alignment problems Bioinformatics, 20, 15461556
[Abstract/Free Full Text] . - Bateman, A., Coin, L., Durbin, R., Finn, R.D., Hollich, V., Griffiths-Jones, S., Khanna, A., Marshall, M., Moxon, S., Sonnhammer, E.L., Studholme, D.J., Yeats, C., Eddy, S.R. (2004) The pfam protein families database Nucleic Acids Res., 32, D138D141
[Abstract/Free Full Text] . - Chandonia, J.M., Hon, G., Walker, N.S., Lo Conte, L., Koehl, P., Levitt, M., Brenner, S.E. (2004) The ASTRAL compendium in 2004 Nucleic Acids Res., 32, D189D192
[Abstract/Free Full Text] . - Rawlings, N.D., Tolle, D.P., Barrett, A.J. (2004) MEROPS: the peptidase database Nucleic Acids Res., 32, D160D164
[Abstract/Free Full Text] . - Feng, D.F. and Doolittle, R.F. (1987) Progressive sequence alignment as a prerequisite to correct phylogenetic trees J. Mol. Evol., 25, 351360[Web of Science][Medline] .
- Thompson, J.D., Higgins, D.G., Gibson, T.J. (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice Nucleic Acids Res., 22, 46734680
[Abstract/Free Full Text] . - Berger, M.P. and Munson, P.J. (1991) A novel randomized iterative strategy for aligning multiple protein sequences Comput. Appl. Biosci., 7, 479484
[Abstract/Free Full Text] . - Gotoh, O. (1993) Optimal alignment between groups of sequences and its application to multiple sequence alignment Comput. Appl. Biosci., 9, 361370
[Abstract/Free Full Text] . - Jones, D.T., Taylor, W.R., Thornton, J.M. (1992) The rapid generation of mutation data matrices from protein sequences Comput. Appl. Biosci., 8, 275282
[Abstract/Free Full Text] . - Edgar, R.C. (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput Nucleic Acids Res., 32, 17921797
[Abstract/Free Full Text] . - Sokal, R.R. and Michener, C.D. (1958) A statistical method for evaluating systematic relationships Univ. Kans. Scientif. Bull., 28, 14091438 .
- Tateno, Y., Ikeo, K., Imanishi, T., Watanabe, H., Endo, T., Yamaguchi, Y., Suzuki, Y., Takahashi, K., Tsunoyama, K., Kawai, M., Kawanishi, Y., Naitou, K., Gojobori, T. (1997) Evolutionary motif and its biological and structural significance J. Mol. Evol., 44, S38S43 .
- Gotoh, O. (1995) A weighting system and algorithm for aligning many phylogenetically related sequences Comput. Appl. Biosci., 11, 543551
[Abstract/Free Full Text] . - Hirosawa, M., Totoki, Y., Hoshida, M., Ishikawa, M. (1995) Comprehensive study on iterative algorithms of multiple sequence alignment Comput. Appl. Biosci., 11, 1318
[Abstract/Free Full Text] . - Needleman, S.B. and Wunsch, C.D. (1970) A general method applicable to the search for similarities in the amino acid sequence of two proteins J. Mol. Biol., 48, 443453[CrossRef][Web of Science][Medline] .
- Stebbings, L.A. and Mizuguchi, K. (2004) HOMSTRAD: recent developments of the homologous protein structure alignment database Nucleic Acids Res., 32, D203D207
[Abstract/Free Full Text] . - Van Walle, I., Lasters, I., Wyns, L. (2004) Align-ma new algorithm for multiple alignment of highly divergent sequences Bioinformatics, 20, 14281435
[Abstract/Free Full Text] . - Gotoh, O. (1996) Significant improvement in accuracy of multiple protein sequence alignments by iterative refinement as assessed by reference to structural alignments J. Mol. Biol., 264, 823838[CrossRef][Web of Science][Medline] .
- Notredame, C., Higgins, D.G., Heringa, J. (2000) T-Coffee: a novel method for fast and accurate multiple sequence alignment J. Mol. Biol., 302, 205217[CrossRef][Web of Science][Medline] .
- Lassmann, T. and Sonnhammer, E.L. (2002) Quality assessment of multiple alignment programs FEBS Lett., 529, 126130[CrossRef][Web of Science][Medline] .
- Raghava, G.P., Searle, S.M., Audley, P.C., Barber, J.D., Barton, G.J. (2003) OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy BMC Bioinformatics, 4, 47[CrossRef][Medline] .
- Edgar, R.C. (2004) MUSCLE: a multiple sequence alignment method with reduced time and space complexity BMC Bioinformatics, 5, 113[CrossRef][Medline] .
- Thompson, J.D., Plewniak, F., Thierry, J., Poch, O. (2000) DbClustal: rapid and reliable global multiple alignments of protein sequences detected by database searches Nucleic Acids Res., 28, 29192926
[Abstract/Free Full Text] . - Pearson, W.R. and Lipman, D.J. (1988) Improved tools for biological sequence comparison Proc. Natl Acad. Sci. USA, 85, 24442448
[Abstract/Free Full Text] . - O'Sullivan, O., Suhre, K., Abergel, C., Higgins, D.G., Notredame, C. (2004) 3DCoffee: combining protein sequences and structures within multiple sequence alignments J. Mol. Biol., 340, 385395[CrossRef][Web of Science][Medline] .
- Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs Nucleic Acids Res., 25, 33893402
[Abstract/Free Full Text] . - Press, W.H., Teukolsky, S.A., Vetterling, W.T., Flannery, B.P. Numerical Recipes in C: The Art of Scientific Computing, 2nd edn, (1995) Cambridge Cambridge University Press .
- Stoye, J., Evers, D., Meyer, F. (1997) Generating benchmarks for multiple sequence alignments and phylogenetic reconstructions Proc. Int. Conf. Intell. Syst. Mol. Biol., 5, pp. 303306[Medline] .
- Thompson, J.D., Plewniak, F., Poch, O. (1999) BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs Bioinformatics, 15, 8788
[Abstract/Free Full Text] .
This article has been cited by other articles:
![]() |
Y. Ogino, H. Katoh, S. Kuraku, and G. Yamada Evolutionary History and Functional Characterization of Androgen Receptor Genes in Jawed Vertebrates Endocrinology, December 1, 2009; 150(12): 5415 - 5427. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Ertas, B. M. von Reumont, J.-W. Wagele, B. Misof, and T. Burmester Hemocyanin Suggests a Close Relationship of Remipedia and Hexapoda Mol. Biol. Evol., December 1, 2009; 26(12): 2711 - 2718. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Simon, S. Strauss, A. von Haeseler, and H. Hadrys A Phylogenomic Approach to Resolve the Basal Pterygote Divergence Mol. Biol. Evol., December 1, 2009; 26(12): 2719 - 2730. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. F. Gruber, R. DeSalle, E. K. Lienau, D. Tchernov, V. A. Pieribone, and H.-T. Kao Novel Internal Regions of Fluorescent Proteins Undergo Divergent Evolutionary Patterns Mol. Biol. Evol., December 1, 2009; 26(12): 2841 - 2848. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Lee, R. Rentzsch, and C. Orengo GeMMA: functional subfamily classification within superfamilies of predicted protein structural domains Nucleic Acids Res., November 18, 2009; (2009) gkp1049v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. V. Kononenko, V. A. Mitkevich, G. C. Atkinson, T. Tenson, V. I. Dubovaya, L. Yu. Frolova, A. A. Makarov, and V. Hauryliuk GTP-dependent structural rearrangement of the eRF1:eRF3 complex and eRF3 sequence motifs essential for PABP binding Nucleic Acids Res., November 11, 2009; (2009) gkp908v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Rodriguez, M. D. Golemba, R. H. Campos, K. Trono, and L. R. Jones Bovine leukemia virus can be classified into seven genotypes: evidence for the existence of two novel clades J. Gen. Virol., November 1, 2009; 90(11): 2788 - 2797. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Pang, J. P. Wenger, K. Saathoff, G. J. Peel, J. Wen, D. Huhman, S. N. Allen, Y. Tang, X. Cheng, M. Tadege, et al. A WD40 Repeat Protein from Medicago truncatula Is Necessary for Tissue-Specific Anthocyanin and Proanthocyanidin Biosynthesis But Not for Trichome Development Plant Physiology, November 1, 2009; 151(3): 1114 - 1129. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. David, N. W.G. Chen, A. Pedrosa-Harand, V. Thareau, M. Sevignac, S. B. Cannon, D. Debouck, T. Langin, and V. Geffroy A Nomadic Subtelomeric Disease Resistance Gene Cluster in Common Bean Plant Physiology, November 1, 2009; 151(3): 1048 - 1065. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Nobre, P. Eggleton, and D. K. Aanen Vertical transmission as the key to the colonization of Madagascar by fungus-growing termites? Proc R Soc B, October 14, 2009; (2009) rspb.2009.1373v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
The Genolevures Consortium, J.-L. Souciet, B. Dujon, C. Gaillardin, M. Johnston, P. V. Baret, P. Cliften, D. J. Sherman, J. Weissenbach, E. Westhof, et al. Comparative genomics of protoploid Saccharomycetaceae Genome Res., October 1, 2009; 19(10): 1696 - 1709. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Uranishi, A. S. Zolotukhin, S. Lindtner, S. Warming, G.-M. Zhang, J. Bear, N. G. Copeland, N. A. Jenkins, G. N. Pavlakis, and B. K. Felber The RNA-binding Motif Protein 15B (RBM15B/OTT3) Acts as Cofactor of the Nuclear Export Receptor NXF1 J. Biol. Chem., September 18, 2009; 284(38): 26106 - 26116. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. McCutcheon, B. R. McDonald, and N. A. Moran Convergent evolution of metabolic roles in bacterial co-symbionts of insects PNAS, September 8, 2009; 106(36): 15394 - 15399. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Tanaka, S. Furukawa, N. Nikoh, T. Sasaki, and T. Fukatsu Complete WO Phage Sequences Reveal Their Dynamic Evolutionary Trajectories and Putative Functional Elements Required for Integration into the Wolbachia Genome Appl. Envir. Microbiol., September 1, 2009; 75(17): 5676 - 5686. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Kong, G. Zhou, U. Avci, X. Gu, C. Jones, Y. Yin, Y. Xu, and M. G. Hahn Two Poplar Glycosyltransferase Genes, PdGATL1.1 and PdGATL1.2, Are Functional Orthologs to PARVUS/AtGATL1 in Arabidopsis Mol Plant, September 1, 2009; 2(5): 1040 - 1050. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Dong, G. He, S. Zhang, and Z. Zhang Evolution of Olfactory Receptor Genes in Primates Dominated by Birth-and-Death Process Gen Biol Evol, August 20, 2009; 2009(0): 258 - 264. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Gouveia-Oliveira, F. S. Roque, R. Wernersson, T. Sicheritz-Ponten, P. W. Sackett, A. Molgaard, and A. G. Pedersen InterMap3D: predicting and visualizing co-evolving protein residues Bioinformatics, August 1, 2009; 25(15): 1963 - 1965. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Fourie, E. T. Steenkamp, T. R. Gordon, and A. Viljoen Evolutionary Relationships among the Fusarium oxysporum f. sp. cubense Vegetative Compatibility Groups Appl. Envir. Microbiol., July 15, 2009; 75(14): 4770 - 4781. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Mosquna, A. Katz, E. L. Decker, S. A. Rensing, R. Reski, and N. Ohad Regulation of stem cell maintenance by the Polycomb protein FIE has been conserved during land plant evolution Development, July 15, 2009; 136(14): 2433 - 2444. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Davey and R. S. Currah Atradidymella muscivora gen. et sp. nov. (Pleosporales) and its anamorph Phoma muscivora sp. nov.: A new pleomorphic pathogen of boreal bryophytes Am. J. Botany, July 1, 2009; 96(7): 1281 - 1288. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. de Wet, B. Slippers, O. Preisig, B. D. Wingfield, P. Tsopelas, and M. J. Wingfield Molecular and morphological characterization of Dothiorella casuarini sp. nov. and other Botryosphaeriaceae with diplodia-like conidia Mycologia, July 1, 2009; 101(4): 503 - 511. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Niimura On the Origin and Evolution of Vertebrate Olfactory Receptor Genes: Comparative Genome Analysis Among 23 Chordate Species Gen Biol Evol, June 22, 2009; 2009(0): 34 - 44. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Liu, S. Raghavan, S. Nelesen, C. R. Linder, and T. Warnow Rapid and Accurate Large-Scale Coestimation of Sequence Alignments and Phylogenetic Trees Science, June 19, 2009; 324(5934): 1561 - 1564. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Sun, Y. Cai, L. Liu, F. Yu, M. L. Farrell, W. McKendree, and W. Farmerie ESPRIT: estimating species richness using large collections of 16S rRNA pyrosequences Nucleic Acids Res., June 1, 2009; 37(10): e76 - e76. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Oetjen and B. Reinhold-Hurek Characterization of the DraT/DraG System for Posttranslational Regulation of Nitrogenase in the Endophytic Betaproteobacterium Azoarcus sp. Strain BH72 J. Bacteriol., June 1, 2009; 191(11): 3726 - 3735. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Chen, S. Kshirsagar, I. Jensen, K. Lau, R. Covarrubias, S. F. Schluter, and J. J. Marchalonis Characterization of arrangement and expression of the T cell receptor {gamma} locus in the sandbar shark PNAS, May 26, 2009; 106(21): 8591 - 8596. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. S. Weber III, T. B. Waltzek, D. A. Young, E. L. Twitchell, A. E. Gates, A. Vagelli, G. R. Risatti, R. P. Hedrick, and S. Frasca Jr Systemic iridovirus infection in the Banggai cardinalfish (Pterapogon kauderni Koumans 1933) J Vet Diagn Invest, May 1, 2009; 21(3): 306 - 320. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Fu, O. Bannach, H. Chen, J.-H. Teune, A. Schmitz, G. Steger, L. Xiong, and W. B. Barbazuk Alternative splicing of anciently exonized 5S rRNA regulates plant transcription factor TFIIIA Genome Res., May 1, 2009; 19(5): 913 - 921. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Waterhouse, J. B. Procter, D. M. A. Martin, M. Clamp, and G. J. Barton Jalview Version 2--a multiple sequence alignment editor and analysis workbench Bioinformatics, May 1, 2009; 25(9): 1189 - 1191. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-H. Kim, T. Ebersole, N. Kouprina, V. N. Noskov, J.-I. Ohzeki, H. Masumoto, B. Mravinac, B. A. Sullivan, A. Pavlicek, S. Dovat, et al. Human gamma-satellite DNA maintains open chromatin structure and protects a transgene from epigenetic silencing Genome Res., April 1, 2009; 19(4): 533 - 544. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ast, A. Gruber, S. Schmitz-Esser, H. E. Neuhaus, P. G. Kroth, M. Horn, and I. Haferkamp Diatom plastids depend on nucleotide import from the cytosol PNAS, March 3, 2009; 106(9): 3621 - 3626. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Schulz, M. Gottschling, I. G. Bravo, U. Wittstatt, E. Stockfleth, and I. Nindl Genomic characterization of the first insectivoran papillomavirus reveals an unusually long, second non-coding region and indicates a close relationship to Betapapillomavirus J. Gen. Virol., March 1, 2009; 90(3): 626 - 633. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. del Viso, A. F. Puebla, C. M. Fusari, A. C. Casabuono, A. S. Couto, H. G. Pontis, H. E. Hopp, and R. A. Heinz Molecular Characterization of a Putative Sucrose:Fructan 6-Fructosyltransferase (6-SFT) of the Cold-Resistant Patagonian Grass Bromus pictus Associated With Fructan Accumulation Under Low Temperatures Plant Cell Physiol., March 1, 2009; 50(3): 489 - 503. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A Jurado-Rivera, A. P Vogler, C. A.M Reid, E. Petitpierre, and J. Gomez-Zurita DNA barcoding insect-host plant associations Proc R Soc B, February 22, 2009; 276(1657): 639 - 648. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. M. Eriksson, A. K. Clarke, L.-G. Franzen, M. Kuylenstierna, K. Martinez, and H. Blanck Community-Level Analysis of psbA Gene Sequences and Irgarol Tolerance in Marine Periphyton Appl. Envir. Microbiol., February 15, 2009; 75(4): 897 - 906. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Hamada, H. Kiryu, K. Sato, T. Mituyama, and K. Asai Prediction of RNA secondary structure using generalized centroid estimators Bioinformatics, February 15, 2009; 25(4): 465 - 473. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Lassmann, O. Frings, and E. L. L. Sonnhammer Kalign2: high-performance multiple alignment of protein and nucleotide sequences allowing external features Nucleic Acids Res., February 1, 2009; 37(3): 858 - 865. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Wilkins, H. Nahal, J. Foong, N. J. Provart, and M. M. Campbell Expansion and Diversification of the Populus R2R3-MYB Family of Transcription Factors Plant Physiology, February 1, 2009; 149(2): 981 - 993. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Polacek, J. E. Foley, and E. Harris Conformational Changes in the Solution Structure of the Dengue Virus 5' End in the Presence and Absence of the 3' Untranslated Region J. Virol., January 15, 2009; 83(2): 1161 - 1166. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. C. Almeida and R. DeSalle Orthology, Function and Evolution of Accessory Gland Proteins in the Drosophila repleta Group Genetics, January 1, 2009; 181(1): 235 - 245. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kuraku, A. Meyer, and S. Kuratani Timing of Genome Duplications Relative to the Origin of the Vertebrates: Did Cyclostomes Diverge before or after? Mol. Biol. Evol., January 1, 2009; 26(1): 47 - 59. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. A. Martin, D. Miranda-Saavedra, and G. J. Barton Kinomer v. 1.0: a database of systematically classified eukaryotic protein kinases Nucleic Acids Res., January 1, 2009; 37(suppl_1): D244 - D250. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Kutsukake, N. Nikoh, H. Shibao, C. Rispe, J.-C. Simon, and T. Fukatsu Evolution of Soldier-Specific Venomous Protease in Social Aphids Mol. Biol. Evol., December 1, 2008; 25(12): 2627 - 2641. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Chalkia, N. Nikolaidis, W. Makalowski, J. Klein, and M. Nei Origins and Evolution of the Formin Multigene Family That Is Involved in the Formation of Actin Filaments Mol. Biol. Evol., December 1, 2008; 25(12): 2717 - 2733. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. H. Degnan and N. A. Moran Diverse Phage-Encoded Toxins in a Protective Insect Endosymbiont Appl. Envir. Microbiol., November 1, 2008; 74(21): 6782 - 6791. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. M. Jaklitsch, K. Poldmaa, and G. J. Samuels Reconsideration of Protocrea (Hypocreales, Hypocreaceae). Mycologia, November 1, 2008; 100(6): 962 - 984. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Gajecka, A. J. Gentles, A. Tsai, D. Chitayat, K. L. Mackay, C. D. Glotzbach, M. R. Lieber, and L. G. Shaffer Unexpected complexity at breakpoint junctions in phenotypically normal individuals and mechanisms involved in generating balanced translocations t(1;22)(p36;q13) Genome Res., November 1, 2008; 18(11): 1733 - 1742. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. N. Hayes, D. Diez, N. Joannin, W. Honda, M. Kanehisa, M. Wahlgren, C. E. Wheelock, and S. Goto varDB: a pathogen-specific sequence database of protein families involved in antigenic variation Bioinformatics, November 1, 2008; 24(21): 2564 - 2565. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Soria-Carrasco and J. Castresana Estimation of Phylogenetic Inconsistencies in the Three Domains of Life Mol. Biol. Evol., November 1, 2008; 25(11): 2319 - 2329. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Luque, M. L. Riera-Alberola, A. Andujar, and J. A. G. Ochoa de Alda Intraphylum Diversity and Complex Evolution of Cyanobacterial Aminoacyl-tRNA Synthetases Mol. Biol. Evol., November 1, 2008; 25(11): 2369 - 2389. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Yang, U. C. Kalluri, S. Jawdy, L. E. Gunter, T. Yin, T. J. Tschaplinski, D. J. Weston, P. Ranjan, and G. A. Tuskan The F-Box Gene Family Is Expanded in Herbaceous Annual Plants Relative to Woody Perennial Plants Plant Physiology, November 1, 2008; 148(3): 1189 - 1200. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Jayawardane, G. C. Russell, J. Thomson, D. Deane, H. Cox, D. Gatherer, M. Ackermann, D. M. Haig, and J. P. Stewart A captured viral interleukin 10 gene with cellular exon structure J. Gen. Virol., October 1, 2008; 89(10): 2447 - 2455. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Daiyasu, H. Saino, H. Tomoto, M. Mizutani, K. Sakata, and H. Toh Computational and Experimental Analyses of Furcatin Hydrolase for Substrate Specificity Studies of Disaccharide-specific Glycosidases J. Biochem., October 1, 2008; 144(4): 467 - 475. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Ahola, T. Aittokallio, M. Vihinen, and E. Uusipaikka Model-based prediction of sequence alignment quality Bioinformatics, October 1, 2008; 24(19): 2165 - 2171. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Pang, G. J. Peel, S. B. Sharma, Y. Tang, and R. A. Dixon A transcript profiling approach reveals an epicatechin-specific glucosyltransferase expressed in the seed coat of Medicago truncatula PNAS, September 16, 2008; 105(37): 14210 - 14215. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Shi, J. B. Franklin, J. T. Yelinek, I. Ebersberger, G. Warren, and C. Y. He Centrin4 coordinates cell and nuclear division in T. brucei J. Cell Sci., September 15, 2008; 121(18): 3062 - 3070. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Igarashi, T. Ishida, C. Hori, and M. Samejima Characterization of an Endoglucanase Belonging to a New Subfamily of Glycoside Hydrolase Family 45 of the Basidiomycete Phanerochaete chrysosporium Appl. Envir. Microbiol., September 15, 2008; 74(18): 5628 - 5634. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Miller, L. J. Jensen, F. Diella, C. Jorgensen, M. Tinti, L. Li, M. Hsiung, S. A. Parker, J. Bordeaux, T. Sicheritz-Ponten, et al. Linear Motif Atlas for Phosphorylation-Dependent Signaling Sci. Signal., September 2, 2008; 1(35): ra2 - ra2. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Barbeyron, S. L'Haridon, G. Michel, and M. Czjzek Mariniflexile fucanivorans sp. nov., a marine member of the Flavobacteriaceae that degrades sulphated fucans from brown algae Int J Syst Evol Microbiol, September 1, 2008; 58(9): 2107 - 2113. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Schmitz-Esser, I. Haferkamp, S. Knab, T. Penz, M. Ast, C. Kohl, M. Wagner, and M. Horn Lawsonia intracellularis Contains a Gene Encoding a Functional Rickettsia-Like ATP/ADP Translocase for Host Exploitation J. Bacteriol., September 1, 2008; 190(17): 5746 - 5752. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Liu, X. Xu, and G. D. Stormo The cis-regulatory map of Shewanella genomes Nucleic Acids Res., September 1, 2008; 36(16): 5376 - 5390. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. C. Almeida and R. DeSalle Evidence of Adaptive Evolution of Accessory Gland Proteins in Closely Related Species of the Drosophila repleta Group Mol. Biol. Evol., September 1, 2008; 25(9): 2043 - 2053. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. G. Hall How Well Does the HoT Score Reflect Sequence Alignment Accuracy? Mol. Biol. Evol., August 1, 2008; 25(8): 1576 - 1580. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Bayram, C. Biesemann, S. Krappmann, P. Galland, and G. H. Braus More Than a Repair Enzyme: Aspergillus nidulans Photolyase-like CryA Is a Regulator of Sexual Development Mol. Biol. Cell, August 1, 2008; 19(8): 3254 - 3262. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Knies, K. K. Dang, T. J. Vision, N. G. Hoffman, R. Swanstrom, and C. L. Burch Compensatory Evolution in RNA Secondary Structures Increases Substitution Rate Variation among Sites Mol. Biol. Evol., August 1, 2008; 25(8): 1778 - 1787. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Katoh and H. Toh Recent developments in the MAFFT multiple sequence alignment program Brief Bioinform, July 1, 2008; 9(4): 286 - 298. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-M. Bourbon Comparative genomics supports a deep evolutionary origin for the large, four-module transcriptional mediator complex Nucleic Acids Res., July 1, 2008; 36(12): 3993 - 4008. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Moretti, A. Wilm, D. G. Higgins, I. Xenarios, and C. Notredame R-Coffee: a web server for accurately aligning noncoding RNA sequences Nucleic Acids Res., July 1, 2008; 36(suppl_2): W10 - W13. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pei, M. Tang, and N. V. Grishin PROMALS3D web server for accurate multiple protein sequence and structure alignments Nucleic Acids Res., July 1, 2008; 36(suppl_2): W30 - W34. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Loytynoja and N. Goldman Phylogeny-Aware Gap Placement Prevents Errors in Sequence Alignment and Evolutionary Analysis Science, June 20, 2008; 320(5883): 1632 - 1635. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kuraku, Y. Takio, K. Tamura, H. Aono, A. Meyer, and S. Kuratani Noncanonical role of Hox14 revealed by its expression patterns in lamprey and shark PNAS, May 6, 2008; 105(18): 6679 - 6683. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Quan, M. van Vuuren, P. G. Howell, D. Groenewald, and A. J. Guthrie Molecular epidemiology of the African horse sickness virus S10 gene J. Gen. Virol., May 1, 2008; 89(5): 1159 - 1168. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Wilm, D. G. Higgins, and C. Notredame R-Coffee: a method for multiple alignment of non-coding RNA Nucleic Acids Res., May 1, 2008; 36(9): e52 - e52. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Balke, J. Gomez-Zurita, I. Ribera, A. Viloria, A. Zillikens, J. Steiner, M. Garcia, L. Hendrich, and A. P. Vogler Ancient associations of aquatic beetles and tank bromeliads in the Neotropical forest canopy PNAS, April 29, 2008; 105(17): 6356 - 6361. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pei, B.-H. Kim, and N. V. Grishin PROMALS3D: a tool for multiple protein sequence and structure alignments Nucleic Acids Res., April 1, 2008; 36(7): 2295 - 2300. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. M. Bushell, C. Sollner, B. Schuster-Boeckler, A. Bateman, and G. J. Wright Large-scale screening for novel low-affinity extracellular protein interactions Genome Res., April 1, 2008; 18(4): 622 - 630. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Moore and R. G. Allaby TreeMos: a high-throughput phylogenomic approach to find and visualize phylogenetic mosaicism Bioinformatics, March 1, 2008; 24(5): 717 - 718. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Pirovano, K. A. Feenstra, and J. Heringa PRALINETM: a strategy for improved multiple alignment of transmembrane proteins Bioinformatics, February 15, 2008; 24(4): 492 - 497. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Michael, G. Trave, C. Ramu, C. Chica, and T. J. Gibson Discovery of candidate KEN-box motifs using Cell Cycle keyword enrichment combined with native disorder prediction and motif conservation Bioinformatics, February 15, 2008; 24(4): 453 - 457. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Conte, S. Gaillard, N. Lanau, M. Rouard, and C. Perin GreenPhylDB: a database for plant comparative genomics Nucleic Acids Res., January 11, 2008; 36(suppl_1): D991 - D998. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Hashimoto, A. C. Yoshizawa, S. Okuda, K. Kuma, S. Goto, and M. Kanehisa The repertoire of desaturases and elongases reveals fatty acid variations in 56 eukaryotic genomes J. Lipid Res., January 1, 2008; 49(1): 183 - 191. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Casola, D. Hucks, and C. Feschotte Convergent Domestication of pogo-like Transposases into Centromere-Binding Proteins in Fission Yeast and Mammals Mol. Biol. Evol., January 1, 2008; 25(1): 29 - 41. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Wang, R. R. Gutell, and D. P. Miranker Biclustering as a method for RNA local multiple sequence alignment Bioinformatics, December 15, 2007; 23(24): 3289 - 3296. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. A. Watkins, D. Maiguel, Z. Jia, and J. Pevsner Evidence for 26 distinct acyl-coenzyme A synthetase genes in the human genome J. Lipid Res., December 1, 2007; 48(12): 2736 - 2750. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Schmidt von Braun, A. Sabetti, P. J. Hanic-Joyce, J. Gu, E. Schleiff, and P. B. M. Joyce Dual targeting of the tRNA nucleotidyltransferase in plants: not just the signal J. Exp. Bot., December 1, 2007; 58(15-16): 4083 - 4093. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Schlegel, O. Mirus, A. von Haeseler, and E. Schleiff The Tetratricopeptide Repeats of Receptors Involved in Protein Translocation across Membranes Mol. Biol. Evol., December 1, 2007; 24(12): 2763 - 2774. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Golubchik, M. J. Wise, S. Easteal, and L. S. Jermiin Mind the Gaps: Evidence of Bias in Estimates of Multiple Sequence Alignments Mol. Biol. Evol., November 1, 2007; 24(11): 2433 - 2442. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||






























