Nucleic Acids Research, 2001, Vol. 29, No. 1 61-65
© 2001 Oxford University Press
PALIa database of Phylogeny and ALIgnment of homologous protein structures
1Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560 012, India and 2Department of Biotechnology, Indian Institute of Technology, Kharagpur 721 302, India
PALI (release 1.2) contains three-dimensional (3-D) structure-dependent sequence alignments as well as structure-based phylogenetic trees of homologous protein domains in various families. The data set of homologous protein structures has been derived by consulting the SCOP database (release 1.50) and the data set comprises 604 families of homologous proteins involving 2739 protein domain structures with each family made up of at least two members. Each member in a family has been structurally aligned with every other member in the same family (pairwise alignment) and all the members in the family are also aligned using simultaneous superposition (multiple alignment). The structural alignments are performed largely automatically, with manual interventions especially in the cases of distantly related proteins, using the program STAMP (version 4.2). Every family is also associated with two dendrograms, calculated using PHYLIP (version 3.5), one based on a structural dissimilarity metric defined for every pairwise alignment and the other based on similarity of topologically equivalent residues. These dendrograms enable easy comparison of sequence and structure-based relationships among the members in a family. Structure-based alignments with the details of structural and sequence similarities, superposed coordinate sets and dendrograms can be accessed conveniently using a web interface. The database can be queried for protein pairs with sequence or structural similarities falling within a specified range. Thus PALI forms a useful resource to help in analysing the relationship between sequence and structure variation at a given level of sequence similarity. PALI also contains over 653 orphans (single member families). Using the web interface involving PSI_BLAST and PHYLIP it is possible to associate the sequence of a new protein with one of the families in PALI and generate a phylogenetic tree combining the query sequence and proteins of known 3-D structure. The database with the web interfaced search and dendrogram generation tools can be accessed at http://pauling.mbu.iisc.ernet.in/~pali .
* To whom correspondence should be addressed. Tel: +91 80 309 2837; Fax: +91 80 360 0535; Email: ns{at}mbu.iisc.ernet.in
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T. J. Wheeler and J. D. Kececioglu Multiple alignment by aligning alignments Bioinformatics, July 1, 2007; 23(13): i559 - i568. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Leslin, A. Abyzov, and V. A. Ilyin TOPOFIT-DB, a database of protein structural alignments based on the TOPOFIT method Nucleic Acids Res., January 12, 2007; 35(suppl_1): D317 - D321. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Tyagi, P. Sharma, C. S. Swamy, F. Cadet, N. Srinivasan, A. G. de Brevern, and B. Offmann Protein Block Expert (PBE): a web-based protein structure analysis server using a structural alphabet. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W119 - W123. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. S. Gowri, O. Krishnadev, C. S. Swamy, and N. Srinivasan MulPSSM: a database of multiple position-specific scoring matrices of protein domain families Nucleic Acids Res., January 1, 2006; 34(suppl_1): D243 - D246. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Anand, V.S. Gowri, and N. Srinivasan Use of multiple profiles corresponding to a sequence alignment enables effective detection of remote homologues Bioinformatics, June 15, 2005; 21(12): 2821 - 2826. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. S. Gowri, S. B. Pandit, P. S. Karthik, N. Srinivasan, and S. Balaji Integration of related sequences with protein three-dimensional structural families in an updated version of PALI database Nucleic Acids Res., January 1, 2003; 31(1): 486 - 488. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. H. Shin, H. Yokota, R. Kim, and S.-H. Kim Crystal structure of conserved hypothetical protein Aq1575 from Aquifexaeolicus PNAS, June 11, 2002; 99(12): 7980 - 7985. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Mallika, A. Bhaduri, and R. Sowdhamini PASS2: a semi-automated database of Protein Alignments Organised as Structural Superfamilies Nucleic Acids Res., January 1, 2002; 30(1): 284 - 288. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. B. Pandit, D. Gosar, S. Abhiman, S. Sujatha, S. S. Dixit, N. S. Mhatre, R. Sowdhamini, and N. Srinivasan SUPFAM--a database of potential protein superfamily relationships derived by comparing sequence-based and structure-based families: implications for structural genomics and function annotation in genomes Nucleic Acids Res., January 1, 2002; 30(1): 289 - 293. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Balaji and N. Srinivasan Use of a database of structural alignments and phylogenetic trees in investigating the relationship between sequence and structural variability among homologous proteins Protein Eng. Des. Sel., April 1, 2001; 14(4): 219 - 226. [Abstract] [Full Text] [PDF] |
||||



