Published online 24 February 2004
Nucleic Acids Research, 2004, Vol. 32, No. 4 1298-1307
© 2004 Oxford University Press
LEON: multiple aLignment Evaluation Of Neighbours
Laboratoire de Biologie et Genomique Structurales, Institut de Génétique et de Biologie Moléculaire et Cellulaire, CNRS/INSERM/ULP, BP 163, 67404 Illkirch Cedex, France
*To whom correspondence should be addressed. Tel: +33 3 88 65 32 00; Fax: +33 3 88 65 32 01; Email: poch{at}igbmc.u-strasbg.fr
Sequence alignments are fundamental to a wide range of applications, including database searching, functional residue identification and structure prediction techniques. These applications predict or propagate structural/functional/evolutionary information based on a presumed homology between the aligned sequences. If the initial hypothesis of homology is wrong, no subsequent application, however sophisticated, can be expected to yield accurate results. Here we present a novel method, LEON, to predict homology between proteins based on a multiple alignment of complete sequences (MACS). In MACS, weak signals from distantly related proteins can be considered in the overall context of the family. Intermediate sequences and the combination of individual weak matches are used to increase the significance of low-scoring regions. Residue composition is also taken into account by incorporation of several existing methods for the detection of compositionally biased sequence segments. The accuracy and reliability of the predictions is demonstrated in large-scale comparisons with structural and sequence family databases, where the specificity was shown to be >99% and the sensitivity was estimated to be
76%. LEON can thus be used to reliably identify the complex relationships between large multidomain proteins and should be useful for automatic high-throughput genome annotations, 2D/3D structure predictions, proteinprotein interaction predictions etc.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. G. Conte, S. Gaillard, N. Lanau, M. Rouard, and C. Perin GreenPhylDB: a database for plant comparative genomics Nucleic Acids Res., January 11, 2008; 36(suppl_1): D991 - D998. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Muller, Y. Oma, L. Vallar, E. Friederich, O. Poch, and B. Winsor Sequence and Comparative Genomic Analysis of Actin-related Proteins Mol. Biol. Cell, December 1, 2005; 16(12): 5736 - 5748. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Prigent, J. C. Thierry, O. Poch, and F. Plewniak DbW: automatic update of a functional family-specific multiple alignment Bioinformatics, April 15, 2005; 21(8): 1437 - 1442. [Abstract] [Full Text] [PDF] |
||||


