Nucleic Acids Research, Vol 25, Issue 9 1665-1677, Copyright © 1997 by Oxford University Press
AF Neuwald, JS Liu, DJ Lipman and CE Lawrence
Biologists often gain structural and functional insights into a protein
sequence by constructing a multiple alignment model of the family. Here a
program called Probe fully automates this process of model construction
starting from a single sequence. Central to this program is a powerful new
method to locate and align only those, often subtly, conserved patterns
essential to the family as a whole. When applied to randomly chosen
proteins, Probe found on average about four times as many relationships as
a pairwise search and yielded many new discoveries. These include: an
obscure subfamily of globins in the roundworm Caenorhabditis elegans ; two
new superfamilies of metallohydrolases; a lipoyl/biotin swinging arm domain
in bacterial membrane fusion proteins; and a DH domain in the yeast Bud3
and Fus2 proteins. By identifying distant relationships and merging
families into superfamilies in this way, this analysis further confirms the
notion that proteins evolved from relatively few ancient sequences.
Moreover, this method automatically generates models of these ancient
conserved regions for rapid and sensitive screening of sequences.
ARTICLES
Extracting protein alignment models from the sequence database
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA. neuwald@ncbi.nlm.nih.gov
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Heger, S. Mallick, C. Wilton, and L. Holm The global trace graph, a novel paradigm for searching protein sequence databases Bioinformatics, September 15, 2007; 23(18): 2361 - 2367. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Papadopoulos and R. Agarwala COBALT: constraint-based alignment tool for multiple protein sequences Bioinformatics, May 1, 2007; 23(9): 1073 - 1079. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. K. DiTursi, S.-J. Kwon, P. J. Reeder, and J. S. Dordick Bioinformatics-driven, rational engineering of protein thermostability Protein Eng. Des. Sel., November 1, 2006; 19(11): 517 - 524. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-N. Hung, E. Rangarajan, C. Munger, G. Nadeau, T. Sulea, and A. Matte Crystal Structure of TDP-Fucosamine Acetyltransferase (WecD) from Escherichia coli, an Enzyme Required for Enterobacterial Common Antigen Synthesis. J. Bacteriol., August 1, 2006; 188(15): 5606 - 5617. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Lee and W. Hong Diverse membrane-associated proteins contain a novel SMP domain FASEB J, February 1, 2006; 20(2): 202 - 206. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Xu, C. He, Z. Zhang, and Y. Chen MKLP1 requires specific domains for its dendritic targeting J. Cell Sci., February 1, 2006; 119(3): 452 - 458. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-M. Frere, M. Galleni, K. Bush, and O. Dideberg Is it necessary to change the classification of {beta}-lactamases? J. Antimicrob. Chemother., June 1, 2005; 55(6): 1051 - 1053. [Full Text] [PDF] |
||||
![]() |
A. Roesner, C. Fuchs, T. Hankeln, and T. Burmester A Globin Gene of Ancient Evolutionary Origin in Lower Vertebrates: Evidence for Two Distinct Globin Families in Animals Mol. Biol. Evol., January 1, 2005; 22(1): 12 - 20. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Raphael, D. Zhi, H. Tang, and P. Pevzner A novel method for multiple alignment of sequences with repeated and shuffled elements Genome Res., November 1, 2004; 14(11): 2336 - 2346. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. M. Iyer, K. S. Makarova, E. V. Koonin, and L. Aravind Comparative genomics of the FtsK-HerA superfamily of pumping ATPases: implications for the origins of chromosome segregation, cell division and viral capsid packaging Nucleic Acids Res., October 5, 2004; 32(17): 5260 - 5279. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Garau, I. Garcia-Saez, C. Bebrone, C. Anne, P. Mercuri, M. Galleni, J.-M. Frere, and O. Dideberg Update of the Standard Numbering Scheme for Class B {beta}-Lactamases Antimicrob. Agents Chemother., July 1, 2004; 48(7): 2347 - 2349. [Full Text] [PDF] |
||||
![]() |
Y. Okamoto, J. Morishita, K. Tsuboi, T. Tonai, and N. Ueda Molecular Characterization of a Phospholipase D Generating Anandamide and Its Congeners J. Biol. Chem., February 13, 2004; 279(7): 5298 - 5305. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Teufel, V. Saudek, J.-P. Ledig, A. Bernhardt, S. Boularand, A. Carreau, N. J. Cairns, C. Carter, D. J. Cowley, D. Duverger, et al. Sequence Identification and Characterization of Human Carnosinase and a Closely Related Non-specific Dipeptidase J. Biol. Chem., February 14, 2003; 278(8): 6521 - 6531. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. R. Panchenko Finding weak similarities between proteins by sequence profile comparison Nucleic Acids Res., January 15, 2003; 31(2): 683 - 689. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. K. Koretke, R. B. Russell, and A. N. Lupas Fold recognition without folds Protein Sci., June 1, 2002; 11(6): 1575 - 1579. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Burmester, B. Ebner, B. Weich, and T. Hankeln Cytoglobin: A Novel Globin Type Ubiquitously Expressed inVertebrate Tissues Mol. Biol. Evol., April 1, 2002; 19(4): 416 - 421. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Purkayastha, L. A. McCue, and K. A. McDonough Identification of a Mycobacterium tuberculosis Putative Classical Nitroreductase Gene Whose Expression Is Coregulated with That of the acr Gene within Macrophages, in Standing versus Shaking Cultures, and under Low Oxygen Conditions Infect. Immun., March 1, 2002; 70(3): 1518 - 1529. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. R. Panchenko and S. H. Bryant A comparison of position-specific score matrices based on sequence and structure alignments Protein Sci., February 1, 2002; 11(2): 361 - 370. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. O'Connell, L. Wang, S. Robert, C. A. Jones, R. Saint, and R. S. Jones Polycomblike PHD Fingers Mediate Conserved Interaction with Enhancer of Zeste Protein J. Biol. Chem., November 9, 2001; 276(46): 43065 - 43073. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Florczyk, L. A. McCue, R. F. Stack, C. R. Hauer, and K. A. McDonough Identification and Characterization of Mycobacterial Proteins Differentially Expressed under Standing and Shaking Culture Conditions, Including Rv2623 from a Novel Class of Putative ATP-Binding Proteins Infect. Immun., September 1, 2001; 69(9): 5777 - 5785. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. V. Grigoriev, C. Zhang, and S.-H. Kim Sequence-based detection of distantly related proteins with the same fold Protein Eng. Des. Sel., July 1, 2001; 14(7): 455 - 458. [Full Text] [PDF] |
||||
![]() |
J. A. Lewis and G. F. Hatfull Control of directionality in integrase-mediated recombination: examination of recombination directionality factors (RDFs) including Xis and Cox proteins Nucleic Acids Res., June 1, 2001; 29(11): 2205 - 2216. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Kwak, L. A. McCue, K. Trczianka, and K. E. Kendrick Identification and Characterization of a Developmentally Regulated Protein, EshA, Required for Sporogenic Hyphal Branches in Streptomyces griseus J. Bacteriol., May 15, 2001; 183(10): 3004 - 3015. [Abstract] [Full Text] |
||||
![]() |
G. L. Moore, C. D. Maranas, S. Lutz, and S. J. Benkovic Predicting crossover generation in DNA shuffling PNAS, March 13, 2001; 98(6): 3226 - 3231. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. J. Sofia, G. Chen, B. G. Hetzler, J. F. Reyes-Spindola, and N. E. Miller Radical SAM, a novel protein superfamily linking unresolved steps in familiar biosynthetic pathways with radical mechanisms: functional characterization using new analysis and information visualization methods Nucleic Acids Res., March 1, 2001; 29(5): 1097 - 1106. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. F. Neuwald and T. Hirano HEAT Repeats Associated with Condensins, Cohesins, and Other Complexes Involved in Chromosome-Related Functions Genome Res., October 1, 2000; 10(10): 1445 - 1452. [Abstract] [Full Text] |
||||
![]() |
L. Aravind, K. S. Makarova, and E. V. Koonin SURVEY AND SUMMARY: Holliday junction resolvases and related nucleases: identification of new families, phyletic distribution and evolutionary trajectories Nucleic Acids Res., September 15, 2000; 28(18): 3417 - 3432. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. F. Neuwald and A. Poleksic PSI-BLAST searches using hidden Markov models of structural repeats: prediction of an unusual sliding DNA clamp and of {beta}-propellers in UV-damaged DNA-binding protein Nucleic Acids Res., September 15, 2000; 28(18): 3570 - 3580. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. A. McCue, K. A. McDonough, and C. E. Lawrence Functional Classification of cNMP-binding Proteins and Nucleotide Cyclases with Implications for Novel Regulatory Pathways in Mycobacterium tuberculosis Genome Res., February 1, 2000; 10(2): 204 - 219. [Abstract] [Full Text] |
||||
![]() |
D. S. Gonzalez and I. K. Jordan The {alpha}-Mannosidases: Phylogeny and Adaptive Diversification Mol. Biol. Evol., February 1, 2000; 17(2): 292 - 300. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. R. Sunyaev, F. Eisenhaber, I. V. Rodchenkov, B. Eisenhaber, V. G. Tumanyan, and E. N. Kuznetsov PSIC: profile extraction from sequence alignments with position-specific counts of independent observations Protein Eng. Des. Sel., May 1, 1999; 12(5): 387 - 394. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Rost Twilight zone of protein sequence alignments Protein Eng. Des. Sel., February 1, 1999; 12(2): 85 - 94. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. L. Afonso, E. R. Tulman, Z. Lu, E. Oma, G. F. Kutish, and D. L. Rock The Genome of Melanoplus sanguinipes Entomopoxvirus J. Virol., January 1, 1999; 73(1): 533 - 552. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. F. Neuwald, L. Aravind, J. L. Spouge, and E. V. Koonin AAA+: A Class of Chaperone-Like ATPases Associated with the Assembly, Operation, and Disassembly of Protein Complexes Genome Res., January 1, 1999; 9(1): 27 - 43. [Abstract] [Full Text] |
||||
![]() |
T. Tugal, X. H. Zou-Yang, K. Gavin, D. Pappin, B. Canas, R. Kobayashi, T. Hunt, and B. Stillman The Orc4p and Orc5p Subunits of the Xenopus and Human Origin Recognition Complex Are Related to Orc1p and Cdc6p J. Biol. Chem., December 4, 1998; 273(49): 32421 - 32429. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Aurora and G. D. Rose Seeking an ancient enzyme in Methanococcus jannaschii using ORF, a program based on predicted secondary structure comparisons PNAS, March 17, 1998; 95(6): 2818 - 2823. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. Heaton, W. M. Dlakic, M. Dlakic, and T. D. Gelehrter Identification and cDNA Cloning of a Novel RNA-binding Protein That Interacts with the Cyclic Nucleotide-responsive Sequence in the Type-1 Plasminogen Activator Inhibitor mRNA J. Biol. Chem., January 26, 2001; 276(5): 3341 - 3347. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. J. Chave, I. E. Auger, J. Galivan, and T. J. Ryan Molecular Modeling and Site-directed Mutagenesis Define the Catalytic Motif in Human gamma -Glutamyl Hydrolase J. Biol. Chem., December 15, 2000; 275(51): 40365 - 40370. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Spencer, A. R. Clarke, and T. R. Walsh Novel Mechanism of Hydrolysis of Therapeutic beta -Lactams by Stenotrophomonas maltophilia L1 Metallo-beta -lactamase J. Biol. Chem., August 31, 2001; 276(36): 33638 - 33644. [Abstract] [Full Text] [PDF] |
||||














