Nucleic Acids Research, Vol 27, Issue 19 3899-3910, Copyright © 1999 by Oxford University Press
N Stojanovic, L Florea, C Riemer, D Gumucio, J Slightom, M Goodman, W Miller and R Hardison
Conserved segments in DNA or protein sequences are strong candidates for
functional elements and thus appropriate methods for computing them need to
be developed and compared. We describe five methods and computer programs
for finding highly conserved blocks within previously computed multiple
alignments, primarily for DNA sequences. Two of the methods are already in
common use; these are based on good column agreement and high information
content. Three additional methods find blocks with minimal evolutionary
change, blocks that differ in at most k positions per row from a known
center sequence and blocks that differ in at most k positions per row from
a center sequence that is unknown a priori. The center sequence in the
latter two methods is a way to model potential binding sites for known or
unknown proteins in DNA sequences. The efficacy of each method was
evaluated by analysis of three extensively analyzed regulatory regions in
mammalian beta-globin gene clusters and the control region of bacterial
arabinose operons. Although all five methods have quite different
theoretical underpinnings, they produce rather similar results on these
data sets when their parameters are adjusted to best approximate the
experimental data. The optimal parameters for the method based on
information content varied little for different regulatory regions of the
beta- globin gene cluster and hence may be extrapolated to many other
regulatory regions. The programs based on maximum allowed mismatches per
row have simple parameters whose values can be chosen a priori and thus
they may be more useful than the other methods when calibration against
known functional sites is not available.
ARTICLES
Comparison of five methods for finding conserved sequences in multiple alignments of gene regulatory regions
Department of Computer Science and Engineering, The Pennsylvania State University, University Park 16802, USA.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
F. Huffner, R. Niedermeier, and S. Wernicke Techniques for Practical Fixed-Parameter Algorithms The Computer Journal, January 1, 2008; 51(1): 7 - 25. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kumar and A. Filipski Multiple sequence alignment: In pursuit of homologous DNA positions Genome Res., February 1, 2007; 17(2): 127 - 135. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Wang, Y. Zhang, Y. Cheng, Y. Zhou, D. C. King, J. Taylor, F. Chiaromonte, J. Kasturi, H. Petrykowska, B. Gibb, et al. Experimental validation of predicted mammalian erythroid cis-regulatory modules Genome Res., December 1, 2006; 16(12): 1480 - 1492. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Siepel, G. Bejerano, J. S. Pedersen, A. S. Hinrichs, M. Hou, K. Rosenbloom, H. Clawson, J. Spieth, L. W. Hillier, S. Richards, et al. Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes Genome Res., August 1, 2005; 15(8): 1034 - 1050. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. T. Dermitzakis, E. Kirkness, S. Schwarz, E. Birney, A. Reymond, and S. E. Antonarakis Comparison of Human Chromosome 21 Conserved Nongenic Sequences (CNGs) With the Mouse and Dog Genomes Shows That Their Selective Constraint Is Independent of Their Genic Environment Genome Res., May 1, 2004; 14(5): 852 - 859. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Kolbe, J. Taylor, L. Elnitski, P. Eswara, J. Li, W. Miller, R. Hardison, and F. Chiaromonte Regulatory Potential Scores From Genome-Wide Three-Way Alignments of Human, Mouse, and Rat Genome Res., April 1, 2004; 14(4): 700 - 707. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Chapman, I. J. Donaldson, J. Gilbert, D. Grafham, J. Rogers, A. R. Green, and B. Gottgens Analysis of Multiple Genomic Sequence Alignments: A Web Resource, Online Tools, and Lessons Learned From Analysis of Mammalian SCL Loci Genome Res., February 1, 2004; 14(2): 313 - 318. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Berezikov, V. Guryev, R. H.A. Plasterk, and E. Cuppen CONREAL: Conserved Regulatory Elements Anchored Alignment Algorithm for Identification of Transcription Factor Binding Sites by Phylogenetic Footprinting Genome Res., January 1, 2004; 14(1): 170 - 178. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. H. Margulies, M. Blanchette, NISC Comparative Sequencing Program, D. Haussler, and E. D. Green Identification and Characterization of Multi-Species Conserved Sequences Genome Res., December 1, 2003; 13(12): 2507 - 2518. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. T. Dermitzakis, A. Reymond, N. Scamuffa, C. Ucla, E. Kirkness, C. Rossier, and S. E. Antonarakis Evolutionary Discrimination of Mammalian Conserved Non-Genic Sequences (CNGs) Science, November 7, 2003; 302(5647): 1033 - 1035. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Florea, M. McClelland, C. Riemer, S. Schwartz, and W. Miller EnteriX 2003: visualization tools for genome alignments of Enterobacteriaceae Nucleic Acids Res., July 1, 2003; 31(13): 3527 - 3532. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. A. Frazer, L. Elnitski, D. M. Church, I. Dubchak, and R. C. Hardison Cross-Species Sequence Comparisons: A Review of Methods and Available Resources Genome Res., January 1, 2003; 13(1): 1 - 12. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. T. Morishige, K. L. Childs, L. D. Moore, and J. E. Mullet Targeted Analysis of Orthologous Phytochrome A Regions of the Sorghum, Maize, and Rice Genomes using Comparative Gene-Island Sequencing Plant Physiology, December 1, 2002; 130(4): 1614 - 1625. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Balmer and R. Blomhoff Gene expression regulation by retinoic acid J. Lipid Res., November 1, 2002; 43(11): 1773 - 1808. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Colinas, K. Birnbaum, and P. N. Benfey Using Cauliflower to Find Conserved Non-Coding Regions in Arabidopsis Plant Physiology, June 1, 2002; 129(2): 451 - 454. [Full Text] [PDF] |
||||
![]() |
N. J. Kaplinsky, D. M. Braun, J. Penterman, S. A. Goff, and M. Freeling Utility and distribution of conserved noncoding sequences in the grasses PNAS, April 30, 2002; 99(9): 6147 - 6151. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Kondrashov and S. A. Shabalina Classification of common conserved sequences in mammalian intergenic regions Hum. Mol. Genet., March 1, 2002; 11(6): 669 - 674. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. DeSilva, L. Elnitski, J. R. Idol, J. L. Doyle, W. Gan, J. W. Thomas, S. Schwartz, N. L. Dietrich, S. M. Beckstrom-Sternberg, J. C. McDowell, et al. Generation and Comparative Analysis of ~3.3 Mb of Mouse Genomic Sequence Orthologous to the Region of Human Chromosome 7q11.23 Implicated in Williams Syndrome Genome Res., January 1, 2002; 12(1): 3 - 15. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. P. Lane, T. Cutforth, J. Young, M. Athanasiou, C. Friedman, L. Rowen, G. Evans, R. Axel, L. Hood, and B. J. Trask Genomic analysis of orthologous mouse and human olfactory receptor loci PNAS, June 19, 2001; 98(13): 7390 - 7395. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Florea, C. Riemer, S. Schwartz, Z. Zhang, N. Stojanovic, W. Miller, and M. McClelland Web-based visualization tools for bacterial genome alignments Nucleic Acids Res., September 15, 2000; 28(18): 3486 - 3496. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Krivan and W. W. Wasserman A Predictive Model for Regulatory Sequences Directing Liver-Specific Transcription Genome Res., September 1, 2001; 11(9): 1559 - 1566. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. J. Kaplinsky, D. M. Braun, J. Penterman, S. A. Goff, and M. Freeling Utility and distribution of conserved noncoding sequences in the grasses PNAS, April 30, 2002; 99(9): 6147 - 6151. [Abstract] [Full Text] [PDF] |
||||







