Skip Navigation

This Article
Right arrow Full Text Freely available
Right arrow Print PDF (583K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (53)
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Stojanovic, N.
Right arrow Articles by Hardison, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Stojanovic, N.
Right arrow Articles by Hardison, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, Vol 27, Issue 19 3899-3910, Copyright © 1999 by Oxford University Press


ARTICLES

Comparison of five methods for finding conserved sequences in multiple alignments of gene regulatory regions

N Stojanovic, L Florea, C Riemer, D Gumucio, J Slightom, M Goodman, W Miller and R Hardison
Department of Computer Science and Engineering, The Pennsylvania State University, University Park 16802, USA.

Conserved segments in DNA or protein sequences are strong candidates for functional elements and thus appropriate methods for computing them need to be developed and compared. We describe five methods and computer programs for finding highly conserved blocks within previously computed multiple alignments, primarily for DNA sequences. Two of the methods are already in common use; these are based on good column agreement and high information content. Three additional methods find blocks with minimal evolutionary change, blocks that differ in at most k positions per row from a known center sequence and blocks that differ in at most k positions per row from a center sequence that is unknown a priori. The center sequence in the latter two methods is a way to model potential binding sites for known or unknown proteins in DNA sequences. The efficacy of each method was evaluated by analysis of three extensively analyzed regulatory regions in mammalian beta-globin gene clusters and the control region of bacterial arabinose operons. Although all five methods have quite different theoretical underpinnings, they produce rather similar results on these data sets when their parameters are adjusted to best approximate the experimental data. The optimal parameters for the method based on information content varied little for different regulatory regions of the beta- globin gene cluster and hence may be extrapolated to many other regulatory regions. The programs based on maximum allowed mismatches per row have simple parameters whose values can be chosen a priori and thus they may be more useful than the other methods when calibration against known functional sites is not available.
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
The Computer JournalHome page
F. Huffner, R. Niedermeier, and S. Wernicke
Techniques for Practical Fixed-Parameter Algorithms
The Computer Journal, January 1, 2008; 51(1): 7 - 25.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
S. Kumar and A. Filipski
Multiple sequence alignment: In pursuit of homologous DNA positions
Genome Res., February 1, 2007; 17(2): 127 - 135.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
H. Wang, Y. Zhang, Y. Cheng, Y. Zhou, D. C. King, J. Taylor, F. Chiaromonte, J. Kasturi, H. Petrykowska, B. Gibb, et al.
Experimental validation of predicted mammalian erythroid cis-regulatory modules
Genome Res., December 1, 2006; 16(12): 1480 - 1492.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
A. Siepel, G. Bejerano, J. S. Pedersen, A. S. Hinrichs, M. Hou, K. Rosenbloom, H. Clawson, J. Spieth, L. W. Hillier, S. Richards, et al.
Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes
Genome Res., August 1, 2005; 15(8): 1034 - 1050.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
E. T. Dermitzakis, E. Kirkness, S. Schwarz, E. Birney, A. Reymond, and S. E. Antonarakis
Comparison of Human Chromosome 21 Conserved Nongenic Sequences (CNGs) With the Mouse and Dog Genomes Shows That Their Selective Constraint Is Independent of Their Genic Environment
Genome Res., May 1, 2004; 14(5): 852 - 859.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
D. Kolbe, J. Taylor, L. Elnitski, P. Eswara, J. Li, W. Miller, R. Hardison, and F. Chiaromonte
Regulatory Potential Scores From Genome-Wide Three-Way Alignments of Human, Mouse, and Rat
Genome Res., April 1, 2004; 14(4): 700 - 707.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. A. Chapman, I. J. Donaldson, J. Gilbert, D. Grafham, J. Rogers, A. R. Green, and B. Gottgens
Analysis of Multiple Genomic Sequence Alignments: A Web Resource, Online Tools, and Lessons Learned From Analysis of Mammalian SCL Loci
Genome Res., February 1, 2004; 14(2): 313 - 318.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
E. Berezikov, V. Guryev, R. H.A. Plasterk, and E. Cuppen
CONREAL: Conserved Regulatory Elements Anchored Alignment Algorithm for Identification of Transcription Factor Binding Sites by Phylogenetic Footprinting
Genome Res., January 1, 2004; 14(1): 170 - 178.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
E. H. Margulies, M. Blanchette, NISC Comparative Sequencing Program, D. Haussler, and E. D. Green
Identification and Characterization of Multi-Species Conserved Sequences
Genome Res., December 1, 2003; 13(12): 2507 - 2518.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
E. T. Dermitzakis, A. Reymond, N. Scamuffa, C. Ucla, E. Kirkness, C. Rossier, and S. E. Antonarakis
Evolutionary Discrimination of Mammalian Conserved Non-Genic Sequences (CNGs)
Science, November 7, 2003; 302(5647): 1033 - 1035.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. Florea, M. McClelland, C. Riemer, S. Schwartz, and W. Miller
EnteriX 2003: visualization tools for genome alignments of Enterobacteriaceae
Nucleic Acids Res., July 1, 2003; 31(13): 3527 - 3532.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
K. A. Frazer, L. Elnitski, D. M. Church, I. Dubchak, and R. C. Hardison
Cross-Species Sequence Comparisons: A Review of Methods and Available Resources
Genome Res., January 1, 2003; 13(1): 1 - 12.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
A. C.W. May
Definition of the tempo of sequence diversity across an alignment and automatic identification of sequence motifs: Application to protein homologous families and superfamilies
Protein Sci., December 1, 2002; 11(12): 2825 - 2835.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
D. T. Morishige, K. L. Childs, L. D. Moore, and J. E. Mullet
Targeted Analysis of Orthologous Phytochrome A Regions of the Sorghum, Maize, and Rice Genomes using Comparative Gene-Island Sequencing
Plant Physiology, December 1, 2002; 130(4): 1614 - 1625.
[Abstract] [Full Text] [PDF]


Home page
J. Lipid Res.Home page
J. E. Balmer and R. Blomhoff
Gene expression regulation by retinoic acid
J. Lipid Res., November 1, 2002; 43(11): 1773 - 1808.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
J. Colinas, K. Birnbaum, and P. N. Benfey
Using Cauliflower to Find Conserved Non-Coding Regions in Arabidopsis
Plant Physiology, June 1, 2002; 129(2): 451 - 454.
[Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
N. J. Kaplinsky, D. M. Braun, J. Penterman, S. A. Goff, and M. Freeling
Utility and distribution of conserved noncoding sequences in the grasses
PNAS, April 30, 2002; 99(9): 6147 - 6151.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
A. S. Kondrashov and S. A. Shabalina
Classification of common conserved sequences in mammalian intergenic regions
Hum. Mol. Genet., March 1, 2002; 11(6): 669 - 674.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
U. DeSilva, L. Elnitski, J. R. Idol, J. L. Doyle, W. Gan, J. W. Thomas, S. Schwartz, N. L. Dietrich, S. M. Beckstrom-Sternberg, J. C. McDowell, et al.
Generation and Comparative Analysis of ~3.3 Mb of Mouse Genomic Sequence Orthologous to the Region of Human Chromosome 7q11.23 Implicated in Williams Syndrome
Genome Res., January 1, 2002; 12(1): 3 - 15.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
R. P. Lane, T. Cutforth, J. Young, M. Athanasiou, C. Friedman, L. Rowen, G. Evans, R. Axel, L. Hood, and B. J. Trask
Genomic analysis of orthologous mouse and human olfactory receptor loci
PNAS, June 19, 2001; 98(13): 7390 - 7395.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. Florea, C. Riemer, S. Schwartz, Z. Zhang, N. Stojanovic, W. Miller, and M. McClelland
Web-based visualization tools for bacterial genome alignments
Nucleic Acids Res., September 15, 2000; 28(18): 3486 - 3496.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
W. Krivan and W. W. Wasserman
A Predictive Model for Regulatory Sequences Directing Liver-Specific Transcription
Genome Res., September 1, 2001; 11(9): 1559 - 1566.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
N. J. Kaplinsky, D. M. Braun, J. Penterman, S. A. Goff, and M. Freeling
Utility and distribution of conserved noncoding sequences in the grasses
PNAS, April 30, 2002; 99(9): 6147 - 6151.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.