Skip Navigation

This Article
Right arrow Print PDF (521K)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (8)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Cuticchia, A. J.
Right arrow Articles by Arnold, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Cuticchia, A. J.
Right arrow Articles by Arnold, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 1992, Vol. 20, No. 14 3651-3657
© 1992


GENOME STRUCTURE AND MAPPING

The application of markov chain analysis to oligonucleotide frequency prediction and physical mappingof Drosophila melanogaster

A. J. Cuticchia, R. Ivarie1 and J. Arnold1,*

Department of Medicine, Johns Hopkins Medical School Baltimore, MD 1Department of Genetics, The University of Georgia Athens, GA 30602, USA

*To whom correspondence should be addressed

Received March 31, 1992. Accepted June 12, 1992.

Here we compare several methods for predicting oligonucleotide frequencies in 691 kb of Drosophila melanogaster DNA. As in previous work on Escherichia coll and Saccharomyces cerevlslae, a relatively simple equation based on tetranucleotide frequencies can be used in predicting frequencies of higher order oligonucleotides. For example, the mean of observed/expected abundances of 4,096 hexamers was 1.07 with a sample standard deviation of .55. This simple predictor arises by considering each base on the sense strand of D.melanogaster to depend only on the three bases 5' to it (a 3rd order Markov chain) and is more accurate than the random predictor. This equation Is useful in predicting restriction enzyme fragment sizes, selecting restriction enzymes that cut preferentially in coding vs noncoding regions, and in selecting probes to fingerprint clones in contig mapping. Once again, this equation well predicts the occurrence of higher order oligonucleotides, supporting our hypothesis that this predictor holds In evolutlonarily diverse organisms. When ranked from highest to lowest abundance, the observed frequencies of oligomers of a given length are closely tracked by the predicted abundances of a 3rd order Markov chain. Through use of the dependence of oligomer frequencies on base composition, we report a list of oligomers that will be useful for the completion of a cosmid physical map of D.melanogaster. Presently, the library is such that it will be possible to construct large contigs using only 30 oligonucleotide probes to fingerprint cosmids.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Stat Methods Med ResHome page
B. Weir
Analysis of DNA sequences
Statistical Methods in Medical Research, November 1, 1993; 2(3): 225 - 239.
[Abstract] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.