Published online 20 October 2005
Article |
Draft versus finished sequence data for DNA and protein diagnostic signature development
Pathogen Bio-Informatics, Lawrence Livermore National Laboratory PO Box 808, L-174, Livermore, CA 94551, USA
*To whom correspondence should be addressed. Tel: +1 925 422 4317; Fax: +1 925 423 6437; Email: gardner26{at}llnl.gov
Received September 23, 2005. Accepted September 27, 2005.
Sequencing pathogen genomes is costly, demanding careful allocation of limited sequencing resources. We built a computational Sequencing Analysis Pipeline (SAP) to guide decisions regarding the amount of genomic sequencing necessary to develop high-quality diagnostic DNA and protein signatures. SAP uses simulations to estimate the number of target genomes and close phylogenetic relatives (near neighbors or NNs) to sequence. We use SAP to assess whether draft data are sufficient or finished sequencing is required using Marburg and variola virus sequences. Simulations indicate that intermediate to high-quality draft with error rates of 103105 (
8x coverage) of target organisms is suitable for DNA signature prediction. Low-quality draft with error rates of
1% (3x to 6x coverage) of target isolates is inadequate for DNA signature prediction, although low-quality draft of NNs is sufficient, as long as the target genomes are of high quality. For protein signature prediction, sequencing errors in target genomes substantially reduce the detection of amino acid sequence conservation, even if the draft is of high quality. In summary, high-quality draft of target and low-quality draft of NNs appears to be a cost-effective investment for DNA signature prediction, but may lead to underestimation of predicted protein signatures.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. P. Malanoski, B. Lin, and D. A. Stenger A model of base-call resolution on broad-spectrum pathogen detection resequencing DNA microarrays Nucleic Acids Res., April 15, 2008; (2008) gkm1156v1. [Abstract] [Full Text] [PDF] |
||||
