Skip Navigation

Nucleic Acids Research 2004 32(17):5183-5191; doi:10.1093/nar/gkh850
This Article
Right arrow Full Text Freely available
Right arrow Print PDF (390K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (3)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Li, M.
Right arrow Articles by Li, L. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Li, M.
Right arrow Articles by Li, L. M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Published online 30 September 2004

Nucleic Acids Research, Vol. 32 No. 17 © Oxford University Press 2004; all rights reserved

Adjust quality scores from alignment and improve sequencing accuracy

Ming Li, Magnus Nordborg and Lei M. Li*

Computational Biology, University of Southern California, Los Angeles, CA, USA

* To whom correspondence should be addressed. Tel: +1 213 740 2407; Fax: +1 213 740 2437; Email: lilei{at}usc.edu

Received July 9, 2004; Revised and Accepted September 8, 2004

In shotgun sequencing, statistical reconstruction of a consensus from alignment requires a model of measurement error. Churchill and Waterman proposed one such model and an expectation–maximization (EM) algorithm to estimate sequencing error rates for each assembly matrix. Ewing and Green defined Phred quality scores for base-calling from sequencing traces by training a model on a large amount of data. However, sample preparations and sequencing machines may work under different conditions in practice and therefore quality scores need to be adjusted. Moreover, the information given by quality scores is incomplete in the sense that they do not describe error patterns. We observe that each nucleotide base has its specific error pattern that varies across the range of quality values. We develop models of measurement error for shotgun sequencing by combining the two perspectives above. We propose a logistic model taking quality scores as covariates. The model is trained by a procedure combining an EM algorithm and model selection techniques. The training results in calibration of quality values and leads to a more accurate construction of consensus. Besides Phred scores obtained from ABI sequencers, we apply the same technique to calibrate quality values that come along with Beckman sequencers.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Genome ResHome page
W. Qu, S.-i. Hashimoto, and S. Morishita
Efficient frequency-based de novo short-read clustering for error trimming in next-generation sequencing
Genome Res., July 1, 2009; 19(7): 1309 - 1315.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
F. De Bona, S. Ossowski, K. Schneeberger, and G. Ratsch
Optimal spliced alignments of short sequence reads
Bioinformatics, August 15, 2008; 24(16): i174 - i180.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
P. L.F. Johnson and M. Slatkin
Inference of population genetic parameters in metagenomics: A clean look at messy data
Genome Res., October 1, 2006; 16(10): 1320 - 1327.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.