Nucleic Acids Research, 1995, Vol. 23, No. 15 2900-2908
© 1995
COMPUTATIONAL BIOLOGY |
A frameshift error detection algorithm for DNA sequencing projects
Institut de Geèneètique et Microbiologie, Universoteé Paris-Sud Baätiment 400, 9405 Orsay Cedex, France 1Laboratorie de Chimie Bacteèrienne CNRS, 3 Chemin Joseph Aiguier, BP 7, 13277 Marseille Cedex 9, France
*To Whom correspondence should be addressed
Received April 24, 1995. Accepted June 20, 1995.
During the determination of DNA sequences, frameshif errors are not the most frequent but they are the most bothersome as they corrupt the amino acid sequence over several residues. Detection of such errors by sequence alignment is only possible when related sequences are found in the databases. To avoid this limitation, we have developed a new tool based on the distribution of non-overlapping 3-tuples or 6-tuples In the three frames of an ORF. The method relies upon the result of a correspondence analysis. It has been extensively tested on Bacillus subtllls and Saccharomyces cerevlsiae sequences and has also been examined with human sequences. The results indicate that It can detect frameshift errors affecting as few as 20 bp with a low rate of false positives (no more than 1.0/1000 bp scanned). The proposed algorithm can be used to scan a large collection of data, but it is mainly intended for laboratory practice as a tool for checking the quality of the sequences produced during a sequencing project.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
N. Gupta, J. Benhamida, V. Bhargava, D. Goodman, E. Kain, I. Kerman, N. Nguyen, N. Ollikainen, J. Rodriguez, J. Wang, et al. Comparative proteogenomics: Combining mass spectrometry and comparative genomics to analyze multiple genomes Genome Res., July 1, 2008; 18(7): 1133 - 1142. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Moon, Y. Byun, H.-J. Kim, S. Jeong, and K. Han Predicting genes expressed via -1 and +1 frameshifts Nucleic Acids Res., September 15, 2004; 32(16): 4884 - 4892. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Perriere and J. Thioulouse Use and misuse of correspondence analysis in codon usage studies Nucleic Acids Res., October 15, 2002; 30(20): 4548 - 4555. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Mathe, M.-F. Sagot, T. Schiex, and P. Rouze Current methods of gene prediction, their strengths and weaknesses Nucleic Acids Res., October 1, 2002; 30(19): 4103 - 4117. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Médigue, M. Rose, A. Viari, and A. Danchin Detecting and Analyzing DNA Sequencing Errors: Toward a Higher Quality of the Bacillus subtilis Genome Sequence Genome Res., November 1, 1999; 9(11): 1116 - 1127. [Abstract] [Full Text] |
||||

