Published online 14 April 2005
Article |
Masking repeats while clustering ESTs
1Computational Biology Unit, University of Bergen Bergen, Norway 2Department of Informatics, University of Bergen Bergen, Norway
*To whom correspondence should be addressed. Tel: +47 55584068; Fax: +47 55584199; Email: coward{at}ii.uib.no
Received December 20, 2004. Revised March 10, 2005. Accepted March 28, 2005.
A problem in EST clustering is the presence of repeat sequences. To avoid false matches, repeats have to be masked. This can be a time-consuming process, and it depends on available repeat libraries. We present a fast and effective method that aims to eliminate the problems repeats cause in the process of clustering. Unlike traditional methods, repeats are inferred directly from the EST data, we do not rely on any external library of known repeats. This makes the method especially suitable for analysing the ESTs from organisms without good repeat libraries. We demonstrate that the result is very similar to performing standard repeat masking before clustering.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Masoudi-Nejad, S. Goto, R. Jauregui, M. Ito, S. Kawashima, Y. Moriya, T. R. Endo, and M. Kanehisa EGENES: Transcriptome-Based Plant Database of Genes with Metabolic Pathway Information and Expressed Sequence Tag Indices in KEGG Plant Physiology, June 1, 2007; 144(2): 857 - 866. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Malde, K. Schneeberger, E. Coward, and I. Jonassen RBR: library-less repeat detection for ESTs Bioinformatics, September 15, 2006; 22(18): 2232 - 2236. [Abstract] [Full Text] [PDF] |
||||

