IsoFinder: computational prediction of isochores in genome sequences
Departamento de Genética, Instituto de Biotecnología, Facultad de Ciencias, Universidad de Granada and 1 Departamento de Física Aplicada II, Universidad de Málaga, Spain
* To whom correspondence should be addressed. Tel: +34 958243261; Fax: +34 958244073; Email: oliver{at}ugr.es
Received January 22, 2004; Revised March 4, 2004; Accepted March 25, 2004
Isochores are long genome segments homogeneous in G+C. Here, we describe an algorithm (IsoFinder) running on the web (http://bioinfo2.ugr.es/IsoF/isofinder.html) able to predict isochores at the sequence level. We move a sliding pointer from left to right along the DNA sequence. At each position of the pointer, we compute the mean G+C values to the left and to the right of the pointer. We then determine the position of the pointer for which the difference between left and right mean values (as measured by the t-statistic) reaches its maximum. Next, we determine the statistical significance of this potential cutting point, after filtering out short-scale heterogeneities below 3 kb by applying a coarse-graining technique. Finally, the program checks whether this significance exceeds a probability threshold. If so, the sequence is cut at this point into two subsequences; otherwise, the sequence remains undivided. The procedure continues recursively for each of the two resulting subsequences created by each cut. This leads to the decomposition of a chromosome sequence into long homogeneous genome regions (LHGRs) with well-defined mean G+C contents, each significantly different from the G+C contents of the adjacent LHGRs. Most LHGRs can be identified with Bernardi's isochores, given their correlation with biological features such as gene density, SINE and LINE (short, long interspersed repetitive elements) densities, recombination rate or single nucleotide polymorphism variability. The resulting isochore maps are available at our web site (http://bioinfo2.ugr.es/isochores/), and also at the UCSC Genome Browser (http://genome.cse.ucsc.edu/).
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. Hackenberg and R. Matthiesen Annotation-Modules: a tool for finding significant combinations of multisource annotations for gene lists Bioinformatics, June 1, 2008; 24(11): 1386 - 1393. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Gao and C.-T. Zhang GC-Profile: a web-based tool for visualizing and analyzing the variation of GC content in genomic sequences. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W686 - W691. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Costantini, O. Clay, F. Auletta, and G. Bernardi An isochore map of human chromosomes. Genome Res., April 1, 2006; 16(4): 536 - 541. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Clay and G. Bernardi How Not to Search for Isochores: A Reply to Cohen et al Mol. Biol. Evol., December 1, 2005; 22(12): 2315 - 2317. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Gueguen Sarment: Python modules for HMM analysis and partitioning of sequences Bioinformatics, August 15, 2005; 21(16): 3427 - 3428. [Abstract] [Full Text] [PDF] |
||||



