Nucleic Acids Research, 2001, Vol. 29, No. 22 4633-4642
© 2001 Oxford University Press
REPuter: the manifold applications of repeat analysis on a genomic scale
Faculty of Technology, University of Bielefeld, PO Box 10 01 31, D-33501 Bielefeld, Germany, 1Artemis Pharmaceuticals, Neurather Ring 1, 51063 Köln, Germany and 2Max Planck Institute for Molecular Genetics, Department for Computational Molecular Biology, Ihnestrasse 73, 14195 Berlin, Germany
The repetitive structure of genomic DNA holds many secrets to be discovered. A systematic study of repetitive DNA on a genomic or inter-genomic scale requires extensive algorithmic support. The REPuter program described herein was designed to serve as a fundamental tool in such studies. Efficient and complete detection of various types of repeats is provided together with an evaluation of significance and interactive visualization. This article circumscribes the wide scope of repeat analysis using applications in five different areas of sequence analysis: checking fragment assemblies, searching for low copy repeats, finding unique sequences, comparing gene structures and mapping of cDNA/EST sequences.
* To whom correspondence should be addressed. Tel: +49 521 106 2906; Fax: +49 521 106 6411; Email: kurtz{at}techfak.uni-bielefeld.de
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. R. Shak, J. J. Dick, R. J. Meinersmann, G. I. Perez-Perez, and M. J. Blaser Repeat-Associated Plasticity in the Helicobacter pylori RD Gene Family J. Bacteriol., November 15, 2009; 191(22): 6900 - 6910. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Turmel, C. Otis, and C. Lemieux The Chloroplast Genomes of the Green Algae Pedinomonas minor, Parachlorella kessleri, and Oocystis solitaria Reveal a Shared Ancestry between the Pedinomonadales and Chlorellales Mol. Biol. Evol., October 1, 2009; 26(10): 2317 - 2331. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Mrazek Finding sequence motifs in prokaryotic genomes--a brief practical guide for a microbiologist Brief Bioinform, September 1, 2009; 10(5): 525 - 536. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Becher, A. Deymonnaz, and P. Heiber Efficient computation of all perfect repeats in genomic sequences of up to half a gigabyte, with a case study on the human genome Bioinformatics, July 15, 2009; 25(14): 1746 - 1753. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Turmel, M.-C. Gagnon, C. J. O'Kelly, C. Otis, and C. Lemieux The Chloroplast Genomes of the Green Algae Pyramimonas, Monomastix, and Pycnococcus Shed New light on the Evolutionary History of Prasinophytes and the Origin of the Secondary Chloroplasts of Euglenids Mol. Biol. Evol., March 1, 2009; 26(3): 631 - 648. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Rosenstein, C. Nerz, L. Biswas, A. Resch, G. Raddatz, S. C. Schuster, and F. Gotz Genome Analysis of the Meat Starter Culture Bacterium Staphylococcus carnosus TM300 Appl. Envir. Microbiol., February 1, 2009; 75(3): 811 - 822. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. R. Miller, A. L. Delcher, S. Koren, E. Venter, B. P. Walenz, A. Brownley, J. Johnson, K. Li, C. Mobarry, and G. Sutton Aggressive assembly of pyrosequencing reads with mates Bioinformatics, December 15, 2008; 24(24): 2818 - 2824. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. W. Innes, C. Ameline-Torregrosa, T. Ashfield, E. Cannon, S. B. Cannon, B. Chacko, N. W.G. Chen, A. Couloux, A. Dalwani, R. Denny, et al. Differential Accumulation of Retroelements and Diversification of NB-LRR Disease Resistance Genes in Duplicated Regions following Polyploidy in the Ancestor of Soybean Plant Physiology, December 1, 2008; 148(4): 1740 - 1759. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Wawrzynski, T. Ashfield, N. W.G. Chen, J. Mammadov, A. Nguyen, R. Podicheti, S. B. Cannon, V. Thareau, C. Ameline-Torregrosa, E. Cannon, et al. Replication of Nonautonomous Retroelements in Soybean Appears to Be Both Recent and Common Plant Physiology, December 1, 2008; 148(4): 1760 - 1771. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Merkel and N. Gemmell Detecting short tandem repeats from genome data: opening the software black box Brief Bioinform, September 1, 2008; 9(5): 355 - 366. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Klasson, T. Walker, M. Sebaihia, M. J. Sanders, M. A. Quail, A. Lord, S. Sanders, J. Earl, S. L. O'Neill, N. Thomson, et al. Genome Evolution of Wolbachia Strain wPip from the Culex pipiens Group Mol. Biol. Evol., September 1, 2008; 25(9): 1877 - 1887. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Thomas-Chollier, O. Sand, J.-V. Turatsinze, R. Janky, M. Defrance, E. Vervisch, S. Brohee, and J. van Helden RSAT: regulatory sequence analysis tools Nucleic Acids Res., July 1, 2008; 36(suppl_2): W119 - W127. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Gao and C.-T. Zhang Origins of Replication in Sorangium cellulosum and Microcystis aeruginosa DNA Res, June 1, 2008; 15(3): 169 - 171. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Hu, W. Fan, B. Han, H. Liu, D. Zheng, Q. Li, W. Dong, J. Yan, M. Gao, C. Berry, et al. Complete Genome Sequence of the Mosquitocidal Bacterium Bacillus sphaericus C3-41 and Comparison with Those of Closely Related Bacillus Species J. Bacteriol., April 15, 2008; 190(8): 2892 - 2902. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Brinkrolf, S. Ploger, S. Solle, I. Brune, S. S. Nentwich, A. T. Huser, J. Kalinowski, A. Puhler, and A. Tauch The LacI/GalR family transcriptional regulator UriR negatively controls uridine utilization of Corynebacterium glutamicum by binding to catabolite-responsive element (cre)-like sequences Microbiology, April 1, 2008; 154(4): 1068 - 1081. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-M. Chaw, A. Chun-Chieh Shih, D. Wang, Y.-W. Wu, S.-M. Liu, and T.-Y. Chou The Mitochondrial Genome of the Gymnosperm Cycas taitungensis Contains a Novel Family of Short Interspersed Elements, Bpu Sequences, and Abundant RNA Editing Sites Mol. Biol. Evol., March 1, 2008; 25(3): 603 - 615. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. R. Smith and R. W. Lee Mitochondrial Genome of the Colorless Green Alga Polytomella capuana: A Linear Molecule with an Unprecedented GC Content Mol. Biol. Evol., March 1, 2008; 25(3): 487 - 496. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. QUERA RAP: A computer program for exploring similarities in behavior sequences using random projections Behav Res Methods, February 1, 2008; 40(1): 21 - 32. [Abstract] [PDF] |
||||
![]() |
C. M. Bergman and H. Quesneville Discovering and detecting transposable elements in genome sequences Brief Bioinform, November 1, 2007; 8(6): 382 - 392. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Grissa, G. Vergnaud, and C. Pourcel CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats Nucleic Acids Res., July 13, 2007; 35(suppl_2): W52 - W57. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. G.W.M. Schijlen, C.H. R. de Vos, S. Martens, H. H. Jonker, F. M. Rosin, J. W. Molthoff, Y. M. Tikunov, G. C. Angenent, A. J. van Tunen, and A. G. Bovy RNA Interference Silencing of Chalcone Synthase, the First Step in the Flavonoid Biosynthesis Pathway, Leads to Parthenocarpic Tomato Fruits Plant Physiology, July 1, 2007; 144(3): 1520 - 1530. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-L. Lee, R. K. Jansen, T. W. Chumley, and K.-J. Kim Gene Relocations within Chloroplast Genomes of Jasminum and Menodora (Oleaceae) Are Due to Multiple, Overlapping Inversions Mol. Biol. Evol., May 1, 2007; 24(5): 1161 - 1180. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Flores, L. Morales, C. Gonzaga-Jauregui, R. Dominguez-Vidana, C. Zepeda, O. Yanez, M. Gutierrez, T. Lemus, D. Valle, Ma. C. Avila, et al. Inaugural Article: Recurrent DNA inversion rearrangements in the human genome PNAS, April 10, 2007; 104(15): 6099 - 6106. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Robbens, E. Derelle, C. Ferraz, J. Wuyts, H. Moreau, and Y. Van de Peer The Complete Chloroplast and Mitochondrial DNA Sequence of Ostreococcus tauri: Organelle Genomes of the Smallest Eukaryote Are Examples of Compaction Mol. Biol. Evol., April 1, 2007; 24(4): 956 - 968. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. E. Timme, J. V. Kuehl, J. L. Boore, and R. K. Jansen A comparative analysis of the Lactuca and Helianthus (Asteraceae) plastid genomes: identification of divergent regions and categorization of shared repeats Am. J. Botany, March 1, 2007; 94(3): 302 - 312. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Bakalis, C.S. Iliopoulos, C. Makris, S. Sioutas, E. Theodoridis, A. Tsakalidis, and K. Tsichlas Locating Maximal Multirepeats in Multiple Strings Under Various Constraints The Computer Journal, March 1, 2007; 50(2): 178 - 185. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Jobling, I. C. C. Lo, D. J. Turner, G. R. Bowden, A. C. Lee, Y. Xue, D. Carvalho-Silva, M. E. Hurles, S. M. Adams, Y. M. Chang, et al. Structural variation on the short arm of the human Y chromosome: recurrent multigene deletions encompassing Amelogenin Y Hum. Mol. Genet., February 1, 2007; 16(3): 307 - 316. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Lindroos, O. Vinnere, A. Mira, D. Repsilber, K. Naslund, and S. G. E. Andersson Genome Rearrangements, Deletions, and Amplifications in the Natural Population of Bartonella henselae J. Bacteriol., November 1, 2006; 188(21): 7426 - 7439. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. W. Chumley, J. D. Palmer, J. P. Mower, H. M. Fourcade, P. J. Calie, J. L. Boore, and R. K. Jansen The Complete Chloroplast Genome Sequence of Pelargonium x hortorum: Organization and Evolution of the Largest and Most Highly Rearranged Chloroplast Genome of Land Plants Mol. Biol. Evol., November 1, 2006; 23(11): 2175 - 2190. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Bonemann, M. Stiens, A. Puhler, and A. Schluter Mobilizable IncQ-Related Plasmid Carrying a New Quinolone Resistance Gene, qnrS2, Isolated from the Bacterial Community of a Wastewater Treatment Plant. Antimicrob. Agents Chemother., September 1, 2006; 50(9): 3075 - 3080. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. P. Duffy, A. M. Young, B. Morin, C. J. Lucarotti, B. F. Koop, and D. B. Levin Sequence Analysis and Organization of the Neodiprion abietis Nucleopolyhedrovirus Genome J. Virol., July 15, 2006; 80(14): 6952 - 6963. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Dezulian, M. Schaefer, R. Wiese, D. Weigel, and D. H. Huson CrossLink: visualization and exploration of sequence relationships between (micro) RNAs. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W400 - W404. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Stiens, S. Schneiker, M. Keller, S. Kuhn, A. Puhler, and A. Schluter Sequence Analysis of the 144-Kilobase Accessory Plasmid pSmeSM11a, Isolated from a Dominant Sinorhizobium meliloti Strain Identified during a Long-Term Field Release Experiment. Appl. Envir. Microbiol., May 1, 2006; 72(5): 3662 - 3672. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Gerlach and R. Giegerich GUUGle: a utility for fast exact matching under RNA complementary rules including G-U base pairing Bioinformatics, March 15, 2006; 22(6): 762 - 764. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Martins, D. de Sousa, K. Proite, P. Guimaraes, M. Moretzsohn, and D. Bertioli New softwares for automated microsatellite marker development Nucleic Acids Res., February 21, 2006; 34(4): e31 - e31. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-I. Won, S. Park, J.-H. Yoon, and S.-W. Kim An efficient approach for sequence matching in large DNA databases Journal of Information Science, February 1, 2006; 32(1): 88 - 104. [Abstract] [PDF] |
||||
![]() |
M. W. E. J. Fiers, H. van de Wetering, T. H. J. M. Peeters, J. J. van Wijk, and J.-P. Nap DNAVis: interactive visualization of comparative genome annotations Bioinformatics, February 1, 2006; 22(3): 354 - 355. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. J. P. Sobreira, A. M. Durham, and A. Gruber TRAP: automated classification, quantification and annotation of tandemly repeated sequences Bioinformatics, February 1, 2006; 22(3): 361 - 362. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Morgulis, E. M. Gertz, A. A. Schaffer, and R. Agarwala WindowMasker: window-based masker for sequenced genomes Bioinformatics, January 15, 2006; 22(2): 134 - 141. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Huang, S.-P. Yang, A. T. Chinwalla, L. W. Hillier, P. Minx, E. R. Mardis, and R. K. Wilson Application of a superword array in genome assembly Nucleic Acids Res., January 5, 2006; 34(1): 201 - 205. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-F. Pombert, C. Otis, C. Lemieux, and M. Turmel The Chloroplast Genome Sequence of the Green Alga Pseudendoclonium akinetum (Ulvophyceae) Reveals Unusual Structural Features and New Insights into the Branching Order of Chlorophyte Lineages Mol. Biol. Evol., September 1, 2005; 22(9): 1903 - 1918. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-J. Kim, K.-S. Choi, and R. K. Jansen Two Chloroplast DNA Inversions Originated Simultaneously During the Early Evolution of the Sunflower Family (Asteraceae) Mol. Biol. Evol., September 1, 2005; 22(9): 1783 - 1792. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Szczepanowski, S. Braun, V. Riedel, S. Schneiker, I. Krahn, A. Puhler, and A. Schluter The 120 592 bp IncF plasmid pRSB107 isolated from a sewage-treatment plant encodes nine different antibiotic-resistance determinants, two iron-acquisition systems and other putative virulence-associated functions Microbiology, April 1, 2005; 151(4): 1095 - 1111. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Campagna, C. Romualdi, N. Vitulo, M. Del Favero, M. Lexa, N. Cannata, and G. Valle RAP: a new computer program for de novo identification of repeated sequences in whole genomes Bioinformatics, March 1, 2005; 21(5): 582 - 588. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Zhang and M. S. Waterman An Eulerian path approach to local multiple alignment for DNA sequences PNAS, February 1, 2005; 102(5): 1285 - 1290. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. A. Brayton, L. S. Kappmeyer, D. R. Herndon, M. J. Dark, D. L. Tibbals, G. H. Palmer, T. C. McGuire, and D. P. Knowles Jr. Complete genome sequencing of Anaplasma marginale reveals that the surface is skewed to two superfamilies of outer membrane proteins PNAS, January 18, 2005; 102(3): 844 - 849. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Warren, W. W. L. Hsiao, H. Kudo, M. Myhre, M. Dosanjh, A. Petrescu, H. Kobayashi, S. Shimizu, K. Miyauchi, E. Masai, et al. Functional Characterization of a Catabolic Plasmid from Polychlorinated- Biphenyl-Degrading Rhodococcus sp. Strain RHA1 J. Bacteriol., November 15, 2004; 186(22): 7783 - 7795. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. C. Minion, E. J. Lefkowitz, M. L. Madsen, B. J. Cleary, S. M. Swartzell, and G. G. Mahairas The Genome Sequence of Mycoplasma hyopneumoniae Strain 232, the Agent of Swine Mycoplasmosis J. Bacteriol., November 1, 2004; 186(21): 7123 - 7133. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Szczepanowski, I. Krahn, B. Linke, A. Goesmann, A. Puhler, and A. Schluter Antibiotic multiresistance plasmid pRSB101 isolated from a wastewater treatment plant is related to plasmids residing in phytopathogenic bacteria and carries eight different resistance determinants including a multidrug transport system Microbiology, November 1, 2004; 150(11): 3613 - 3630. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. A. Pevzner, H. Tang, and G. Tesler De Novo Repeat Classification and Fragment Assembly Genome Res., September 1, 2004; 14(9): 1786 - 1796. [Abstract] [Full Text] [PDF] |
||||
![]() |
G Van Buggenhout, C Melotte, B Dutta, G Froyen, P Van Hummelen, P Marynen, G Matthijs, T de Ravel, K Devriendt, J P Fryns, et al. Mild Wolf-Hirschhorn syndrome: micro-array CGH analysis of atypical 4p16.3 deletions enables refinement of the genotype-phenotype map J. Med. Genet., September 1, 2004; 41(9): 691 - 698. [Full Text] [PDF] |
||||
![]() |
C. T. Storlazzi, T. Fioretos, K. Paulsson, B. Strombeck, C. Lassen, T. Ahlgren, G. Juliusson, F. Mitelman, M. Rocchi, and B. Johansson Identification of a commonly amplified 4.3 Mb region with overexpression of C8FW, but not MYC in MYC-containing double minutes in myeloid malignancies Hum. Mol. Genet., July 15, 2004; 13(14): 1479 - 1485. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-F. Pombert, C. Otis, C. Lemieux, and M. Turmel The Complete Mitochondrial DNA Sequence of the Green Alga Pseudendoclonium akinetum (Ulvophyceae) Highlights Distinctive Evolutionary Trends in the Chlorophyta and Suggests a Sister-Group Relationship Between the Ulvophyceae and Chlorophyceae Mol. Biol. Evol., May 1, 2004; 21(5): 922 - 935. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Rise, K. R. von Schalburg, G. D. Brown, M. A. Mawer, R. H. Devlin, N. Kuipers, M. Busby, M. Beetz-Sargent, R. Alberto, A. R. Gibbs, et al. Development and Application of a Salmonid EST Database and cDNA Microarray: Data Mining and Interspecific Hybridization Characteristics Genome Res., March 1, 2004; 14(3): 478 - 490. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Ben Zakour, M. Gautier, R. Andonov, D. Lavenier, M.-F. Cochet, P. Veber, A. Sorokin, and Y. Le Loir GenoFrag: software to design primers optimized for whole genome scanning by long-range PCR amplification Nucleic Acids Res., January 2, 2004; 32(1): 17 - 24. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Schluter, H. Heuer, R. Szczepanowski, L. J. Forney, C. M. Thomas, A. Puhler, and E. M. Top The 64 508 bp IncP-1{beta} antibiotic multiresistance plasmid pB10 isolated from a waste-water treatment plant provides evidence for recombination between members of different branches of the IncP-1{beta} group Microbiology, November 1, 2003; 149(11): 3139 - 3153. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Healy, E. E. Thomas, J. T. Schwartz, and M. Wigler Annotating Large Genomes With Exact Word Matches Genome Res., October 1, 2003; 13(10): 2306 - 2315. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Eder, M. Ventura, M. Ianigro, M. Teti, M. Rocchi, and N. Archidiacono Chromosome 6 Phylogeny in Primates and Centromere Repositioning Mol. Biol. Evol., September 1, 2003; 20(9): 1506 - 1512. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Li and M. S. Waterman Estimating the Repeat Structure and Length of DNA Sequences Using {ell}-Tuples Genome Res., August 1, 2003; 13(8): 1916 - 1922. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. O. Glockner, M. Kube, M. Bauer, H. Teeling, T. Lombardot, W. Ludwig, D. Gade, A. Beck, K. Borzym, K. Heitmann, et al. Complete genome sequence of the marine planctomycete Pirellula sp. strain 1 PNAS, July 8, 2003; 100(14): 8298 - 8303. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Dong, L. Roy, M. Freeling, V. Walbot, and V. Brendel ZmDB, an integrated database for maize genome research Nucleic Acids Res., January 1, 2003; 31(1): 244 - 247. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Bao, Y. Tian, W. Li, Z. Xu, Z. Xuan, S. Hu, W. Dong, J. Yang, Y. Chen, Y. Xue, et al. A Complete Sequence of the T. tengcongensis Genome Genome Res., May 1, 2002; 12(5): 689 - 700. [Abstract] [Full Text] [PDF] |
||||


















