| Nucleic Acids Research | Pages |
Analyzing genomes with cumulative skew diagrams
Introduction
Results
Microorganisms
AT versus GC skew
Genome rearrangements
Other genomes
Discussion
Acknowledgements
References
Analyzing genomes with cumulative skew diagrams
ABSTRACT
INTRODUCTION
The human mitochondrial genome of <17 kilobase pairs (kb) was sequenced 17 years ago (1). Today, the 50-750 times longer complete genome sequences of the yeast Saccharomyces cerevisiae (2), bacteria Bacillus subtilis (3), Borrelia burgdorferi (4), Escherichia coli (5), Haemophilus influenzae (6), Helicobacter pylori (7), Mycoplasma genitalium (8), Mycoplasma pneumoniae (9), Synechocystis sp. strain PCC6803 (10) and archaea Archaeoglobus fulgidus (11), Methanococcus jannaschii (12) and Methanobacterium thermoautotrophicum (13) are available. In silico analysis of such sequences, cataloguing open reading frames (ORFs) and other structural elements, accelerates molecular biology studies by traditional methods.
Studies of genome replication and rearrangements can also benefit from the analysis of complete sequences. I demonstrate this here by exploiting an early hypothesis of differential mutation occurring in leading and lagging strands due to asymmetry of the replication mechanism: combined with natural selection, this factor may be responsible for a skewed base distribution along the genome (14-16). A report on the E.coli genome sequence (5) included a GC skew plot, earlier used for smaller regions by Lobry (15), for the whole chromosome. The skew, calculated as (G-C)/(G+C) for a window sliding along the sequence, was shown to switch polarity in the vicinity of the terminus (ter) and origin (ori) of replication, with the leading strand manifesting a positive skew. Such plots may not always be very illustrative due to visible fluctuations for a small window size, while larger windows may hide precise coordinates of polarity switches. Also, as shown below, some of the local polarity switches are important.
Instead, one can use a far more convenient cumulative GC skew: a sum of (G-C)/(G+C) in adjacent windows from an arbitrary start to a given point in a sequence. Similar to integration of the skew function over the DNA length, this value reaches its global maximum at the E.coli terminus, while the minimum resides over the replication origin. Figure 1A gives an example of the M.pneumoniae sequence (9), where polarity switches are much harder to detect in the skew plot, compared to E.coli (5), but the cumulative plot readily reveals them.
Figure 1. Cumulative diagrams. Organism and E.coli strain names are color-coded as shown in each diagram. Y axis (note how the ranges vary between different panels) shows the cumulative skew, calculated as described in the text. To avoid the dependence on the window size w and chromosome length c, the skew values are multiplied by w/c. For all diagrams, w/c <0.5%. To allow for plotting graphs for several genomes, the bp coordinate is replaced by a percentage of the genome size on the X axis; zero is selected at base pair 1 of each sequence. The point where a plot intersects the right-hand vertical axis gives a total skew value over the whole genome (for any base pair in a circular sequence, selected as a starting point, the total skew would be the same as well as the distance between global extremities). GC skew versus cumulative skew, putative origin and terminus locations (A). GC diagrams for circular bacterial chromosomes (B), linear chromosome and plasmids (C), circular archaeal chromosomes (D). GC and AT diagrams for two E.coli strains, vertical lines at 39 and 55% show locations of an inversion and deletion, respectively. (E) GC diagrams and second order polynomial trendlines for adenoviral and mitochondrial genomes; for the latter, oriL is marked by the vertical line, oriH is at 0/100% (F). For an imaginary genome which underwent no other sequence changes apart from constant rate strand-specific substitutions (with constant skew values for each strand) such a plot will consist of straight lines. Plots of cumulative GC (or AT) skew are termed GC (or AT) diagrams in this paper. For sequences starting at a ter/ori site, diagrams will have characteristic V/inverted V-shapes for bi-directional replication between singular ori and ter. The two global extremities separated by half of the chromosome length will indicate the points where the properties of two strands switch. Recent sequence inversions will be visible as local extremities located at the inversion boundaries.
RESULTS
Microorganisms
Analysis of 14 microbial genomes has shown that GC diagram extremities nearly coincide in every case with known and putative ori/ter sites, the distances between global extremities are very close to 50% of the respective genome lengths and the minimum is always located at the origin (Fig. 1A-D and Table 1). The largest discrepancy is with the B.subtilis terminus assigned to a region between a pair of terminator sequences, arresting a replication fork near the rtp gene at 2017 kb (3). However, the diagram maximum is located at 1.94 Mb, precisely between rtp and another pair that includes a recently identified terminator next to glnA (17).
A simple GC skew analysis has been presented for the B.burgdorferi sequence (4). While finding a polarity switch for its linear chromosome, it failed to detect any in its multiple plasmids. However, diagrams for some of them (e.g., lp25 and lp17; Fig. 1C) display clear V-shapes, suggesting centrally located origins, as is the case for the chromosome (18).
The diagrams for the archaea M.jannaschii and M.thermoautotrophicum also feature pairs of global extremities half-chromosomes away from each other (Fig. 1D). Near the minimum of the M.thermoautotrophicum diagram there is a putative homolog of the B.subtilis gene soj (10 kb from the B.subtilis origin), which may be involved in replication control and chromosome partitioning (19).
Table 1
| Organism | Replication | GC diagram | AT diagram | ||||
| (bacteria, archaea) | ori | ter | min | max | min-max | min | max |
| Diagrams in the opposite phase (GC minimum is AT maximum) | |||||||
| E.coli (see Fig. 1E) | 84.5 | 34.1 | 84.5 | 33.4 | 48.9 | 43.1 | 84.5 |
| H.influenzae | 32.9 | 80.1 | 33.1 | 80.4 | 47.3 | 82.4 | 32.5 |
| C.trachomatisa | n/a | n/a | 68.9 | 18.6 | 49.7 | 25.2 | 68.9 |
| M.tuberculosisa | 0 | n/a | 0 | 46.2 | 46.2 | 48.4 | 0 |
| T.palliduma | n/a | n/a | 35.5 | 87.0 | 48.5 | 87.0 | 35.5 |
| B.burgdorferi | 50.1 | 0 | 49.5 | 0 | 49.5 | 0 | 49.5 |
| Diagrams in the same phase (GC minimum is AT minimum) | |||||||
| B.subtilis | 0 | 47.8 | 0 | 46.0 | 46.0 | 0 | 46.0 |
| M.pneumoniae | 25.1 | n/a | 24.8 | 74.1 | 49.3 | 25.1 | 63.8 |
| M.genitalium | 0 | n/a | 0 | 50.4 | 49.6 | 0 | 61.2 |
| H.pylori | n/a | n/a | 0b | 48.7 | 48.7 | 0 | 51.9 |
| M.jannaschii | n/a | n/a | 41.4 | 98.7 | 42.5 | 41.4 | 98.7 |
| M.thermoautotrophicum | n/a | n/a | 74.7 | 23.2 | 48.5 | 96.9 | 22.9 |
Table 1 includes the data on three genomes awaiting completion and publication: Treponema pallidum (sequence available at http://www.tigr.org), Mycobacterium tuberculosis (http://www.sanger.ac.uk) and Chlamydia trachomatis (http://chlamydia-www.berkeley.edu:4231). Partial contiguous sequences can also be analyzed with this method: a segment of Mycobacterium leprae genome (GenBank accession number L39923) features a minimum for oriC site between dnaA and dnaN on its diagram (data not shown).
AT versus GC skew
The cumulative diagrams provide an easy way to compare the behavior of AT and GC skews over a genome. Unlike the GC skew, an AT diagram maximum does not always correspond to a terminus site but to an origin of replication instead. Also, there are frequent shifts of the positions of AT diagram peaks relative to those in GC diagrams (Table 1, also see Fig. 1E). Such markedly dissimilar behavior of the skews is likely to result from different mechanisms of respective base pair substitution in the above organisms.
Local minima and maxima in AT diagrams correspond to less pronounced local extremities of GC diagrams (Fig. 1E). This may reflect a faster `smoothing' of sharp distortions that are caused by genome rearrangements, indicating a higher rate of substitutions contributing to the GC skew.
Genome rearrangements
Local extremities may be introduced by sequence inversions or direct translocations to another half of a chromosome (swapping the leading and lagging strands), or integration of foreign DNA into the chromosome. To confirm this, the AT and GC diagrams have been compared for two E.coli strains: MG1655 (5) and W3110, available at http://mol.genes.nig.ac.jp/ecoli. The main differences (as clearly seen in Fig. 1E) are located in two regions: a tiny peak in the GC diagram at the 39% position indicates an inversion, while convergence at 55% points to either a smaller-size inversion or presence of a short fragment with a local skew opposite to that of the surrounding region in MG1655, missing in W3110. Zooming in to the plot, one finds no counterpart of that fragment in the W3110 diagram. In other words, this inverted fragment, possibly acquired by horizontal transfer, was deleted at this position in W3110.
Indeed, one will find an inversion between positions 1 756 051-1 775 569 in MG1655 and 1 760 880-1 780 398 in W3110. At 55%, a new cryptic prophage (missing in W3110) was detected between positions 2 556 711 and 2 563 508 in MG1655 (5). As expected, the local GC skew of this prophage is positive, while the chromosome GC skew around the insertion site is negative. This example shows how the long-range (up to the whole genome length) sequence comparisons can be facilitated with the aid of cumulative diagrams.
Other genomes
While there are examples of V-shaped GC diagrams reflecting bi-directional replication in viruses, e.g. SV40 (20) and chloroplasts, e.g., Euglena gracilis (21), one can attempt to interpret diagram shapes resulting from otherreplication mechanisms.
Sequential replication of two strands of the human adenovirus type 40 linear DNA (22) leaves one of them in single-stranded state while another is being duplicated (23). The time the DNA spends single-stranded decreases linearly from one end of the molecule to another, and an integral of such linear function will be an upward pointing parabola. In fact, its GC diagram has a nearly parabolic shape (Fig. 1F).
In vertebrate mitochondria each DNA strand also has its own replication origin: the heavy (H) strand is synthesized first from its oriH origin until the light (L) strand origin oriL is reached; then the remainder of the H strand and the whole L strand are replicated (24). This process in effect divides the DNA into two segments between oriH and oriL, so that in the first segment (2/3 of the genome) the oriH, and in the second segment the oriL spend the longest time single-stranded. The GC diagram of the H strand of the human mitochondrion (Fig. 1F, trendline shown for the longest segment) reflects this mechanism in that it consists of two distinct parts both displaying parabolic trends (with the GC skew increase towards oriH and oriL in each segment). This holds for all 20 vertebrate mitochondria analyzed (although their total skews vary) and, as one would expect, the closest diagrams are those between closely related species, e.g., human and chimpanzee (data not shown).
Eucaryotic chromosome replication is more complex, it starts from multiple origins. In S.cerevisiae, autonomous replicating sequences (ARS) constitute parts of such origins (25), located close to each telomere and at several other sites. GC diagrams for the yeast chromosomes show the most distinct local minima near telomeres but not at other ARS sites (data not shown). Nucleotide substitution mechanisms may differ from those in bacteria and also be chromosome-specific. In fact, diagrams drastically vary between different chromosomes: an AT diagram for chromosome III and GC diagram for chromosome I are both positive (their counterparts negative), while both diagrams for chromosome II are negative and relatively close to each other.
DISCUSSION
With the aid of cumulative skew diagrams, positive (and close to constant for long genome stretches in some of the organisms) leading strand GC skew is observed in 12 out of 14 microbial genomes. This may be related to a presumably average constant time which equal segments of the leading strand spend single-stranded (being more prone to damage) during chromosomal replication.
The diagrams of the differently replicated mitochondrial and adenoviral genomes support this hypothesis. Strand composition bias in relatively short human mitochondrial DNA (26) may result from the prolonged exposure of the H strand during replication with polymerization rate some 200-fold slower than that of E.coli (25). As demonstrated in Figure 1F, the GC skew of this strand increases towards oriH and oriL, the longest exposed sites within respective replication segments.
Two origins (one for each strand) are located on the opposite ends of the adenovirus genome. Starting from either end, replication proceeds in the first phase along one strand to produce a full duplex while the other parental strand is gradually displaced as a single strand (23). It means that in the displaced strand the exposure of DNA to damage will decrease linearly from the point where the synthesis starts on the other, template strand. In the second phase of replication the displaced strand becomes a template, and its previously most exposed 5[prime]-end again remains single-stranded until the end of the synthesis. The GC diagram of the adenovirus type 40 genome is strikingly close to a perfect parabola (Fig. 1F), indicating that its derivative (non-cumulative GC skew) also linearly decreases from the left 5[prime]-end of the shown sequence, changing from positive to negative at ~44% of the genome length. Replication may start at either origin, so both strands have a higher GC skew at their 5[prime]-ends.
Spontaneous deamination of C or 5-methylcytosine on the single-stranded template can lead to pairing it with A and a relative abundance of G and T on that strand since deamination rates raise over 100-fold when DNA is single-stranded (27). Mismatch repair peculiarities, such as the fact that a C-C mispair is a very poor substrate for the methyl-directed repair pathway (28) may also contribute to the positive GC skew of the leading strand.
However, it is difficult to propose a single mechanism that can also account for all the differences in the AT diagrams in the above genomes. Clearly, mutational sequence changes arise from multiple independent and organism-specific factors related to relative asymmetries and fidelities of replication and repair processes combined with natural selection. As shown earlier (15,5), the GC skew is most pronounced in intergenic regions and third codon positions where relaxation of selective pressure increases bias, as might be expected with mutational bias.
A recent review (29) strongly advocated DNA strand substitution asymmetry being a result of unequal exposure of the two strands to damage and of differential opportunity for repair during transcription (and admitted that this hypothesis cannot explain the bias in mitochondria). The direction of transcription, rather than replication, would then determine the strand sequence change. While such transcription-coupled repair contribution to the observed bias cannot be ruled out, it does not seem to have a global effect since the numbers of ORFs transcribed in the direction of replication vary between the genomes analyzed in this paper and can be as low as 55% for E.coli (5). A notable distortion to the right of the B.subtilis diagram maximum (Fig. 1B) corresponds to a large group of genes (from yonP to yomA) encoded on the lagging strand, while 75% of B.subtilis ORFs are collinear with the direction of replication (3). Other divergently transcribed regions do not produce any comparable distortions, so this peak probably indicates a recent inversion or an insertion of foreign DNA. One also has to bear in mind that the direction of transcription may be a consequence of earlier inversions.
Of all analyzed bacterial genome sequences, only the Synechocystis sp. diagram (Fig. 1B) displays an atypical behavior. Its skew amplitude is very low, the dnaA gene is between the global minimum at zero and maximum at 87%. The replication mechanism of this chromosome is unknown and seems to differ from that of other bacteria in the resulting nucleotide substitution effects.
Little is known about replication in archaea and it is hard to explain the diagram behavior of A.fulgidus (Fig. 1D) without speculating about large rearrangements or a possible case of more than one pair of origins and termini: it features two global minima and two global maxima located between them. This resembles the behavior of diagrams for eucaryotic chromosomes. One can observe a certain degree of similarity between the diagrams of A.fulgidus and M.jannaschii: their global and prominent local extremities are unexpectedly close (e.g. the 60% position) and many of the changes occur in the opposite phase. However, this is likely to be a coincidence for the two genomes that differ in size by 500 kb, with starts arbitrarily located upstream of the repeat-rich regions (11,12). Repeats are found in some of the large local peaks (data not shown) suggesting potential points of rearrangements.
Homology searches with sequences from the regions around global extremities have not revealed any consensus for potential replication origins or termini in archaea. However, the fact that circular chromosomes of M.jannaschii and, more clearly, M.thermoautotrophicum are divided into two nearly equal parts by their GC diagrams suggests that the similarity with bacterial diagrams may be not coincidental. The regions surrounding the diagram extremities can be further tested experimentally for their ability to replicate. Such studies may also determine if the prominent local (in addition to the global) extremities point to more than one pair of ori/ter sites in A.fulgidus and M.jannaschii.
Microbial diagrams show potential rearrangement events, with disruptions of linearity of up to 5% of a genome length. In addition to homologous recombination, one can envisage a mechanism for such inversions related to the observed nucleoid structure of E.coli, with 20-50 loops emanating from a dense node of DNA (30). A hypothetical event involving two double-strand breaks at the root of one loop and a subsequent recombination could produce a loop-sized inversion. If the diagram non-linearities indeed reflect inversions undergone by the analyzed genomes, this may represent a way the microorganisms regulate expression or even `create new genes' randomly combining parts of existing ORFs.
Relative numbers and sizes of diagram distortions may be indicative of the genome stability in different organisms, characterized by the quantities of horizontally transferred DNA in their chromosomes and recombination frequencies. Any distortion manifests a change in the compositional properties of DNA, thus reflecting a local balance of many mutational events (substitutions and rearrangements) and selective constraints. Other estimates of the relative genome stability are provided by the data on some 1500 DNA uptake sites in H.influenzae (31), anddiverse gene arrangement in different strains of H.pylori (32), although the latter fact has been recently contested (33). These findings are in agreement with the more distorted patterns in the diagrams of those organisms. The `jagged' diagrams of the archaea may reflect the mutation forces specific to their extreme environments or the similarity of their replication machinery to that of eucaryotic genomes (11-13). Independently replicating genome halves seem to accumulate different amounts of rearrangements, this (and unequal lengths of the `halves') may explain the non-zero total skews at the 100% coordinate (Fig. 1).
As demonstrated for two E.coli strains (Fig. 1E), long-range homologies and sequence rearrangements (e.g., an inversion of 20 kb) can be easily found using cumulative diagrams for the whole genome sequence comparisons. Even the diagrams which follow complex patterns and are harder to interpret may help with comparative analysis: e.g., remarkably similar diagrams of chloroplasts of higher plants point to several clear differences between their genomes (data not shown). The simplicity and effective visual representation of cumulative diagrams warrant their usefulness for other studies of global sequence organization.
While this paper was under review, I became aware of a publication by Freeman et al. (34) who analyzed a subset (nine microbial genomes) of the data using plots of cumulative purine, keto and coding-strand excess, calculated by `walking' along a sequence in the four nucleotide space (a similar method was earlier described in ref. 35). Global extremities in their plots are also close to the known ori/ter sites but not in every case. The reason for this is clear, from the important differences between the AT and GC skews discussed above and summarized in Table 1. Purine and keto excesses are aggregate measures which fail to detect such differences in the contributions of the AT and GC skews.
Diagrams for the genomes not shown here are available at http://www.mpimg-berlin-dahlem.mpg.de/~andy/diagrams.
ACKNOWLEDGEMENTS
Thanks are due to Igor Ivanov for stimulating discussions and helpful suggestions, Hans Lehrach for continuing support, Leo Schalkwyk, Elmar Maier and John O'Brien for critical reading of the manuscript. Generous data release policies of the organizations governing the Web sites mentioned in the text, are gratefully acknowledged. This work was in part supported by the grant 01KW 9608 of the BMBF Human Genome Project.
REFERENCES
This article has been cited by other articles:
This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals
Comments and feedback: www-admin{at}oup.co.uk
Last modification: 13 May 1998
Copyright©Oxford University Press, 1998.
![]()
CiteULike
Connotea
Del.icio.us What's this?
![]()
![]()

![]()
![]()
![]()
N. V. Sernova and M. S. Gelfand
Identification of replication origins in prokaryotic genomes
Brief Bioinform,
September 1, 2008;
9(5):
376 - 391.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
J. Uchiyama, M. Rashel, I. Takemura, H. Wakiguchi, and S. Matsuzaki
In Silico and In Vivo Evaluation of Bacteriophage {phi}EF24C, a Candidate for Treatment of Enterococcus faecalis Infections
Appl. Envir. Microbiol.,
July 1, 2008;
74(13):
4149 - 4163.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
A. Kulinska, M. Czeredys, F. Hayes, and G. Jagura-Burdzy
Genomic and Functional Characterization of the Modular Broad-Host-Range RA3 Plasmid, the Archetype of the IncU Group
Appl. Envir. Microbiol.,
July 1, 2008;
74(13):
4119 - 4132.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
I. Anderson, J. Rodriguez, D. Susanti, I. Porat, C. Reich, L. E. Ulrich, J. G. Elkins, K. Mavromatis, A. Lykidis, E. Kim, et al.
Genome Sequence of Thermofilum pendens Reveals an Exceptional Loss of Biosynthetic Pathways without Genome Reduction
J. Bacteriol.,
April 15, 2008;
190(8):
2957 - 2965.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
D. R. Smith and R. W. Lee
Mitochondrial Genome of the Colorless Green Alga Polytomella capuana: A Linear Molecule with an Unprecedented GC Content
Mol. Biol. Evol.,
March 1, 2008;
25(3):
487 - 496.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
A. Necsulea and J. R. Lobry
A New Method for Assessing the Effect of Replication on DNA Base Composition Asymmetry
Mol. Biol. Evol.,
October 1, 2007;
24(10):
2169 - 2179.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
M. Ventura, C. Canchaya, A. Tauch, G. Chandra, G. F. Fitzgerald, K. F. Chater, and D. van Sinderen
Genomics of Actinobacteria: Tracing the Evolutionary History of an Ancient Phylum
Microbiol. Mol. Biol. Rev.,
September 1, 2007;
71(3):
495 - 548.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
F. Gao and C.-T. Zhang
DoriC: a database of oriC regions in bacterial genomes
Bioinformatics,
July 15, 2007;
23(14):
1866 - 1867.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
H. Yukawa, C. A. Omumasaba, H. Nonaka, P. Kos, N. Okai, N. Suzuki, M. Suda, Y. Tsuge, J. Watanabe, Y. Ikeda, et al.
Comparative analysis of the Corynebacterium glutamicum group and complete genome sequence of strain R
Microbiology,
April 1, 2007;
153(4):
1042 - 1058.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
E. P.C. Rocha, M. Touchon, and E. J. Feil
Similar compositional biases are caused by very different mutational effects
Genome Res.,
December 1, 2006;
16(12):
1537 - 1547.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
D. L. Maeder, I. Anderson, T. S. Brettin, D. C. Bruce, P. Gilna, C. S. Han, A. Lapidus, W. W. Metcalf, E. Saunders, R. Tapia, et al.
The Methanosarcina barkeri Genome: Comparative Analysis with Methanosarcina acetivorans and Methanosarcina mazei Reveals Extensive Rearrangement within Methanosarcinal Genomes
J. Bacteriol.,
November 15, 2006;
188(22):
7922 - 7931.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
R. T. DeBoy, E. F. Mongodin, J. B. Emerson, and K. E. Nelson
Chromosome Evolution in the Thermotogales: Large-Scale Inversions and Strain Diversification of CRISPR Sequences.
J. Bacteriol.,
April 1, 2006;
188(7):
2364 - 2374.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
R. Frutos, A. Viari, C. Ferraz, A. Morgat, S. Eychenie, Y. Kandassamy, I. Chantal, A. Bensaid, E. Coissac, N. Vachiery, et al.
Comparative Genomic Analysis of Three Strains of Ehrlichia ruminantium Reveals an Active Process of Genome Size Plasticity.
J. Bacteriol.,
April 1, 2006;
188(7):
2533 - 2542.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
S. Ozaki, K. Fujimitsu, H. Kurumizaka, and T. Katayama
The DnaA homolog of the hyperthermophilic eubacterium Thermotoga maritima forms an open complex with a minimal 149-bp origin region in an ATP-dependent manner
Genes Cells,
April 1, 2006;
11(4):
425 - 438.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
J. Hirsch, V. Lefort, M. Vankersschaver, A. Boualem, A. Lucas, C. Thermes, Y. d'Aubenton-Carafa, and M. Crespi
Characterization of 43 Non-Protein-Coding mRNA Genes in Arabidopsis, Including the MIR162a-Derived Transcripts
Plant Physiology,
April 1, 2006;
140(4):
1192 - 1204.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
Y. Azuma, H. Hirakawa, A. Yamashita, Y. Cai, M. A. Rahman, H. Suzuki, S. Mitaku, H. Toh, S. Goto, T. Murakami, et al.
Genome Sequence of the Cat Pathogen, Chlamydophila felis
DNA Res,
January 1, 2006;
13(1):
15 - 23.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
H. Jeong, J. H. Yim, C. Lee, S.-H. Choi, Y. K. Park, S. H. Yoon, C.-G. Hur, H.-Y. Kang, D. Kim, H. H. Lee, et al.
Genomic blueprint of Hahella chejuensis, a marine microbe producing an algicidal agent
Nucleic Acids Res.,
December 13, 2005;
33(22):
7066 - 7073.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
Y. Sakaguchi, T. Hayashi, K. Kurokawa, K. Nakayama, K. Oshima, Y. Fujinaga, M. Ohnishi, E. Ohtsubo, M. Hattori, and K. Oguma
The genome sequence of Clostridium botulinum type C neurotoxin-converting phage and the molecular mechanisms of unstable lysogeny
PNAS,
November 29, 2005;
102(48):
17472 - 17477.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
A. M. Hauth, U. G. Maier, B. F. Lang, and G. Burger
The Rhodomonas salina mitochondrial genome: bacteria-like operons, compact gene arrangement and complex repeat region
Nucleic Acids Res.,
August 5, 2005;
33(14):
4433 - 4442.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
A. Tauch, O. Kaiser, T. Hain, A. Goesmann, B. Weisshaar, A. Albersmeier, T. Bekel, N. Bischoff, I. Brune, T. Chakraborty, et al.
Complete Genome Sequence and Analysis of the Multiresistant Nosocomial Pathogen Corynebacterium jeikeium K411, a Lipid-Requiring Bacterium of the Human Skin Flora
J. Bacteriol.,
July 1, 2005;
187(13):
4671 - 4682.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
M. N. Price, E. J. Alm, and A. P. Arkin
Interruptions in gene expression drive highly expressed operons to the leading strand of DNA replication
Nucleic Acids Res.,
June 7, 2005;
33(10):
3224 - 3234.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
D. A. Garsin, J. Urbach, J. C. Huguet-Tapia, J. E. Peters, and F. M. Ausubel
Construction of an Enterococcus faecalis Tn917-Mediated-Gene-Disruption Library Offers Insight into Tn917 Insertion Patterns
J. Bacteriol.,
November 1, 2004;
186(21):
7280 - 7289.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
N. S. Baliga, R. Bonneau, M. T. Facciotti, M. Pan, G. Glusman, E. W. Deutsch, P. Shannon, Y. Chiu, R. S. Weng, R. R. Gan, et al.
Genome sequence of Haloarcula marismortui: A halophilic archaeon from the Dead Sea
Genome Res.,
November 1, 2004;
14(11):
2221 - 2234.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
M. Touchon, A. Arneodo, Y. d'Aubenton-Carafa, and C. Thermes
Transcription-coupled and splicing-coupled strand asymmetries in eukaryotic genomes
Nucleic Acids Res.,
September 23, 2004;
32(17):
4969 - 4978.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
K. Ueda, A. Yamashita, J. Ishikawa, M. Shimada, T.-o Watsuji, K. Morimura, H. Ikeda, M. Hattori, and T. Beppu
Genome sequence of Symbiobacterium thermophilum, an uncultivable bacterium that depends on microbial commensalism
Nucleic Acids Res.,
September 21, 2004;
32(16):
4937 - 4944.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
J. D. Jaffe, N. Stange-Thomann, C. Smith, D. DeCaprio, S. Fisher, J. Butler, S. Calvo, T. Elkins, M. G. FitzGerald, N. Hafez, et al.
The Complete Genome and Proteome of Mycoplasma mobile
Genome Res.,
August 1, 2004;
14(8):
1447 - 1461.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
P. Mackiewicz, J. Zakrzewska-Czerwinska, A. Zawilak, M. R. Dudek, and S. Cebrat
Where does bacterial replication start? Rules for predicting the oriC region
Nucleic Acids Res.,
July 16, 2004;
32(13):
3781 - 3791.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
E. P. C. Rocha
The replication-related organization of bacterial genomes
Microbiology,
June 1, 2004;
150(6):
1609 - 1627.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
J. Westberg, A. Persson, A. Holmberg, A. Goesmann, J. Lundeberg, K.-E. Johansson, B. Pettersson, and M. Uhlen
The Genome Sequence of Mycoplasma mycoides subsp. mycoides SC Type Strain PG1T, the Causative Agent of Contagious Bovine Pleuropneumonia (CBPP)
Genome Res.,
February 1, 2004;
14(2):
221 - 227.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
H. Schmidt and M. Hensel
Pathogenicity Islands in Bacterial Pathogenesis
Clin. Microbiol. Rev.,
January 1, 2004;
17(1):
14 - 56.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
B. R. Berquist and S. DasSarma
An Archaeal Chromosomal Autonomously Replicating Sequence Element from an Extreme Halophile, Halobacterium sp. Strain NRC-1
J. Bacteriol.,
October 15, 2003;
185(20):
5959 - 5966.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
E. S. Miller, E. Kutter, G. Mosig, F. Arisaka, T. Kunisawa, and W. Ruger
Bacteriophage T4 Genome
Microbiol. Mol. Biol. Rev.,
March 1, 2003;
67(1):
86 - 156.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
Y. Zivanovic, P. Lopez, H. Philippe, and P. Forterre
Pyrococcus genome comparison evidences chromosome shuffling-driven evolution
Nucleic Acids Res.,
May 1, 2002;
30(9):
1902 - 1910.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
Q. Bao, Y. Tian, W. Li, Z. Xu, Z. Xuan, S. Hu, W. Dong, J. Yang, Y. Chen, Y. Xue, et al.
A Complete Sequence of the T. tengcongensis Genome
Genome Res.,
May 1, 2002;
12(5):
689 - 700.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()