| Nucleic Acids Research | Pages |
Comparative study of overlapping genes in the genomes of Mycoplasma genitalium and Mycoplasma pneumoniae
Introduction
Materials and Methods
Results and Discussion
Acknowledgements
References
Comparative study of overlapping genes in the genomes of Mycoplasma genitalium and Mycoplasma pneumoniae
ABSTRACT
INTRODUCTION
Many overlapping genes have been identified in the genomes of prokaryotes, bacteriophages, animal viruses and mitochondria, some of which have been reported to have functional roles such as in translational coupling (1-5) and negative translational coupling (6,7). Nevertheless, their evolutionary origin, i.e. how they have emerged, is not clearly understood.
We systematically analyzed all overlapping genes in genomes of two closely related species (Table 1). Mycoplasma genitalium (8) and Mycoplasma pneumoniae (9) were selected for our analysis, as the evolutionary distance of these two species is the closest among the 17 species whose complete genomes are currently available (as of October 1998). Many parts of these two genomes (Fig.
Table 1.
| M.genitalium | M.pneumoniae | |
| Genome size (bp) | 580 074 | 816 394 |
| Number of genes (CDS) | 480 | 677 |
| Total coding regions | 523 714 bp (90.3%) | 710 090 bp (87.0%) |
| Total overlapping regions | 2603 bp (0.45%) | 4894 bp (0.60%) |
There are 162 overlapping gene pairs in the genome of M.genitalium according to the TIGR annotation. The genome of M.pneumoniae, on the other hand, contains 203 overlapping gene pairs. There are 135 homologous overlapping gene pairs which exist in both species. The other 27 and 68 overlapping gene pairs are found only in M.genitalium and M.pneumoniae, respectively. The comparative analysis of these two genomes allows us to propose a model of how overlapping genes have emerged over the course of evolution. In particular, careful comparisons were made for the homologous genes that are overlapped in one species but not in the other.
Figure 1. Three patterns of overlapping genes.
MATERIALS AND METHODS
The whole genome sequence of M.genitalium with annotation (updated in 1998) was downloaded from the TIGR Microbe Database (http://www.tigr.org/tdb/mdb/mdb.html , updated version), and that of M.pneumoniae was from The Mycoplasma Pneumoniae Genome Project (http://mail.zmbh.uni-heidelberg.de/M-Pneumo niae/MP-Home.html ). Information on homologous parts of these two genomes was also obtained from The Mycoplasma Pneumoniae Genome Project.
In this paper, overlapping genes are defined as a pair of adjacent genes whose coding regions are partly overlapping. We first list all overlapping genes in their genomes according to the annotations in the databases. For each overlapping gene pair in one species, we aligned the sequence, using ClustalW (11), with the sequence of the homologous part of the other species. We then classified all the cases according to the three directional patterns as described in Figure
For those genes that overlap in one species but not in the other, we made careful analyses in order to infer the cause of the overlapping. Inferred causes of these events of overlapping were then classified into several types.
Figure 2. 4-base overlapping genes. Further analyses were conducted for those genes whose 3[prime] ends were elongated by more than 15 amino acids compared with their homologous genes in the other species. FASTA (12) (GenBank version Release 109.0) was used for searching homologous genes in bacteria other than Mycoplasma, and homology of the elongated regions was inspected to see if the regions contain any functionally important sequences. Furthermore, MOTIFS (GCG program package version Unix-8.1 of the Genetics Computer Group, WI) was used for examining possible motifs in these regions. Annotation of homologous genes is sometimes not in agreement between the two genomes. In particular, many annotational differences were found in determining start codons; i.e. the beginnings of coding regions. This is because the sequence annotations of these two species were made by different software: BLAZE (13) for M.genitalium and FRAMES (GCG program package version Unix-8.1 of the Genetics Computer Group, WI) for M.pneumoniae. We excluded from our discussion those homologous genes whose start codon was assigned differently by the two software packages.
RESULTS AND DISCUSSION
Table 2 summarizes the numbers of overlapping gene pairs in these two genomes. Most overlapping genes are uni-directional, though there are a few end-on overlapping genes. Interestingly, there is only one case of a head-on overlapping gene.
Table 2.
| ->-> | ->[larr] | [larr] -> | |
| in M.genitalium | 134 | 25 | 3 |
| in M.pneumoniae | 180 | 20 | 3 |
| in both species | 119 | 15 | 1 |
| only in M.genitalium | 15 | 10 | 2 |
| only in M.pneumoniae | 61 | 5 | 2 |
Table 3.
| ttaa | ttag | ctaa | ctag | |
| M.genitalium | 2 | 5 | 1 | 0 |
| M.pneumoniae | 3 | 2 | 0 | 3 |
Table 4.
| taatg | tagtg | |
| M.genitalium | 34 | 3 |
| M.pneumoniae | 52 | 2 |
Table 5.
| Deletion | Point | Frame | Unknown | |
| End-on (-> [larr]) | 5 | 1 | 2 | 4 |
| Uni-directional (->->) | 24 | 1 | 1 | 7 |
| Total | 29 | 2 | 3 | 11 |
Out of the end-on overlapping genes (the direction of which is ->[larr]), many overlap only 1 or 4 bases. Of the 45 overlapping gene pairs (two species together), 16 overlap only 4 bases (Table 3). Mycoplasma genitalium and M.pneumoniae use TAA and TAG for their stop codons. As shown in Figure
Out of the 314 uni-directional overlapping gene pairs (->->), 91 (29.0%) are overlapping only 1 base (Table 4). The overlapped base is either the middle A in the sequence TAATG, which includes TAA for a stop codon of one gene and ATG for a start codon of the other, or the middle G in TAGTG, which includes TAG for the stop codon and GTG for the start codon.
|
Table 6.
|
Table 7.
The cause of each case of gene overlapping in one species was inferred from the non-overlapping gene sequence in the other species, and categorized as described with examples in Figure
Estimated sequence error rate was reported to be <1 in 10 000 bases in M.genitalium (8). The probability of a particular stop codon being replaced due to sequence error is, thus, one in thousands. We therefore consider that sequence errors do not influence the results of our analyses.
All overlapping gene pairs in M.genitalium and M.pneumoniae, and their homologous genes in the other species, length of overlapping regions, and direction of genes are listed in Tables 6 and 6. Those genes that overlap in one species but not in the other are indicated by an asterisk in the remark column. Genes marked with - in the column are those which we excluded from our analyses due to annotational difference or absence of homologous genes.
While there is a total of 30 end-on overlapping gene pairs (->[larr]), there are only five head-on overlapping gene pairs ([larr]->). In addition, most uni-directional overlapping genes (->->) were caused by elongation of the 3[prime] ends of the preceding genes, not by elongation of the 5[prime] ends of the subsequent genes. From these observations, we conclude that many overlapping genes were caused by elongation of the 3[prime] end of a coding region, nearly concomitant with the loss of its stop codon.
There are seven cases in which gene elongation in one species has presumably occurred by more than 15 amino acids (Table 8). The FASTA search revealed that, for three of the seven cases, certain elongation was also found in other bacteria. However, the elongated regions are not well conserved between the species. Furthermore, the MOTIFS search found no known motifs in the elongated regions for all the seven cases. These results suggest that the elongated regions have little or no functional role that is biologically important.
Overlapping genes might have been thought of as the results of evolutionary pressure to minimize genome size. However, our analysis indicates that many overlapping genes, at least in the genomes of M.genitalium and M.pneumoniae, are due primarily to incidental elongation of coding regions.
ACKNOWLEDGEMENTS
We thank Rintaro Saito and Masahiko Wada for their support in computer programing. This work was supported in part by a Grant-in-Aid for Scientific Research on Priority Areas Genome Science from The Ministry of Education, Science, Sports and Culture in Japan.
Table 8.
| Genes | Homolog | Elongation | Motifs |
| MG029 | Y967_METJA | + | 0/0 |
| MG034 | KITH_BACSU | - | 0/2 |
| MG176 | RS_SYNY3 | - | 0/1 |
| MG248 | YQFN_BACSU | + | 0/0 |
| MG364 | ABRA_PLAFF | + | 0/0 |
| G07_orf215 | RS6_HAEIN | - | 0/1 |
| G12_orf282a | RNC_BACSU | - | 0/1 |
a
![]() |
b
![]() |
c
|
Figure 3. Causes of overlapping genes. (a) Deletion of stop codon. Deletion of a segment that includes a stop codon in one of two adjacent non-overlapping genes can result in overlapping genes. (b) Point mutation at stop codon. A stop codon (TAA or TAG) in one of two adjacent non-overlapping genes has been lost due to a point mutation, elongating the genes coding region and resulting in overlapping genes. (c) Frame-shift. Frame-shift mutation in coding region of one of two adjacent non-overlapping genes can cause elongation of the gene, resulting in overlapping genes.
REFERENCES
This article has been cited by other articles:
This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals
Comments and feedback: www-admin{at}oup.co.uk
Last modification: 26 Mar 1999
Copyright©Oxford University Press, 1999.
![]()
CiteULike
Connotea
Del.icio.us What's this?
![]()
![]()

![]()
![]()
![]()
L.-W. Jiang, K.-L. Lin, and C. L. Lu
OGtree: a tool for creating genome trees of prokaryotes based on overlapping genes
Nucleic Acids Res.,
July 1, 2008;
36(suppl_2):
W475 - W480.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
R. Belshaw, O. G. Pybus, and A. Rambaut
The evolution of genome compression and genomic novelty in RNA viruses
Genome Res.,
October 1, 2007;
17(10):
1496 - 1504.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
C. Kingsford, A. L. Delcher, and S. L. Salzberg
A Unified Model Explaining the Offsets of Overlapping and Near-Overlapping Prokaryotic Genes
Mol. Biol. Evol.,
September 1, 2007;
24(9):
2091 - 2098.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
S. Pasek, A. Bergeron, J.-L. Risler, A. Louis, E. Ollivier, and M. Raffinot
Identification of genomic features using microsyntenies of domains: Domain teams
Genome Res.,
June 1, 2005;
15(6):
867 - 874.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
K. R. Sakharkar, M. K. Sakharkar, C. Verma, and V. T. K. Chow
Comparative study of overlapping genes in bacteria, with special reference to Rickettsia prowazekii and Rickettsia conorii
Int J Syst Evol Microbiol,
May 1, 2005;
55(3):
1205 - 1209.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
Z. I. Johnson and S. W. Chisholm
Properties of overlapping genes are conserved across microbial genomes
Genome Res.,
November 1, 2004;
14(11):
2268 - 2272.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
D. R. Denver, S. L. Swenson, and M. Lynch
An Evolutionary Analysis of the Helix-Hairpin-Helix Superfamily of DNA Repair Glycosylases
Mol. Biol. Evol.,
October 1, 2003;
20(10):
1603 - 1611.
[Abstract]
[Full Text]
![]()
![]()
![]()

![]()
![]()
![]()
T. Dandekar, M. Huynen, J. T. Regula, B. Ueberle, C. U. Zimmermann, M. A. Andrade, T. Doerks, L. Sanchez-Pulido, B. Snel, M. Suyama, et al.
Re-annotating the Mycoplasma pneumoniae genome sequence: adding value, function and reading frames
Nucleic Acids Res.,
September 1, 2000;
28(17):
3278 - 3288.
[Abstract]
[Full Text]
[PDF]
![]()
This Article ![]()
![]()
Abstract
![]()
Print PDF (279K)
![]()
Alert me when this article is cited
![]()
Alert me if a correction is posted
![]()
Services ![]()
![]()
Email this article to a friend
![]()
Similar articles in this journal
![]()
Similar articles in ISI Web of Science
![]()
Similar articles in PubMed
![]()
Alert me to new issues of the journal
![]()
Add to My Personal Archive
![]()
Download to citation manager
![]()
Search for citing articles in:
ISI Web of Science (24)
![]()
Request Permissions ![]()
Commercial Re-use Guidelines
for Open Access NAR Content
![]()
Google Scholar ![]()
![]()
Articles by Fukuda, Y.
![]()
Articles by Tomita, M.
![]()
Search for Related Content
![]()
PubMed ![]()
![]()
PubMed Citation
![]()
Articles by Fukuda, Y.
![]()
Articles by Tomita, M.
![]()
Social Bookmarking ![]()
![]()
What's this?








