ABSTRACT
Induction of transcription into long CTG[middot]CAG repeats contained on plasmids in Escherichia coli is showntoincrease the frequency of deletions within the repeat sequences. This elevated genetic instability was detected because active transcription into the triplet repeat influenced the growth transitions of the host cell, allowing advantageous growth for cells harboring plasmids with deleted repeat sequences. The variety of deletion products observed in separate cultures suggests that transcription altered the metabolism of the DNA in a manner that produced random length changes in the repeat sequence. For cultures containing plasmids without active transcription into the triplet repeat, or those maintained in exponential growth, deletions occurred within the repeat at a lower frequency (5-20-fold lower). In these incubations the extent of deletions was proportional to the number of cell divisions and many repeat lengths were observed within each culture, suggesting that the decrease in average repeat length at long incubation times was due to multiple small deletions. These observations show that deletions within long CTG[middot]CAG repeats contained on plasmids in E.coli occur via more than one pathway and their level of genetic instability is altered by the enzymatic processes occurring upon the DNA.
Simple repetitive DNA sequences, which are widespread throughout natural genomes, are polymorphic in length due to intrinsic genetic instabilities (1 -3 ). The recent surge of interest in the factors influencing such instabilities is due to the association of simple repetitive DNA sequences with a variety of human diseases (4 ,5 ). For example, hereditary non-polyposis colon cancer cell lines have a high frequency of length changes in specific dinucleotide repeats (6 ,7 ) and unusual mutation events, involving the expansion of triplet repeat sequences (TRS), are associated with a number of neurogenetic disorders (for recent reviews see 8 -12 ). The propensity for TRS to expand provides an explanation for the non-Mendelian inheritance patterns (termed anticipation) of their associated diseases; with progression through a pedigree, the TRS become longer and the disease has an earlier age of onset and an increased severity.
Some features of the events leading to certain diseases associated with expansion of TRS are known (11 ,12 ). The expansion events fall into two categories, termed Types 1 and 2 (12 ). In Type 1 expansions, the TRS is always CTG[middot]CAG and is in a coding segment of the gene. In this case expansion events occur in small steps and the net effect is to produce proteins with longer tracts of polyglutamines. It is likely that these mutations lead to disease proteins with a gain of function that is particularly deleterious in neurons. Type 2 expansion events are associated with multi-system disorders and have a number of differences; various types of sequences are found and the TRS is not located within a coding region of the gene. All of the Type 2 expansion events are extremely large (hundreds of copies of the repeat for disease alleles).
Although progress has been made in understanding the pathologies of diseases associated with genetic instability of simple repetitive DNA sequences, the mechanism of expansion of TRS remains unclear. A number of studies have shown that sequences which undergo expansion can adopt unusual structures (reviewed in 13 ) which may cause aberrant metabolism of the DNA. To attempt to elucidate these mechanisms, we have initiated in vivo studies in Escherichia coli. Maintenance of long TRS in bacterial systems is difficult since they are deleted readily to sequences of heterogeneous length when contained on plasmids (14 ,15 ). Despite these problems, expanded products of some TRS have been obtained in E.coli (14 ,16 ), although the relationship of these observations to eukaryotic systems remains to be determined.
Herein we show that the deletion of long CTG[middot]CAG tracts from plasmids in E.coli is increased upon active transcription into the repeat sequence. An elevated frequency of deletions is observed because cells harboring plasmids with deleted repeat sequences have a growth advantage as the cultures pass through stationary phase (17 ). In the absence of transcription, a lower rate of deletions is observed and their extent was dependent on the number of cell division events. Hence, at least two pathways leading to deletions of TRS exist in E.coli.
Plasmids used in these experiments contained repeating CTG[middot]CAG inserts, which may also be designated TGC[middot]GCA or GCT[middot]AGC, have been described previously (14 ,17 ). The orientation of this TRS with respect to the direction of replication within these plasmids has been defined (14 ) as follows: plasmids containing the CTG sequence as the leading strand template are designated orientation I; plasmids containing the CTG sequence as the lagging strand template are designated orientation II. Figure 1 shows maps of pRW3247 and pRW3268 with the restriction sites relevant to this study. Both plasmids contain the sequence (GCT)27ACT(GCT)40ACT(GCT)106 [referred to as (CTG)175[middot](CAG)175 for convenience] in orientation II; note that pRW3247 has an intact promoter for lacZ', but pRW3268 does not. Equivalent plasmids with the same TRS in orientation I are pRW3248 (intact lacZ' promoter) and pRW3269 (deleted lacZ' promoter). In some experiments plasmid pIQ-kan (18 ) was used to provide the lacIQ repressor inside the cell.
Large amounts of plasmids were prepared by alkaline lysis of a 1 l culture (LB + antibiotic) grown to stationary phase and were purified by CsCl/ethidium bromide ultracentrifugation overnight (19 ). Purification of plasmids from 10 ml cultures was performed using the standard alkaline lysis miniprep procedure (19 ).
All cloning procedures and experiments were conducted in E.coli HB101 as described previously (17 ). After transformations, bacteria were allowed to recover at 37oC in SOC medium (2% Bacto-tryptone, 0.5% yeast extract, 10 mM NaCl, 2.5 mM KCl, 10 mM MgCl2, 10 mM MgSO4, 20 mM glucose, pH 7.0) for 30-45 min and then grown up in liquid culture with antibiotic selection; ampicillin and kanamycin were included at final concentrations of 100 and 50 [mu]g/ml respectively. When required, IPTG was included in the growth medium at a final concentration of 1 mM. To check the efficiency of each electroporation, 20 [mu]l of the transformation mixture were grown overnight at 37oC on LB-agar plates containing antibiotic.
The genetic stability of plasmids containing TRS was analyzed under a variety of conditions. In the basic procedure, E.coli HB101 was transformed with the appropriate plasmid and an aliquot was inoculated into glass tubes (150 mm length * 17 mm diameter) containing 10 ml LB with antibiotic. For incubations containing IPTG, the transformed cells were first grown for 2 h without IPTG to allow cell recovery in the absence of transcription from the lacZ' promoter and this culture was then diluted into fresh medium with or without IPTG.
Incubation of the liquid cultures was continued for the required time at 37oC and a shaking rate of 250 r.p.m. Growth of the cultures was monitored by optical density readings at 600 nm. The bacteria were subcultured into fresh liquid medium as appropriate: inoculations from stationary phase cultures were performed after 24 h (OD600 >2) with a dilution factor of 107; inoculations from logarithmic phase cultures were performed after 12 h (OD600 0.3-0.8) with a dilution factor of 4 * 106. The cells from each culture were harvested and stored at -80oC until the end of the experiment and then all DNA minipreps were performed at the same time.
Restriction digests were performed following the manufacturer's instructions and the samples were radiolabeled at the EcoRI site using E.coli DNA polymerase I Klenow fragment incubated with [[alpha]-32P]dATP (19 ). The samples were analyzed by electrophoresis through 5% polyacrylamide gels in TBE buffer and the radioactive fragments were observed by autoradiography or with a phosphorimager (400S, Molecular Dynamics). Quantitation of the gels was performed directly upon the phosphorimage.
Long tracts of CTG[middot]CAG repeats are prone to deletion over extended periods of growth in E.coli. Previous investigations (14 ,17 ) showed that (CTG)175[middot](CAG)175 was more stable when the CTG strand was the leading strand template for replication (orientation I) compared with when it was the lagging strand template (orientation II). For all plasmids containing long CTG[middot]CAG tracts, deletions were observed within the TRS. However, the frequency of deletions varied widely between different vectors. Diverse aspects of DNA metabolism (e.g. recombination, repair, replication, transcription) were proposed to explain such variations, but the relative importance of each was not known. Thus the present experiments were undertaken to identify the factors influencing the frequencies of deletions of long CTG[middot]CAG repeats from plasmids in E.coli.
In an earlier study (14 ) the following pairs of plasmids were produced: pRW3247 (orientation II) and pRW3248 (orientation I); pRW3268 (orientation II) and pRW3269 (orientation I). These plasmids all contain (GCT)27ACT(GCT)40ACT(GCT)106 [referred to as (CTG)175[middot](CAG)175], but growth of E.coli harboring these plasmids produced different frequencies of deletions within each of the TRS (14 ). An obvious difference in the two sets of plasmids is that pRW3247 and pRW3248 have an intact promoter for lacZ', whereas pRW3268 and pRW3269 do not; thus the TRS is located inside a transcription unit for pRW3247 and pRW3248, but not for pRW3268 and pRW3269 (Fig. 1 ).
Thus far, we have shown that active transcription into the TRS of plasmids in E.coli HB101 produced an increased frequency of deletions in repeat sequences from cultures that passed through stationary phase compared with those maintained in exponential growth. Since pRW3268 and pRW3269 do not have a promoter for lacZ', the frequency of deletions from the TRS in these plasmids should not be influenced by growth phase. Therefore, experiments were performed with pRW3268 and pRW3269 to test this hypothesis. Since transcription from the lacZ' promoter was not possible for these plasmids, in these experiments pIQ-kan was not co-transformed into the bacteria and IPTG was not included in the growth medium.
Figure
After electroporation of plasmids containing the TRS into E.coli HB101, the transformation mixture was halved and grown for many generations with the cultures either maintained in exponential phase or allowed to pass through stationary phase (after every 24 generations). At each subculturing a portion of the culture was harvested. The plasmids were purified and the lengths of fragments containing the TRS from pRW3268 (TRS in orientation II, no lacZ' promoter) were determined (Fig. 4 ); samples from cultures harboring the plasmid with an intact lacZ' promoter and the TRS in orientation II (pRW3247) that were maintained in exponential growth were also analyzed for comparison with our previous observations (17 ). Quantitation of the data showed that there was little influence of growth phase on the frequency of deletions in the plasmid without the lacZ' promoter and the TRS in orientation II (pRW3268), since cultures maintained in exponential growth or allowed to pass through stationary phase had ~80% full-length product after 72 generations. This is in contrast to cultures harboring the plasmid with the lacZ' promoter and the TRS in orientation II (pRW3247), which had a dramatically higher extent of deletions when allowed to pass through stationary phase (17 ). For both types of growth conditions, no deletions were detected in pRW3269 (TRS in orientation I, no lacZ' promoter) (data not shown).
In the experiments with cultures which contained pIQ-kan as well as the plasmids with the TRS (Fig. 2 ), cultures containing pRW3268 and maintained in exponential growth or allowed to pass through stationary phase had ~65% full-length product after 72 generations. Thus although co-transformation of pIQ-kan decreased the growth rate of the host cell (data not shown), this had little effect upon genetic stability of the TRS in pRW3268 (no lacZ' promoter). In contrast, comparison of the experiments with plasmids containing an intact lacZ' promoter (pRW3247 and pRW3248) (compare Fig. 2 with fig. 3 of 17 ) shows that co-transformation of pIQ-kan did alter the genetic stability of the TRS. In E.coli HB101 not containing pIQ-kan there was transcription into the TRS of pRW3247 and pRW3248 (data not shown), probably due to the high copy number of the plasmid titrating out the normal level of repressor in cells; the increased amount of deletion in cultures that passed through stationary phase was due to a growth advantage for cells harboring plasmids with deletions (17 ). The reduced effects on the genetic stability of pRW3247 and pRW3248 by induction of transcription from the lacZ' promoter with IPTG (Fig. 2 ) were because the decreased growth rate of bacteria harboring pIQ-kan (data not shown) reduced the growth advantage of cells harboring plasmids with deleted TRS. Thus, although transcription clearly affects the frequency of deletions within the TRS, other factors also influence the observed genetic stability of the repeat sequence.
Figure
Growth curve measurements of cultures harboring each of the plasmids showed that there was less interaction between the growth characteristics of E.coli HB101 and plasmids with long TRS if the plasmids did not contain the lacZ' promoter (pRW3268 and pRW3269) (data not shown). Hence, the absence of an effect of growth phase on the deletion frequency within the TRS of plasmids without an intact lacZ' promoter (Fig. 4 ) was as predicted. Plasmids without transcription from lacZ' are almost completely stable in orientation I (pRW3269) and in orientation II (pRW3268) have a deletion frequency that is dependent only on the number of cell division events.
We wished to determine whether the advantageous propagation of cells harboring deleted repeats was due to effects of transcription at a specific growth phase of the culture. Thus, an experiment was performed in which the plasmid harboring an intact lacZ' promoter with the TRS in orientation II (pRW3247) was transformed into E.coli HB101 containing pIQ-kan and at each subculturing the cells were diluted into medium with or without IPTG.
The strategy and results of the experiment are shown in Figure 5 . The proportion of plasmid containing full-length TRS was quantitated from restriction digests analyzed on polyacrylamide gels (data not shown). The results showed that the genetic stability of the TRS was determined by the state of the parent culture regarding transcription into the TRS. If the parent culture had transcription into the TRS then, after passage through stationary phase, its daughter cultures had increased amounts of deletions with and without IPTG. Conversely, if there was no active transcription into the TRS in the parent culture, the daughter cultures had relatively little change in their frequency of deleted products, even when active transcription into their TRS was induced by the presence of IPTG. In a control experiment the cultures were washed and incubated with medium with or without IPTG immediately prior to subculturing; as expected, the presence or absence of IPTG had no effect at this stage (data not shown).
Thus active transcription into (CTG)175[middot](CAG)175 contained on plasmids influences the transitions between stationary and exponential growth. This produces advantageous growth for cells harboring plasmids with deleted TRS and increases the observed amount of deletions in the TRS in cultures that pass through stationary phase.
The genetic stability of simple repeating sequences contained on plasmids in E.coli is influenced by many factors. Long CTG[middot]CAG repeats were shown to have a higher frequency of deletions upon active transcription into the TRS. This elevated frequency of deletions (5-20-fold increase for each growth cycle, depending on the experiment) was not proportional to the number of generations of the culture, but was produced upon passage of cultures through stationary phase. This observation explains earlier experiments which detected a growth advantage for cells harboring plasmids which contained deleted TRS (17 ). These results are unlikely to be due to an elevated copy number of plasmids harboring deletions because the size of the TRS has little effect on the copy number of these plasmids (data not shown). Although growth advantages were obtained only for plasmids with active transcription into the TRS, we are uncertain whether the destabilizing influence was transcription or some other event dependent upon transcription, such as translation. Plasmids without active transcription into the TRS also produced deletions within the repeat, but at the lower frequency of cultures maintained in exponential growth and the extent was proportional to the number of cell division events. Hence, deletions from long CTG[middot]CAG repeats on plasmids in E.coli occur via more than one pathway.
The molecular processes responsible for the effect of transcription on deletion frequency are uncertain from this in vivo study. However, examination of the patterns of deletions from different plasmids with the TRS in orientation II (CTG sequence as lagging strand template) reveals details of the mechanism by which transcription influences the stability of long TRS. For example, in conditions providing a growth advantage for cells with deleted products, passage through stationary phase of cultures which harbor plasmids with transcription into the TRS (pRW3247 co-transformed with pIQ-kan and in the presence of IPTG), the pattern of deletions varied in different experiments and the frequency of deletions varied by up to 2-fold (see for example Figs 2 and 3 ). However, each culture contained plasmids with relatively few different lengths of TRS. Comparison of separate inoculations showed that a wide variety of lengths of TRS were produced (compare Figs 2 and 3 ); frequently the deletions produced only plasmids containing small repeats.
Figure
Analysis of growth conditions which did not give an advantage for cells with deleted products, plasmids without transcription into the TRS (e.g. pRW3268 and pRW3247 co-transformed with pIQ-kan and in the absence of IPTG) or cultures maintained in exponential growth, shows that the patterns of deletion products were similar in different experiments, in agreement with earlier observations (14 ). Also, the observed frequency of deletions was similar in separate experiments. In these samples it was usual for many lengths of TRS to be observed in each culture. Thus deletions occurred via a step-wise mechanism which produced a gradual decrease in the average length of repeats as the incubation time of the culture was increased (Fig. 6 A). These types of deletion events were influenced by the orientation of the repeat sequence with respect to the direction of replication, suggesting that they were generated by slippage of the DNA at the replication fork (14 ). It has been proposed previously that the intrahelical process of slipped strand mispairing is likely to be the major factor in initial length changes of short repeated motifs (22 ); multiple events would be required to produce the short TRS observed at long generation times of the cultures in our experiments.
An additional mechanism for generating deletions was dominant under growth conditions which produced an advantage for plasmids harboring a smaller TRS. Upon passage of the cultures through stationary phase, the growth advantage associated with plasmids harboring shorter lengths of TRS meant that the bacterial culture became biased towards the shortest length produced during that incubation (Fig. 6 B). Induction of transcription into the repeat sequence produced random length changes, giving a variety of deletion sizes in different experiments. A plausible mechanism for generating random length changes which could encompass almost the full length of the TRS is the formation of hairpin structures within single-stranded regions of the DNA. Stable hairpin structures have been observed in vitro within short single-stranded oligonucleotides of CTG[middot]CAG repeats and in plasmids containing long CTG[middot]CAG repeats (reviewed in 13 ).
Our data cannot determine whether the deletions due to transcription into the TRS were produced during exponential growth of the culture (as illustrated in Fig. 6 B) or only if the cultures entered stationary phase. Rare deletion events that occurred during exponential growth may not have been detected by our analysis, but the reduced lag phase of cells harboring such plasmids (17 ) could have produced their accumulation upon passage through stationary phase. The cellular physiology of stationary phase cells is dramatically different from cells in exponential growth (23 ,24 ) and the fitness of a culture continues to change during long incubations in stationary phase (25 ). It is possible that interactions between the replication and transcription machinery produced the growth disadvantages that we observed for cultures harboring plasmids with long TRS as they passed through stationary phase.
Studies in a variety of experimental systems have suggested that problems occur when a replication fork meets a transcription complex on plasmid DNA (26 -29 ). Such encounters occur during normal cell growth, so the cell must have evolved efficient ways to circumvent these interactions (30 ). However, under certain conditions errors may be more likely. The observation that DNA synthesis does not proceed efficiently through long CTG[middot]CAG repeats in vitro (31 ,32 ) suggests that polymerase reactions through TRS may be prone to aberrations in vivo. The presence of transcription complexes within the TRS may enhance the error rate of replication and, therefore, produce the high frequency of deletions reported here.
Our experiments show only that induction of transcription from the lacZ' promoter had an effect on the genetic stability of (CTG)175[middot](CAG)175. Such effects have been demonstrated for other promoters and simple repeating sequences. In E.coli the genetic stability of plasmids containing monomeric repeat sequences [(A)34[middot](T)34 and (G)34[middot](C)34] cloned downstream of the lacZ' promoter was decreased upon induction of transcription (28 ). Also, recent studies in yeast showed that transcription produced an increase in the polymorphism of dinucleotide repeats (33 ) and that high levels of transcription increased the rate of spontaneous mutations (34 ).
The discovery described herein that transcription can influence the genetic stability of simple repeating sequences prompted us to re-evaluate earlier investigations which used repetitive DNA to study unusual DNA structures. Other TRS associated with human diseases, CGG[middot]CCG and AAG[middot]CTT, were also observed to be less stable in plasmids in E.coli when cloned within a transcription unit (15 ; Ohshima, Montermini, Pandolfo and Wells, unpublished data). Similarly, during analysis of the formation of Z-DNA, dinucleotide repeats were observed to undergo more deletions when located in transcribed regions of plasmids (35 -38 ). Thus transcription has destabilizing effects on the genetic propagation of a number of different simple repetitive DNA sequences.
The relevance of our observations on the stability of TRS to their associated human diseases is unclear. The identification that more than one pathway produces genetic instabilities in long TRS is consistent with the suggestion that different mechanisms produce the distinct characteristics of disease Types 1 and 2 (12 ). Since the regulation of transcription is specific for each promoter, this may provide one biochemical explanation by which different pathways can produce expansion. In all situations in which the expansion of a TRS is associated with a human disease, the TRS is located within a gene (11 ,12 ). Thus there is potential for transcription-associated events to influence the stability of TRS in all disorders. Our observations in E.coli show that the type of genetic instability associated with a particular TRS is determined by the processes occurring upon it. The fundamental nature of these processes (transcription and replication) suggests that such instabilities are likely to be conserved through evolution, although the details may vary in different systems.
These studies were supported by grants from the National Institutes of Health (GM52982) and the Robert A.Welch Foundation. We thank D.M.J.Lilley and D.Chen for provision of the plasmid pIQ-kan and S.Kang, K.Ohshima, E.Lawson, R.Iyer and L.Bowater for helpful discussions.
*To whom correspondence should be addressed. Tel: +1 713 677 7651; Fax: +1 713 677 7689; Email: rwells@ibt.tamu.edu
Present addresses: +Imperial Cancer Research Fund, Clare Hall Laboratories, South Mimms, Hertfordshire EN6 3LD, UK and [sect]Department of Genetics of Microorganisms, Institute of Microbiology and Immunology, University of Lodz, 12/16 Banacha St., 90-237 Lodz and Center for Microbiology and Virology, Polish Academy of Sciences, 106 Lodowa St., 93-232 Lodz, Poland
REFERENCES

