Published online 8 November 2004
Nucleic Acids Research, Vol. 32 No. 19 © Oxford University Press 2004; all rights reserved
Replication-mediated instability of the GAA triplet repeat mutation in Friedreich ataxia
1 Department of Biochemistry and Molecular Biology and 2 Department of Pediatrics, University of Oklahoma Health Sciences Center, Oklahoma City, OK 73104, USA, 3 Bruce Leroy Centre for Genetic Health Research, Murdoch Children's Research Institute, 4 Department of Pediatrics, University of Melbourne, Australia, 5 BioGeM Consortium, 6 IEOS-CNR and 7 DBPCM, Federico II University, Naples, Italy and 8 Department of Genetics, Louisiana State University Health Sciences Center, New Orleans, LA, USA
* To whom correspondence should be addressed at 975 NE, 10th, BRC458, Oklahoma City, OK 73104, USA. Tel: +1 405 271 1360; Fax: +1 405 271 3910; Email: Sanjay-Bidichandani{at}ouhsc.edu
Received July 22, 2004; Revised and Accepted October 22, 2004
| ABSTRACT |
|---|
|
|
|---|
Friedreich ataxia is caused by the expansion of a polymorphic and unstable GAA triplet repeat in the FRDA gene, but the mechanisms for its instability are poorly understood. Replication of (GAATTC)n sequences (9105 triplets) in plasmids propagated in Escherichia coli displayed length- and orientation-dependent instability. There were small length variations upon replication in both orientations, but large contractions were frequently observed when GAA was the lagging strand template. DNA replication was also significantly slower in this orientation. To evaluate the physiological relevance of our findings, we analyzed peripheral leukocytes from human subjects carrying repeats of similar length (8107 triplets). Analysis of 9400 somatic FRDA molecules using small-pool PCR revealed a similar mutational spectrum, including large contractions. The threshold length for the initiation of somatic instability in vivo was between 40 and 44 triplets, corresponding to the length of a eukaryotic Okazaki fragment. Consistent with the stabilization of premutation alleles during germline transmission, we also found that instability of somatic cells in vivo and repeats propagated in E.coli were abrogated by (GAGGAA)n hexanucleotide interruptions. Our data demonstrate that the GAA triplet repeat mutation in Friedreich ataxia is destabilized, frequently undergoing large contractions, during DNA replication.
| INTRODUCTION |
|---|
|
|
|---|
Friedreich ataxia, the most prevalent inherited ataxia, is caused by abnormal expansion of a GAA triplet repeat sequence in intron 1 of the FRDA gene (1,2) (http://www.geneclinics.org/). This sequence is both polymorphic and genetically unstable (1,3,4). Normal alleles, which contain <32 triplets, are classified as being either short normal (SN; <12 triplets) or long normal (LN;
12 triplets). Disease-causing expanded (E) alleles, containing 661700 triplets, interfere with gene transcription at the FRDA locus, thereby resulting in frataxin deficiency and Friedreich ataxia (59). SN and LN alleles are stably transmitted from parent to offspring (3,4). However, E alleles display intergenerational instability, usually contracting by 2030% upon paternal transmission, and showing equal tendencies for expansion and contraction during maternal transmission (10,11). Rare, intermediate-sized alleles, containing 3365 uninterrupted GAA triplets, are termed premutation (PM) alleles because they may undergo hyperexpansion upon intergenerational transmission, expanding by >5- to 10-fold to generate E alleles in offspring [(3,4,12,13); unpublished data]. PM alleles and those at the long end of the LN spectrum may be interrupted by (GAGGAA)n hexanucleotides, which are believed to result in their stabilization during germline transmission (3). By analyzing individual somatic genomes from peripheral blood leukocytes using small-pool PCR (SP-PCR), we previously showed that E alleles are extremely unstable in vivo and show a marked predilection for large contractions (14). Unstable triplet repeats are known to cause at least 15 inherited diseases (15,16). So far, only three of the ten possible triplet repeat motifs are known to be associated with disease: (CGGCCG)n, (CTGCAG)n, and (GAATTC)n. Instability of (CGGCCG)n and (CTGCAG)n repeat sequences in bacteria and yeast is dependent upon both repeat length and the orientation of the repeat relative to the origin of replication (1726). For example, the (CTGCAG)n repeat tract is more unstable, with a significant tendency for contractions, when CTG is the template for lagging strand synthesis. Either strand of the (CGGCCG)n or (CTGCAG)n repeat sequence (i.e. the CNG triplet motif) has a tendency to form hairpin structures. The relative thermodynamic stability of the hairpin formed by single-stranded CTG versus CAG on the lagging strand template is thought to be the basis for the contraction bias in both bacterial and yeast systems (17).
Recent evidence suggests the possibility that GAA and TTC single-stranded sequences may form hairpin structures (27), which is a well-known property of both the CNG triplet repeats (28). However, the polypurinepolypyrimidine (GAATTC)n repeat differs from the two unstable CNG triplet repeats in its ability to adopt triplex structures in supercoiled templates (5,7,2932). Although a triplex structure formed by the (GAATTC)n repeat is believed to be the molecular basis for the transcriptional interference afforded by expanded FRDA alleles (57,9,30,33), its role in mediating genetic instability is less clear. We proposed that errors during lagging strand synthesis could be the basis for the genetic instability of the (GAATTC)n repeat based on our previous observation of large, spontaneous GAA contractions in somatic cells in vivo (14), the known disparity in the relative affinities of replication protein A (RPA) for polypurine versus polypyrimidine sequences (34) and the propensity of single-stranded (GAA)n sequence to adopt altered structural conformations (35,36). Furthermore, Ohshima et al. (6) had previously reported orientation-dependent instability of (GAATTC)n repeats, noting increased smearing of their cloned inserts after restriction digest, indicative of a contraction bias in plasmids when GAA was the template for lagging strand synthesis.
Therefore, in this study we conducted a more detailed investigation of the effect of orientation of DNA replication on the instability of repeats, cloned from human carriers of several FRDA alleles, in plasmids propagated in Escherichia coli. We also evaluated the instability of similar repeat lengths in peripheral blood leukocytes from human donors using SP-PCR. Here, we show that while small slippage-type events occurred in both orientations, large contractions of (GAATTC)n sequences occurred primarily when GAA was the template for lagging strand synthesis. DNA replication was significantly impeded in this orientation, and the incremental accumulation of large contractions was coincident with the phase of maximal plasmid replication. We also observed the same spectrum of mutations in peripheral leukocytes in vivo in human subjects carrying FRDA alleles of similar length, including complete reversions to the normal/premutation length. During the preparation of this manuscript, similar findings were reported using (GAATTC)n containing plasmids in yeast (37). This indicates that the mechanism of the strand-specific differential instability of (GAATTC)n repeats appears to be conserved among prokaryotes and eukaryotes, and the contraction bias is seen not only in prokaryotes and simple eukaryotes, but in human somatic cells as well. The threshold length for the initiation of somatic instability in vivo coincided with the length of a eukaryotic Okazaki fragment (38,39). (GAGGAA)n hexanucleotide interruptions stabilized the (GAATTC)n sequence in E.coli and human somatic cells, constituting the first direct evidence for a stabilizing role of interruptions in the context of the human genome in vivo.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Plasmid construction
Genomic DNA from whole blood of human subjects was used to amplify 9, 21, 44, 81, 107, 119 and 163 pure (GAATTC)n repeats and 114 (GAATTC)n repeats containing a (GAGGAA)n hexanucleotide repeat interruption in intron 1 of the FRDA gene, using the following primers (40): GAA-104F (5'-GGCTTAAACTTCCCACACGTGTT-3') and GAA-629R (5'-AGGACCATCATGGCCACACTT-3'), followed by nested PCR using one of the following primer pairs: gaapst1-F (5'-GCTCCGCTGCAGCCCAGTATCTACTAAAAAATAC-3') and gaaxba1-R (5'-GATGCGTCTAGACGCGCGACACCACGCCCGGCTAAC-3'), or ttcpst1-F (5'-GCTCCGCTGCAGCGCGCGACACCACGCCCGGCTAAC-3') and ttcxba1-R (5'-GATGCGTCTAGACCCAGTATCTACTAAAAAATAC-3').
The PCR products were digested with PstI and XbaI, which recognize sequences located at the 5' end of the forward and reverse primers, respectively. The fragments containing (GAATTC)n repeats along with minimal flanking sequence (38 bp 5' and 35 bp 3' to the repeat) were ligated into the PstI and XbaI sites of pUC19. The pUC19 vector used contains the ColE1 origin and a portion of the ROP gene making it a low copy number plasmid. The primer pairs determined the orientation of the repeat tract: GAA orientation (gaapst1-F/gaaxba1-R) or TTC orientation (ttcpst1-F/ttcxba1-R) (Figure 1). The following recombinant plasmids were selected for mutation analysis: 9, 21, 48, 82, 105 and 111 repeats in the GAA orientation, and 8, 21, 48, 79, 105 and 108 repeats in the TTC orientation. Repeat lengths and orientation were determined by sequencing. The constructs containing (GAA)9, (GAA)48, (GAA)105, (GAA)111, (TTC)8, (TTC)48, (TTC)79, (TTC)105 and (TTC)108 were pure (GAATTC)n repeats, and the other constructs contained either a single non-GAA interruption within the repeat tract as follows: (GAA)21 = (GAA)17(A)(GAA)4; (GAA)82 = (GAA)70(AA)(GAA)11; (TTC)21 = (TTC)4(TT)(TTC)17 or a (GAGGAA)5 hexanucleotide interruption along with additional non-GAA sequence as follows: (GAA)101* = (GAA)60GA(GAA)2GAGGA(GAA)18(GAGGAA)5(GAA)8; (TTC)95* = (TTC)8(TTCCTC)5(TTC)18TCCTC(TTC)2TC(TTC)54. A control, non-repeat containing plasmid with a 289 bp insert of random sequence (51% G/C content) was created by amplifying exon 5 of the APTX (aprataxin) gene using primers: aptx5pst1-F (5'-GCTCCGCTGCAGGTCTGTTTCCTTCTCTTGT-3') and aptx5xba1-R (5'-GATGCGTCTAGAGGAGCCAGCAGCACTACC-3'), which was similarly digested and inserted into the PstI and XbaI sites in pUC19.
|
Analysis of (GAATTC)n repeat instability
Individual bacterial (DH5
) colonies obtained by plating glycerol stocks of sequence verified plasmids were grown in liquid culture (LuriaBertani + 100 µg/ml ampicillin) at 30°C for 1224 h. PCR of individual colonies was performed to confirm that the inserts were full-length repeat tracts immediately prior to initiation of cultures. Colonies representing each repeat length and orientation of replication were cultured in triplicate, so that at each hour (between 1224 h) three separate 5 ml cultures could be harvested for each construct. Detailed studies were carried out for GAA-82, TTC-79 and a random sequence insert, in triplicate. Each culture was treated as follows: 1 ml was used to estimate the OD600 as a measure of cell density (to plot bacterial growth curves), 1 ml was used to set up a glycerol stock (for mutation studies, see below) and the rest was used for plasmid DNA isolation (to measure the amount of plasmid replication). Plasmid DNA was applied to a Zeta-Probe® membrane using a Bio-Dot® microfiltration apparatus (BioRad), probed with
-ATP32-labeled pUC19-R oligonucleotide and quantified by densitometry. Bacterial growth curves and plasmid replication results were used to determine the time points for detailed mutation analysis: 12, 16, 20 h (log phase), and 24 h (stationary phase) of culture. We specifically avoided multiple sub-culturing of E.coli and only analyzed the mutational profile over a single log phase, which resulted in low mutation frequencies (>80% of final inserts were full-length), thus minimizing the bias for contractions and multiple mutations involving the same repeat tract. Previous reports indicate that transformation of triplet repeat-containing plasmids per se increases the instability of the repeat tract (41). Therefore, glycerol stocks for GAA-82 and TTC-79 at 12, 16, 20 and 24 h time points were plated and incubated at 37°C for 16 h. PCR of individual colonies was performed to assess (GAATTC)n repeat instability using primers: GS-F = (5'-CCCAGTATCTACTAAAAAATAC-3') and GS-R (5'-ACACCACGCCCGGCTAACTTTTC-3'). Relative sizes of PCR products were determined by electrophoresis on 3% agarose gels, coupled with direct sequencing of selected products. Approximately 100 colonies were analyzed for each orientation and time point, in triplicate, and instability was defined as the percentage of the total number of PCR products amplified whose length was altered after replication in E.coli. Large contractions were defined as products that lost >50% of their initial repeat length.
To compare E.coli growth curves and instability of pure versus interrupted (GAATTC)n repeat tracts, 5 ml cultures were inoculated with colonies containing plasmids with either the pure or interrupted repeat tract inserted in both orientations. Cultures were grown at 30°C for 1024 h, in triplicate, such that three cultures for each construct were harvested every hour. Growth curves were generated as described above. Repeat tract instability was analyzed after 22 h of growth, as described previously. Approximately 75 colonies were analyzed for each of the four constructs, in triplicate.
SP-PCR analysis
The SP-PCR analysis was carried out as described previously (14,42). Briefly, serial dilutions of human genomic DNA, ranging from 6600 pg, were prepared in siliconized microfuge tubes. PCR was performed using GAA-104F and GAA-629R, which allowed accurate sizing of alleles used in the present study. PCR products were resolved by electrophoresis on 12% agarose gels (2% gels were used for the calculation of threshold length for the initiation of instability), and bands detected by Southern blotting using an end-labeled (TTC)9 oligonucleotide probe. The calculation of the average number of individual FRDA molecules per reaction (the gene equivalent) was performed by Poisson analysis as described previously (42,43). The average quantity of genomic DNA required for the amplification of one FRDA molecule was 13.4 pg (95% CI 10.616.1 pg). For each genomic DNA sample, multiple reactions were performed using small pools of 2.525 individual FRDA molecules per reaction to detect mutations. Mutation loads were calculated as the proportion of molecules that differed by >5% in size from the constitutional (most common) allele determined by direct sequencing.
Statistical methods
Comparison of medians was carried out using the MannWhitney U test, means were compared using the t-test, and frequencies were compared by
2 analysis.
| RESULTS |
|---|
|
|
|---|
Large contractions of (GAATTC)n occur when GAA is the template for lagging strand synthesis
(GAATTC)n repeat tracts (n = 9, 21, 48, 80 and 105 triplets) were cloned in both orientations relative to the ColE1 origin of replication, such that either GAA or TTC would serve as the template for lagging strand synthesis (Figure 1). Replication of these cloned (GAATTC)n triplet repeats, which were propagated in E.coli (DH5
), displayed length-dependent instability. Whereas repeat tracts containing
21 triplets were completely stable, repeat tracts containing 48 and
80 triplets were moderately unstable and the construct containing
100 triplets in the GAA orientation was highly unstable (data not shown). We selected the moderately unstable repeat tracts containing
80 triplets to perform a detailed analysis of the effects of differential orientation of replication on GAA triplet repeat instability, which was accomplished by analyzing multiple individual replication events in bacterial colonies. The (GAATTC)n sequence was found to be significantly more unstable when GAA was the template for lagging strand synthesis (GAA orientation; GAA-82) compared with TTC (TTC orientation; TTC-79) (Figure 2A). The same orientation-dependent instability was also noted when (GAATTC)n repeats were cloned in plasmids pBluescript II (Stratagene), pCR2.1 and pCR3.1 (Invitrogen) and replicated in DH5
and TOP10 (Invitrogen) strains of E.coli (data not shown).
|
To further examine the role of replication in GAA triplet repeat instability, we analyzed the mutational frequencies and spectra during the entire period of active replication of plasmids in E.coli. Throughout the log phase of E.coli growth, both bacterial growth and plasmid DNA replication were significantly slower when GAA was the template for lagging strand synthesis (Figure 2B and C). A random sequence insert of the same size showed the same profile as when TTC was the lagging strand template (Figure 2B and C). Even when cultures were grown for 40 h, well into the stationary phase, bacterial density was significantly lower for GAA-82 than for either TTC-79 or the random sequence control (data not shown). These data indicate that DNA replication is significantly slower when GAA is the template for lagging strand synthesis.
Repeat instability was analyzed for GAA-82 and TTC-79 after 12, 16, 20 (log phase), and 24 h (stationary phase) of culture. Instability was significantly greater in the GAA orientation, and it increased throughout the exponential phase of plasmid replication (Figure 2D). Contractions were much more prevalent than expansions in both orientations. However, analysis of all contraction events from 1224 h revealed that large contractions, involving >50% of the original length, were observed primarily when GAA was the template for lagging strand synthesis (Figure 2A and E). Large contractions in the TTC orientation were significantly less frequent (P < 0.001), and of lesser magnitude [median size of contractions = 25.5 versus 49.5 triplets (P < 0.001)]. Furthermore, despite the relatively blunted exponential replication phase, there was an incremental accumulation of large contractions throughout the log phase in the GAA orientation (Figure 2F), and indeed, most of the observed instability was due to these large contractions (Figure 2D and F).
We did not observe large expansions in either orientation of replication. Small contractions and expansions (involving <10% of the original tract length) were equally frequent in the GAA or TTC orientations (P = 0.41 and P = 0.37, respectively). Large contractions were far more prevalent than small contractions only in the GAA orientation (P < 0.001) (Figure 2E). In summary, we noted large contractions predominantly when GAA is the lagging strand template, and this was coincident with plasmid DNA replication.
Similar mutational spectrum is observed at the FRDA locus in human somatic cells in vivo
To test the physiological relevance of our observations, we performed SP-PCR analysis of individual FRDA molecules containing repeat lengths similar to those tested in E.coli, derived from genomic DNA of peripheral blood leukocytes of human subjects. We have previously optimized this technique for the analysis of GAA triplet repeat instability at the FRDA locus (14). Analysis of 2520 individual FRDA molecules from four heterozygous carriers of constitutional alleles with 78 (n = 524 molecules), 81 (n = 700 molecules), 91 (n = 328 molecules) and 105 (n = 968 molecules) uninterrupted triplet repeats revealed the same spectrum of mutations observed in E.coli: small expansions and contractions, large contractions and a paucity of large expansions or intermediate-sized contractions (Figure 3A and B). Of the 2520 molecules analyzed, we observed 283 variant bands (mutation load = 11.2%), with a significant contraction bias [55 (2.2%) expansions versus 228 (9%) contractions, P < 0.001; Figure 3B]. In contrast to replication in E.coli, somatic instability in leukocytes resulted in far more small versus large contractions. However, the extent of the large contractions was similar in the two systems, with most large contractions resulting in almost complete reversion to the normal size, and very few intermediate-sized contractions (Figure 3B). It should be noted that the frequency of large contractions is most likely underestimated in our SP-PCR assay since all four carriers used in this study also have a normal allele, which makes it nearly impossible to detect complete reversion events of individual FRDA molecules (Figure 3A).
|
The threshold length for the initiation of somatic instability at the FRDA locus coincides with the length of a eukaryotic Okazaki fragment
Using SP-PCR, we previously showed that the threshold length for the initiation of somatic instability in vivo is between 26 and 44 uninterrupted GAA triplet repeats (14). Here, using the same technique, we analyzed a total of 5593 individual FRDA molecules with constitutional allele sizes ranging from 8 to 66 uninterrupted GAA triplet repeats to more precisely define the threshold length. SP-PCR of (GAA)820 (n = 349 molecules), (GAA)30 (n = 560 molecules) and (GAA)39 (n = 1150 molecules) showed complete stability (Figure 4A and B) (data not shown). In contrast, SP-PCR analysis of (GAA)44 (n = 2304 molecules) and (GAA)66 (n = 1230 molecules) showed somatic instability in vivo, with mutation loads of 6.3 and 30%, respectively (Figure 4C and D). This indicates that the threshold length for the initiation of somatic instability in peripheral leukocytes in vivo at the FRDA locus is between 40 and 44 uninterrupted triplet repeats. The (GAA)39 allele, which showed no somatic instability (Figure 3B), is known to have undergone hyperexpansion in one of three germline transmissions, producing a (GAA)650 allele in the offspring. Therefore, it seems that the minimum length required for the initiation of somatic instability at the FRDA locus may be longer than that required for germline instability.
|
(GAGGAA)n hexanucleotide interruptions stabilize the (GAATTC)n triplet repeat at the FRDA locus in somatic cells in vivo
To test whether (GAGGAA)n hexanucleotide interruptions in (GAATTC)n repeat tracts cause stabilization of triplet repeats, as has been observed in E.coli (44) and predicted in human germline transmissions (3), SP-PCR analysis was performed using leukocytic DNA from carriers of two similar (GAATTC)n alleles. One carrier had a pure (GAA)107 allele, and another a (GAA)114* allele interrupted with a (GAGGAA)5 hexanucleotide sequence close to the 3' end of the repeat tract [(GAA)76(GAGGGA)(GAA)18(GAGGAA)5(GAA)8]. SP-PCR of 1316 individual FRDA molecules from the carrier of the pure (GAA)107 allele revealed 294 variant bands (mutation load = 22.3%), ranging in size from 63 to 136 triplets (Figure 5A). In contrast, SP-PCR analysis of 1294 individual FRDA molecules from the carrier of the interrupted (GAA)114* allele showed no somatic variability (Figure 5B). This indicates that the (GAGGAA)n hexanucleotide interruption, which is frequently seen in FRDA alleles with >27 triplets (3), stabilizes the repeat tract in somatic cells in vivo. This is an interesting result since the hexanucleotide interruption maps close to the 3' end of the (GAA)114* allele, leaving a tract of 76 uninterrupted GAA triplets, which is greater than the threshold length for the initiation of somatic instability (Figure 4).
|
(GAGGAA)n hexanucleotide interruptions stabilize the (GAATTC)n triplet repeat when propagated in plasmids in E.coli
In order to test whether the (GAGGAA)n interruption also stabilizes (GAATTC)n repeats in E.coli, we cloned PCR products obtained from leukocytic DNA of the two individuals mentioned above, one having a pure (GAA)107 allele, and another having an interrupted (GAA)114* allele. The resulting PCR products were inserted into pUC19 in both orientations relative to the ColE1 origin of replication, generating the following constructs: (GAA)111, (GAA)101*, (TTC)108 and (TTC)95* (see Materials and Methods for exact sequences). Bacterial growth curves were determined for each of the four constructs as an indication of plasmid replication (as shown in Figure 2). The pure (GAATTC)n repeat resulted in slower growth when GAA was the template for lagging strand synthesis [compare (GAA)111 with (TTC)108; Figure 6A], as was previously observed with shorter repeat tracts (Figure 2B). However, comparison of (GAA)111 versus (GAA)101* revealed similar growth patterns, indicating that plasmid replication was similarly impeded by the interrupted GAA tract. Despite the similar patterns of bacterial growth observed for pure and interrupted GAA constructs, the pure repeat sequence was found to be significantly more unstable than the interrupted repeat sequence in both the GAA (92 versus 31.9%, P < 0.001) and TTC (29.6 versus 7.6%; P = 0.004) orientations (Figure 6C), and this instability was composed almost entirely of contractions (data not shown). Interestingly, the stabilizing effect of the hexanucleotide interruption was more pronounced in the GAA orientation (Figure 6C). In the TTC orientation, the (GAGGAA)n interrupted repeat tract was very stable (Figure 6C), similar to the result we observed for this sequence in human somatic cells in vivo (Figure 5B).
|
| DISCUSSION |
|---|
|
|
|---|
Friedreich ataxia is a recessive disease and asymptomatic heterozygotes, i.e. individuals who have one FRDA allele containing a fully expanded repeat, represent 0.51% of the IndoEuropean population (2,4). Consequently, most Friedreich ataxia patients inherit fully expanded alleles (typically containing 6001200 triplets). Hyperexpansion of PM alleles as a means of inheriting the FRDA mutation occurs very rarely (3,4,12,13). This is in contrast to all the other triplet repeat diseases, which are dominantly inherited, wherein de novo expansions are much more frequent. We therefore believe that the reversal of a fully expanded allele to the normal size in somatic cells, rather than prevention of germline hyperexpansion, represents an appropriate strategy for the treatment of Friedreich ataxia. We have previously shown that, unlike the expansion bias of CTG repeat in myotonic dystrophy (4550), fully expanded GAA triplet repeat alleles at the FRDA locus have a marked tendency to contract in somatic cells in vivo, in some cases even reverting to the normal/premutation size range (14). Our goal is to understand the molecular mechanism(s) underlying the apparently spontaneous reversion of the FRDA mutation, and to devise methods to accelerate this process in somatic cells.
Here, we show that (GAATTC)n repeats are more unstable when GAA is the template for lagging strand synthesis during plasmid replication in E.coli, with a marked contraction bias. Ohshima et al. (6) had previously noted increased smearing of their cloned inserts upon multiple rounds of replication, indicative of a contraction bias in plasmids propagated in E.coli when GAA was the template for lagging strand synthesis. During the preparation of this manuscript, a paper was published reporting similar results in a yeast model system (37). The instability of the (GAATTC)n sequence was orientation-dependent, with increased instability and a contraction bias when GAA was the template for lagging strand synthesis. These data support our conclusion that (GAATTC)n repeat instability is replication-mediated, and also indicate that the mechanism of the strand-specific, differential instability is conserved in prokaryotes and eukaryotes, as is seen for instability of CNG repeats in bacteria and yeast (1726). Our present analysis of individual replication events using the colony PCR method allowed us to precisely measure the degree and type of instability in the E.coli model system. Furthermore, it allowed us to accurately visualize the spectrum of mutations that occur after individual replication events. Our analysis revealed that it is mainly large contractions that increase in frequency when GAA is the template for lagging strand synthesis. Large contractions frequently resulted in complete reversion to the normal size range. Moreover, in peripheral leukocytes derived from human carriers of similar sized alleles, we identified the same spectrum of mutations. Somatic mutations in vivo consisted mainly of small slippage events and large contractions into the non-disease size range. The similarity of the mutational spectra observed in multiple independent systems, especially the observation of large contractions, suggests that a common mechanism may underlie the instability of (GAATTC)n repeats.
The mechanism(s) responsible for GAA triplet repeat instability are poorly understood. We propose that at least two distinct mechanisms underlie the mutations we observed. Small length changes of the repeat tract (<10% variation) may occur as a result of slippage and mispairing during replication of the triplet repeat. The equal frequency and magnitude of small length changes in both GAA and TTC orientations is consistent with the expectation that slippage is equally likely to occur during leading or lagging strand synthesis, and with the observation that GAA and TTC strands undergo approximately the same degree of slippage during rolling circle replication (51). However, our results implicate erroneous lagging strand synthesis, when GAA is the template strand, as the likely mechanism for the generation of large contractions of the (GAATTC)n repeat (Figure 7). The incremental accumulation of large contractions was associated with DNA replication when GAA was the template for lagging strand synthesis. Our in vivo data show that the threshold length for the initiation of somatic instability is between 40 and 44 uninterrupted triplets. Therefore, somatic instability in vivo may initiate when the length of the (GAATTC)n repeat exceeds the length of a eukaryotic Okazaki fragment. If the single-stranded lagging strand template were to adopt a stable or metastable secondary structure(s), the replication machinery could bypass a variable number of repeats in the nascent strand, resulting in contraction of the repeat (Figure 7). The 50-fold lower affinity of RPA for polypurine sequences (34) and the selective ability of GAA sequences to adopt intrastrand structures (17,35,36) lend further support to this mechanism. The frequent observation of large contractions and paucity of intermediate-sized contractions could stem from a minimum length requirement for GAA repeats to maintain a stable structure in order to be bypassed in the nascent strand. These data have important implications for understanding the molecular basis of somatic instability at the FRDA locus.
|
It is likely that a structural transition may underlie the initiation of somatic instability in vivo. The minimum length for the formation of sticky DNA (59 triplets) (30) is greater than the 4044 triplet repeat threshold for the initiation of somatic instability. However, Potaman et al. (32) showed that the (GAATTC)n repeat undergoes a structural transition from a stable intramolecular triplex at 923 triplets to an intramolecular bi-triplex structure at 42 triplets, coinciding in length with the threshold for the initiation of somatic instability in human cells. Interestingly, the (GAA)114* hexanucleotide interrupted allele was stable in somatic cells, despite containing a pure repeat tract of 76 triplets. Pure (GAA)111 and (TTC)108 sequences were also significantly more unstable than interrupted (GAA)101* and (TTC)95* sequences, respectively, when propagated in E.coli. However, the interruption appeared to have no effect on the slowed bacterial growth. The mechanism by which the hexanucleotide interruption confers stability is unknown, but it may be due to the inability of the impure GAA template strand to adopt a stable secondary structure. Previous work has shown that a repeat tract comprised entirely of (GAGGAA)n repetitive sequence (50% non-GAA) does not form sticky DNA, unlike pure (GAA)n repeat sequences (44). Interestingly, in the plasmid replication model the hexanucleotide interruption resulted in significantly enhanced stability of the adjacent repeat when GAA was the lagging strand template, despite interfering with efficient plasmid replication. This dissociation of replication blockade and repeat instability suggests that while bacterial growth and replication may be slowed by the homopurinehomopyrimidine nature of the sequence, purity of the triplet repeat tract is required for GAA instability.
Furthermore, it seems that (GAATTC)n repeats are quantitatively more unstable than (CTGCAG)n and (CGGCCG)n repeats in somatic cells in vivo. SP-PCR analysis of similar-sized alleles of the other triplet repeats in leukocytes revealed very low mutation frequencies compared with the mutation loads we observed with (GAATTC)n alleles (50,52). The reason for these differences is not known, but it is interesting to note that during the evolution of the human genome, (GAATTC)n repeats have undergone significant instability as reflected by the development of a wide range of allele lengths in comparison with (CTGCAG)n and (CGGCCG)n sequences (53).
In conclusion, we have shown that the (GAATTC)n repeat that causes Friedreich ataxia is destabilized during DNA replication, undergoing large contractions when GAA serves as the lagging strand template. In somatic cells in vivo, the (GAATTC)n repeat at the FRDA locus initiates instability between 40 and 44 triplets, displays significant mutation load, and the mutational spectrum includes large contractions that result in reversion to non-disease alleles. These data indicate that the FRDA mutation is reversible.
| ACKNOWLEDGEMENTS |
|---|
We are grateful to the patients and their families for participating in this study. We would like to thank Dr Gillian Dalgliesh for critically reviewing this manuscript. This research was supported in part by grants from the NIH/NINDS (NS047596), American Diabetes Association and OCAST to S.I.B. S.S. was supported by a fellowship from the Presbyterian Health Foundation.
| REFERENCES |
|---|
|
|
|---|
- Campuzano,V., Montermini,L., Moltó,M.D., Pianese,L., Cossée,M., Cavalcanti,E., Monrós,F., Rodius,F., Duclos,F., Monticelli,A. et al. ( (1996) ) Friedreich's ataxia: autosomal recessive disease caused by an intronic GAA triplet repeat expansion. Science, , 271, , 14231427.[Abstract]
- Bidichandani,S.I. and Ashizawa,T. ( (2002) ) Friedreich ataxia, GeneReviews: Genetic Disease Online Reviews of GeneTests-GeneClinics. Copyright University of Washington, Seattle, WA, USA.
- Montermini,L., Andermann,E., Labuda,M., Richter,A., Pandolfo,M., Cavalcanti,F., Pianese,L., Iodice,L., Farina,G., Monticelli,A. et al. ( (1997) ) The Friedreich ataxia GAA triplet repeat: premutation and normal alleles. Hum. Mol. Genet., , 6, , 12611266.
[Abstract/Free Full Text] - Cossée,M., Schmitt,M., Campuzano,V., Reutenauer,L., Moutou,C., Mandel,J.L. and Koenig,M. ( (1997) ) Evolution of the Friedreich's ataxia trinucleotide repeat expansion: founder effect and premutations. Proc. Natl Acad. Sci., , 94, , 74527457.
[Abstract/Free Full Text] - Bidichandani,S.I., Ashizawa,T. and Patel,P.I. ( (1998) ) The GAA triplet-repeat expansion in Friedreich ataxia interferes with transcription and may be associated with an unusual DNA structure. Am. J. Hum. Genet., , 62, , 111121.[CrossRef][Web of Science][Medline]
- Ohshima,K., Montermini,L., Wells,R.D. and Pandolfo,M. ( (1998) ) Inhibitory effects of expanded GAATTC triplet repeats from intron 1 of the Friedreich ataxia gene on transcription and replication in vivo. J. Biol. Chem., , 273, , 1458814595.
[Abstract/Free Full Text] - Grabczyk,E. and Usdin, K ( (2000) ) The GAATTC triplet repeat expanded in Friedreich's ataxia impedes transcription elongation by T7 RNA polymerase in a length and supercoil dependent manner. Nucleic Acids Res., , 28, , 28152822.
[Abstract/Free Full Text] - Campuzano,V., Montermini,L., Lutz,Y., Cova,L., Hindelang,C., Jiralespong,S., Trottier,Y., Kish,S.J., Faucheux,B., Trouillas,P. et al. ( (1997) ) Frataxin is reduced in Friedreich ataxia patients and is associated with mitochondrial membranes. Hum. Mol. Genet., , 6, , 17711780.
[Abstract/Free Full Text] - Sakamoto,N., Ohshima,K., Montermini,L., Pandolfo,M. and Wells,R.D. ( (2001) ) Sticky DNA, a self-associated complex formed at long GAATTC repeats in intron 1 of the frataxin gene, inhibits transcription. J. Biol. Chem., , 276, , 2717127177.
[Abstract/Free Full Text] - De Michele,G., Cavalcanti,F., Criscuolo,C., Pianese,L., Monticelli,A., Filla,A. and Cocozza,S. ( (1998) ) Parental gender, age at birth and expansion length influence GAA repeat intergenerational instability in the X25 gene: pedigree studies and analysis of sperm from patients with Friedreich's ataxia. Hum. Mol. Genet., , 7, , 19011906.
[Abstract/Free Full Text] - Monrós,E., Moltó,M.D., Martinez,F., Cañizares,J., Blanca,J., Vilchez,J.J., Prieto,F., de Frutos,R. and Palau,F. ( (1997) ) Phenotype correlation and intergenerational dynamics of the Friedreich ataxia GAA trinucleotide repeat. Am. J. Hum. Genet., , 61, , 101110.[Web of Science][Medline]
- Epplen,C., Epplen,J.T., Frank,G., Miterski,B., Santos,E.J. and Schols,L. ( (1997) ) Differential stability of the (GAA)n tract in the Friedreich ataxia (STM7) gene. Hum. Genet., , 99, , 834836.[CrossRef][Web of Science][Medline]
- Delatycki,M.B., Paris,D., Gardner,R.J., Forshaw,K., Nicholson,G.A., Nassif,N., Williamson,R. and Forrest,S.M. ( (1998) ) Sperm DNA analysis in a Friedreich ataxia premutation carrier suggests both meiotic and mitotic expansion in the FRDA gene. J. Med. Genet., , 35, , 713716.
[Abstract/Free Full Text] - Sharma,R., Bhatti,S., Gómez,M., Clark,R.M., Murray,C., Ashizawa,T. and Bidichandani,S.I. ( (2002) ) The GAA triplet-repeat shows a high level of somatic instability in vivo, with a significant predilection for large contractions. Hum. Mol. Genet., , 11, , 21752187.
[Abstract/Free Full Text] - Cummings,C.J. and Zoghbi,H.Y. ( (2000) ) Fourteen and counting: unraveling trinucleotide repeat diseases. Hum. Mol. Genet., 9, , 909916.
[Abstract/Free Full Text] - Bowater,R.P. and Wells,R.D. ( (2001) ) The intrinsically unstable life of DNA triplet repeats associated with human hereditary disorders. Prog. Nucleic Acid Res. Mol. Biol., , 66, , 159202.[Web of Science][Medline]
- Kang,S., Jaworski,A., Ohshima,K. and Wells,R.D. ( (1995) ) Expansion and deletion of CTG repeats from human disease genes are determined by the direction of replication in E.coli. Nature Genet., , 10, , 213218.[Web of Science][Medline]
- Samadashwily,S.M., Raca,G. and Mirkin,S.M. ( (1997) ) Trinucleotide repeats affect DNA replication in vivo. Nature Genet., , 17, , 298304.[Web of Science][Medline]
- Sarkar,P.S., Chang,H.C., Boudi,F.B. and Reddy,S. ( (1998) ) CTG repeats show bimodal amplification in E.coli. Cell, , 95, , 531540.[CrossRef][Web of Science][Medline]
- Freudenreich,C.H., Stavenhagen,J.B. and Zakian,V.A. ( (1997) ) Stability of a CTG/CAG trinucleotide repeat in yeast is dependent on its orientation in the genome. Mol. Cell. Biol., , 17, , 20902098.[Abstract]
- Miret,J.J., Pessoa-Brandão,L. and Lahue,R.S. ( (1998) ) Orientation-dependent and sequence-specific expansions of CTG/CAG trinucleotide repeats in Saccharomyces cerevisiae. Proc. Natl Acad. Sci. USA, , 95, , 1243812443.
[Abstract/Free Full Text] - Maurer,D.J., O'Callaghan,B.L. and Livingston,D.M. ( (1996) ) Orientation dependence of trinucleotide CAG repeat instability in Saccharomyces cerevisiae. Mol. Cell. Biol., , 16, , 66176622.[Abstract]
- Shimizu,M., Gellibolian,R., Oostra,B.A. and Wells,R.D. ( (1996) ) Cloning, characterization, and properties of plasmids containing CGG triplet repeats from the FMR1 gene. J. Mol. Biol., , 258, , 614626.[CrossRef][Web of Science][Medline]
- Hirst,M.C. and White,P.J. ( (1998) ) Cloned human FMR1 trinucleotide repeats exhibit a length- and orientation-dependent instability suggestive of in vivo lagging strand secondary structure. Nucleic Acids Res., , 26, , 23532358.
[Abstract/Free Full Text] - White,P.J., Borts,R.H. and Hirst,M.C. ( (1999) ) Stability of the human fragile X (CGG)(n) triplet repeat array in Saccharomyces cerevisiae deficient in aspects of DNA metabolism. Mol. Cell. Biol., , 19, , 567584.
[Abstract/Free Full Text] - Balakumaran,B.S., Freudenreich,C.H. and Zakian,V.A. ( (2000) ) CGG/CCG repeats exhibit orientation-dependent instability and orientation-independent fragility in Saccharomyces cerevisiae. Hum. Mol. Genet., , 9, , 93100.
[Abstract/Free Full Text] - Heidenfelder,B.L., Makhov,A.M. and Topal,M.D. ( (2003) ) Hairpin formation in Friedreich's ataxia triplet repeat expansion. J. Biol. Chem., , 278, , 24252431.
[Abstract/Free Full Text] - Gacy,A.M., Goellner,G., Juranic,N., Macura,S. and McMurray,C.T. ( (1995) ) Trinucleotide repeats that expand in human disease form hairpin structures in vitro. Cell, , 81, , 533540.[CrossRef][Web of Science][Medline]
- Gacy,A.M., Goellner,G.M., Spiro,C., Chen,X., Gupta,G., Bradbury,E.M., Dyer,R.B., Mikesell,M.J., Yao,J.Z., Johnson,A.J. et al. ( (1998) ) GAA instability in Friedreich's ataxia shares a common, DNA-directed and intraallelic mechanism with other trinucleotide diseases. Mol. Cell, , 1, , 583593.[CrossRef][Web of Science][Medline]
- Sakamoto,N., Chastain,P.D., Parniewski,P., Ohshima,K., Pandolfo,M., Griffith,J.D. and Wells,R.D. ( (1999) ) Sticky DNA: self-association properties of long GAATTC repeats in RRY triplex structures from Friedreich's ataxia. Mol. Cell., , 3, , 465475.[CrossRef][Web of Science][Medline]
- Mariappan,S.V.S., Catasti,P., Silks,L.A.,III, Bradbury,E.M. and Gupta,G. ( (1999) ) The high-resolution structure of the triplex formed by the GAA/TTC triplet repeat associated with Friedreich's ataxia. J. Mol. Biol., , 285, , 20352052.[CrossRef][Web of Science][Medline]
- Potaman,V.N., Oussatcheva,E.A., Lyubchenko,Y.L., Shlyakhtenko,L.S., Bidichandani,S.I., Ashizawa,T. and Sinden,R.R. ( (2004) ) Length-dependent structure formation in Friedreich ataxia (GAA)n(TTC)n repeats. Nucleic Acids Res., , 32, , 12241231.
[Abstract/Free Full Text] - Grabczyk,E. and Usdin,K. ( (2000) ) Alleviating transcript insufficiency caused by Friedreich's ataxia triplet repeats. Nucleic Acids Res., , 28, , 49304937.
[Abstract/Free Full Text] - Kim,C., Snyder,R.O. and Wold,M.S. ( (1992) ) Binding properties of replication protein A from human and yeast cells. Mol. Cell. Biol., , 12, , 30503059.
[Abstract/Free Full Text] - Suen,I., Rhodes,J.N., Christy,M., McEwen,B., Gray,D.M. and Mitas,M. ( (1999) ) Structural properties of Friedreich's ataxia d(GAA) repeats. Biochim. Biophys. Acta, , 1444, , 1424.[Medline]
- LeProust,E.M., Pearson,C.E., Sinden,R.R. and Gao,X. ( (2000) ) Unexpected formation of parallel duplex in GAA and TTC trinucleotide repeats of Friedreich's ataxia. J. Mol. Biol., , 302, , 10631080.[CrossRef][Web of Science][Medline]
- Krasilnikova,M.M. and Mirkin,S.M. ( (2004) ) Replication stalling at Friedreich's ataxia (GAA)n repeats in vivo. Mol. Cell. Biol., , 24, , 22862295.
[Abstract/Free Full Text] - Burhans,W.C., Vassilev,L.T., Caddle,M.S., Heintz,N.H. and DePamphilis,M.L. ( (1990) ) Identification of an origin of bidirectional DNA replication in mammalian chromosomes. Cell, , 62, , 955965.[CrossRef][Web of Science][Medline]
- Anderson,S. and DePamphilis,M.L. ( (1979) ) Metabolism of Okazaki fragments during simian virus 40 DNA replication. J. Biol. Chem., , 254, , 1149511504.
[Abstract/Free Full Text] - Filla,A., De Michele,G., Cavalcanti,F., Pianese,L., Monticelli,A., Campanella,G. and Cocozza,S. ( (1996) ) The relationship between trinucleotide (GAA) repeat length and clinical features in Friedreich ataxia. Am. J. Hum. Genet., , 59, , 554560.[Web of Science][Medline]
- Hashem,V.I., Klysik,E.A., Rosche,W.A. and Sinden,R.R. ( (2002) ) Instability of repeated DNAs during transformation in Escherichia coli. Mutat. Res., , 502, , 3946.[Web of Science][Medline]
- Gomes-Pereira,M., Bidichandani,S.I. and Monckton,D.G. ( (2004) ) Analysis of unstable triplet repeats using small pool polymerase chain reaction. In Kohwi,Y. (ed.), Methods in Molecular Biology, Humana Press, Totowa, NJ, USA, Vol. 227, pp. 6176.
- Jeffreys,A.J., Tamaki,K., Mac Leon,A., Monckton,D.G., Neil,D.L. and Armour,J.A.L. ( (1994) ) Complex gene conversion events in germline mutation at human minisatellites. Nature Genet., , 6, , 136145.[CrossRef][Web of Science][Medline]
- Sakamoto,N., Larsson,J.E., Iyer,R.R., Montermini,L., Pandolfo,M. and Wells,R.D. ( (2001) ) GGATCC-interrupted triplets in long GAATTC repeats inhibit the formation of triplex and sticky DNA structures, alleviate transcription inhibition, and reduce genetic instabilities. J. Biol. Chem., , 276, , 2717827187.
[Abstract/Free Full Text] - Monckton,D.G., Wong,L.C., Ashizawa,T. and Caskey,T. ( (1995) ) Somatic mosaicism, germline expansions, germline reversions and intergenerational reductions in myotonic dystrophy males: small pool PCR analyses. Hum. Mol. Genet., , 4, , 18.[Web of Science][Medline]
- Ashizawa,T., Mockton,D.G., Vaishnav,S., Patel,B.J., Voskova,A. and Caskey,T. ( (1996) ) Instability of the expanded (CTG)n repeats in the myotonin protein kinase gene in cultured lymphoblastoid cell lines from patients with myotonic dystrophy. Genomics, , 36, , 4753.[CrossRef][Web of Science][Medline]
- Martorell,L., Monckton,D.G., Gamez,J., Johnson,K.J., Gich,I., Lopez de Munain,A. and Baiget,M. ( (1998) ) Progression of somatic CTG repeat length heterogeneity in the blood cells of myotonic dystrophy patients. Hum. Mol. Genet., , 7, , 307312.
[Abstract/Free Full Text] - Fortune,M.T., Vassilopoulos,C., Coolbaugh,M.I., Siciliano,M.J. and Monckton,D.G. ( (2000) ) Dramatic, expansion-biased, age-dependent, tissue-specific somatic mosaicism in a transgenic mouse model of triplet repeat instability. Hum. Mol. Genet., , 9, , 439445.
[Abstract/Free Full Text] - Gomes-Pereira,M., Fortune,M.T. and Monckton,D.G. ( (2001) ) Mouse tissue culture models of unstable triplet repeats: in vitro selection for larger alleles, mutational expansion bias and tissue specific, but no association with cell division rates. Hum. Mol. Genet., , 10, , 845854.
[Abstract/Free Full Text] - Martorell,L., Monckton,D.G., Sanchez,A., Lopez de Munain,A. and Baiget,M. ( (2001) ) Frequency and stability of the myotonic dystrophy type 1 premutation. Neurology., , 56, , 328335.
[Free Full Text] - Iyer,R.R. and Wells,R.D. ( (1999) ) Expansion and deletion of triplet repeat sequences in Escherichia coli on the leading strand of DNA replication. J. Biol. Chem., , 274, , 38653877.
[Abstract/Free Full Text] - Mornet,E., Chateau,C., Hirst,M.C., Thepot,F., Taillandier,A., Cibois,O. and Serre,J.L. ( (1996) ) Analysis of germline variation at the FMR1 CGG repeat shows variation in the normal-premutated borderline range. Hum. Mol. Genet., , 5, , 821825.
[Abstract/Free Full Text] - Clark,R.M., Dalgliesh,G.L., Endres,D., Gómez,M., Taylor,J. and Bidichandani,S.I. ( (2004) ) Expansion of GAA triplet repeats in the human genome: unique origin of the FRDA mutation at the center of an Alu. Genomics, , 83, , 373383.[CrossRef][Web of Science][Medline]
This article has been cited by other articles:
![]() |
E. Soragni, D. Herman, S. Y. R. Dent, J. M. Gottesfeld, R. D. Wells, and M. Napierala Long intronic GAA*TTC repeats induce epigenetic changes and reporter gene silencing in a molecular model of Friedreich ataxia Nucleic Acids Res., November 1, 2008; 36(19): 6056 - 6065. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Wells DNA triplexes and Friedreich ataxia FASEB J, June 1, 2008; 22(6): 1625 - 1634. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. M. Pollard, R. L. Bourn, and S. I. Bidichandani Repair of DNA double-strand breaks within the (GAA*TTC)n sequence results in frequent deletion of the triplet-repeat sequence Nucleic Acids Res., February 2, 2008; 36(2): 489 - 500. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. M. Pollard, Y. K. Chutake, P. M. Rindler, and S. I. Bidichandani Deficiency of RecA-dependent RecFOR and RecBCD pathways causes increased instability of the (GAA{middle dot}TTC)n sequence when GAA is the lagging strand template Nucleic Acids Res., November 29, 2007; 35(20): 6884 - 6894. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Krasilnikova, M. L. Kireeva, V. Petrovic, N. Knijnikova, M. Kashlev, and S. M. Mirkin Effects of Friedreich's ataxia (GAA)n{middle dot}(TTC)n repeats on RNA synthesis and stability Nucleic Acids Res., February 28, 2007; 35(4): 1075 - 1084. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Singh, L. Zheng, V. Chavez, J. Qiu, and B. Shen Concerted Action of Exonuclease and Gap-dependent Endonuclease Activities of FEN-1 Contributes to the Resolution of Triplet Repeat Sequences (CTG)n- and (GAA)n-derived Secondary Structures Formed during Maturation of Okazaki Fragments J. Biol. Chem., February 9, 2007; 282(6): 3465 - 3477. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. M. Rindler, R. M. Clark, L. M. Pollard, I. De Biase, and S. I. Bidichandani Replication in mammalian cells recapitulates the locus-specific differences in somatic instability of genomic GAA triplet-repeats Nucleic Acids Res., December 4, 2006; 34(21): 6352 - 6361. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||







(large rectangle) replicates the leading strand at the advancing fork. Pol

