| Nucleic Acids Research | Pages |
Conserved DNA sequences adjacent to chromosome fragmentation and telomere addition sites in Euplotes crassus
Introduction
Materials And Methods
Cell culture and DNA isolation
Southern hybridizations and other molecular biological methods
DNA sequencing
Analysis of DNA sequences
Polymerase chain reaction
Results
Analysis of subtelomeric regions of macronuclear DNA
Micronuclear chromosome fragmentation sites
A second macronuclear-destined sequence overlaps the micronuclear TelRNA gene
Discussion
A refined model of chromosome fragmentation/telomere addition
Implications for telomere addition
Acknowledgements
References
Conserved DNA sequences adjacent to chromosome fragmentation and telomere addition sites in Euplotes crassus
DDBJ/EMBL/GenBank accession nos§
ABSTRACT
INTRODUCTION
All ciliated protozoa that have been examined at the molecular level have been found to undergo developmentally controlled chromosome fragmentation. This is possible, in part, because of the nuclear dimorphism displayed by this group of organisms (reviewed in 1-3), in which cells contain both micronuclei and macronuclei. The micronucleus contains conventional chromosomes, but is transcriptionally inert during asexual reproduction. The second nucleus, the macronucleus, is responsible for transcription during vegetative growth. It possesses a polyploid genome of subchromosomal DNA molecules that represent a subset of the micronuclear genome. In the hypotrich group (e.g. Euplotes, Oxytricha and Stylonychia) the macronuclear DNA molecules are linear, very short (~2 kb on average) and generally contain single coding regions. The ends of the macronuclear DNA molecules are capped by telomeres, which consist of 5[prime]-CCCCAAAA-3[prime]/3[prime]-GGGGTTTT-5[prime] (C4A4 or G4T4) repeats in the hypotrichs (4).
The macronucleus is destroyed during sexual reproduction and regenerated from a copy of the micronuclear genome. The process begins with cell pairing (conjugation), followed by micronuclear meiosis. Haploid products are exchanged between the paired cells and fuse with a resident haploid nucleus to form the zygotic nucleus. The zygotic nucleus then undergoes one or more mitotic divisions. Some division products are retained as micronuclei, while others undergo a DNA rearrangement process that culminates in the formation of a new macronucleus. In hypotrichs, macronuclear development begins with multiple rounds of DNA replication that result in the formation of polytene chromosomes. During this period, thousands of interstitial DNA segments, termed IESs (internal eliminated sequences), are excised from the chromosomes with rejoining of flanking DNA. IESs consist of both short (<600 bp) unique DNA sequences and large families of transposable elements (Tec elements) in Euplotes crassus (2,5). Once the polytene chromosomes have fully formed, they are fragmented to generate the individual macronuclear DNA molecules. Telomeric repeat sequences are added to the ends of the liberated DNA molecules by telomerase (6). In E.crassus, the telomeres are initially synthesized in an extended form (7-9) and subsequently trimmed later in development (10). The final stages of macronuclear development involve additional rounds of DNA replication that result in a mature macronucleus.
The chromosome fragmentation/telomere addition process has been studied at the molecular level in only a few ciliate species. It is generally a very reproducible process, but most ciliates that have been examined display some heterogeneity with regard to the exact position at which telomeric repeats are added (reviewed in 1). The sole known exception is E.crassus, where chromosome fragmentation/telomere addition reproducibly occurs at the same nucleotide position (11). Sequence elements controlling chromosome fragmentation/telomere addition have been best characterized in Tetrahymena thermophila. A 15 bp sequence termed the Cbs (chromosome breakage sequence) resides in the developmentally eliminated DNA flanking macronuclear-destined sequences (12) and has been shown to be necessary and sufficient to direct chromosome fragmentation/telomere addition (13). However, the Cbs element is found only in Tetrahymena and closely related species (14). Candidate cis-acting sequences have not been identified for most other ciliates, perhaps because of the observed heterogeneity in the process.
For E.crassus, previous studies have suggested a candidate sequence element that may direct chromosome fragmentation/telomere addition. First, bulk sequencing of total macronuclear DNA termini using the chemical sequencing method has provided evidence that many macronuclear DNA molecules share the sequence element 5[prime]-TTGAA-3[prime] beginning at base 18 following the telomeric repeats (4,15). Sequencing of individual macronuclear DNA molecules has confirmed this, but has also shown that a significant number of termini lack the sequence element (11,16,17). However, sequencing of a limited number of micronuclear chromosome fragmentation sites (i.e. the junctions between macronuclear-destined and micronuclear-limited DNA) suggests that if the 5[prime]-TTGAA-3[prime] element is absent in the macronuclear-destined DNA, it is then found in the flanking micronuclear-limited DNA (11). In these instances, the sequence element is in an inverted orientation and located 12 bp from the position at which telomeric sequences will ultimately be added. A model was proposed to reconcile the different spacing of the sequence element when in micronuclear-limited as opposed to macronuclear-destined DNA (11; Fig.
In the current study we more thoroughly examine sequences near chromosome fragmentation sites. A large sample of macronuclear DNA termini has been analyzed and a more extensive conserved sequence element containing the 5[prime]-TTGAA-3[prime] core has been identified. Additional micronuclear fragmentation sites have also been examined to test the hypothesis that the sequence element may be positioned in either macronuclear-destined or flanking micronuclear-limited DNA. The results of these analyses provide new insights into the chromosome fragmentation/telomere addition process in E.crassus and lead us to suggest a modified model of the process.
MATERIALS AND METHODS
Cell culture and DNA isolation
Euplotes crassus strains X1, X2, ST9 and ST11 were grown in artificial seawater prepared using Reef Crystals (Aquarium Systems, Mentor, OH) and supplemented as described in Roth et al. (19), except that vitamin B12 was omitted. Either the alga Dunaliella salina or the bacterium Escherichia coli were used as the food source. Matings were carried out at 24°C using strains X1 and X2 as previously described (8,19).
Euplotes crassus vegetative total cellular DNA was isolated using established procedures (4). Micronuclear DNA used in PCR reactions was obtained by subjecting total vegetative cellular DNA to electrophoresis in a low melting point agarose gel (Gibco BRL Life Technologies, Gaithersburg, MD). Limit mobility DNA was excised from the gel and used directly for PCR.
Developing macronuclei were isolated and DNA purified as previously described (8,19), except that 3 mM CaCl2 and 50 mM NaCl were used in place of spermidine in nuclear isolation buffers.
Southern hybridizations and other molecular biological methods
Southern blots of agarose gels were prepared using GeneScreen Plus membrane filters (NEN Research Products, Boston, MA) and the alkaline blotting method (20). Hybridizations and washes were carried out as previously described (21,22) using restriction fragments or PCR products labeled with 32P by the random hexamer priming method (23) as probes.
Random clones of macronuclear DNA molecules were isolated from previously described plasmid recombinant libraries (10). Isolation of the micronuclear EC1 clone and identification of the Mac C and Mac J macronuclear-destined sequences has been described (24). Restriction fragments derived from EC1 were used to screen a recombinant macronuclear DNA library (24) in order to isolate clones of the Mac C and Mac J macronuclear DNA molecules.
Other molecular biological methods were performed using established procedures (20).
DNA sequencing
For sequencing PCR products, the reactions were first run on low melting point agarose gels. The specific PCR products were excised from the gels and the DNA purified as described (25). The DNAs were then directly sequenced using the same primers employed in PCR. Sequencing was carried out by the University of Connecticut Health Center Molecular Core facility using the Taq Dideoxy Termination Cycle Sequencing Kit (Perkin Elmer Cetus, Norwalk, CT).
Sequencing of the cloned DNA segments was carried out using Sequenase kits (US Biochemicals, Cleveland, OH). For sequencing of the EC1 micronuclear clone and the Mac J and Mac C macronuclear clones, selected restriction fragments were subcloned into pBS or pKS plasmids (Stratagene, La Jolla, CA). Synthetic oligonucleotide primers (Gibco BRL Life Technologies Inc.) were then used in some cases to extend sequences determined from the vector priming sites.
Analysis of DNA sequences
Conserved sequence elements in macronuclear DNA termini and micronuclear junctions were defined using a previously described statistical method (26). Briefly, sequences in the sample were aligned relative to a reference point, which in this case was either the first subtelomeric base or the chromosome fragmentation/telomere addition site. The overall frequencies of each base (fA, fT, fC and fG), as well as the overall frequencies of each combination of three bases (fATC, fTCG, etc.), were then calculated for the entire sample. The average expected occurrence (expected mean = M) for each base or combination of three bases at each position was next calculated as nfB, where n is the number of sequences in the sample. The expected standard deviation was calculated as the square root of nfB(1 - fB). Positions where the observed occurrence of a base or a combination of three bases exceeded M + 2 SD were considered to be significant.
The sample of macronuclear DNA termini analyzed included the newly sequenced macronuclear DNA ends noted above. In addition, the sample included sequence data from the following previously sequenced termini of macronuclear DNA molecules, most of which contain genes of known function, that were available in GenBank (accession nos are given in parentheses): telomerase RNA (M33461); TLEE (M73025), phosphoglycerate kinase (U97355); V3 (U47679), conZA8 (AF061334); actin (J04533); MACET3 (M63336); [gamma]-tubulin 1 (X85234); [gamma]-tubulin 2 (X85235); ribosomal protein L29 (U13207); histone H3 (U75429); histone H4 (U75430); telomere binding protein (M96818); telomere binding protein homolog (M96819); V2 (M28500); [beta]-tubulin (J04534); V1 (M28500); D7 (M28498); D8 (M28498 and M28499); conF5 (AF063084); conZB1 (U65646). A compilation of the sequence data used in the analysis is available on request.
Using the above method, some single positions displayed non-random base composition for both a single base and a combination of three bases. For the consensus sequences, we have chosen to display significant single bases in the consensus when their frequency is >0.60 in the sample and the three base combination when the frequency of the single base is <0.60. Also included in the consensus are positions within or adjacent to conserved clusters, where a single base was present at a frequency of >0.5. These positions are indicated in lower case.
Polymerase chain reaction
PCR was carried out using either a mixture of Taq polymerase (Gibco BRL Life Technologies Inc.) and Vent polymerase (New England Biolabs Inc., Beverly, MA) or KlenTaq polymerase (Sigma, St Louis, MO), using buffers and reaction conditions supplied by the manufacturers. All reactions were carried out in a MJ Research PTC-150 MiniCycler (Cambridge, MA).
For inverse PCR (27) reactions to isolate sequences flanking the micronuclear TelRNA and RPL29 genes, DNA isolated from developing macronuclei 40 h after mating was used as a substrate. The 40 h DNA was digested with HindIII and then ligated at a concentration of 2.5 ng/µl to foster intramolecular circle formation. Aliquots of 12.5 ng ligated DNA were then used as the substrate in 50 µl PCR reactions.
For inverse PCR of the TelRNA gene, the first step PCR reaction used primers ETEL-1 (5[prime]-CATTTTGAATCCCTGACCTC-3[prime]) and ETEL-2 (5[prime]-CTAAACGCACCTATTGATCG-3[prime]). Five microliters of this PCR reaction were then used as the substrate for a second round of PCR using the ETEL-3 (5[prime]-CCTGACCTCCATTTTTAAGG-3[prime]) and ETEL-4 (5[prime]-TTGTTTGACAGATTTGACGG-3[prime]) oligonucleotides as primers. For the RPL29 gene the first round of PCR employed oligonucleotides r29-1 (5[prime]-TGTCTCATACCGTGCTTTCC-3[prime]) and r29-2 (5[prime]-AGTTCCTCTCGTTGTCAAGG-3[prime]) as primers and the second round used oligonucleotides r29-3 (5[prime]-GTGACCTCTGAGTTTCCTCG-3[prime]) and r29-4 (5[prime]-AGGTGCCTGTGTATTGAGAG-3[prime]). In all cases, 25 cycles of amplification were carried out, with a cycle consisting of 95°C for 1 min, 45 or 46°C for 75 s and 72°C for 135 s.
For PCR amplification of the macronuclear LTEL terminus, 1.5 µg E.crassus strain X1 total cellular DNA were run on a 1.2% low melting point agarose gel and the region of the gel containing DNA molecules ~1.4 kb in size was excised for use as the substrate. The first round of PCR was carried out using oligonucleotides LTEL1 (5[prime]-GGCTTTAAGGTGAATCTGAG-3[prime]) and C4A4 28mer (5[prime]-CCCCAAAACCCCAAAACCCCAAAACCCC-3[prime]) as primers. The second round of PCR employed LTEL2 (5[prime]-CATCATTCTCTCAAGGTAAG-3[prime]) and C4A4 28mer as the primers. Each round of PCR consisted of 25 cycles of 95°C for 1 min, 50°C for 75 s and 72°C for 1 min.
Some of the above two-step PCR strategies resulted in a weak specific product band with additional background PCR products. To enhance specificity and product yield, once candidate PCR products were identified, first round PCR reactions were subjected to electrophoresis on low melting point agarose gels. The region of the gel containing DNA fragments of the size predicted for first round PCR products (usually weak or not visible) was excised and this material was used as the substrate DNA for the second round PCR reactions.
RESULTS
Analysis of subtelomeric regions of macronuclear DNA
The analysis of macronuclear subtelomeric sequences examined a total sample of 100 macronuclear DNA termini. Forty two of the terminal sequences were from previously characterized macronuclear DNA molecules that are available in GenBank (Materials and Methods). To increase the sample size, one or both termini of an additional 33 randomly selected clones of E.crassus macronuclear DNA molecules were sequenced. These new terminal sequences have been deposited in GenBank under accession nos AF068575-AF068614, AF069914-AF069917, AF069919-AF069931 and AF077371.
A previously described statistical method (26) was employed to identify conserved sequence elements near the ends of macronuclear DNA molecules. This involved aligning the first 50 subtelomeric bases of the 100 macronuclear DNA ends and then calculating the overall frequency of each base for the entire sample. The first subtelomeric base is operationally defined as the one following the last block of four C residues in the 5[prime]-C4A4-3[prime] telomeric repeats. Based on the overall base composition, an expected mean occurrence and standard deviation for each of the bases at any given position, as well as for all possible combinations of three different bases (ATG, TGC, GCA and CAT), was calculated (analyses of all possible combinations of two bases were also performed, but provided little additional information and will not be discussed). Subtelomeric positions where the actual (observed) frequency of a particular base or a combination of three bases exceeded M + 2 SD were considered to have significant deviations from random base composition. The analysis of three base combinations, in effect, provides a means of assessing whether single bases are under-represented at a particular position. It should also be noted that this statistical approach compensates for the overall AT-richness of the subtelomeric sequences (the sample of 100 termini had the following base composition: A = 0.40,T = 0.35, C = 0.10 and G = 0.15).
Figure
Figure 1. Analysis of the subtelomeric sequences of macronuclear DNA molecules. Histograms display the results of statistical analyses of base composition at each subtelomeric position for the entire data set of macronuclear termini (A) (n = 100; Materials and Methods) and for the selected macronuclear DNA termini (B) (n = 62). Black bars are shown only at those positions that have statistically significant deviations from random base composition for single bases (A, C, G or T). Open bars are shown at positions where a combination of three bases (ATC = H, TCG = B, CGA = V or GAT = D) displayed a statistically significant deviation from random base composition, except for those positions where a statistically significant single base whose frequency was >0.60 was also present. The overall base composition for the 100 termini analyzed in (A) was: A = 0.40, T = 0.35, C = 0.10, G = 0.15. Bases or combinations of three bases had to be present at a particular position in excess of the following values to be considered significant: A > 49, T > 44, C > 16, G > 21, H > 92, B > 69, V > 74 and D > 95. For the sequences analyzed in (B), the overall base composition was A = 0.40, T = 0.36, C = 0.10, G = 0.14. In this case, bases or combinations of three bases had to be present at a particular position in excess of the following values to be considered significant: A > 32, T > 29, C > 11, G > 14, H > 58, B > 45, V > 47 and D > 60. A second cluster of positions that has significant deviations from random base composition is positioned adjacent to the telomere. The first subtelomeric base is rarely C (fC at position1 = 0.02) and positions 2-4 all show a moderate bias for T. The third cluster of significant positions extends from subtelomeric position 16 to 24. It centers on the 5[prime]-TTGAA-3[prime] motif previously identified as a conserved element at the ends of macronuclear DNA molecules (4,15). The overall consensus sequence that can be deduced is 5[prime]-a0.53V0.79T0.85T0.65G0.72-A0.73A0.63NH0.93-3[prime] (V = ACG, H = ACT and N = ACGT; subscripts denote the observed frequency of the base or combination of bases at the given position in the sample). Inspection of individual sequences for strong matches to the well-conserved core 5[prime]-TTGAA-3[prime] sequence (i.e. matches at four of the five positions) produced two intriguing observations. First, 62 of the 100 macronuclear termini contained strong matches to the core beginning at subtelomeric position 18. A significant number of the remaining macronuclear DNA termini (16 of 38) that lack a strong match to the E-Cbs core beginning at position 18 displayed a strong match to the core sequence positioned 1-3 bp closer to the chromosome fragmentation/telomere addition site. Only two termini had a strong match to the core sequence located 1-3 bp more distal of the usual placement. This suggests that the sequence element can sometimes be positioned closer to the telomere of macronuclear DNA molecules. Second, for the macronuclear DNA molecules where the subtelomeric sequences of both ends were known, one could find instances where the core was present at both ends, neither end, only the end 5[prime] of the coding region or only the end 3[prime] of the coding region. This is significant, as the conserved sequence element could potentially play a role in transcription, translation, DNA replication or chromosome fragmentation/telomere addition. The fact that the sequence element exists at the 5[prime]- and/or 3[prime]-ends of coding regions lessens the likelihood that it is involved in transcriptional or translational control. While it remains formally possible that the sequence element plays a role in DNA replication, we consider it more likely to play a role in directing chromosome fragmentation/telomere addition and will refer to the element as the Euplotes chromosome breakage sequence (E-Cbs). The above considerations indicated that a mixed sample was being considered in the original analysis of the 100 macronuclear DNA termini, i.e. the sample contained ends with the E-Cbs core beginning at position 18, as well as ends lacking the E-Cbs or having it in a slightly different position. As a result, we carried out the same type of analysis on the 62 macronuclear termini possessing a strong match to the E-Cbs core beginning at subtelomeric position 18 to determine if a more robust consensus could be derived from this selected sample (Fig.
Micronuclear chromosome fragmentation sites
Even allowing for some variation in the position of the E-Cbs, there are numerous macronuclear DNA termini that lack reasonable matches to the consensus. It was previously suggested (11) that for these cases, the same sequence element exists in the flanking DNA in the micronucleus in inverted orientation, but is positioned six bases closer to the site of telomere addition. To investigate this further, we sequenced eight new micronuclear regions containing chromosome fragmentation/telomere addition sites (junction sequences), as well as compiled previously determined junction sequences available in GenBank.
The first approach to obtaining new junctions involved sequencing portions of a previously characterized cloned segment of micronuclear DNA (clone EC1; 24), as well as the termini of two macronuclear DNA molecules (Mac J and Mac C) that are derived from this region of the micronuclear genome. The sequencing analysis is outlined in Figure
Figure 2. Restriction maps of a portion of micronuclear clone EC1 and two macronuclear DNA molecules (Mac J and Mac C) derived from this region of the genome. Sequenced regions are indicated by stippled rectangles and the position of an IES is indicated by a black box. S, SstI; E, EcoRI; H, HindIII; X, XbaI. In the second approach, we specifically sought to characterize micronuclear junctions where the corresponding macronuclear DNA molecule was known to lack the E-Cbs at one end. This was accomplished using inverse PCR (27) to determine the micronuclear junctions for the two previously characterized macronuclear chromosomes that carry the telomerase RNA (TelRNA; 29) and ribosomal protein L29 (RPL29; 17) genes. The inverse PCR strategy for the TelRNA gene is outlined in Figure We obtained a 1.6 kb PCR product for the TelRNA gene from the two-step inverse PCR procedure (Fig. Figure Figure 3. Inverse PCR of the chromosomal copy of the TelRNA gene. (A) The inverse PCR strategy is diagrammed, beginning with a representation of the micronuclear TelRNA gene (open rectangle; any IESs present are not shown) and flanking micronuclear DNA (lines). Positions of oligonucleotides used as PCR primers are indicated below the maps, with the arrows denoting the direction in which they prime DNA synthesis. (B) Agarose gel displaying the PCR product obtained from the TelRNA inverse PCR procedure (lane PCR), along with size markers (lane M). Sizes of selected marker DNAs are indicated in kb. (C) Restriction map of the 1.6 kb PCR product obtained using the inverse PCR strategy for the TelRNA gene. TelRNA macronuclear-destined sequences are indicated as open boxes. The location of the LTEL macronuclear-destined sequences are also indicated (shaded box), along with locations of PCR primers (LTEL1 and LTEL2) used in later analyses. A statistical analysis of the micronuclear junction sequences using the same procedure employed for the macronuclear DNA termini was also carried out (data not shown). In this case we aligned all the junction sequences relative to the 5[prime]-TTGAA-3[prime] core (i.e. junctions with the E-Cbs in flanking micronuclear DNA were inverted for the analysis and junctions where the core began at position 17 were shifted by 1 bp). The results again indicated that there is a cluster of positions with significant deviationsfrom random base composition with the consensus V0.96T0.87T0.87G1.00A0.91A0.83NA0.61. This consensus sequence is quite similar to that obtained for the samples of total and selected macronuclear DNA ends (Fig. Figure 4. Micronuclear junction sequences. The first eight junction sequences shown are those determined in this study, while the remainder were published previously. The positions of the chromosome fragmentation/telomere sites are indicated and macronuclear-destined sequences are generally shown to the right. Matches to the 5[prime]-TTGAA-3[prime] E-Cbs core are double underlined and flags denote cases where the E-Cbs core begins at position 17, as opposed to the more typical position 18, when positioned in macronuclear-destined DNA. In the sequence names, -5[prime] and -3[prime] denote the junctions upstream and downstream of coding regions in macronuclear-destined sequences, where known. In cases where coding regions have not been identified, -l and -r denote the left and right junctions of macronuclear destined sequences relative to previously published maps. For the two junctions where overlapping macronuclear-destined sequences are present, the name of the second macronuclear DNA molecule is given in parentheses. The full names of the genes present in the macronuclear-destined sequences of the junctions and their GenBank accession numbers or appropriate reference sources are as follows: Mac C (AF072706 and AF072708); Mac J (AF072707 and AF072708); TelRNA (telomerase RNA, AF061108 and AF061109); RPL29 (ribosomal protein L29, AF061111 and AF061112); V2 and V1 (M28500); V3 (protein kinase, M28500); EFA1 (elongation factor-1[alpha]-1; 41); EFA2 (elongation factor-1[alpha]-2; 41); Ec2 (M73022 and M73023); D8 (M28498 and M28499); D7 (M28498); conZA8 (AF063083). Finally, we would like to note two additional aspects of the known micronuclear junctions. First, the V2-r junction contains the poorest match to the E-Cbs (4/10 positions overall and only 3/5 positions in the core; Fig.
A second macronuclear-destined sequence overlaps the micronuclear TelRNA gene
In the micronuclear genome, adjacent macronuclear-destined sequences are typically separated by a developmentally eliminated spacer DNA segment (18; this study). However, in E.crassus, one pair of adjacent macronuclear-destined sequences has been found to overlap by 6 bp (18). To determine if other macronuclear-destined DNA sequences were located adjacent to the TelRNA and RPL29 genes in the micronucleus, Southern hybridizations to macronuclear DNA were carried out using restriction fragments from the inverse PCR products as probes. A 310 bp NdeI restriction fragment from the TelRNA inverse PCR product (Fig.
Figure 5. Analyses of the LTEL macronuclear-destined sequence. (A) Autoradiographs of Southern hybridizations of the 0.3 kb NdeI and 0.5 kb HindIII fragments of the 1.6 kb inverse PCR product from the micronuclear TelRNA locus to E.crassus whole cell DNA. (B) Agarose gel displaying the product obtained from ~1.4 kb E.crassus macronuclear DNA after two cycles of PCR using primers LTEL1 + C4A4 28mer and LTEL2 + C4A4 28mer (lane Mac-end PCR), along with DNA size markers (lane M). (C) Agarose gel displaying the product obtained from E.crassus micronuclear DNA after two cycles of PCR using primers ETEL2 + LTEL1 and ETEL4 + LTEL2 (lane Mic PCR), along with DNA size markers (lane M). Sizes of selected marker DNA fragments, hybridizing DNA fragments and PCR products are shown in kb. (D) Comparison of the micronuclear sequence of the TelRNA gene and its left-flanking region (Mic.) with the termini of the TelRNA and LTEL macronuclear DNA molecules (TelRNA Mac. and LTEL Mac., respectively). The telomeric repeats at the 5[prime]- and 3[prime]-ends of the macronuclear DNA molecules are abbreviated as C4A4 and G4T4, respectively, and the region matching the E-Cbs core is underlined. The sequence of the macronuclear LTEL terminus is deposited in GenBank under accession no. AF061110. Figure 6. (A) Model of chromosome fragmentation and de novo telomere formation in E.crassus. A stretch of micronuclear DNA containing overlapping macronuclear-destined sequences (Mac A and Mac B) is shown at the top in association with proteins that interact with the E-Cbs (blue oval) and carry out chromosome fragmentation (yellow circle). Mac A and Mac B macronuclear-destined sequences are shown in blue and red, respectively, except for the 6 bp shared between them (black) and the core of the E-Cbs is underlined. Bases synthesized during the chromosome fragmentation/telomere addition process are shown in green. See text for additional details of the model. (B) De novo telomere addition by telomerase. Telomerase is represented as a green oval with a portion of telomerase RNA shown in red with the telomeric repeat template region (5[prime]-CCCCAAAA-3[prime]) highlighted. At the left, telomerase is shown interacting with a newly generated terminus of a macronuclear-destined sequence (blue lettering) where the 3[prime]-terminus does not have a G residue (H). De novo telomere synthesis initiates by adding four G residues followed by four T residues. At the right, the 3[prime]-end of the macronuclear-destined DNA has a G residue that aligns with part of the template of telomerase RNA. In this situation, telomere synthesis begins by adding three G residues followed by four T residues. To determine the position of the chromosome fragmentation/telomere addition site that generates LTEL, we employed a procedure termed MAC-end PCR (33) to isolate a terminus of the LTEL macronuclear DNA molecule. MAC-end PCR takes advantage of the fact that all macronuclear DNA molecules have telomeric repeats at their ends, so that a C4A4 oligonucleotide can serve as a universal PCR primer. A PCR reaction using gel-purified 1.4 kb macronuclear DNA molecules from E.crassus strain X1 as substrate was first carried out using LTEL1 (Fig. Because the `micronuclear' sequence of the TelRNA/LTEL region was obtained from a PCR product generated from 40 h developing macronuclear DNA, it remained formally possible that an IES separated the TelRNA and LTEL macronuclear-destined sequences. To test for this possibility, a step-in PCR procedure was used to PCR amplify across the TelRNA/LTEL junction directly from E.crassus strain X1 micronuclear DNA. The first round of PCR employed oligonucleotides ETEL2 and LTEL1 as the primers, while the second round used oligonucleotides ETEL4 and LTEL2 (Fig.
See Corrigendum.
DISCUSSION
In this study we have examined a large number of macronuclear DNA termini, as well as junction sequences in micronuclear DNA. Our results provide a number of new insights into the E.crassus chromosome fragmentation/telomere addition process, as well as substantiate and clarify previous observations. First, through analyzing 100 macronuclear DNA termini, we have presented strong evidence for a conserved DNA element which we term the E-Cbs. The E-Cbs is found at one or both ends of most macronuclear DNA molecules, most typically with the highly conserved 5[prime]-TTGAA-3[prime] core beginning at subtelomeric position 18. By selecting and analyzing the subset of macronuclear DNA termini that show a strong match to the core, we have deduced an E-Cbs consensus sequence of H0.95A0.69T0.95T0.90G0.98A0.95-A0.76a0.50H0.97H0.95 for subtelomeric positions 16-25. This sequence is generally similar to a previously derived consensus (11), but the current statistically based analysis of a much larger sample indicates to us that the newly derived consensus provides the better representation of the E-Cbs.
Second, although most macronuclear DNA termini possess the E-Cbs, there is a significant minority that lacks it. By analyzing micronuclear junction sequences, we have found that strong matches to the E-Cbs can be found in the flanking micronuclear DNA in all cases where an E-Cbs is absent from the terminus of the macronuclear-destined sequence (Fig.
The E-Cbs sequence is intriguing in that a very similar conserved sequence element resides within the terminal inverted repeat of Tec transposon IESs and at one end of the short IESs of E.crassus (5 and references therein). This has led to the suggestion that there may be a link between chromosome fragmentation and IES excision (e.g. a shared interacting protein; 5 includes a discussion of this issue). With regard to protein recognition, studies of both macronuclear chromatin and naked macronuclear DNA indicate that the 5[prime]-TTGAA-3[prime] core of the E-Cbs contains DNase I-hypersensitive sites and is particularly sensitive to DMS methylation (15). This suggests that the element has an alternative DNA structure, which might aid in protein recognition.
It should be emphasized, however, that even the extended E-Cbs consensus does not appear to contain sufficient information to restrict chromosome fragmentation to all appropriate positions in the genome during macronuclear development. We have carried out computer searches for the E-Cbs and have found strong matches at internal regions of mature macronuclear DNA molecules. Since these sites are not subject to fragmentation, this implies that other features must be involved in specifying chromosome fragmentation. This could include a second sequence element that was not identified in our analyses. Alternatively, the sequence context of the E-Cbs may be important, and the overall AT-richness of the subtelomeric regions is noteworthy in this regard. Also, recent studies suggest that alternative chromatin structures are involved in the DNA rearrangement processes of macronuclear development in both E.crassus (34) and Tetrahymena (35). It is possible that some E-Cbs-like sequences in the genome are masked from the fragmentation machinery by virtue of their chromatin structures.
Finally, the E-Cbs shares little sequence similarity with the 15 bp Cbs element (5[prime]-AAAGAGGTTGGTTTA-3[prime]) in Tetrahymena that has been shown to be required for chromosome fragmentation/telomere addition (reviewed in 1). Moreover, in contrast to the E-Cbs, the Tetrahymena Cbs always resides within flanking, eliminated micronuclear DNA and appears to work in an orientation-independent manner. This suggests that there may be significant differences in the chromosome fragmentation processes of these two distantly related ciliates. With regard to other hypotrichs, our results are probably relevant to other Euplotes species, as many macronuclear DNA termini in E.aediculatus, E.octocarinatus and E.eurystomus possess at least the core of the E-Cbs at the same position as E.crassus (4; GenBank entries). For more distantly related hypotrichous ciliates, such as Stylonychia and Oxytricha spp., one can often find reasonable matches to the E-Cbs core near fragmentation sites and some of these are positioned as in E.crassus. Moreover, there is one case of overlapping macronuclear-destined sequences in Oxytricha nova, where the overlap can be viewed as being 6 bp (36). However, the observed heterogeneity in the fragmentation/telomere addition process in these more distantly related hypotrichs (reviewed in 1) greatly complicates the interpretation of these data and makes it unclear if the E-Cbs is a relevant feature for hypotrichs other than Euplotes.
A refined model of chromosome fragmentation/telomere addition
Our results generally support a previously proposed model of chromosome fragmentation/telomere addition in E.crassus (11). A refined version of the model is presented in Figure
A key virtue of the model is that the proposed 6 bp staggered cut explains the two observed positions of the E-Cbs, relative to the telomere addition site, in macronuclear-destined as opposed to flanking micronuclear DNA. As can be seen in Figure
Finally, while the model shown in Figure
Implications for telomere addition
The analysis of macronuclear DNA termini also revealed a non-random base composition at the first four subtelomeric positions: the first position is almost never a C residue, while positions 2-4 have a significant bias for T residues (Fig.
The strong bias against having a C at the first subtelomeric position, which corresponds to an absence of G on the opposite strand, is likely a reflection of the mode of action of the E.crassus telomerase. In vitro, telomerase from both vegetative cells and cells undergoing macronuclear development efficiently adds telomeric repeats to the 3[prime]-ends of oligonucleotides consisting of the telomeric repeat sequence (39,40). The developmental, but not the vegetative, telomerase will also add telomeric repeats to single-stranded oligonucleotides that have a single telomeric repeat at the 5[prime]-end but non-telomeric DNA at the 3[prime]-end (39,40). The latter situation somewhat mimics the type of de novo telomeric repeat addition that must occur on the ends generated by chromosome fragmentation. Melek et al. (40) have shown that the developmental telomerase usually begins telomere addition onto non-telomeric 3[prime]-ends by adding four G residues, followed by four T residues (Fig.
This type of model also helps explain the subset of macronuclear DNA termini containing the E-Cbs sequence located 1-3 bp closer to the telomere. In our analysis of micronuclear junctions, four junctions had the E-Cbs shifted 1 bp closer to the telomere (Fig.
ACKNOWLEDGEMENTS
We thank Tom Doak and Dorothy Shippen for their insightful comments on models of chromosome fragmentation/telomere addition and Michelle Kelley for technical assistance. This work was supported by grant 4737 from The Council for Tobacco Research to L.A.K., National Science Foundation grant MCB-9319009 to C.L.J. and National Institutes of Health grant GM41803 to C.M.P.
REFERENCES
This article has been cited by other articles:
This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals
Comments and feedback: www-admin{at}oup.co.uk
Last modification: 29 Aug 1998
Copyright©Oxford University Press, 1998.
![]()
CiteULike
Connotea
Del.icio.us What's this?
![]()
![]()

![]()
![]()
![]()
A. Le Mouel, A. Butler, F. Caron, and E. Meyer
Developmentally Regulated Chromosome Fragmentation Linked to Imprecise Elimination of Repeated Sequences in Paramecia
Eukaryot. Cell,
October 1, 2003;
2(5):
1076 - 1090.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
M. E. Jacobs, A. Sanchez-Blanco, L. A. Katz, and L. A. Klobutcher
Tec3, a New Developmentally Eliminated DNA Element in Euplotes crassus
Eukaryot. Cell,
February 1, 2003;
2(1):
103 - 114.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
S. Ray, Z. Karamysheva, L. Wang, D. E. Shippen, and C. M. Price
Interactions between Telomerase and Primase Physically Link the Telomere and Chromosome Replication Machinery
Mol. Cell. Biol.,
August 15, 2002;
22(16):
5859 - 5868.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
M. Mollenbeck and L. A. Klobutcher
De novo telomere addition to spacer sequences prior to their developmental degradation in Euplotes crassus
Nucleic Acids Res.,
January 15, 2002;
30(2):
523 - 531.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
D. M. Prescott and S. J. Dizick
A unique pattern of intrastrand anomalies in base composition of the DNA in hypotrichs
Nucleic Acids Res.,
December 1, 2000;
28(23):
4679 - 4688.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
N. P. Robinson, N. Burman, S. E. Melville, and J. D. Barry
Predominance of Duplicative VSG Gene Conversion in Antigenic Variation in African Trypanosomes
Mol. Cell. Biol.,
September 1, 1999;
19(9):
5839 - 5846.
[Abstract]
[Full Text]
[PDF]
![]()
This Article ![]()
![]()
Abstract
![]()
Print PDF (392K)
![]()
Alert me when this article is cited
![]()
Alert me if a correction is posted
![]()
Services ![]()
![]()
Email this article to a friend
![]()
Similar articles in this journal
![]()
Similar articles in ISI Web of Science
![]()
Similar articles in PubMed
![]()
Alert me to new issues of the journal
![]()
Add to My Personal Archive
![]()
Download to citation manager
![]()
Search for citing articles in:
ISI Web of Science (27)
![]()
Request Permissions ![]()
Commercial Re-use Guidelines
for Open Access NAR Content
![]()
Google Scholar ![]()
![]()
Articles by Klobutcher, L. A.
![]()
Articles by Jahn, C. L.
![]()
Search for Related Content
![]()
PubMed ![]()
![]()
PubMed Citation
![]()
Articles by Klobutcher, L. A.
![]()
Articles by Jahn, C. L.
![]()
Social Bookmarking ![]()
![]()
What's this?