Enhanced concatemer cloning-a modification to the SAGE (Serial Analysis of Gene Expression) technique Acknowledgement References
Enhanced concatemer cloning-a modification to the SAGE (Serial Analysis of Gene Expression) technique
Enhanced concatemer cloning-a modification to the SAGE (Serial Analysis of Gene Expression) technique
J. Powell*
The Richard Dimbleby Department of Cancer Research, I.C.R.F. Laboratory, Rayne Institute, 4th Floor Lambeth Wing, St Thomas's Hospital, Lambeth Palace Road, London SE1 7EH, UK
Received April 22, 1998;Revised and Accepted June 5, 1998
ABSTRACT
The Serial Analysis of Gene Expression (SAGE) method, described in 1995 by Velculescu et al., represents a powerful means to compare gene expression between two mRNA populations. An improvement to SAGE that removes contaminating linker molecules, which compromise the efficiency of the method, has been developed. This modification utilises biotinylated PCR primers, which generate biotinylated linkers at an early stage in the SAGE protocol, thus allowing removal of the unwanted linkers by binding to streptavidin-coated magnetic beads at a later stage. The application of this modification resulted in the rapid generation of high ditag yields and clones with large average insert sizes.
The Serial Analysis of Gene Expression (SAGE) method (1) generates short sequence tags which are positionally located within the cDNA molecule from which they are derived. This allows specific detection of that cDNA from a large number of different transcripts. The tags are generated as dimers, or ditags, and are ligated together to form concatemers which are then cloned. Sequencing the clones allows over 30 individual tags to be read from each lane of an automated sequencing gel. The abundance of a particular tag relates directly to the expression level of the gene from which it is derived. This serial analysis of many thousands of gene specific tags allows the simultaneous accumulation of information from genes expressed in the tissue of interest and gives rise to an expression profile of that tissue (1-5).
During the generation of the ditags linker molecules persist, despite a gel purification step, which have compatible sticky ends enabling them to ligate to the ditags (Fig. 1). This unwanted linker ligation terminates concatenation of that molecule and the reaction is effectively poisoned. The resulting molecule will not have compatible ends with which to clone into the prepared vector, pZero (Invitrogen). This leads to fewer clones being generated which, in turn, contain very few tags per clone. For SAGE to be efficient and to permit the analysis of large numbers of tags, the number of tags per clone must be as high as possible.
Figure1. Diagram showing how contaminating linkers are removed during the modification to the SAGE method. (a) Shown here are the products resulting from Step 8 of the detailed SAGE protocol (7). These are an unwanted 80 bp linker/linker artefact and the desired product, a 102 bp linker/ditag/linker molecule. (b) Bulk PCR reactions are performed using biotinylated primers A and B. Subsequent NlaIII digestion releases linkers and ditags as shown. The unwanted biotinylated linkers are removed by binding to streptavidin magnetic beads. (c) The remaining ditags are free from contaminating linkers and can ligate to form long clonable concatemers.
I describe a modification to the SAGE method which efficiently removes linkers from the ditag concatenation reaction and generates clones with large inserts and high tag numbers. In order to assess directly the improvement that this modification gives, a side by side comparison was carried out between the method I describe below and an alternative protocol which aims to remove the contaminating linkers using a gel-purification step, described by Velculescu et al. (6).
The modification is described as follows: biotinylated PCR primers are used to prepare bulk PCR reactions, (50 × 100 µl) of SAGE ditags, which have been produced exactly as described as in the detailed protocol (7). The 26 bp ditag sample is isolated without (at this stage) making any attempts to minimise linker contamination. As the linkers are now biotinylated, due to the PCR primers used in their generation, streptavidin coated magnetic beads (Dynabeads M-280 streptavidin, Dynal, Norway) can be used to extract all the biotinylated DNA as follows: the ditag sample is made up to 100 µl with LoTE (3 mM Tris-HCl pH 7.5, 0.2 mM EDTA pH 7.5). An aliquot of 100 µl 2× binding and washing buffer (10 mM Tris-HCl pH 7.5, 1 mM EDTA, 2.0 mM NaCl) is then added, the sample divided in half and each half added to 100 µl streptavidin beads which have been pre-washed with 1× binding and washing buffer. After mixing, the samples are left at room temperature for 15 min with intermittent gentle agitation. A magnet is then used to immobilise the streptavidin beads/contaminating biotinylated linkers and the supernatant reserved. The beads are washed once with 1× binding and washing buffer and once with LoTE; the supernatant is reserved in each case. The supernatants are then ethanol precipitated and the pellets combined and resuspended in 7 µl LoTE. This purified ditag sample is then used in concatemer and clone formation, resuming the detailed SAGE protocol at step 11 (7).
For each method, the starting material consisted of 50 bulk PCR reactions of 100 µl, generated using biotinylated PCR primers (these were found to perform exactly the same as non-biotinylated primers; data not shown). The two methods were then carried out as described on each bulk PCR sample. Concatemers were generated and cloned. Several parameters were then determined for the SAGE libraries resulting from each method. The results are shown in Table 1.
For optimum performance of SAGE, maximum information is required from each clone sequenced in order to minimise the sequencing load per experiment. The amount of ditags generated is also critical to the cloning outcome, with several hundred nanograms of material being required for successful cloning of large concatenated inserts. Table 1 shows that, relative to the recent revisions of the originators of this technique (6), the method described here gave a greater yield of ditags. In addition, the average clone size was longer and the number of tags per clone was 43% greater, which increases the efficiency of the SAGE method.
These results therefore show that the modification described here represents a rapid and effective means of improving the efficiency of the SAGE method. Further SAGE libraries have been constructed in order to assess the reproducibility of the method. The results from these libraries also are presented in Table 1 and show a further improvement in clone size confirming reproducibility.
Ditags were generated using each protocol as described. After concatemer formation and cloning, 100 clones with inserts were analysed from the SAGE library resulting from each method. [The SAGE method was carried out using NlaIII as the anchoring enzyme and BsmFI as the tagging enzyme and the linkers described by Velculescu et al. (7). A clone insert consists of 226 bp of vector plus concatenated ditags, each of which is 26 bp. Each ditag represents two tags. Therefore, a clone of 616 bp equates to 30 SAGE tags, 226 bp of vector sequence plus 15 × 26 bp ditags.] Ditag yield describes the amount of ditags produced using each method, from a starting material of 50 identical bulk PCR reactions. The average clone insert size is an important parameter as large clone inserts are essential to the efficiency of SAGE. A single automated sequencing run can yield 600-1000 bp of readable sequence. Insert sizes approaching this range are therefore desirable. Experiment 1 describes results when SAGE was carried out using the modification described here. Experiment 2 shows the results obtained using the alternative method described by Velculescu et al. (6). Experiments 3-5 describe results from three further SAGE libraries constructed using the modification described here, showing that the technique gives a reproducible increase in clone size due to the effective removal of the contaminating linkers.
6. Velculescu,V.E., Zhang, L., Vogelstein,B. and Kinzler,K.W. (1997) Serial Analysis of Gene Expression: Detailed Protocol (Version 1.0c), September 1997. Available from Johns Hopkins Oncology Centre and Howard Hughes Medical Institute, 424 North Bond Street, Baltimore, MD 21231, USA; Fax: +1 410 955 0548.
7. Velculescu,V.E., Zhang, L., Vogelstein,B. and Kinzler,K.W. (1997) Serial Analysis of Gene Expression: Detailed Protocol (Version 1.0b), November 1995. Available as above.
J. Khattra, A. D. Delaney, Y. Zhao, A. Siddiqui, J. Asano, H. McDonald, P. Pandoh, N. Dhalla, A.-l. Prabhu, K. Ma, et al. Large-scale production of SAGE libraries from microdissected tissues, flow-sorted cells, and cell lines
Genome Res.,
January 1, 2007;
17(1):
108 - 116.
[Abstract][Full Text][PDF]
J. C. Graff, M. Behnke, J. Radke, M. White, and M. A. Jutila A comprehensive SAGE database for the analysis of {gamma}{delta} T cells
Int. Immunol.,
April 1, 2006;
18(4):
613 - 626.
[Abstract][Full Text][PDF]
K. J. Coyne, J. M. Burkholder, R. A. Feldman, D. A. Hutchins, and S. C. Cary Modified Serial Analysis of Gene Expression Method for Construction of Gene Expression Profiles of Microbial Eukaryotic Species
Appl. Envir. Microbiol.,
September 1, 2004;
70(9):
5298 - 5304.
[Abstract][Full Text][PDF]
A. P. So, R. F. B. Turner, and C. A. Haynes Increasing the efficiency of SAGE adaptor ligation bydirected ligation chemistry
Nucleic Acids Res.,
July 6, 2004;
32(12):
e96 - e96.
[Abstract][Full Text][PDF]
M. Gowda, C. Jantasuriyarat, R. A. Dean, and G.-L. Wang Robust-LongSAGE (RL-SAGE): A Substantially Improved LongSAGE Method for Gene Discovery and Transcriptome Analysis
Plant Physiology,
March 1, 2004;
134(3):
890 - 897.
[Abstract][Full Text][PDF]
K. F. Nolan, V. Strong, D. Soler, P. J. Fairchild, S. P. Cobbold, R. Croxton, J.-A. Gonzalo, A. Rubio, M. Wells, and H. Waldmann IL-10-Conditioned Dendritic Cells, Decommissioned for Recruitment of Adaptive Immunity, Elicit Innate Inflammatory Gene Products in Response to Danger Signals
J. Immunol.,
February 15, 2004;
172(4):
2201 - 2209.
[Abstract][Full Text][PDF]
T. Shiraki, S. Kondo, S. Katayama, K. Waki, T. Kasukawa, H. Kawaji, R. Kodzius, A. Watahiki, M. Nakamura, T. Arakawa, et al. Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage
PNAS,
December 23, 2003;
100(26):
15776 - 15781.
[Abstract][Full Text][PDF]
F. Schwartz, A. Duka, E. Triantafyllidi, C. Johns, I. Duka, J. Cui, and H. Gavras Serial analysis of gene expression in mouse kidney following angiotensin II administration
Physiol Genomics,
December 16, 2003;
16(1):
90 - 98.
[Abstract][Full Text][PDF]
C. Vilain, F. Libert, D. Venet, S. Costagliola, and G. Vassart Small amplified RNA-SAGE: an alternative approach to study transcriptome from limiting amount of mRNA
Nucleic Acids Res.,
March 15, 2003;
31(6):
e24 - e24.
[Abstract][Full Text][PDF]
I. Schwering, A. Brauninger, U. Klein, B. Jungnickel, M. Tinguely, V. Diehl, M.-L. Hansmann, R. Dalla-Favera, K. Rajewsky, and R. Kuppers Loss of the B-lineage-specific gene expression program in Hodgkin and Reed-Sternberg cells of Hodgkin lymphoma
Blood,
February 15, 2003;
101(4):
1505 - 1512.
[Abstract][Full Text][PDF]
J. D. Gottsch, G. D. Seitzman, E. H. Margulies, A. L. Bowers, A. J. Michels, S. Saha, A. S. Jun, W. J. Stark, and S. H. Liu Gene Expression in Donor Corneal Endothelium
Arch Ophthalmol,
February 1, 2003;
121(2):
252 - 258.
[Abstract][Full Text][PDF]
N. Meissner, J. Radke, J. F. Hedges, M. White, M. Behnke, S. Bertolino, M. Abrahamsen, and M. A. Jutila Serial Analysis of Gene Expression in Circulating {gamma}{delta} T Cell Subsets Defines Distinct Immunoregulatory Phenotypes and Unexpected Gene Expression Profiles
J. Immunol.,
January 1, 2003;
170(1):
356 - 364.
[Abstract][Full Text][PDF]
J. J. Dunn, S. R. McCorkle, L. A. Praissman, G. Hind, D. van der Lelie, W. F. Bahou, D. V. Gnatenko, and M. K. Krause Genomic Signature Tags (GSTs): A System for Profiling Genomic DNA
Genome Res.,
November 1, 2002;
12(11):
1756 - 1765.
[Abstract][Full Text][PDF]
W. D. Patino, O. Y. Mian, and P. M. Hwang Serial Analysis of Gene Expression: Technical Considerations and Applications to Cardiovascular Biology
Circ. Res.,
October 4, 2002;
91(7):
565 - 569.
[Abstract][Full Text][PDF]
E. H. Margulies, S. L. R. Kardia, and J. W. Innis Identification and prevention of a GC content bias in SAGE libraries
Nucleic Acids Res.,
June 15, 2001;
29(12):
e60 - e60.
[Abstract][Full Text][PDF]
K. Polyak and G. J. Riggins Gene Discovery Using the Serial Analysis of Gene Expression Technique: Implications for Cancer Research
J. Clin. Oncol.,
June 1, 2001;
19(11):
2948 - 2958.
[Abstract][Full Text][PDF]
S. V. Anisimov, E. G. Lakatta, and K. R. Boheler Discovering altered genomic expression patterns in heart: transcriptome determination by serial analysis of gene expression
Eur J Heart Fail,
June 1, 2001;
3(3):
271 - 281.
[Abstract][Full Text][PDF]
A. Waghray, M. Schober, F. Feroze, F. Yao, J. Virgin, and Y. Q. Chen Identification of Differentially Expressed Genes by Serial Analysis of Gene Expression in Human Prostate Cancer
Cancer Res.,
May 1, 2001;
61(10):
4283 - 4286.
[Abstract][Full Text]
M. J. Hayden and P. J. Sharp Sequence-tagged microsatellite profiling (STMP): a rapid technique for developing SSR markers
Nucleic Acids Res.,
April 15, 2001;
29(8):
e43 - e43.
[Abstract][Full Text][PDF]
B W S Robinson, D J Erle, D A Jones, S Shapiro, W J Metzger, S M Albelda, W C Parks, and A Boylan Recent advances in molecular biological techniques and their relevance to pulmonary research
Thorax,
April 1, 2000;
55(4):
329 - 339.
[Full Text]