Nucleic Acids Research, 2001, Vol. 29, No. 12 e60
© 2001 Oxford University Press
Identification and prevention of a GC content bias in SAGE libraries
1Department of Human Genetics and 2Department of Pediatrics and Communicable Diseases, University of Michigan Medical School and 3Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, MI 48109, USA
Serial Analysis of Gene Expression (SAGE) is becoming a widely used gene expression profiling method for the study of development, cancer and other human diseases. Investigators using SAGE rely heavily on the quantitative aspect of this method for cataloging gene expression and comparing multiple SAGE libraries. We have developed additional computational and statistical tools to assess the quality and reproducibility of a SAGE library. Using these methods, a critical variable in the SAGE protocol was identified that has the potential to bias the Tag distribution relative to the GC content of the 10 bp SAGE Tag DNA sequence. We also detected this bias in a number of publicly available SAGE libraries. It is important to note that the GC content bias went undetected by quality control procedures in the current SAGE protocol and was only identified with the use of these statistical analyses on as few as 750 SAGE Tags. In addition to keeping any solution of free DiTags on ice, an analysis of the GC content should be performed before sequencing large numbers of SAGE Tags to be confident that SAGE libraries are free from experimental bias.
* To whom correspondence should be addressed at: Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109-0618, USA. Tel: +1 734 647 3817; Fax: +1 734 763 3784; Email: innis{at}umich.edu
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Khattra, A. D. Delaney, Y. Zhao, A. Siddiqui, J. Asano, H. McDonald, P. Pandoh, N. Dhalla, A.-l. Prabhu, K. Ma, et al. Large-scale production of SAGE libraries from microdissected tissues, flow-sorted cells, and cell lines Genome Res., January 1, 2007; 17(1): 108 - 116. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Reverter, A. Ingham, S. A. Lehnert, S.-H. Tan, Y. Wang, A. Ratnakumar, and B. P. Dalrymple Simultaneous identification of differential gene expression and connectivity in inflammation, adipogenesis and cancer Bioinformatics, October 1, 2006; 22(19): 2396 - 2404. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. D. Singh, J. C. Davis, and D. A. Petrov X-Linked Genes Evolve Higher Codon Bias in Drosophila and Caenorhabditis Genetics, September 1, 2005; 171(1): 145 - 155. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. T. Webster, N. G.C. Smith, M. J. Lercher, and H. Ellegren Gene Expression, Synteny, and Local Similarity in Human Noncoding Mutation Rates Mol. Biol. Evol., October 1, 2004; 21(10): 1820 - 1830. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. P. So, R. F. B. Turner, and C. A. Haynes Increasing the efficiency of SAGE adaptor ligation bydirected ligation chemistry Nucleic Acids Res., July 6, 2004; 32(12): e96 - e96. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Lercher, J.-V. Chamary, and L. D. Hurst Genomic Regionality in Rates of Evolution Is Not Explained by Clustering of Genes of Comparable Expression Profile Genome Res., June 1, 2004; 14(6): 1002 - 1013. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Schwartz, A. Duka, E. Triantafyllidi, C. Johns, I. Duka, J. Cui, and H. Gavras Serial analysis of gene expression in mouse kidney following angiotensin II administration Physiol Genomics, December 16, 2003; 16(1): 90 - 98. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Lercher, A. O. Urrutia, A. Pavlicek, and L. D. Hurst A unification of mosaic structures in the human genome Hum. Mol. Genet., October 1, 2003; 12(19): 2411 - 2415. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. O. Urrutia and L. D. Hurst The Signature of Selection Mediated by Expression on Human Genes Genome Res., October 1, 2003; 13(10): 2260 - 2264. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. D. Patino, O. Y. Mian, and P. M. Hwang Serial Analysis of Gene Expression: Technical Considerations and Applications to Cardiovascular Biology Circ. Res., October 4, 2002; 91(7): 565 - 569. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. H. Margulies, S. L.R. Kardia, and J. W. Innis A Comparative Molecular Analysis of Developing Mouse Forelimbs and Hindlimbs Using Serial Analysis of Gene Expression (SAGE) Genome Res., October 1, 2001; 11(10): 1686 - 1698. [Abstract] [Full Text] [PDF] |
||||







