Nucleic Acids Research, 2002, Vol. 30, No. 1 328-331
© 2002 Oxford University Press
DBTSS: DataBase of human Transcriptional Start Sites and full-length cDNAs
Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokane-dai, Minato-ku, Tokyo 108-8639, Japan and 1Taisho Laboratory of Functional Genomics, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma-shi, Nara 630-0101, Japan
Although the information of cDNAs is indispensable for analyzing gene function, most of the cDNA sequences stored in current databases are imperfect in the sense that they lack the precise information of 5' end termini. To overcome this difficulty, we have developed the oligo-capping method to obtain full-length cDNAs, the information of which has been partly deposited in public databases. In this study, we further constructed human cDNA libraries enriched in clones containing the cap structure to systematically explore the 5' end structure of expressed genes. Of approximately 217 402 5' end sequences obtained, 111 382 have been matched to cDNA sequences of known genes (7889 genes) and are presented in our new database, DataBase of Transcriptional Start Sites (DBTSS; http://elmo.ims.u-tokyo.ac.jp/dbtss/). Sequence comparison between our entries and those of a reference sequence database, RefSeq, revealed that 4683 (34%) of RefSeq sequences should be extended towards the 5' ends. We also mapped each sequence on the human draft genome sequence to identify its transcriptional start site, which provides us with more detailed information on distribution patterns of transcriptional start sites and adjacent regulatory regions.
* To whom correspondence should be addressed. Tel: +81 3 5449 5343; Fax: +81 3 5449 5416; Email: ysuzuki{at}manage.ims.u-tokyo.ac.jp
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
N. Murakami, T. Hashidate, T. Harayama, T. Yokomizo, T. Shimizu, and M. Nakamura Transcriptional regulation of human G2A in monocytes/ macrophages: involvement of c/EBPs, Runx and Pu.1 Genes Cells, December 1, 2009; 14(12): 1441 - 1455. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Megraw, F. Pereira, S. T. Jensen, U. Ohler, and A. G. Hatzigeorgiou A transcription factor affinity-based code for mammalian transcription initiation Genome Res., April 1, 2009; 19(4): 644 - 656. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M.R. Petit, H. Lindskog, E. Larsson, P. Wasteson, E. Athley, S. Breuer, M. Angstenberger, D. Hertfelder, E. Mattsson, A. Nordheim, et al. Smooth Muscle Expression of Lipoma Preferred Partner Is Mediated by an Alternative Intronic Promoter That Is Regulated by Serum Response Factor/Myocardin Circ. Res., July 3, 2008; 103(1): 61 - 69. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Yamashita, Y. Suzuki, N. Takeuchi, H. Wakaguri, T. Ueda, S. Sugano, and K. Nakai Comprehensive detection of human terminal oligo-pyrimidine (TOP) genes and analysis of their characteristics Nucleic Acids Res., June 1, 2008; 36(11): 3707 - 3715. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Tharakaraman, O. Bodenreider, D. Landsman, J. L. Spouge, and L. Marino-Ramirez The biological function of some human transcription factor binding motifs varies with position relative to the transcription start site Nucleic Acids Res., May 1, 2008; 36(8): 2777 - 2786. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. O. Hoque, M. S. Kim, K. L. Ostrow, J. Liu, G. B. A. Wisman, H. L. Park, M. L. Poeta, C. Jeronimo, R. Henrique, A. Lendvai, et al. Genome-Wide Promoter Analysis Uncovers Portions of the Cancer Methylome Cancer Res., April 15, 2008; 68(8): 2661 - 2670. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Goulet, G. Gauvin, S. Boisvenue, and J. Cote Alternative Splicing Yields Protein Arginine Methyltransferase 1 Isoforms with Distinct Activity, Substrate Specificity, and Subcellular Localization J. Biol. Chem., November 9, 2007; 282(45): 33009 - 33021. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Myslinski, M.-A. Gerard, A. Krol, and P. Carbon Transcription of the human cell cycle regulated BUB1B gene requires hStaf/ZNF143 Nucleic Acids Res., May 11, 2007; 35(10): 3453 - 3464. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Husain, X. Zhang, M. A. Doll, J. C. States, D. F. Barker, and D. W. Hein Identification of N-Acetyltransferase 2 (NAT2) Transcription Start Sites and Quantitation of NAT2-Specific mRNA in Human Tissues Drug Metab. Dispos., May 1, 2007; 35(5): 721 - 727. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Baek, C. Davis, B. Ewing, D. Gordon, and P. Green Characterization and predictive discovery of evolutionarily conserved mammalian alternative promoters Genome Res., February 1, 2007; 17(2): 145 - 155. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Myslinski, M.-A. Gerard, A. Krol, and P. Carbon A Genome Scale Location Analysis of Human Staf/ZNF143-binding Sites Suggests a Widespread Role for Human Staf/ZNF143 in Mammalian Promoters J. Biol. Chem., December 29, 2006; 281(52): 39953 - 39962. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Elnitski, V. X. Jin, P. J. Farnham, and S. J.M. Jones Locating mammalian transcription factor binding sites: A survey of computational and experimental techniques Genome Res., December 1, 2006; 16(12): 1455 - 1464. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. X. Jin, A. Rabinovich, S. L. Squazzo, R. Green, and P. J. Farnham A computational genomics approach to identify cis-regulatory modules from chromatin immunoprecipitation microarray data--A case study using E2F1 Genome Res., December 1, 2006; 16(12): 1585 - 1595. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Xie, S. Wu, K.-M. Lam, and H. Yan PromoterExplorer: an effective promoter identification method based on the AdaBoost algorithm Bioinformatics, November 15, 2006; 22(22): 2722 - 2728. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Swan, S. A. Richards, N. P. Duroudier, I. Sayers, and I. P. Hall Alternative Promoter Use and Splice Variation in the Human Histamine H1 Receptor Gene Am. J. Respir. Cell Mol. Biol., July 1, 2006; 35(1): 118 - 126. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Agrawal and G. D. Stormo Using mRNAs lengths to accurately predict the alternatively spliced gene products in Caenorhabditis elegans Bioinformatics, May 15, 2006; 22(10): 1239 - 1244. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Cooper, N. D. Trinklein, E. D. Anton, L. Nguyen, and R. M. Myers Comprehensive analysis of transcriptional promoter structure and function in 1% of the human genome Genome Res., January 1, 2006; 16(1): 1 - 10. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Yamashita, Y. Suzuki, H. Wakaguri, K. Tsuritani, K. Nakai, and S. Sugano DBTSS: DataBase of Human Transcription Start Sites, progress report 2006 Nucleic Acids Res., January 1, 2006; 34(suppl_1): D86 - D89. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Sun, S. K. Palaniswamy, T. T. Pohar, V. X. Jin, T. H.-M. Huang, and R. V. Davuluri MPromDb: an integrated resource for annotation and visualization of mammalian gene promoters and ChIP-chip experimental data Nucleic Acids Res., January 1, 2006; 34(suppl_1): D98 - D103. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Sun, L. D. Hurst, G. G. Carmichael, and J. Chen Evidence for a preferential targeting of 3'-UTRs by cis-encoded natural antisense transcripts Nucleic Acids Res., October 4, 2005; 33(17): 5533 - 5543. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Florquin, Y. Saeys, S. Degroeve, P. Rouze, and Y. Van de Peer Large-scale structural analysis of the core promoter in mammalian and plant genomes Nucleic Acids Res., July 27, 2005; 33(13): 4255 - 4264. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. V. Vishnevsky and N. A. Kolchanov ARGO: a web system for the detection of degenerate motifs and large-scale recognition of eukaryotic promoters Nucleic Acids Res., July 1, 2005; 33(suppl_2): W417 - W422. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Qian, N. Esumi, Y. Chen, Q. Wang, I. Chowers, and D. J. Zack Identification of regulatory targets of tissue-specific transcription factors: application to retina-specific gene regulation Nucleic Acids Res., June 20, 2005; 33(11): 3479 - 3491. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kamalakaran, S. K. Radhakrishnan, and W. T. Beck Identification of Estrogen-responsive Genes Using a Genome-wide Analysis of Promoter Elements for Transcription Factor Binding Sites J. Biol. Chem., June 3, 2005; 280(22): 21491 - 21497. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Ho Sui, J. R. Mortimer, D. J. Arenillas, J. Brumm, C. J. Walsh, B. P. Kennedy, and W. W. Wasserman oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes Nucleic Acids Res., June 2, 2005; 33(10): 3154 - 3164. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. H. Brown, S. S. Gross, and M. R. Brent Begin at the beginning: Predicting genes with 5' UTRs Genome Res., May 1, 2005; 15(5): 742 - 747. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Zukunft, T. Lang, T. Richter, K. I. Hirsch-Ernst, A. K. Nussler, K. Klein, M. Schwab, M. Eichelbaum, and U. M. Zanger A Natural CYP2B6 TATA Box Polymorphism (-82T-> C) Leading to Enhanced Transcription and Relocation of the Transcriptional Start Site Mol. Pharmacol., May 1, 2005; 67(5): 1772 - 1782. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. I. Gershenzon, G. D. Stormo, and I. P. Ioshikhes Computational technique for improvement of the position-weight matrices for the DNA/protein binding sites Nucleic Acids Res., April 22, 2005; 33(7): 2290 - 2301. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Palaniswamy, V. X. Jin, H. Sun, and R. V. Davuluri OMGProm: a database of orthologous mammalian gene promoters Bioinformatics, March 15, 2005; 21(6): 835 - 836. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. L. Shang and S. C. Dudley Jr. Tandem Promoters and Developmentally Regulated 5'- and 3'-mRNA Untranslated Regions of the Mouse Scn5a Cardiac Sodium Channel J. Biol. Chem., January 14, 2005; 280(2): 933 - 940. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Sumazin, G. Chen, N. Hata, A. D. Smith, T. Zhang, and M. Q. Zhang DWE: Discriminating Word Enumerator Bioinformatics, January 1, 2005; 21(1): 31 - 38. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Nanjo, N. Futamura, M. Nishiguchi, T. Igasaki, K. Shinozaki, and K. Shinohara Characterization of Full-length Enriched Expressed Sequence Tags of Stress-treated Poplar Leaves Plant Cell Physiol., December 15, 2004; 45(12): 1738 - 1748. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Oyama, C. Itagaki, H. Hata, Y. Suzuki, T. Izumi, T. Natsume, T. Isobe, and S. Sugano Analysis of Small Human Proteins Reveals the Translation of Upstream Open Reading Frames of mRNAs Genome Res., October 1, 2004; 14(10b): 2048 - 2052. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. V. Sun, D. R. Boverhof, L. D. Burgoon, M. R. Fielden, and T. R. Zacharewski Comparative analysis of dioxin response elements in human, mouse and rat genomic sequences Nucleic Acids Res., August 24, 2004; 32(15): 4512 - 4523. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. C. FitzGerald, A. Shlyakhtenko, A. A. Mir, and C. Vinson Clustering of DNA Sequences in Human Promoters Genome Res., August 1, 2004; 14(8): 1562 - 1574. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Halees and Z. Weng PromoSer: improvements to the algorithm, visualization and accessibility Nucleic Acids Res., July 1, 2004; 32(suppl_2): W191 - W194. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. B. Vega, D. K. Bangarusamy, L. D. Miller, E. T. Liu, and C.-Y. Lin BEARR: Batch Extraction and Analysis of cis-Regulatory Regions Nucleic Acids Res., July 1, 2004; 32(suppl_2): W257 - W260. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Tullai, M. E. Schaffer, S. Mullenbrock, S. Kasif, and G. M. Cooper Identification of Transcription Factor Binding Sites Upstream of Human Genes Regulated by the Phosphatidylinositol 3-Kinase and MEK/ERK Signaling Pathways J. Biol. Chem., May 7, 2004; 279(19): 20167 - 20177. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Sekiya, S. Adachi, K. Kohu, T. Yamada, O. Higuchi, Y. Furukawa, Y. Nakamura, T. Nakamura, K. Tashiro, S. Kuhara, et al. Identification of BMP and Activin Membrane-bound Inhibitor (BAMBI), an Inhibitor of Transforming Growth Factor-{beta} Signaling, as a Target of the {beta}-Catenin Pathway in Colorectal Tumor Cells J. Biol. Chem., February 20, 2004; 279(8): 6840 - 6846. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Marino-Ramirez, J. L. Spouge, G. C. Kanga, and D. Landsman Statistical analysis of over-represented words in human promoter sequences Nucleic Acids Res., February 12, 2004; 32(3): 949 - 958. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Suzuki, R. Yamashita, S. Sugano, and K. Nakai DBTSS, DataBase of Transcriptional Start Sites: progress report 2004 Nucleic Acids Res., January 1, 2004; 32(90001): D78 - 81. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. D. Schmid, V. Praz, M. Delorenzi, R. Perier, and P. Bucher The Eukaryotic Promoter Database EPD: the impact of in silico primer extension Nucleic Acids Res., January 1, 2004; 32(90001): D82 - 85. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. T. Pohar, H. Sun, and R. V. Davuluri HemoPDB: Hematopoiesis Promoter Database, an information resource of transcriptional regulation in blood cell development Nucleic Acids Res., January 1, 2004; 32(90001): D86 - 90. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Clark, Y. J.K. Edwards, D. Peterson, S. W. Clifton, A. J. Thompson, M. Sasaki, Y. Suzuki, K. Kikuchi, S. Watabe, K. Kawakami, et al. Fugu ESTs: New Resources for Transcription Analysis and Genome Annotation Genome Res., December 1, 2003; 13(12): 2747 - 2753. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. B. Bajic and S. H. Seah Dragon Gene Start Finder: An Advanced System for Finding Approximate Locations of the Start of Gene Transcriptional Units Genome Res., August 1, 2003; 13(8): 1923 - 1929. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Chong, G. Zhang, and V. B. Bajic FIE2: a program for the extraction of genomic DNA sequences around the start and translation initiation site of human genes Nucleic Acids Res., July 1, 2003; 31(13): 3546 - 3553. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Halees, D. Leyfer, and Z. Weng PromoSer: a large-scale mammalian promoter and transcription start site identification service Nucleic Acids Res., July 1, 2003; 31(13): 3554 - 3559. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. B. Bajic and S. H. Seah Dragon Gene Start Finder identifies approximate locations of the 5' ends of genes Nucleic Acids Res., July 1, 2003; 31(13): 3560 - 3563. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Nishiyama, T. Fujita, T. Shin-I, M. Seki, H. Nishide, I. Uchiyama, A. Kamiya, P. Carninci, Y. Hayashizaki, K. Shinozaki, et al. Comparative genomics of Physcomitrella patens gametophytic transcriptome and Arabidopsis thaliana: Implication for land plant evolution PNAS, June 24, 2003; 100(13): 8007 - 8012. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. D. Trinklein, S. J. F. Aldred, A. J. Saldanha, and R. M. Myers Identification and Functional Analysis of Human Transcriptional Promoters Genome Res., February 1, 2003; 13(2): 308 - 312. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Majewski and J. Ott Distribution and Characterization of Regulatory Elements in the Human Genome Genome Res., December 1, 2002; 12(12): 1827 - 1836. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Halfon and A. M. Michelson Exploring genetic regulatory networks in metazoan development: methods and models Physiol Genomics, September 3, 2002; 10(3): 131 - 143. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Frith, J. L. Spouge, U. Hansen, and Z. Weng Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences Nucleic Acids Res., July 15, 2002; 30(14): 3214 - 3224. [Abstract] [Full Text] [PDF] |
||||












