Nucleic Acids Research, 2002, Vol. 30, No. 1 328-331
© 2002 Oxford University Press
DBTSS: DataBase of human Transcriptional Start Sites and full-length cDNAs
Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokane-dai, Minato-ku, Tokyo 108-8639, Japan and 1Taisho Laboratory of Functional Genomics, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma-shi, Nara 630-0101, Japan
Received September 17, 2001; Accepted September 28, 2001.
| ABSTRACT |
|---|
|
|
|---|
Although the information of cDNAs is indispensable for analyzing gene function, most of the cDNA sequences stored in current databases are imperfect in the sense that they lack the precise information of 5' end termini. To overcome this difficulty, we have developed the oligo-capping method to obtain full-length cDNAs, the information of which has been partly deposited in public databases. In this study, we further constructed human cDNA libraries enriched in clones containing the cap structure to systematically explore the 5' end structure of expressed genes. Of approximately 217 402 5' end sequences obtained, 111 382 have been matched to cDNA sequences of known genes (7889 genes) and are presented in our new database, DataBase of Transcriptional Start Sites (DBTSS; http://elmo.ims.u-tokyo.ac.jp/dbtss/). Sequence comparison between our entries and those of a reference sequence database, RefSeq, revealed that 4683 (34%) of RefSeq sequences should be extended towards the 5' ends. We also mapped each sequence on the human draft genome sequence to identify its transcriptional start site, which provides us with more detailed information on distribution patterns of transcriptional start sites and adjacent regulatory regions.
| INTRODUCTION |
|---|
|
|
|---|
Even after the human genome was sequenced as a rough draft (1,2), the need for its cDNA analyses is still unchanged to locate the position of genes and to examine their expression status. Since conventional methods for determining exact transcriptional start sites (TSS), such as 5' RACE (3,4) or primer extension (5), are laborious, the 5' end positions of most cDNA sequences deposited so far in public databases are incomplete. In addition, computational methods to predict the location of promoters are still immature (6). Nevertheless, positional data of TSS is a rich source of biological information. First, precise determination of TSS is indispensable to identifying the promoter region, which is located just proximal to TSS in many cases. Secondly, when multiple TSS are observed, it enables us to examine the dynamic nature of the transcriptional initiation events. Loose specification of TSS may reflect slippery interaction between the promoter and the transcription machinery, whereas tight specification may reflect the rigid interaction (7). In any case, it would be intriguing to find correlations between distribution patterns of TSS and their upstream sequences. Finally, since TSS marks the 5' end limit of the cDNA, the amino acid sequence of the exact N-terminus of the encoding protein could be deduced, which is essential to examine the presence of protein sorting signals.
To obtain the full-length cDNAs, we have developed the oligo-capping method (8,9). In this paper, we present a novel database containing the result of systematic 5' end sequencing of human full-length cDNAs [the DataBase of human Transcriptional Start Sites and full-length cDNAs (DBTSS)]. Our database will be useful not only as a full-length version of the RefSeq database (10,11) but also as a resource for analyzing regulatory information, such as differential TSS and promoter uses.
| CONSTRUCTION OF FULL-LENGTH cDNA LIBRARIES |
|---|
|
|
|---|
Here we briefly explain the oligo-capping method (reviewed in 8,9). With this method, the cap structure of mRNA is replaced with the synthetic oligonucleotide enzymatically in the following three steps: bacterial alkaline phosphatase hydrolyses the phosphate of truncated mRNA 5' ends whose cap structures have been broken down. Tobacco acid pyrophosphatase removes the cap structure, leaving the phosphate at the 5' end. T4 RNA ligase, which requires a phosphate at the 5' end as its substrate, selectively ligates the 5' oligo to the 5' end that originally had the cap structure. Each mRNA product of the oligo-capping contains the sequence tags at both ends, which are poly(A) at the 3' end and the cap-replaced oligo at the 5' end. With the oligo-capped mRNA as a starting material, a new system is developed to selectively clone the cDNAs that contain both of the sequence tags at the respective ends. Thus, cDNA libraries were constructed in which the content of full-length cDNA is significantly enriched (full-length-enriched cDNA library). We obtained 217 402 sequences derived from 132 libraries. More information of the used libraries is listed in the database. We will continue to produce 5' end sequences by the oligo-capping method and will update DBTSS accordingly.
| SEQUENCE DATA PROCESSING |
|---|
|
|
|---|
Each sequence produced by the oligo-capping method was first processed to trim its vector site and its low quality parts. Then, they were compared with human reference sequences (RefSeq) using the BLAST program (12). If a homolog was found with >95% identity and less than 10100 in e-value, it was regarded as identical to the RefSeq sequence. In total, 142 021 sequences were matched to the RefSeq NM (i.e. curated mRNA) human sequences. In order to identify precise TSS information, sequences that have multiple homologs in RefSeq were discarded. Besides, we removed sequences that were not mapped on the human genome working draft sequence (Golden Path: http://genome.ucsc.edu/) database. Using the sim4 program (13), 111 382 sequences were mapped to the human genome sequence. These 111 382 clones were classified into 7889 RefSeq NM entries and are stored in our database. Sequence comparison between these data indicated that 4683 (34%) of the 7889 RefSeq sequences could be extended towards the 5' ends. Figure 1A shows the length distribution of added sequences. On average, our data extended the RefSeq sequence by 87 bases. The predicted full-length sequences which are the RefSeq sequences elongated with our data are called Ref-Full and are obtainable via FTP.
|
Figure 1B represents the distribution of the distance between the 5' ends of Ref-Full and RefSeq on genomic sequence. In many genes, the distance between newly identified TSS as the 5' end of Ref-Full and the 5' end of pre-existing RefSeq was so large that our data should be indispensable to identifying the promoter region, which is located adjacent to the TSS.
| USAGE OF DBTSS |
|---|
|
|
|---|
From our DBTSS web page at http://elmo.ims.u-tokyo.ac.jp/dbtss/, users can retrieve the TSS information of a specified gene in several ways. For example, users can use a gene name search or can directly enter the RefSeq IDs (such as NM_005718), LocusLink IDs, UniGene IDs or gene symbols. Another way of entry specification is to enter a partial mRNA sequence. Then, entries hit by a BLAST search will be displayed.
Figure 2 shows an example of NM_005718 (actin related protein 2/3 complex, subunit 4). One of the major contents of the entry is Ref-Full sequence, i.e. representative full-length cDNA sequence (Fig. 2C). While the RefSeq entry NM_005718 starts at position 11 780 511 of chromosome 3, our clone, HRC00655, starts at 11 775 385. Then, we added 175 bases of the 5' end region from HRC00655 to NM_005718, defining a Ref-Full sequence of NM_005718. Although the 5' end sequence extended to the RefSeq sequence is short, this part is essential to identify the first exon and adjacent promoter region which are separated from the other exons by a large intron.
|
Another feature of DBTSS is that it enables us to see the exonintron structures and the distribution patterns of TSS with various clones. For example, Figure 2A shows that there are two types of alternative first exons in this gene. Figure 1B, shows that even in either of the exons, the exact positions of the TSS are highly variable, which is consistent with our recent observations (7). Our database should provide versatile information on the diverse nature of the human gene transcripts.
| ACKNOWLEDGEMENTS |
|---|
We thank H. Hata and every member of the HGC-IMSUT sequencing team for their excellent sequencing work. We are also thankful to T. Hasui and J. M. Sugano for helpful discussions, and to Y. Makita for technical support in database construction. This work was supported by a Grant-in-Aid for Scientific Research on Priority Areas and by special coordination funds for promoting science and technology (SCF), both from the Ministry of Education, Culture, Sports, Science and Technology in Japan.
| FOOTNOTES |
|---|
* To whom correspondence should be addressed. Tel: +81 3 5449 5343; Fax: +81 3 5449 5416; Email: ysuzuki{at}manage.ims.u-tokyo.ac.jp
| REFERENCES |
|---|
|
|
|---|
-
1 Lander,E.S., Linton,L.M., Birren,B., Nusbaum,C., Zody,M.C., Baldwin,J., Devon,K., Dewar,K., Doyle,M., FitzHugh,H. et al. (2001) Initial sequencing and analysis of the human genome. Nature, 409, 860921.[Medline]
2 Venter,J.C., Adams,M.D., Myers,E.W., Li,P.W., Mural,R.J., Sutton,G.G., Smith,H.O., Yandell,M., Evans,C.A., Holt,R.A. et al. (2001) The sequence of the human genome. Science, 291, 13041351.
3 Frohman,M.A., Dush,M.K. and Martin,G.R. (1988) Rapid production of full-length cDNAs from rare transcripts: amplification using a single gene-specific oligonucleotide primer. Proc. Natl Acad. Sci. USA, 85, 89989002.
4 Schaefer,B.C. (1995) Revolutions in rapid amplification of cDNA ends: new strategies for polymerase chain reaction cloning of full-length cDNA ends. Anal. Biochem., 227, 255273.[Web of Science][Medline]
5 McKnight,S.L. and Kingsbury,R. (1982) Transcriptional control signals of a eukaryotic protein-coding gene. Science, 217, 316324.
6 Ohler,U. and Niemann,H. (2001) Identification and analysis of eukaryotic promoters: recent computational approaches. Trends Genet., 17, 5660.[Web of Science][Medline]
7 Suzuki,Y., Taira,H., Tsunoda,T., Mizushima-Sugano,J., Sese,J., Hata,H., Ota,T., Isogai,T., Tanaka,T., Morishita,S. et al. (2001) Diverse transcriptional initiation revealed by fine, large-scale mapping of mRNA start sites. EMBO Rep., 2, 388393.[Web of Science][Medline]
8 Maruyama,K. and Sugano,S. (1994) Oligo-capping: a simple method to replace the cap structure of eukaryotic mRNAs with oligoribonucleotides. Gene, 138, 171174.[Web of Science][Medline]
9 Suzuki,Y., Yoshitomo-Nakagawa,K., Maruyama,K., Suyama,A. and Sugano,S. (1997) Construction and characterization of a full length-enriched and a 5'-end-enriched cDNA library. Gene, 200, 149156.[Web of Science][Medline]
10 Pruitt,K.D., Katz,K.S., Sicotte,H. and Maglott,D.R. (2000) Introducing RefSeq and LocusLink: curated human genome resources at the NCBI. Trends Genet., 16, 4447.[Web of Science][Medline]
11 Pruitt,K.D. and Maglott,D.R. (2001) RefSeq and LocusLink: NCBI gene-centered resources. Nucleic Acids Res., 29, 137140.
12 Altschul,S.F., Madden,T.L., Schaffer,A.A., Zhang,J., Zhang,Z., Miller,W. and Lipman,D.J. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res., 17, 33893402.
13 Florea,L., Hartzell,G., Zhang,Z., Rubin,G.M. and Miller,W. (1998) A computer program for aligning a cDNA sequence with a genomic DNA sequence. Genome Res., 8, 967974.
This article has been cited by other articles:
![]() |
M. Megraw, F. Pereira, S. T. Jensen, U. Ohler, and A. G. Hatzigeorgiou A transcription factor affinity-based code for mammalian transcription initiation Genome Res., April 1, 2009; 19(4): 644 - 656. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M.R. Petit, H. Lindskog, E. Larsson, P. Wasteson, E. Athley, S. Breuer, M. Angstenberger, D. Hertfelder, E. Mattsson, A. Nordheim, et al. Smooth Muscle Expression of Lipoma Preferred Partner Is Mediated by an Alternative Intronic Promoter That Is Regulated by Serum Response Factor/Myocardin Circ. Res., July 3, 2008; 103(1): 61 - 69. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Yamashita, Y. Suzuki, N. Takeuchi, H. Wakaguri, T. Ueda, S. Sugano, and K. Nakai Comprehensive detection of human terminal oligo-pyrimidine (TOP) genes and analysis of their characteristics Nucleic Acids Res., June 1, 2008; 36(11): 3707 - 3715. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Tharakaraman, O. Bodenreider, D. Landsman, J. L. Spouge, and L. Marino-Ramirez The biological function of some human transcription factor binding motifs varies with position relative to the transcription start site Nucleic Acids Res., May 1, 2008; 36(8): 2777 - 2786. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. O. Hoque, M. S. Kim, K. L. Ostrow, J. Liu, G. B. A. Wisman, H. L. Park, M. L. Poeta, C. Jeronimo, R. Henrique, A. Lendvai, et al. Genome-Wide Promoter Analysis Uncovers Portions of the Cancer Methylome Cancer Res., April 15, 2008; 68(8): 2661 - 2670. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Goulet, G. Gauvin, S. Boisvenue, and J. Cote Alternative Splicing Yields Protein Arginine Methyltransferase 1 Isoforms with Distinct Activity, Substrate Specificity, and Subcellular Localization J. Biol. Chem., November 9, 2007; 282(45): 33009 - 33021. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Myslinski, M.-A. Gerard, A. Krol, and P. Carbon Transcription of the human cell cycle regulated BUB1B gene requires hStaf/ZNF143 Nucleic Acids Res., May 11, 2007; 35(10): 3453 - 3464. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Husain, X. Zhang, M. A. Doll, J. C. States, D. F. Barker, and D. W. Hein Identification of N-Acetyltransferase 2 (NAT2) Transcription Start Sites and Quantitation of NAT2-Specific mRNA in Human Tissues Drug Metab. Dispos., May 1, 2007; 35(5): 721 - 727. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Baek, C. Davis, B. Ewing, D. Gordon, and P. Green Characterization and predictive discovery of evolutionarily conserved mammalian alternative promoters Genome Res., February 1, 2007; 17(2): 145 - 155. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Myslinski, M.-A. Gerard, A. Krol, and P. Carbon A Genome Scale Location Analysis of Human Staf/ZNF143-binding Sites Suggests a Widespread Role for Human Staf/ZNF143 in Mammalian Promoters J. Biol. Chem., December 29, 2006; 281(52): 39953 - 39962. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Elnitski, V. X. Jin, P. J. Farnham, and S. J.M. Jones Locating mammalian transcription factor binding sites: A survey of computational and experimental techniques Genome Res., December 1, 2006; 16(12): 1455 - 1464. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. X. Jin, A. Rabinovich, S. L. Squazzo, R. Green, and P. J. Farnham A computational genomics approach to identify cis-regulatory modules from chromatin immunoprecipitation microarray data--A case study using E2F1 Genome Res., December 1, 2006; 16(12): 1585 - 1595. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Xie, S. Wu, K.-M. Lam, and H. Yan PromoterExplorer: an effective promoter identification method based on the AdaBoost algorithm Bioinformatics, November 15, 2006; 22(22): 2722 - 2728. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Swan, S. A. Richards, N. P. Duroudier, I. Sayers, and I. P. Hall Alternative Promoter Use and Splice Variation in the Human Histamine H1 Receptor Gene Am. J. Respir. Cell Mol. Biol., July 1, 2006; 35(1): 118 - 126. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Agrawal and G. D. Stormo Using mRNAs lengths to accurately predict the alternatively spliced gene products in Caenorhabditis elegans Bioinformatics, May 15, 2006; 22(10): 1239 - 1244. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Cooper, N. D. Trinklein, E. D. Anton, L. Nguyen, and R. M. Myers Comprehensive analysis of transcriptional promoter structure and function in 1% of the human genome Genome Res., January 1, 2006; 16(1): 1 - 10. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Yamashita, Y. Suzuki, H. Wakaguri, K. Tsuritani, K. Nakai, and S. Sugano DBTSS: DataBase of Human Transcription Start Sites, progress report 2006 Nucleic Acids Res., January 1, 2006; 34(suppl_1): D86 - D89. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Sun, S. K. Palaniswamy, T. T. Pohar, V. X. Jin, T. H.-M. Huang, and R. V. Davuluri MPromDb: an integrated resource for annotation and visualization of mammalian gene promoters and ChIP-chip experimental data Nucleic Acids Res., January 1, 2006; 34(suppl_1): D98 - D103. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Sun, L. D. Hurst, G. G. Carmichael, and J. Chen Evidence for a preferential targeting of 3'-UTRs by cis-encoded natural antisense transcripts Nucleic Acids Res., October 4, 2005; 33(17): 5533 - 5543. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Florquin, Y. Saeys, S. Degroeve, P. Rouze, and Y. Van de Peer Large-scale structural analysis of the core promoter in mammalian and plant genomes Nucleic Acids Res., July 27, 2005; 33(13): 4255 - 4264. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. V. Vishnevsky and N. A. Kolchanov ARGO: a web system for the detection of degenerate motifs and large-scale recognition of eukaryotic promoters Nucleic Acids Res., July 1, 2005; 33(suppl_2): W417 - W422. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Qian, N. Esumi, Y. Chen, Q. Wang, I. Chowers, and D. J. Zack Identification of regulatory targets of tissue-specific transcription factors: application to retina-specific gene regulation Nucleic Acids Res., June 20, 2005; 33(11): 3479 - 3491. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kamalakaran, S. K. Radhakrishnan, and W. T. Beck Identification of Estrogen-responsive Genes Using a Genome-wide Analysis of Promoter Elements for Transcription Factor Binding Sites J. Biol. Chem., June 3, 2005; 280(22): 21491 - 21497. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Ho Sui, J. R. Mortimer, D. J. Arenillas, J. Brumm, C. J. Walsh, B. P. Kennedy, and W. W. Wasserman oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes Nucleic Acids Res., June 2, 2005; 33(10): 3154 - 3164. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. H. Brown, S. S. Gross, and M. R. Brent Begin at the beginning: Predicting genes with 5' UTRs Genome Res., May 1, 2005; 15(5): 742 - 747. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Zukunft, T. Lang, T. Richter, K. I. Hirsch-Ernst, A. K. Nussler, K. Klein, M. Schwab, M. Eichelbaum, and U. M. Zanger A Natural CYP2B6 TATA Box Polymorphism (-82T-> C) Leading to Enhanced Transcription and Relocation of the Transcriptional Start Site Mol. Pharmacol., May 1, 2005; 67(5): 1772 - 1782. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. I. Gershenzon, G. D. Stormo, and I. P. Ioshikhes Computational technique for improvement of the position-weight matrices for the DNA/protein binding sites Nucleic Acids Res., April 22, 2005; 33(7): 2290 - 2301. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Palaniswamy, V. X. Jin, H. Sun, and R. V. Davuluri OMGProm: a database of orthologous mammalian gene promoters Bioinformatics, March 15, 2005; 21(6): 835 - 836. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. L. Shang and S. C. Dudley Jr. Tandem Promoters and Developmentally Regulated 5'- and 3'-mRNA Untranslated Regions of the Mouse Scn5a Cardiac Sodium Channel J. Biol. Chem., January 14, 2005; 280(2): 933 - 940. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Sumazin, G. Chen, N. Hata, A. D. Smith, T. Zhang, and M. Q. Zhang DWE: Discriminating Word Enumerator Bioinformatics, January 1, 2005; 21(1): 31 - 38. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Nanjo, N. Futamura, M. Nishiguchi, T. Igasaki, K. Shinozaki, and K. Shinohara Characterization of Full-length Enriched Expressed Sequence Tags of Stress-treated Poplar Leaves Plant Cell Physiol., December 15, 2004; 45(12): 1738 - 1748. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Oyama, C. Itagaki, H. Hata, Y. Suzuki, T. Izumi, T. Natsume, T. Isobe, and S. Sugano Analysis of Small Human Proteins Reveals the Translation of Upstream Open Reading Frames of mRNAs Genome Res., October 1, 2004; 14(10b): 2048 - 2052. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. V. Sun, D. R. Boverhof, L. D. Burgoon, M. R. Fielden, and T. R. Zacharewski Comparative analysis of dioxin response elements in human, mouse and rat genomic sequences Nucleic Acids Res., August 24, 2004; 32(15): 4512 - 4523. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. C. FitzGerald, A. Shlyakhtenko, A. A. Mir, and C. Vinson Clustering of DNA Sequences in Human Promoters Genome Res., August 1, 2004; 14(8): 1562 - 1574. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Halees and Z. Weng PromoSer: improvements to the algorithm, visualization and accessibility Nucleic Acids Res., July 1, 2004; 32(suppl_2): W191 - W194. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. B. Vega, D. K. Bangarusamy, L. D. Miller, E. T. Liu, and C.-Y. Lin BEARR: Batch Extraction and Analysis of cis-Regulatory Regions Nucleic Acids Res., July 1, 2004; 32(suppl_2): W257 - W260. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Tullai, M. E. Schaffer, S. Mullenbrock, S. Kasif, and G. M. Cooper Identification of Transcription Factor Binding Sites Upstream of Human Genes Regulated by the Phosphatidylinositol 3-Kinase and MEK/ERK Signaling Pathways J. Biol. Chem., May 7, 2004; 279(19): 20167 - 20177. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Sekiya, S. Adachi, K. Kohu, T. Yamada, O. Higuchi, Y. Furukawa, Y. Nakamura, T. Nakamura, K. Tashiro, S. Kuhara, et al. Identification of BMP and Activin Membrane-bound Inhibitor (BAMBI), an Inhibitor of Transforming Growth Factor-{beta} Signaling, as a Target of the {beta}-Catenin Pathway in Colorectal Tumor Cells J. Biol. Chem., February 20, 2004; 279(8): 6840 - 6846. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Marino-Ramirez, J. L. Spouge, G. C. Kanga, and D. Landsman Statistical analysis of over-represented words in human promoter sequences Nucleic Acids Res., February 12, 2004; 32(3): 949 - 958. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Suzuki, R. Yamashita, S. Sugano, and K. Nakai DBTSS, DataBase of Transcriptional Start Sites: progress report 2004 Nucleic Acids Res., January 1, 2004; 32(90001): D78 - 81. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. D. Schmid, V. Praz, M. Delorenzi, R. Perier, and P. Bucher The Eukaryotic Promoter Database EPD: the impact of in silico primer extension Nucleic Acids Res., January 1, 2004; 32(90001): D82 - 85. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. T. Pohar, H. Sun, and R. V. Davuluri HemoPDB: Hematopoiesis Promoter Database, an information resource of transcriptional regulation in blood cell development Nucleic Acids Res., January 1, 2004; 32(90001): D86 - 90. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Clark, Y. J.K. Edwards, D. Peterson, S. W. Clifton, A. J. Thompson, M. Sasaki, Y. Suzuki, K. Kikuchi, S. Watabe, K. Kawakami, et al. Fugu ESTs: New Resources for Transcription Analysis and Genome Annotation Genome Res., December 1, 2003; 13(12): 2747 - 2753. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. B. Bajic and S. H. Seah Dragon Gene Start Finder: An Advanced System for Finding Approximate Locations of the Start of Gene Transcriptional Units Genome Res., August 1, 2003; 13(8): 1923 - 1929. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Chong, G. Zhang, and V. B. Bajic FIE2: a program for the extraction of genomic DNA sequences around the start and translation initiation site of human genes Nucleic Acids Res., July 1, 2003; 31(13): 3546 - 3553. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Halees, D. Leyfer, and Z. Weng PromoSer: a large-scale mammalian promoter and transcription start site identification service Nucleic Acids Res., July 1, 2003; 31(13): 3554 - 3559. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. B. Bajic and S. H. Seah Dragon Gene Start Finder identifies approximate locations of the 5' ends of genes Nucleic Acids Res., July 1, 2003; 31(13): 3560 - 3563. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Nishiyama, T. Fujita, T. Shin-I, M. Seki, H. Nishide, I. Uchiyama, A. Kamiya, P. Carninci, Y. Hayashizaki, K. Shinozaki, et al. Comparative genomics of Physcomitrella patens gametophytic transcriptome and Arabidopsis thaliana: Implication for land plant evolution PNAS, June 24, 2003; 100(13): 8007 - 8012. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. D. Trinklein, S. J. F. Aldred, A. J. Saldanha, and R. M. Myers Identification and Functional Analysis of Human Transcriptional Promoters Genome Res., February 1, 2003; 13(2): 308 - 312. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Majewski and J. Ott Distribution and Characterization of Regulatory Elements in the Human Genome Genome Res., December 1, 2002; 12(12): 1827 - 1836. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Halfon and A. M. Michelson Exploring genetic regulatory networks in metazoan development: methods and models Physiol Genomics, September 3, 2002; 10(3): 131 - 143. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Frith, J. L. Spouge, U. Hansen, and Z. Weng Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences Nucleic Acids Res., July 15, 2002; 30(14): 3214 - 3224. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||













