Nucleic Acids Research, 2003, Vol. 31, No. 13 3576-3579
© 2003 Oxford University Press
MATCHTM: a tool for searching transcription factor binding sites in DNA sequences
1 BIOBASE GmbH, Halchtersche Str. 33, D-38304 Wolfenbüttel, Germany 2 Institute of Cytology and Genetics, Pr. Lavrentyeva 10, 360090, Novosibirsk, Russia 3 Department of Bioinformatics, UKG, University of Göttingen, Goldschmidtstr. 1, D-37077 Göttingen, Germany
*To whom correpondence should be addressed at BIOBASE GmbH, Halchtersche Strasse 33, D-38304 Wolfenbüttel, Germany. Tel: +49 5331 858441; Fax: +49 5331 858470; Email: ake{at}biobase.de
Received February 17, 2003; Revised and Accepted April 3, 2003
| ABSTRACT |
|---|
|
|
|---|
MatchTM is a weight matrix-based tool for searching putative transcription factor binding sites in DNA sequences. MatchTM is closely interconnected and distributed together with the TRANSFAC® database. In particular, MatchTM uses the matrix library collected in TRANSFAC® and therefore provides the possibility to search for a great variety of different transcription factor binding sites. Several sets of optimised matrix cut-off values are built in the system to provide a variety of search modes of different stringency. The user may construct and save his/her specific user profiles which are selected subsets of matrices including default or user-defined cut-off values. Furthermore a number of tissue-specific profiles are provided that were compiled by the TRANSFAC® team. A public version of the MatchTM tool is available at: http://www.gene-regulation.com/pub/programs.html#match. The same program with a different web interface can be found at http://compel.bionet.nsc.ru/Match/Match.html. An advanced version of the tool called MatchTM Professional is available at http://www.biobase.de.
| INTRODUCTION |
|---|
|
|
|---|
Regulation of gene expression on the level of transcription is a very complex process especially in multicellular eukaryotic organisms. Each cell type or tissue, at a specific developmental stage or under influence of an extracellular signal expresses a characteristic pattern of activated transcription factors (TF). These transcription-regulating nuclear proteins bind to specific binding sites in the regulatory regions (e.g. promoters, enhancers) of the genes thus providing their activation or repression. Computational methods of predicting TF binding sites in DNA are very important for understanding the molecular mechanisms of gene regulation. We have developed a new tool called MatchTM which is a weight matrix-based tool for searching putative transcription factor binding sites in DNA sequences. MatchTM uses a library of position weight matrices (PWMs) collected in the TRANSFAC® database (1) and therefore provides the possibility to search for a great variety of different transcription factor binding sites. There are several similar software tools available on the web that use weight matrices for predicting TF binding sites. SIGNAL SCAN (2), MATRIX SEARCH (3) and TESS (see in 4) are best known. MatchTM differs from them by providing the most up-to-date library of matrices. Moreover, MatchTM uses different algorithms for calculating score by applying the so called information vector (see below). In that way it is similar to MatInspector (5), where the information vector is also used. However, MatchTM's particular strength is in the extensive use of optimised, predefined by experts, function-specific sets of matrices and other parameters of the search (see below).
| ALGORITHM |
|---|
|
|
|---|
MatchTM takes DNA sequences as input, searches for potential TF binding sites using a library of PWMs and outputs a list of found potential sites and a visual representation of their locations in the sequence. The search algorithm uses two score values: the matrix similarity score (MSS) and the core similarity score (CSS). These two scores measure the quality of a match between the sequence and the matrix, which ranges from 0.0 to 1.0, where 1.0 denotes an exact match. The core of each matrix is defined as the first five most conserved consecutive positions of a matrix. Both scores, MSS and CSS, are calculated using the same formula (see below). Whereas MSS is calculated using all positions of the matrix, CSS is calculated using the core positions only. Two cut-offs for two corresponding scores are defined for every matrix (pre-defined by the MatchTM team or set up by the user). The algorithm reports only those matches of a matrix that have got both scores higher than the two corresponding cut-offs. To speed up the algorithm a hash table is constructed for all pentanucleotides in the sequence under study. The core similarity score is calculated for all pentanucleotides and the program puts the corresponding values into the hash table. For each entry of the hash table with the CSS higher than the cut-off each occurrence of this pentanucleotide is looked up in the sequence and is prolonged at both ends, so that it fits the matrix length. Then the matrix similarity score is calculated. Only those matches for which the matrix similarity score is higher than a certain cut-off are given in the program output.
The matrix similarity score mSS (as well as the core similarity score) for a subsequence x of the length L is calculated in the following way:
![]() |
![]() |
{A, T, G, C})
![]() |
![]() |
The information vector
![]() |
Matrix similarity cut-off estimation
In order to find putative TF binding sites with the help of MatchTM it is very important to choose appropriate values of the cut-off for core and matrix similarity. Selection of a cut-off value largely depends on the user's objectives. We have pre-calculated three different cut-offs for each matrix presented in the TRANSFAC® database (as in 7): (i) to minimise false positive (over-prediction error) rate; (ii) to minimise false negative (under-prediction error) rate; (iii) to minimise the sum of both errors.
Cut-offs minimising false negative rate (minFN)
We used actual weight matrices to calculate the probability of nucleotides occurring at each position of the matrix. Based on these probabilities for every weight matrix, we have generated a sample of oligonucleotides and applied our algorithm to this sample without using any cut-offs. Then we set the cut-offs to a value that provides recognition of at least 90% of the generated oligonucleotides. We decided to tolerate an error rate of 10%, taking into account that the set of oligonucleotides might contain weak representatives. We call this set of cut-offs minFN cut-offs.
Cut-offs minimising false positive rate (minFP)
We have applied the algorithm described above to the sequences of the second exons (
6x106 bp) because these sequences are presumed to contain no biologically relevant TF binding sites. For every matrix the lowest cut-off for which no match is found in the set of exon sites is considered to be the minFP cut-off. When a minFP cut-off is applied for searching a DNA sequence being studied, the algorithm will find a relatively low number of matches per nucleotide. In the output the user will only find putative sites with a good similarity to the weight matrix; however, some known genomic binding sites could not be recognised. This kind of cut-off is useful, for example, for searching the most promising potential binding sites in extended genomic DNA sequences. Since the selection of background sequences can influence the cut-off selection we are going to evaluate the use of other genomic sequences of distinct function as control which may give rise to alternative minFP cut-off estimates.
Cut-offs minimising the sum of both errors (minSum)
We compute a sum of both error rates to find cut-offs that give an optimal number of false positives and false negatives. For that, we compute the number of matches found in the exon sequences for each matrix using minFN cut-offs. This number is defined as 100% of false positives. The sum of corresponding percentages for false positives and false negatives is then computed for every cut-off ranging from minFN to minFP. We refer to the cut-off that gives the minimum sum as minSum cut-off.
The algorithm of MatchTM is quite similar to that of MatInspector (5), but with some differences. First of all, the calculation of the mSS score (see Equation 1) is slightly different which makes Match more discriminative when using certain matrices. From the user perspective, the most important difference is the extensive usage of the concept of profiles. This introduces additional flexibility in parametrising the search for any specific need.
| IMPLEMENTATION |
|---|
|
|
|---|
The algorithm is implemented in C and the program is wrapped by a Perl script to maintain a user friendly web interface (8,9). A public version of the MatchTM tool is available at http://www.gene-regulation.com/pub/programs.html#match. The same program under a different web interface can be found at http://compel.bionet.nsc.ru/Match/Match.html. An advanced version of the tool called MatchTM Professional is available at http://www.biobase.de. It makes use of the whole TRANSFAC Professional matrix library, while the publicly available version of Match has only access to the matrices of the TRANSFAC public version. This public library is comparatively small because it does not contain most of the matrices generated by the TRANSFAC team. MatchTM Professional contains a number of tissue-specific profiles that are not included in the public version. In addition, so called best_selection profiles are accessible only with MatchTM Professional. These profiles contain selections of the most reliable matrices and the cut-offs are optimised using well prepared sets of real binding sites from TRANSFAC (in contrast to the default profiles where cut-offs are optimised using an oligonucleotide generation approach, see minFN above). MatchTM Professional provides an additional tool for matrix construction which is not included in the public version of Match. This tool allows users to construct their own matrices from a set of aligned sequences.
The MatchTM user interface is shown in Figure 1. It has been designed so that the user has all necessary parameters available on one screen. The left panel is used to paste the sequence (or several sequences) and to specify the name of the search. The right panel contains three major sections: matrix selection, cut-off selection and profile selection.
|
The matrix selection section provides the possibility to select the taxa (vertebrate, insects, plant, fungi or all). High quality selection tag enables to use the high quality matrices only. These are approximately 70% of TRANSFAC® matrices that are characterised by the lowest false positive rate. We have selected these matrices using the following criteria. When using a matrix with a cut-off which allows a false negative rate of 50%, the frequency of matches found in exon2 sequences (false positive rate) must drop below 1 match per 1 kb. The choice of three cut-off sets (minFN, minFP and minSUM) is also provided in the matrix selection section. Alternatively, the user can select some uniform MSS and CSS cut-offs (e.g. 0.7 and 0.75) that will be applied to all matrices.
The profile selection section is the alternative way of defining parameters of the search. A profile is a subset of matrices with defined cut-offs. The user can choose one of the predefined profiles (created by the MatchTM team) or build his/her own profile using the associated web tool called Profiler. In the Profiler the user can flexibly select different matrices from the whole TRANSFAC® matrix library and define cut-offs individually or simultaneously to all matrices in the selection and save the profile under a new name. The user can also modify some of the existing profiles. A number of useful predefined profiles are provided by MatchTM including a small number of best matrices called best selection and several tissue-/cell type-specific (liver, muscle, immune-cells) or process specific (cell cycle) profiles. To build such profiles groups of transcription factors known to be active in a particular tissue or a process have been collected for each profile with the help of information from the TRANSFAC® database. Matrices linked to these transcription factors in TRANSFAC® were then retrieved. When more than one matrix was linked to a transcription factor, we chose the matrix that had the lowest false positive rate.
After submitting the form to the server, the MatchTM program makes the search of the TF binding sites according to the given parameters. The output of the MatchTM program is shown in Figure 2. Every match found by the program is shown in a separate line in the results table. It contains: matrix ID, position of the match, strand [(+) or (-), that indicate the matrix orientation in the match], two scores of the match, corresponding subsequence and names of transcription factors associated with the matrix. It must be mentioned that the position of the match is always given according to the (+) strand of the sequence. A simple visual representation of locations of the found matches is generated after pressing the graphic button (Fig. 2B). Sites are shown above the sequence and the orientation of the > sign corresponds to the (+) or (-) location of the sites. The name of the matrix is given as well. In Figure 2 we show the results of a MatchTM search in the promoter of the human gene for IL-12 using the predefined immune cell-specific profile. Three sites that are known in this promoter (see TRANSFAC® database) were found by MatchTM (shadowed in Fig. 2) along with a number of new sites. The relatively low number of known sites among numerous predicted sites can be explained first of all by the very limited knowledge obtained so far about real functional sites in genomes. Taking into account the whole complexity of regulatory functions maintained by promoters of genes that have to be encoded in their structure by a system of TF site combinations (10), we can speculate that many more TF sites will be revealed experimentally in the near future. All predictions obtained by MatchTM search can be considered as a source of well supported hypotheses for further experimental verification.
|
| ACKNOWLEDGEMENTS |
|---|
This work was mainly funded by BIOBASE GmbH (Wolfenbüttel, Germany). Part of this work was supported by Siberian Branch of Russian Academy of Sciences and by a grant of the European Commission (BIO4-95-0226).
| REFERENCES |
|---|
|
|
|---|
- Wingender,E., Chen,X., Fricke,E., Geffers,R., Hehl,R., Liebich,I., Krull,M., Matys,V., Michael,H., Ohnhaeuser,R. et al. (2001) The TRANSFAC system on gene expression regulation. Nucleic Acids Res., 29, 281283.
[Abstract/Free Full Text] - Prestridge,D.S. (1996) SIGNAL SCAN 4.0: additional databases and sequence formats. Comput. Appl. Biosci., 12, 157160.
[Abstract/Free Full Text] - Chen,Q.K., Hertz,G.Z. and Stormo,G.D. (1995) MATRIX SEARCH 1.0: a computer program that scans DNA sequences for transcriptional elements using a database of weight matrices. Comput. Appl. Biosci., 11, 563566.
[Abstract/Free Full Text] - Stoeckert,C.J.,Jr, Salas,F., Brunk,B. and Overton,G.C. (1999) EpoDB: a prototype database for the analysis of genes expressed during vertebrate erythropoiesis. Nucleic Acids Res., 27, 200203.
[Abstract/Free Full Text] - Quandt,K., Frech,K., Karas,H., Wingender,E. and Werner,T. (1995) MatInd and MatInspector: new fast and versatile tools for detection of consensus matches in nucleotide sequence data. Nucleic Acids Res., 23, 48784884.
[Abstract/Free Full Text] - Kel,A., Kel-Margoulis,O., Babenko,V. and Wingender,E. (1999) Recognition of NFATp/AP-1 composite elements within genes induced upon the activation of immune cells J. Mol. Biol., 288, 353376.[CrossRef][Web of Science][Medline]
- Pickert,L., Reuter,I., Klawonn,F. and Wingender,E. (1998) Transcription regulatory region analysis using signal detection and fuzzy clustering. Bioinformatics, 14, 244251.
[Abstract/Free Full Text] - Kel,A.E., Kondrakhin,Y.V., Kolpakov,Ph.A., Kel,O.V., Romashenko,A.G., Wingender,E., Milanesi,L. and Kolchanov,N.A. (1995) Computer tool FUNSITE for analysis of eukaryotic regulatory genomic sequences. Proc. Int. Conf. Intell. Syst. Mol. Biol., 3, 197205.[Medline]
- Gößling,E., Kel-Margoulis,O.V., Kel,A.E. and Wingender,E. (2001) MATCHTMa tool for searching transcription factor binding sites in DNA sequences. Application for the analysis of human chromosomes. Proceedings of the German Conference on Bioinformatics GCB'01. Braunschweig, Germany, October 710, pp. 158161.
- Fessele,S., Maier,H., Zischek,C., Nelson,P.J. and Werner,T. (2002) Regulatory context is a crucial part of gene function. Trends Genet., 18, 6063.[CrossRef][Web of Science][Medline]
This article has been cited by other articles:
![]() |
Z. Xu and J. A. Taylor SNPinfo: integrating GWAS and candidate gene information into functional SNP selection for genetic association studies Nucleic Acids Res., July 1, 2009; 37(suppl_2): W600 - W605. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Colecchia, D. Kottwitz, M. Wagner, C. V. Pfenninger, G. Thiel, I. Tamm, C. Peterson, and U. A. Nuber Tissue-specific regulatory network extractor (TS-REX): a database and software resource for the tissue and cell type-specific investigation of transcription factor-gene networks Nucleic Acids Res., June 1, 2009; 37(11): e82 - e82. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Thomas and C. A. Erickson FOXD3 regulates the lineage switch between neural crest-derived glial cells and pigment cells by repressing MITF through a non-canonical mechanism Development, June 1, 2009; 136(11): 1849 - 1858. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Tokovenko, R. Golda, O. Protas, M. Obolenskaya, and A. El'skaya COTRASIF: conservation-aided transcription-factor-binding site finder Nucleic Acids Res., April 1, 2009; 37(7): e49 - e49. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Xia, M. E. Lemieux, W. Li, J. S. Carroll, M. Brown, X. S. Liu, and A. L. Kung Integrative analysis of HIF binding and transactivation reveals its role in maintaining histone methylation homeostasis PNAS, March 17, 2009; 106(11): 4260 - 4265. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Tomaru, M. Nakanishi, H. Miura, Y. Kimura, H. Ohkawa, Y. Ohta, Y. Hayashizaki, and M. Suzuki Identification of an inter-transcription factor regulatory network in human hepatoma cells by Matrix RNAi Nucleic Acids Res., March 1, 2009; 37(4): 1049 - 1060. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Chen, K. Wang, J. Chen, J. Guo, Y. Yin, X. Cai, X. Guo, G. Wang, R. Yang, L. Zhu, et al. In Vitro Evidence Suggests That miR-133a-mediated Regulation of Uncoupling Protein 2 (UCP2) Is an Indispensable Step in Myogenic Differentiation J. Biol. Chem., February 20, 2009; 284(8): 5362 - 5369. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Miura, Y. Tomaru, M. Nakanishi, S. Kondo, Y. Hayashizaki, and M. Suzuki Identification of DNA regions and a set of transcriptional regulatory factors involved in transcriptional regulation of several human liver-enriched transcription factor genes Nucleic Acids Res., February 1, 2009; 37(3): 778 - 792. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. D. Scharer, C. D. McCabe, M. Ali-Seyed, M. F. Berger, M. L. Bulyk, and C. S. Moreno Genome-Wide Promoter Analysis of the SOX4 Transcriptional Network in Prostate Cancer Cells Cancer Res., January 15, 2009; 69(2): 709 - 717. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Kasperczyk, B. Baumann, K.-M. Debatin, and S. Fulda Characterization of sonic hedgehog as a novel NF-{kappa}B target gene that promotes NF-{kappa}B-mediated apoptosis resistance and tumor growth in vivo FASEB J, January 1, 2009; 23(1): 21 - 33. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Kaur, A. Radovanovic, M. Essack, U. Schaefer, M. Maqungo, T. Kibler, S. Schmeier, A. Christoffels, K. Narasimhan, M. Choolani, et al. Database for exploration of functional context of genes implicated in ovarian cancer Nucleic Acids Res., January 1, 2009; 37(suppl_1): D820 - D823. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Koudritsky and E. Domany Positional distribution of human transcription factor binding sites Nucleic Acids Res., December 1, 2008; 36(21): 6795 - 6805. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Turner, L. P. L. Pelascini, J. A. Macedo, and C. P. Muller Highly individual methylation patterns of alternative glucocorticoid receptor promoters suggest individualized epigenetic regulatory mechanisms Nucleic Acids Res., December 1, 2008; 36(22): 7207 - 7218. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. B. Riggins, J. P-J. Lan, U. Klimach, A. Zwart, L. R. Cavalli, B. R. Haddad, L. Chen, T. Gong, J. Xuan, S. P. Ethier, et al. ERR{gamma} Mediates Tamoxifen Resistance in Novel Models of Invasive Lobular Breast Cancer Cancer Res., November 1, 2008; 68(21): 8908 - 8917. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Dupre, D. W. Burt, R. Talbot, A. Downing, D. Mouzaki, D. Waddington, B. Malpaux, J. R. E. Davis, G. A. Lincoln, and A. S. I. Loudon Identification of Melatonin-Regulated Genes in the Ovine Pituitary Pars Tuberalis, a Target Site for Seasonal Hormone Control Endocrinology, November 1, 2008; 149(11): 5527 - 5539. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Lange, B. Kaynak, U. B. Forster, M. Tonjes, J. J. Fischer, C. Grimm, J. Schlesinger, S. Just, I. Dunkel, T. Krueger, et al. Regulation of muscle development by DPF3, a novel histone acetylation and methylation reader of the BAF chromatin remodeling complex Genes & Dev., September 1, 2008; 22(17): 2370 - 2384. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Li, H.-i. H. Paik, C. Balch, Y. Kim, L. Li, T. H-M. Huang, K. P. Nephew, and S. Kim Enriched transcription factor binding sites in hypermethylated gene promoters in drug resistant cancer cells Bioinformatics, August 15, 2008; 24(16): 1745 - 1748. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Torkamani and N. J. Schork Predicting functional regulatory polymorphisms Bioinformatics, August 15, 2008; 24(16): 1787 - 1792. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. D. Wederell, M. Bilenky, R. Cullum, N. Thiessen, M. Dagpinar, A. Delaney, R. Varhol, Y. Zhao, T. Zeng, B. Bernier, et al. Global analysis of in vivo Foxa2-binding sites in mouse adult liver using massively parallel sequencing Nucleic Acids Res., August 1, 2008; 36(14): 4549 - 4564. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Wingender The TRANSFAC project as an example of framework technology that supports the analysis of genomic regulation Brief Bioinform, July 1, 2008; 9(4): 326 - 332. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Nakanishi, Y. Tomaru, H. Miura, Y. Hayashizaki, and M. Suzuki Identification of transcriptional regulatory cascades in retinoic acid-induced growth arrest of HepG2 cells Nucleic Acids Res., June 1, 2008; 36(10): 3443 - 3454. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Smeenk, S. J. van Heeringen, M. Koeppel, M. A. van Driel, S. J. J. Bartels, R. C. Akkers, S. Denissov, H. G. Stunnenberg, and M. Lohrum Characterization of genome-wide p53-binding sites upon stress response Nucleic Acids Res., June 1, 2008; 36(11): 3639 - 3654. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Della Gatta, M. Bansal, A. Ambesi-Impiombato, D. Antonini, C. Missero, and D. di Bernardo Direct targets of the TRP63 transcription factor revealed by a combination of gene expression profiling and reverse engineering Genome Res., June 1, 2008; 18(6): 939 - 948. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Whittington, A. T. Papenfuss, P. Bansal, A. M. Torres, E. S.W. Wong, J. E. Deakin, T. Graves, A. Alsop, K. Schatzkamer, C. Kremitzki, et al. Defensins and the convergent evolution of platypus and reptile venom genes Genome Res., June 1, 2008; 18(6): 986 - 994. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Nahle, M. Hsieh, T. Pietka, C. T. Coburn, P. A. Grimaldi, M. Q. Zhang, D. Das, and N. A. Abumrad CD36-dependent Regulation of Muscle FoxO1 and PDK4 in the PPAR{delta}/{beta}-mediated Adaptation to Metabolic Stress J. Biol. Chem., May 23, 2008; 283(21): 14317 - 14326. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Liu, Y. Zhao, D. Nie, J. Shi, L. Fu, Y. Li, D. Yu, and J. Lu Association of a Functional Cytochrome P450 4F2 Haplotype with Urinary 20-HETE and Hypertension J. Am. Soc. Nephrol., April 1, 2008; 19(4): 714 - 721. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. R. Sedaghat, J. German, T. M. Teslovich, J. Cofrancesco Jr., C. C. Jie, C. C. Talbot Jr., and R. F. Siliciano Chronic CD4+ T-Cell Activation and Depletion in Human Immunodeficiency Virus Type 1 Infection: Type I Interferon-Mediated Disruption of T-Cell Dynamics J. Virol., February 15, 2008; 82(4): 1870 - 1883. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Levy, D. Tatomer, C. B. Herber, X. Zhao, H. Tang, T. Sargeant, L. J. Ball, J. Summers, T. P. Speed, and D. C. Leitman Differential Regulation of Native Estrogen Receptor-Regulatory Elements by Estradiol, Tamoxifen, and Raloxifene Mol. Endocrinol., February 1, 2008; 22(2): 287 - 303. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Xie, A. Mosig, X. Qi, Y. Li, P. F. Stadler, and J. J.-L. Chen Structure and Function of the Smallest Vertebrate Telomerase RNA from Teleost Fish J. Biol. Chem., January 25, 2008; 283(4): 2049 - 2059. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hertel, I. L. Hofacker, and P. F. Stadler SnoReport: computational identification of snoRNAs with unknown targets Bioinformatics, January 15, 2008; 24(2): 158 - 164. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Li, Y. Li, Y. Li, H. Liu, Z. Sun, J. Lu, and Y. Zhao Glucocorticoid repression of human with-no-lysine (K) kinase-4 gene expression is mediated by the negative response elements in the promoter J. Mol. Endocrinol., January 1, 2008; 40(1): 3 - 12. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Xu, Y. Zhao, and R. Simon Gene Set Expression Comparison kit for BRB-ArrayTools Bioinformatics, January 1, 2008; 24(1): 137 - 139. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. G Lemay, A. M Zivkovic, and J B. German Building the bridges to bioinformatics in nutrition research Am. J. Clinical Nutrition, November 1, 2007; 86(5): 1261 - 1269. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Jiang, M. Q. Zhang, and X. Zhang OSCAR: One-class SVM for accurate recognition of cis-elements Bioinformatics, November 1, 2007; 23(21): 2823 - 2828. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Lapointe, C. Li, C. P. Giacomini, K. Salari, S. Huang, P. Wang, M. Ferrari, T. Hernandez-Boussard, J. D. Brooks, and J. R. Pollack Genomic Profiling Reveals Alternative Genetic Pathways of Prostate Tumorigenesis Cancer Res., September 15, 2007; 67(18): 8504 - 8510. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Tullai, M. E. Schaffer, S. Mullenbrock, G. Sholder, S. Kasif, and G. M. Cooper Immediate-Early and Delayed Primary Response Genes Are Distinct in Function and Genomic Architecture J. Biol. Chem., August 17, 2007; 282(33): 23981 - 23995. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. S. Baker, L. A. Eory, H. Yakhnin, J. Mercante, T. Romeo, and P. Babitzke CsrA Inhibits Translation Initiation of Escherichia coli hfq by Binding to a Single Site Overlapping the Shine-Dalgarno Sequence J. Bacteriol., August 1, 2007; 189(15): 5472 - 5481. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Hertzberg, S. Izraeli, and E. Domany STOP: searching for transcription factor motifs using gene expression Bioinformatics, July 15, 2007; 23(14): 1737 - 1743. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Reimand, M. Kull, H. Peterson, J. Hansen, and J. Vilo g:Profiler--a web-based toolset for functional profiling of gene lists from large-scale experiments Nucleic Acids Res., July 13, 2007; 35(suppl_2): W193 - W200. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Brudno, A. Poliakov, S. Minovitsky, I. Ratnere, and I. Dubchak Multiple whole genome alignments and novel biomedical applications at the VISTA portal Nucleic Acids Res., July 13, 2007; 35(suppl_2): W669 - W674. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. A. Kolchanov, T. I. Merkulova, E. V. Ignatieva, E. A. Ananko, D. Yu. Oshchepkov, V. G. Levitsky, G. V. Vasiliev, N. V. Klimova, V. M. Merkulov, and T. C. Hodgman Combined experimental and computational approaches to study the regulatory elements in eukaryotic genes Brief Bioinform, July 12, 2007; (2007) bbm027v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Das, T. A. Clark, A. Schweitzer, M. Yamamoto, H. Marr, J. Arribere, S. Minovitsky, A. Poliakov, I. Dubchak, J. E. Blume, et al. A correlation with exon expression approach to identify cis-regulatory elements for tissue-specific alternative splicing Nucleic Acids Res., July 10, 2007; (2007) gkm485v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. R. Kiela, N. Kuscuoglu, A. J. Midura, M. T. Midura-Kiela, C. B. Larmonier, M. Lipko, and F. K. Ghishan Molecular mechanism of rat NHE3 gene promoter regulation by sodium butyrate Am J Physiol Cell Physiol, July 1, 2007; 293(1): C64 - C74. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. X. Jin, H. O'Geen, S. Iyengar, R. Green, and P. J. Farnham Identification of an OCT4 and SRY regulatory module using integrated computational and experimental genomics approaches Genome Res., June 1, 2007; 17(6): 807 - 817. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. O'Lone, K. Knorr, I. Z. Jaffe, M. E. Schaffer, P. G. V. Martini, R. H. Karas, J. Bienkowska, M. E. Mendelsohn, and U. Hansen Estrogen Receptors {alpha} and {beta} Mediate Distinct Pathways of Vascular Gene Expression, Including Genes Involved in Mitochondrial Electron Transport and Generation of Reactive Oxygen Species Mol. Endocrinol., June 1, 2007; 21(6): 1281 - 1296. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Bao, J. L. Peirce, M. Zhou, H. Li, D. Goldowitz, R. W. Williams, L. Lu, and Y. Cui An integrative genomics strategy for systematic characterization of genetic loci modulating phenotypes Hum. Mol. Genet., June 1, 2007; 16(11): 1381 - 1390. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Li, Y. Liang, and R. L. Bass GAPWM: a genetic algorithm method for optimizing a position weight matrix Bioinformatics, May 15, 2007; 23(10): 1188 - 1194. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Qiu, B. Tian, R. G. Resuello, F. F. Natividad, A. Peppas, Y.-T. Shen, D. E. Vatner, S. F. Vatner, and C. Depre Sex-specific regulation of gene expression in the aging monkey aorta Physiol Genomics, April 24, 2007; 29(2): 169 - 180. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Tomovic and E. J. Oakeley Position dependencies in transcription factor binding sites Bioinformatics, April 15, 2007; 23(8): 933 - 941. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Tullai, J. Chen, M. E. Schaffer, E. Kamenetsky, S. Kasif, and G. M. Cooper Glycogen Synthase Kinase-3 Represses Cyclic AMP Response Element-binding Protein (CREB)-targeted Immediate Early Genes in Quiescent Cells J. Biol. Chem., March 30, 2007; 282(13): 9482 - 9491. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. J. Jenkins, A. W. Roberts, C. J. Greenhill, M. Najdovska, T. Lundgren-May, L. Robb, D. Grail, and M. Ernst Pathologic consequences of STAT3 hyperactivation by IL-6 and IL-11 during hematopoiesis and lymphopoiesis Blood, March 15, 2007; 109(6): 2380 - 2388. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Finocchiaro, M. S. Carro, S. Francois, P. Parise, V. DiNinni, and H. Muller Localizing hotspots of antisense transcription Nucleic Acids Res., March 12, 2007; 35(5): 1488 - 1500. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Galuschka, M. Schindler, L. Bulow, and R. Hehl AthaMap web tools for the analysis and identification of co-regulated genes Nucleic Acids Res., January 12, 2007; 35(suppl_1): D857 - D862. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Keller, S. Addya, R. Vadigepalli, B. Banini, K. Delgrosso, H. Huang, and S. Surrey Transcriptional regulatory network analysis of developing human erythroid progenitors reveals patterns of coregulation and potential transcriptional regulators Physiol Genomics, December 13, 2006; 28(1): 114 - 128. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. X. Jin, A. Rabinovich, S. L. Squazzo, R. Green, and P. J. Farnham A computational genomics approach to identify cis-regulatory modules from chromatin immunoprecipitation microarray data--A case study using E2F1 Genome Res., December 1, 2006; 16(12): 1585 - 1595. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Bhatti, D. M. Church, J. L. Rutter, J. P. Struewing, and A. J. Sigurdson Candidate Single Nucleotide Polymorphism Selection using Publicly Available Tools: A Guide for Epidemiologists Am. J. Epidemiol., October 15, 2006; 164(8): 794 - 804. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Carayol, J. Chen, F. Yang, T. Jin, L. Jin, D. States, and C.-Y. Wang A Dominant Function of IKK/NF-{kappa}B Signaling in Global Lipopolysaccharide-induced Gene Expression J. Biol. Chem., October 13, 2006; 281(41): 31142 - 31151. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. Faraco, L. Appelbaum, W. Marin, S. E. Gaus, P. Mourrain, and E. Mignot Regulation of Hypocretin (Orexin) Expression in Embryonic Zebrafish J. Biol. Chem., October 6, 2006; 281(40): 29753 - 29761. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Fang, S. Fan, X. Zhang, and M. Q. Zhang Predicting methylation status of CpG islands in the human brain Bioinformatics, September 15, 2006; 22(18): 2204 - 2209. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta Computational identification of transcriptional regulatory elements in DNA sequence Nucleic Acids Res., July 19, 2006; 34(12): 3585 - 3598. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. G. Lemay and D. H. Hwang Genome-wide identification of peroxisome proliferator response elements using integrated computational genomics J. Lipid Res., July 1, 2006; 47(7): 1583 - 1587. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Al-Shahrour, P. Minguez, J. Tarraga, D. Montaner, E. Alloza, J. M. Vaquerizas, L. Conde, C. Blaschke, J. Vera, and J. Dopazo BABELOMICS: a systems biology perspective in the functional annotation of genome-scale experiments. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W472 - W476. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Grau, I. Ben-Gal, S. Posch, and I. Grosse VOMBAT: prediction of transcription factor binding sites using variable order Bayesian trees. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W529 - W533. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Waleev, D. Shtokalo, T. Konovalova, N. Voss, E. Cheremushkin, P. Stegmaier, O. Kel-Margoulis, E. Wingender, and A. Kel Composite Module Analyst: identification of transcription factor binding site combinations using genetic algorithm. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W541 - W545. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kel, T. Konovalova, T. Waleev, E. Cheremushkin, O. Kel-Margoulis, and E. Wingender Composite Module Analyst: a fitness-based tool for identification of transcription factor binding site combinations Bioinformatics, May 15, 2006; 22(10): 1190 - 1197. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Antonini, B. Rossi, R. Han, A. Minichiello, T. Di Palma, M. Corrado, S. Banfi, M. Zannini, J. L. Brissette, and C. Missero An Autoregulatory Loop Directs the Tissue-Specific Expression of p63 through a Long-Range Evolutionarily Conserved Enhancer Mol. Cell. Biol., April 15, 2006; 26(8): 3308 - 3318. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. A. Ivakine, O. M. Gulban, S. M. Mortin-Toth, E. Wankiewicz, C. Scott, D. Spurrell, A. Canty, and J. S. Danska Molecular genetic analysis of the idd4 locus implicates the IFN response in type 1 diabetes susceptibility in nonobese diabetic mice. J. Immunol., March 1, 2006; 176(5): 2976 - 2990. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Sauer, E. Shelest, and E. Wingender Evaluating phylogenetic footprinting for human-rodent comparisons Bioinformatics, February 15, 2006; 22(4): 430 - 437. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Zhang, Z. Xuan, S. Otto, J. R. Hover, S. R. McCorkle, G. Mandel, and M. Q. Zhang A clustering property of highly-degenerate transcription factor binding sites in the mammalian genome. Nucleic Acids Res., January 1, 2006; 34(8): 2238 - 2246. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. V. Poll, C. E. Pierreux, L. Lokmane, C. Haumaitre, Y. Achouri, P. Jacquemin, G. G. Rousseau, S. Cereghini, and F. P. Lemaigre A vHNF1/TCF2-HNF6 Cascade Regulates the Transcription Factor Network That Controls Generation of Pancreatic Precursor Cells Diabetes, January 1, 2006; 55(1): 61 - 69. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Kummerfeld and S. A. Teichmann DBD: a transcription factor prediction database Nucleic Acids Res., January 1, 2006; 34(suppl_1): D74 - D81. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. A. Sharov, D. B. Dudekula, and M. S. H. Ko CisView: A Browser and Database of cis-regulatory Modules Predicted in the Mouse Genome DNA Res, January 1, 2006; 13(3): 123 - 134. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Matys, O. V. Kel-Margoulis, E. Fricke, I. Liebich, S. Land, A. Barre-Dirrie, I. Reuter, D. Chekmenev, M. Krull, K. Hornischer, et al. TRANSFAC(R) and its module TRANSCompel(R): transcriptional gene regulation in eukaryotes Nucleic Acids Res., January 1, 2006; 34(suppl_1): D108 - D110. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Bogwitz, H. Chung, L. Magoc, S. Rigby, W. Wong, M. O'Keefe, J. A. McKenzie, P. Batterham, and P. J. Daborn Cyp12a4 confers lufenuron resistance in a natural population of Drosophila melanogaster PNAS, September 6, 2005; 102(36): 12807 - 12812. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Pudimat, E.-G. Schukat-Talamazzini, and R. Backofen A multiple-feature framework for modelling and predicting transcription factor binding sites Bioinformatics, July 15, 2005; 21(14): 3082 - 3088. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Di Cara, K. Schmidt, B. A. Hemmings, and E. J. Oakeley PromoterPlot: a graphical display of promoter similarities by pattern recognition Nucleic Acids Res., July 1, 2005; 33(suppl_2): W423 - W426. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Al-Shahrour, P. Minguez, J. M. Vaquerizas, L. Conde, and J. Dopazo BABELOMICS: a suite of web tools for functional annotation and analysis of groups of genes in high-throughput experiments Nucleic Acids Res., July 1, 2005; 33(suppl_2): W460 - W464. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Conde, J. M. Vaquerizas, C. Ferrer-Costa, X. de la Cruz, M. Orozco, and J. Dopazo PupasView: a visual tool for selecting suitable SNPs, with putative pathological effect in genes, for genotyping purposes Nucleic Acids Res., July 1, 2005; 33(suppl_2): W501 - W505. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Vaquerizas, L. Conde, P. Yankilevich, A. Cabezon, P. Minguez, R. Diaz-Uriarte, F. Al-Shahrour, J. Herrero, and J. Dopazo GEPAS, an experiment-oriented pipeline for the analysis of microarray gene expression data Nucleic Acids Res., July 1, 2005; 33(suppl_2): W616 - W620. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||


































