Article |
MEME: discovering and analyzing DNA and protein sequence motifs
Institute of Molecular Bioscience, The University of Queensland St Lucia, QLD 4072, Australia 1 SDSC, UCSD, La Jolla CA, USA
*To whom correspondence should be addressed. Tel: +61 7 3346 2614; Fax: +61 7 3346 2101; Email: t.bailey{at}imb.uq.edu.au
Received February 14, 2006. Revised March 21, 2006. Accepted March 21, 2006.
| ABSTRACT |
|---|
|
|
|---|
MEME (Multiple EM for Motif Elicitation) is one of the most widely used tools for searching for novel signals in sets of biological sequences. Applications include the discovery of new transcription factor binding sites and protein domains. MEME works by searching for repeated, ungapped sequence patterns that occur in the DNA or protein sequences provided by the user. Users can perform MEME searches via the web server hosted by the National Biomedical Computation Resource (http://meme.nbcr.net) and several mirror sites. Through the same web server, users can also access the Motif Alignment and Search Tool to search sequence databases for matches to motifs encoded in several popular formats. By clicking on buttons in the MEME output, users can compare the motifs discovered in their input sequences with databases of known motifs, search sequence databases for matches to the motifs and display the motifs in various formats. This article describes the freely accessible web server and its architecture, and discusses ways to use MEME effectively to find new sequence patterns in biological sequences and analyze their significance.
| INTRODUCTION |
|---|
|
|
|---|
The purpose of MEME (Multiple EM For Motif Elicitation) (rhymes with team) (1,2) is to allow users to discover signals (called motifs) in DNA or protein sequences. The user of MEME inputs a set of sequences believed to share some (unknown) sequence signal(s). For example, some or all of a set of promoters from co-expressed and/or orthologous genes may contain binding sites (the signal) for the same transcription factor (3). Similarly, a set of proteins that interact with a single host protein may do so via similar domains (the signal) (4). Both types of sequence signals can often be represented as motifs-ungapped, approximate sequence patterns. Using a process akin to gapless, local, multiple sequence alignment, MEME searches for statistically significant motifs in the input sequence set. In this way, MEME can discover the binding sites for the shared transcription factor in the set of promoters or the common proteinprotein binding domains in the set of proteins. MEME can also be used to discover motifs describing many other types of DNA or protein signals besides transcription factor binding sites and proteinprotein interaction domains.
To use MEME via the website, the user provides a set of sequences in the FASTA format by either uploading a file or by cut-and-paste. The only other required input is an email address where the results will be sent. (A planned future version will remove this requirement by providing temporary storage of the results on the web server for a preset period of time.) By default, MEME looks for up to three motifs, each of which may be present in some or all of the input sequences. MEME chooses the width and number of occurrences of each motif automatically in order to minimize the E-value of the motifthe probability of finding an equally well-conserved pattern in random sequences. By default, only motif widths between 6 and 50 are considered, but the user may change this as well as several other aspects of the search for motifs.
The MEME output is HTML and shows the motifs as local multiple alignments of (subsets of) the input sequences, as well as in several other formats (Figure 1). Block diagrams show the relative positions of the motifs in each of the input sequences. Buttons on the MEME HTML output allow one or all of the motifs to be forwarded for analysis by other web-based programs. Clicking on a button allows all of the motifs to be sent to the MAST web server where various sequence databases (or uploaded sequences) can be searched for sequences matching the motifs. This is useful in cases, for example, where the user would like to find whether the motif of interest is also present in other genes or genomes.
|
MAST is a web-based tool that can be used to search for sequences that match one or more motifs. It can be used to look for sequences that contain motifs found by MEME, by other motif discovery tools or that are taken from a motif database. The MAST website, reached via the same URL as the MEME website, provides numerous nucleotide and protein databases for searching. MAST queries may contain any number of motifs, and it scores each sequence in the selected database using all of the motifs. In the first example above, MAST can search DNA sequences for matches to the putative transcription factor binding site (TFBS) motifs found by MEME in a set of promoter sequences. MAST can search for matches in protein sequences to the putative proteinprotein interaction motifs found in the second MEME example.
Users of MEME via the website or locally installed versions are asked to cite this article as well as the primary reference for MEME (5). Users of MAST are asked to cite this article and Ref. (6).
| MOTIF DISCOVERY STRATEGIES |
|---|
|
|
|---|
Motif discovery can be viewed as a needle in a haystack problem. The motif discovery algorithm is looking for a set of similar short sequences (the needle) in a set of much longer sequences (the haystack). The problem is easier when the motif instances are long and very similar to each other. It gets much harder when the motif instances are short and/or degenerate, or the input sequences are very long.
Discovering TFBS motifs in a set of DNA sequences (e.g. genomic regions upstream of genes) is a difficult task owing to the tendency of binding sites to be short and degenerate, and owing to the fact that promoter regions are often difficult to identify precisely. The problem tends to be worse in eukaryotes than in prokaryotes and yeast because eukaryotic TFBS tend to be shorter and more variable (7).
To successfully discover TFBS motifs with MEME, it is necessary to choose and prepare the input sequences carefully. Candidate sequences can be the promoters of genes believed to be co-regulated based on the evidence from expression microarray experiments, or sequences appearing to bind to a transcription factor based on chromatin immunoprecipitation experiments. The sequences should be as short as possible and contain as few noise sequences (sequences not containing any motif) as possible. Ideally, the sequences should be <1000 bp long (8). Including more than 40 motif-containing sequences generally does not improve TFBS motif discovery with MEME and similar algorithms (9). If the sequences contain low-information segments that do not contain motifs of interest, it can be helpful to remove them using the DUST program (R. L. Tatusov and D. J. Lipman, unpublished NCBI/Toolkit), which is available for downloading at http://blast.wustl.edu/pub/dust/. Repetitive DNA elements should also be removed from the sequences input to MEME using the RepeatMasker program (A. Smit, R. Hubley and P. Green, unpublished data), which can be accessed via the Web (http://www.repeatmasker.org/cgi-bin/WEBRepeatMasker).
It should be noted that MEME is not suited to whole-genome TFBS motif discovery. Owing to their shortness and degeneracy, TFBS motifs become statistically invisible in the context of a whole genome. The sensitivity of the search for TFBS motifs can be improved by using a higher-order background sequence model, but this option is only available currently when users download the MEME source code and install it locally. Instructions for the installation are available at the MEME website (http://meme.nbcr.net/meme/website/meme-download.html) by clicking on View MEME man page; see the documentation for the -bfile switch there.
Protein motifs are generally easier to discover owing to the length of the protein alphabet and the chemical similarity among groups of amino acids. This allows shorter motifs to be more statistically significant and makes it easier to distinguish functional motifs from statistical artifacts. To use MEME to discover protein motifs, the same basic guidelines apply as with DNA motifskeep the sequences as short as possible and include as few sequences that are not likely to contain the motif as possible in the input to MEME. Low-complexity regions can be removed from the protein input sequences using the SEG program (10).
| ANALYZING MOTIFS USING THE MEME OUTPUT HYPERLINKS |
|---|
|
|
|---|
The MEME HTML output contains buttons making it easy to analyze the motifs it discovers. By clicking on the button labeled Compare PSPM to known motifs in JASPAR database following each motif, the DNA motif can be compared to each of the motifs in the JASPAR database (11) of known TFBS motifs. Similarly, protein motifs may be compared with protein motifs in the BLOCKS database of protein motifs (12) by clicking on the submit BLOCK button following each motif on the MEME form. This takes the user to the BLOCKS server where clicking on LAMA will compare the motif with those in the BLOCKS database. The BLOCKS server also allows users to display protein motifs in many different ways, including LOGOS (13) or phylogenetic trees, by clicking on the corresponding buttons on the BLOCKS server form. By clicking on one of the file output formats under Logos, the user is able to obtain a LOGOS diagram similar to that shown in Figure 2.
|
To search sequences for matches to the motifs found by MEME, users can click on the MAST button at the top of the MEME output form. This will take the user to the MAST website where they can select the database to search. Since MAST is sequence-oriented, TFBS motifs should only be used to search promoter regions. These are listed in the MAST database pull-down menu as Upstream Sequence Databases. Currently, only a few organisms are supported. However, users can upload their own database of promoter sequences for searching using MAST. Protein motifs can be used to search any of the sequence databases provided by the MAST website since MAST can search either protein or nucleotide databases with protein motifs. The MAST database are updated weekly.
| WEB SERVER AND USER SUPPORT |
|---|
|
|
|---|
As of MEME version 3.5, the configuration and installation of MEME (including the web server) is significantly simplified by using Autoconf (http://www.gnu.org/software/autoconf/autoconf.html) and Automake (http://www.gnu.org/software/automake/automake.html) from the GNU Build System. An installation session for MEME and MAST web server may be as simple as follows:
cd meme_3.5.2
./configure --prefix=$HOME/meme --with-url=http://www.nbcr.net/
meme --enable-web
make
make test
make install
Supported platforms now include Linux, Solaris, MacOS X, Cygwin and Irix.
The MEME web server hosted by NBCR is queried by about 800 different users (based on unique email addresses) each month. Usage has been growing steadily since the service was first introduced in 1996. Figure 3 shows usage growth at the NBCR server since 2000.
|
To meet the growing user demand and take advantage of the emerging grid-computing resources (14), we have made MEME available for the installation on Linux clusters using either the RPM package manager or Rocks. The RPM package manager is a tool for managing software installation on computers running many versions of the Linux operating system. Rocks (http://www.rocksclusters.org) is a highly customized toolkit for computational biologists and engineers to build and maintain Linux clusters. The current NBCR MEME web server cluster is built using the MEME roll for Rocks and requires minimal maintenance effort.
MEME and MAST can be downloaded and installed free of charge by academic users via the website: (http://meme.nbcr.net/meme/website/meme-download.html). Approximately 300 users download the MEME/MAST software each month. The MEME support team offers assistance to the MEME and MAST user community through the forum (http://nbcr.net/forum/viewforum.php?f=5) or the mailing list (meme{at}nbcr.net). Institutes interested in setting up MEME mirror sites are encouraged to contact us for any assistance.
| FUTURE DIRECTIONS |
|---|
|
|
|---|
To increase the sensitivity of MEME searches, we will add an option in the web server to let the user upload a background sequence model to MEME. We hope to add algorithms for removing low-complexity regions (SEG and DUST) and repeated elements (RepeatMasker) in the MEME website as a convenience to users. These services will also be exposed as web services and are integrated using workflow tools developed by using NBCR.
We have also planned to add buttons to the MEME output to allow TFBS motifs to be used in searching for cis-regulatory modules via algorithms such as MCAST (15). MCAST will be configured to be able to search the same DNA databases as MAST. In conjunction with this, we will add databases of upstream sequences for many additional organisms to the MAST/MCAST websites to facilitate the analysis of TFBS motifs discovered by using MEME.
NBCR has developed a set of tools built on top of the open source software that allows bioinformatics applications to be deployed as Web Services easily (S. Krishnan, B. Stearn, K. Bhatia, W. W. Li and P. Arzberger, manuscript submitted) and leverage the Cyberinfrastructure components transparently (14). A prototype has been deployed using MEME as a scientific driver (16) that offers a user with a dynamic pool of distributed compute resource, workflow management console and a friendly user interface. This portal will be deployed to the production web server in the future.
| ACKNOWLEDGEMENTS |
|---|
The authors acknowledge NBCR award from NCRR, NIH P41 RR08605, for support of the MEME and MAST website. TLB acknowledges grant from NIH, R01 RR021692-01, for support of continuing development of the MEME and related sequence analysis tools. T.L.B. also acknowledges the ARC Centre for Bioinformatics (ACB) (ARC CE0348221) for infrastructure support for the MEME mirror site at the ACB. Funding to pay the Open Access publication charges for this article was provided by the NIH.
Conflict of interest statement. None declared.
| REFERENCES |
|---|
|
|
|---|
- Bailey, T.L. and Elkan, C. (1995) Unsupervised Learning of Multiple Motifs In Biopolymers Using EM Mach. Learn, 21, 5180 .
- Bailey, T.L. and Elkan, C. (1995) The value of prior knowledge in discovering motifs with MEME Proceedings of the Third International Conference on Intelligent Systems for Molecular biology, July In Rawlings, C., Clark, D., Altman, R., Hunter, L., Lengauer, T., Wodak, S. (Eds.). Menlo Park, CA AAAI Press pp. 2129 .
- Lyons, T.J., Gasch, A.P., Alex Gaither, L., Botstein, D., Brown, P.O., Eide, D.J. (2000) Genome-wide characterization of the Zap1p zinc-responsive regulon in yeast Proc. Natl Acad. Sci. USA, 97, 79577962
[Abstract/Free Full Text] . - Fang, J., Haasl, R.J., Dong, Y., Lushington, G.H. (2005) Discover protein sequence signatures from protein-protein interaction data BMC Bioinformatics, 6, 18
[Free Full Text] . - Bailey, T.L. and Elkan, C. (1994) Fitting a mixture model by expectation maximization to discover motifs in biopolymers Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, August In Altman, R.B., Brutlag, D.L., Karp, P.D., Lathrop, R.H., Searls, D.B. (Eds.). Menlo Park, CA AAAI Press pp. 2836 .
- Bailey, T.L. and Gribskov, M. (1998) 'Combining evidence using P-values: application to sequence homology searches Bioinformatics, 14, 4854
[Abstract/Free Full Text] . - Tompa, M., Li, N., Bailey, T.L., Church, G.M., De Moor, B., Eskin, E., Favorov, A.V., Frith, M.C., Fu, Y., Kent, W.J., et al. (2005) Assessing Computational Tools for the Discovery of Transcription Factor Binding Sites Nat. Biotechnol, . 23, 137147[CrossRef][Web of Science][Medline] .
- Pevzner, P.A. and Sze, S.H. (2000) Combinatorial approaches to finding subtle signals in DNA sequences Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology, August. In Bourne, P.E., Gribskov, M., Altman, R.B., Jensen, N., Hope, D., Lengauer, T., Mitchell, J.C., Scheeff, E.D., Smith, C., Strande, S., Weissig, H. (Eds.). Menlo Park, CA AAAI Press pp. 269278 .
- Hu, J., Li, B., Kihara, D. (2005) Limitations and potentials of current motif discovery algorithms Nucleic Acids Res, . 33, 48994913
[Abstract/Free Full Text] . - Wootton, J.C. and Federhen, S. (1966) Analysis of compositionally biased regions in sequence databases Methods Enzymol, 266, 554571 .
- Sandelin, A., Alkema, W., Engström, P., Wasserman, W.W., Lenhard, B. (2004) JASPAR: an open-access database for eukaryotic transcription factor binding profiles Nucleic Acids Res, 32, D91D94
[Abstract/Free Full Text] . - Henikoff, J.G., Pietrokovski, S., Henikoff, S. (1997) Recent enhancements to the blocks database servers Nucleic Acids Res, . 25, 222225
[Abstract/Free Full Text] . - Schneider, T.D. and Stephens, R.M. (1990) Sequence logos: a new way to display consensus sequences Nucleic Acids Res, . 18, 60976100
[Abstract/Free Full Text] . - Foster, I. and Kesselman, C. The Grid 2: Blueprint for a New Computing Infrastructure, (2004) 2nd edn San Francisco, CA Morgan Kaufmann Publishers, Inc .
- Bailey, T.L. and Noble, W.S. (2003) Searching for statistically significant regulatory modules Bioinformatics, 19, Suppl 2, II16II25[Medline] .
- Li, W.W., Krishnan, S., Mueller, K., Misleh, C., Arzberger, P. (2006) Building cyberinfrastructure for bioinformatics using service oriented architecture Proceedings of the IEEE International Symposium on Cluster Computing and the Grid, May In Bu Sung, F.L., Abramson, D., Cai, W., Graupner, S., Jin, H., Sloot, P. (Eds.). USA IEEE Press (in press)
.
This article has been cited by other articles:
![]() |
M. Mihara, T. Itoh, and T. Izawa SALAD database: a motif-based database of protein annotations for plant comparative genomics Nucleic Acids Res., October 23, 2009; (2009) gkp831v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Bolognani, T. Contente-Cuomo, and N. I. Perrone-Bizzozero Novel recognition motifs and biological functions of the RNA-binding protein HuD revealed by genome-wide identification of its targets Nucleic Acids Res., October 21, 2009; (2009) gkp863v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Zhang and G. J. Olsen Messenger RNA processing in Methanocaldococcus (Methanococcus) jannaschii RNA, October 1, 2009; 15(10): 1909 - 1916. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. A. Sharov and M. S.H. Ko Exhaustive Search for Over-represented DNA Sequence Motifs with CisFinder DNA Res, October 1, 2009; 16(5): 261 - 273. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. J. Stollar, B. Garcia, P. A. Chong, A. Rath, H. Lin, J. D. Forman-Kay, and A. R. Davidson Structural, Functional, and Bioinformatic Studies Demonstrate the Crucial Role of an Extended Peptide Binding Site for the SH3 Domain of Yeast Abp1p J. Biol. Chem., September 25, 2009; 284(39): 26918 - 26927. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. F. T. van Hijum, M. H. Medema, and O. P. Kuipers Mechanisms and Evolution of Control Logic in Prokaryotic Transcriptional Regulation Microbiol. Mol. Biol. Rev., September 1, 2009; 73(3): 481 - 509. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. F. Stadler, J. J.-L. Chen, J. Hackermuller, S. Hoffmann, F. Horn, P. Khaitovich, A. K. Kretzschmar, A. Mosig, S. J. Prohaska, X. Qi, et al. Evolution of Vault RNAs Mol. Biol. Evol., September 1, 2009; 26(9): 1975 - 1991. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Narsai, K. A. Howell, A. Carroll, A. Ivanova, A. H. Millar, and J. Whelan Defining Core Metabolic and Transcriptomic Responses to Oxygen Availability in Rice Embryos and Young Seedlings Plant Physiology, September 1, 2009; 151(1): 306 - 322. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Waterland, R. Kellermayer, M.-T. Rached, N. Tatevian, M. V. Gomes, J. Zhang, L. Zhang, A. Chakravarty, W. Zhu, E. Laritsky, et al. Epigenomic profiling indicates a role for DNA methylation in early postnatal liver development Hum. Mol. Genet., August 15, 2009; 18(16): 3026 - 3038. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Sinha, A. D. S. Cameron, and R. J. Redfield Sxy Induces a CRP-S Regulon in Escherichia coli J. Bacteriol., August 15, 2009; 191(16): 5180 - 5195. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Fan, P. B. Bitterman, and O. Larsson Regulatory element identification in subsets of transcripts: Comparison and integration of current computational methods RNA, August 1, 2009; 15(8): 1469 - 1482. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Bonham, T. Neumann, M. Tirrell, and N. O. Reich Tracking transcription factor complexes on DNA using total internal reflectance fluorescence protein binding microarrays Nucleic Acids Res., July 1, 2009; 37(13): e94 - e94. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. C. Petrossian and S. G. Clarke Multiple Motif Scanning to Identify Methyltransferases from the Yeast Proteome Mol. Cell. Proteomics, July 1, 2009; 8(7): 1516 - 1526. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Byrne, R. Grzela, A. Lartigue, S. Audic, S. Chenivesse, S. Encinas, J.-M. Claverie, and C. Abergel The polyadenylation site of Mimivirus transcripts obeys a stringent 'hairpin rule' Genome Res., July 1, 2009; 19(7): 1233 - 1242. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Elhai, A. Taton, J. Massar, J. K. Myers, M. Travers, J. Casey, M. Slupesky, and J. Shrager BioBIKE: A Web-based, programmable, integrated biological knowledge base Nucleic Acids Res., July 1, 2009; 37(suppl_2): W28 - W32. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Achcar, J.-M. Camadro, and D. Mestivier AutoClass@IJM: a powerful tool for Bayesian classification of heterogeneous data in biology Nucleic Acids Res., July 1, 2009; 37(suppl_2): W63 - W67. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. L. Bailey, M. Boden, F. A. Buske, M. Frith, C. E. Grant, L. Clementi, J. Ren, W. W. Li, and W. S. Noble MEME SUITE: tools for motif discovery and searching Nucleic Acids Res., July 1, 2009; 37(suppl_2): W202 - W208. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Sharma, D. Mohanty, and A. Surolia RegAnalyst: a web interface for the analysis of regulatory motifs, networks and pathways Nucleic Acids Res., July 1, 2009; 37(suppl_2): W193 - W201. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Mitsuda and M. Ohme-Takagi Functional Analysis of Transcription Factors in Arabidopsis Plant Cell Physiol., July 1, 2009; 50(7): 1232 - 1248. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li, G. Liu, J. Yu, W. Cao, V. G. Lobo, and J. Xie In Vivo Selection of Kinase-responsive RNA Elements Controlling Alternative Splicing J. Biol. Chem., June 12, 2009; 284(24): 16191 - 16201. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Zhang, E. Li, and G. J. Olsen Protein-coding gene promoters in Methanocaldococcus (Methanococcus) jannaschii Nucleic Acids Res., June 1, 2009; 37(11): 3588 - 3601. [Abstract] [Full Text] [PDF] |
||||
![]() |
F.-O. Desmet, D. Hamroun, M. Lalande, G. Collod-Beroud, M. Claustres, and C. Beroud Human Splicing Finder: an online bioinformatics tool to predict splicing signals Nucleic Acids Res., May 1, 2009; 37(9): e67 - e67. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Dassa, N. London, B. L. Stoddard, O. Schueler-Furman, and S. Pietrokovski Fractured genes: a novel genomic arrangement involving new split inteins and a new homing endonuclease family Nucleic Acids Res., May 1, 2009; 37(8): 2560 - 2573. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Yoshida, H. Saitoh, S. Fujisawa, H. Kanzaki, H. Matsumura, K. Yoshida, Y. Tosa, I. Chuma, Y. Takano, J. Win, et al. Association Genetics Reveals Three Novel Avirulence Genes from the Rice Blast Fungal Pathogen Magnaporthe oryzae PLANT CELL, May 1, 2009; 21(5): 1573 - 1591. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Gunisova, E. Elboher, J. Nosek, V. Gorkovoy, Y. Brown, J.-F. Lucier, N. Laterreur, R. J. Wellinger, Y. Tzfati, and L. Tomaska Identification and comparative analysis of telomerase RNAs from Candida species reveal conservation of functional elements RNA, April 1, 2009; 15(4): 546 - 559. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. R. Dresang, D. T. Vereide, and B. Sugden Identifying Sites Bound by Epstein-Barr Virus Nuclear Antigen 1 (EBNA1) in the Human Genome: Defining a Position-Weighted Matrix To Predict Sites Bound by EBNA1 in Viral Genomes J. Virol., April 1, 2009; 83(7): 2930 - 2940. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hertel, D. de Jong, M. Marz, D. Rose, H. Tafer, A. Tanzer, B. Schierwater, and P. F. Stadler Non-coding RNA annotation of the genome of Trichoplax adhaerens Nucleic Acids Res., April 1, 2009; 37(5): 1602 - 1615. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Vallania, D. Schiavone, S. Dewilde, E. Pupo, S. Garbay, R. Calogero, M. Pontoglio, P. Provero, and V. Poli Genome-wide discovery of functional transcription factor binding sites by comparative genomics: The case of Stat3 PNAS, March 31, 2009; 106(13): 5117 - 5122. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Xia, M. E. Lemieux, W. Li, J. S. Carroll, M. Brown, X. S. Liu, and A. L. Kung Integrative analysis of HIF binding and transactivation reveals its role in maintaining histone methylation homeostasis PNAS, March 17, 2009; 106(11): 4260 - 4265. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Sleumer, M. Bilenky, A. He, G. Robertson, N. Thiessen, and S. J. M. Jones Caenorhabditis elegans cisRED: a catalogue of conserved genomic elements Nucleic Acids Res., March 1, 2009; 37(4): 1323 - 1334. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. R. Sanford, X. Wang, M. Mort, N. VanDuyn, D. N. Cooper, S. D. Mooney, H. J. Edenberg, and Y. Liu Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts Genome Res., March 1, 2009; 19(3): 381 - 394. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Loor and W. S. Cohick ASAS Centennial Paper: Lactation biology for the twenty-first century J Anim Sci, February 1, 2009; 87(2): 813 - 824. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. A. Howell, R. Narsai, A. Carroll, A. Ivanova, M. Lohse, B. Usadel, A. H. Millar, and J. Whelan Mapping Metabolic and Transcript Temporal Switches during Germination in Rice Highlights Specific Transcription Factors and the Role of RNA Instability in the Germination Process Plant Physiology, February 1, 2009; 149(2): 961 - 980. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Balsera, T. A. Goetze, E. Kovacs-Bogdan, P. Schurmann, R. Wagner, B. B. Buchanan, J. Soll, and B. Bolter Characterization of Tic110, a Channel-forming Protein at the Inner Envelope Membrane of Chloroplasts, Unveils a Response to Ca2+ and a Stromal Regulatory Disulfide Bridge J. Biol. Chem., January 30, 2009; 284(5): 2603 - 2616. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. M. Viiri, J. Janis, T. Siggers, T. Y. K. Heinonen, J. Valjakka, M. L. Bulyk, M. Maki, and O. Lohi DNA-Binding and -Bending Activities of SAP30L and SAP30 Are Mediated by a Zinc-Dependent Module and Monophosphoinositides Mol. Cell. Biol., January 15, 2009; 29(2): 342 - 356. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Whitington, A. C. Perkins, and T. L. Bailey High-throughput chromatin information enables accurate tissue-specific prediction of transcription factor binding sites Nucleic Acids Res., January 1, 2009; 37(1): 14 - 25. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Baumbach, A. Tauch, and S. Rahmann Towards the integrated analysis, visualization and reconstruction of microbial gene regulatory networks Brief Bioinform, January 1, 2009; 10(1): 75 - 83. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Cuddapah, R. Jothi, D. E. Schones, T.-Y. Roh, K. Cui, and K. Zhao Global analysis of the insulator binding protein CTCF in chromatin barrier regions reveals demarcation of active and repressive domains Genome Res., January 1, 2009; 19(1): 24 - 32. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Klein, R. Munch, I. Biegler, I. Haddad, I. Retter, and D. Jahn Strepto-DB, a database for comparative genomics of group A (GAS) and B (GBS) streptococci, implemented with the novel database platform 'Open Genome Resource' (OGeR) Nucleic Acids Res., January 1, 2009; 37(suppl_1): D494 - D498. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Mao, P. Dam, J. Chou, V. Olman, and Y. Xu DOOR: a database for prokaryotic operons Nucleic Acids Res., January 1, 2009; 37(suppl_1): D459 - D463. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Praz and P. Bucher CleanEx: new data extraction and merging tools based on MeSH term annotation Nucleic Acids Res., January 1, 2009; 37(suppl_1): D880 - D884. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Wozniak, S. Keles, J. J. Lugus, K. H. Young, M. E. Boyer, T. M. Tran, K. Choi, and E. H. Bresnick Molecular Hallmarks of Endogenous Chromatin Complexes Containing Master Regulators of Hematopoiesis Mol. Cell. Biol., November 1, 2008; 28(21): 6681 - 6694. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Zhang, M. Chen, X. Chen, Z. Xu, S. Guan, L.-C. Li, A. Li, J. Guo, L. Mao, and Y. Ma Phylogeny, gene structures, and expression patterns of the ERF gene family in soybean (Glycine max L.) J. Exp. Bot., November 1, 2008; 59(15): 4095 - 4107. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Lemire, N. Figueroa-Bossi, and L. Bossi A Singular Case of Prophage Complementation in Mutational Activation of recET Orthologs in Salmonella enterica Serovar Typhimurium J. Bacteriol., October 15, 2008; 190(20): 6857 - 6866. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Chan and M. B. Roth Anoxia-Induced Suspended Animation in Budding Yeast as an Experimental Paradigm for Studying Oxygen-Regulated Gene Expression Eukaryot. Cell, October 1, 2008; 7(10): 1795 - 1808. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Mihara, T. Itoh, and T. Izawa In Silico Identification of Short Nucleotide Sequences Associated with Gene Expression of Pollen Development in Rice Plant Cell Physiol., October 1, 2008; 49(10): 1451 - 1464. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Jothi, S. Cuddapah, A. Barski, K. Cui, and K. Zhao Genome-wide identification of in vivo protein-DNA binding sites from ChIP-Seq data Nucleic Acids Res., September 1, 2008; 36(16): 5221 - 5231. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. M. Petrykowska, C. M. Vockley, and L. Elnitski Detection and characterization of silencers and enhancer-blockers in the greater CFTR locus Genome Res., August 1, 2008; 18(8): 1238 - 1246. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Melamed, L. Pnueli, and Y. Arava Yeast translational response to high salinity: Global analysis reveals regulation at multiple levels RNA, July 1, 2008; 14(7): 1337 - 1351. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Song, X. Liu, X. S. Liu, and X. He A high-resolution map of nucleosome positioning on a fission yeast centromere Genome Res., July 1, 2008; 18(7): 1064 - 1072. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Thomas-Chollier, O. Sand, J.-V. Turatsinze, R. Janky, M. Defrance, E. Vervisch, S. Brohee, and J. van Helden RSAT: regulatory sequence analysis tools Nucleic Acids Res., July 1, 2008; 36(suppl_2): W119 - W127. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. E. Martinez-Guerrero, R. Ciria, C. Abreu-Goodger, G. Moreno-Hagelsieb, and E. Merino GeConT 2: gene context analysis for orthologous proteins, conserved domains and metabolic pathways Nucleic Acids Res., July 1, 2008; 36(suppl_2): W176 - W180. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. R. Morris, N. Mukherjee, and J. D. Keene Ribonomic Analysis of Human Pum1 Reveals cis-trans Conservation across Species despite Evolution of Diverse mRNA Target Sets Mol. Cell. Biol., June 15, 2008; 28(12): 4093 - 4103. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Lee, X. Li, A. Hechmer, M. Eisen, M. D. Biggin, B. J. Venters, C. Jiang, J. Li, B. F. Pugh, and D. S. Gilmour NELF and GAGA Factor Are Linked to Promoter-Proximal Pausing at Many Genes in Drosophila Mol. Cell. Biol., May 15, 2008; 28(10): 3290 - 3300. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Kerk, G. Templeton, and G. B.G. Moorhead Evolutionary Radiation Pattern of Novel Protein Phosphatases Revealed by Analysis of Protein Data from the Completely Sequenced Genomes of Humans, Green Algae, and Higher Plants Plant Physiology, February 1, 2008; 146(2): 351 - 367. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Davila Lopez and T. Samuelsson Early evolution of histone mRNA 3' end processing RNA, January 1, 2008; 14(1): 1 - 10. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Kann, S. L. Sheetlin, Y. Park, S. H. Bryant, and J. L. Spouge The identification of complete domains within protein sequences using accurate E-values for semi-global alignment Nucleic Acids Res., July 9, 2007; 35(14): 4678 - 4685. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Cooper, N. D. Trinklein, L. Nguyen, and R. M. Myers Serum response factor binding sites differ in three human cell types Genome Res., February 1, 2007; 17(2): 136 - 144. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||






















