Nucleic Acids Research, 2004, Vol. 32, Database issue D41-D44
© 2004 Oxford University Press
MIPS: analysis and annotation of proteins from whole genomes
1 Institute for Bioinformatics (MIPS), GSF National Research Center for Environment and Health, Ingolstaedter Landstrasse 1, D-85764 Neuherberg, Germany and 2 Technische Universität München, Chair of Genome Oriented Bioinformatics, Center of Life and Food Science, D-85350 Freising-Weihenstephan, Germany
*To whom correspondence should be addressed at Institute for Bioinformatics (MIPS), GSF National Research Center for Environment and Health, Ingolstädter Landstraße 1, D-85764 Neuherberg, Germany. Tel: +49 89 3187 3580; Fax: +49 89 3187 3585; Email: w.mewes{at}gsf.de
Received September 15, 2003; Revised and Accepted October 7, 2003
| ABSTRACT |
|---|
|
|
|---|
The Munich Information Center for Protein Sequences (MIPS-GSF), Neuherberg, Germany, provides protein sequence-related information based on whole-genome analysis. The main focus of the work is directed toward the systematic organization of sequence-related attributes as gathered by a variety of algorithms, primary information from experimental data together with information compiled from the scientific literature. MIPS maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the database of complete cDNAs (German Human Genome Project, NGFN), the database of mammalian proteinprotein interactions (MPPI), the database of FASTA homologies (SIMAP), and the interface for the fast retrieval of protein-associated information (QUIPOS). The Arabidopsis thaliana database, the rice database, the plant EST databases (MATDB, MOsDB, SPUTNIK), as well as the databases for the comprehensive set of genomes (PEDANT genomes) are described elsewhere in the 2003 and 2004 NAR database issues, respectively. All databases described, and the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de).
| INTRODUCTION |
|---|
|
|
|---|
MIPS develops and maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences, and provides tools for the comprehensive analysis of protein squences. An overview of the general organization and annotation of genome-related information at MIPS is shown in Figure 1.
|
| FUNGAL MODEL ORGANISMS: THE COMPREHENSIVE YEAST GENOME DATABASE (CYGD) AND THE MIPS Neurospora crassa DATABASE (MNCDB) |
|---|
|
|
|---|
The MIPS yeast genome database is developed and maintained by a group of European databases and yeast laboratories forming a decentralized network of expertise in order to provide detailed information on protein-coding sequences and other genetic elements. Representing the best investigated eukaryote, the database is organized by complementary classifiers with the aim of allowing for the interpretation of functional relations between genes and their corresponding proteins. For instance, the Functional Catalogue (FunCat), providing a systematic classification of protein function is intensively used to project functional data such as expression profiles onto known or probable functional units. Manual FunCat classifications are available not only for yeast, but also for other MIPS-curated genomes such as Arabidopsis thaliana, N.crassa and the human genome (1). In addition, the set of proteins represented in the PEDANT genomes database was assigned to FunCat classes; the complete list of the assignments is accessible (see Table 1). Within each functional class, the sequences have been clustered into disjoint homology-based subsets.
|
The Catalogue of ProteinProtein Interactions, the Protein Complex Catalogue and the Protein Localization Catalogues allow information related to the interaction of proteins in yeast to be obtained. More than 10 600 proteinprotein interaction records (
9100 physical,
1500 genetic) were compiled from published large-scale experiments and the literature. The annotated protein complexes (>1000) can be split into
87 000 putative binary interactions. The vast majority of the records are documented by PubMed reference IDs and by information on the nature of the experimental evidence, which correlates with the confidence of the assignment used in probabilistic computations. Detailed information on transport proteins [Yeast Transport Protein DB (2)], transcription factors and their binding sites [TRANSFAC (3)] and metabolic pathways are either part of the core yeast database or can be retrieved using the BioRS data integration system. To be able to represent complex data of fungal genomes, we use the Genome Research Environment (GenRE) as our annotation data structure. GenRE allows combination of information on different classes of genetic elements and their relations, such as proteinprotein interactions or common regulatory features; it provides annotation as well as flexible data retrieval interfaces.
Related proteins from other species can be retrieved using the precomputed SIMAP database (see below) but also using the integrated SESAM tool [Seed Extraction Sequence Analysis Method (4)]. SESAM was developed to achieve better selectivity and sensitivity for the characterization of proteins on a large scale without being dependent on secondary data collections, such as InterPro (5). The selectivity and sensitivity particularly addresses the challenging twilight zone of <30% overall pairwise sequence identity. SESAM does not require the manual adjustment of parameters and copes well with different cases of highly conserved as well as distantly related homologues. A subsequent clustering step starts from SESAM seed-based alignments and leads to SESAM feature clusters.
In CYGD, manually annotated genomes are interlinked via BioRS with the PEDANT analysis of recently published full genomes as well as the 13 hemiascomycetous yeasts, generated by the Génolevures I project (6). An up-to-date compilation of the Saccharomyces cerevisiae introns and the analysis of introns in seven related species can be accessed through the Hemiascomycetous Yeast Spliceosomal Introns view (7). Comparative analysis of the S.cerevisiae chromosomes is enabled by a graphical display of the fungal orthologues. The integrated complete genomes include: Schizosaccharomyces pombe, Candida albicans, Saccharomyces bayanus, Saccharomyces castellii, Saccharomyces kluyveri, Saccharomyces kudriavzevii, Saccharomyces mikatae, Saccharomyces paradoxus [Whitehead Genome Center (http://www-genome.wi.mit.edu/) and George Washington University, St Louis (http://www.genetics.wustl.edu/)], Candida glabrata, Debaryomyces hansenii, Kluyveromyces lactis, Yarrowia lipolytica (Génolevures II, http://cbi.labri.u-bordeaux.fr/Genolevures), as well as the genomes annotated at MIPS: N.crassa (MNCDB), Magnaporthe grisea, Aspergillus nidulans, Fusarium graminearum (FGDB) and Ustilago maydis. Further genomes will be added to enable a comprehensive comparative fungal data resource.
The recently annotated genome of the filamentous fungus N.crassa is based on data from the German Neurospora Sequencing Project (Chromosomes II and V) (8) and the whole genome sequence, assembled by the Whitehead Genome Center, Cambridge, MA in 2002 (9). In a collaborative effort with the Whitehead group, the MIPS group has annotated the complete genome including manually supervised gene modeling and functional classification of the proteins encoded. The genome of
40 Mb encodes
10 000 proteins automatically predicted by the program FGENESH (http://softberry. com), specifically trained for Neurospora. The manual inspection of the gene models included intrinsic and extrinsic information such as comparison with known proteins and ESTs as well as splicing consensus signals. Protein sequences were subsequently submitted to the comprehensive analysis of the functional and structural attributes. All information is available at the Neurospora project page (Table 1).
| THE MIPS HUMAN cDNA DATABASE |
|---|
|
|
|---|
With the draft sequence of the human genome in hand, attention has focused on the identification of the complete set of its genes and gene function, respectively. In order to complete this task, combinations of ab initio gene predictions, mapping of full length cDNAs with the genomic sequence and comparative genome analysis are widely applied methods.
Since no approach to the prediction of human genes from the genome has returned satisfactory results, sequencing full length cDNAs provides an essential source of information, in particular since it allows elucidation of the structure of alternative splice variants which are thought to be a basis for the complexity of human and other higher eukaryotes. cDNA clones can also represent non-coding RNAs and may contain regulatory elements.
The German cDNA consortium started its work in 1997 as part of the German Human Genome Project (DHGP/NGFN) to release completely sequenced novel cDNAs from various human tissues (10). During this period, several cDNA libraries from so far uncharacterized tissues have been constructed and sequenced. From these libraries 182 543 ESTs (102 396 966 bp) have been generated and analysed from independent clones, and 9380 complete cDNAs (31 187 876 bp) have been identified. At MIPS, the sequence data are automatically analysed and subsequently subjected to several steps of manual annotation and curation, including their functional classification. All sequences are submitted to the public DNA data repositories; the sequences and their annotations are accessible via the MIPS website (Table 1).
Comparative analysis of the human genome data and closely related species, in particular the great apes, additionally improves the quality of predicted genes and allows the discovery of yet unidentified genes. At the beginning of 2003 analysis of the transcriptome of orangutan (Pongo pygmeus) as a model organism was initiated by the German cDNA consortium. So far, 27 813 ESTs (14 439 916 bp) and 578 completely sequenced cDNAs (1 548 893 bp) derived and sequenced from different tissues of P.pygmeus have been stored in the MIPS database. Comparative sequence analysis from at least three primate species (human, chimpanzee and orangutan) can provide insights into human evolution and will help to find the genetic changes in the human lineage that count for unusual traits such as bipedalism and large brain (11).
The German cDNA data set has been made available to the H-Invitational initiative organized by JBIRC (Japan Biological Information Research Center) and DDBJ (DNA Data Bank of Japan) to assemble a single transcriptome database to overcome present inconsistencies between various databases such as diverse nomenclature or insufficient annotation and to remove redundancies from the data set.
| MPPI: A DATABASE OF MAMMALIAN PROTEINPROTEIN INTERACTIONS |
|---|
|
|
|---|
Proteinprotein interactions (PPIs) represent a pivotal aspect of protein function. Almost every cellular process relies on transient or permanent physical binding of two or more proteins in order to accomplish its. The importance of proteinprotein interactions is reflected by the recent popularity of experimental techniques such as co-immunoprecipitation, the yeast two-hybrid system, large-scale co-purification and identification of binding partners by mass spectroscopy. Accordingly, comprehensive databases of PPI in S.cerevisiae (see CYGD above) have proved to be invaluable resources for various predictive methods applied to experimental data (12). Although yeast is a well established model organism, not all interactions in higher eukaryotes have equivalent counterparts in unicellular model systems. Although current databases include some information on PPI in mammals the vast majority of data comes from microorganisms.
In order to fill this critical gap we have started a database of high-quality protein interaction data from mammals. Expert curators are building the database by harvesting experimental evidence about PPI from the publicly available literature. In contrast to high-throughput data the results of carefully performed individual experiments are considered to be more reliable, especially if multiple independent evidence is presented. Currently, our database contains
1600 entries of experimental evidence for PPI, which have been integrated in the mouse database of the PEDANT genome information system. A comprehensive user interface for the database is under development. We are in the process of complementing the manually curated data with data from external sources such as mammalian high-throughput experiments. Despite being at an early stage, this database currently represents, according to our knowledge, the single largest publicly available collection of high-quality PPI data in mammals.
| SIMAP: A DATABASE OF HOMOLOGY SCORES BASED ON PRECALCULATED EXHAUSTIVE SIMILARITY SEARCHES |
|---|
|
|
|---|
Pairwise similarity comparison remains the most powerful tool in genome analysis. Individual searches for homologues do not allow structuring of the sequence universe. Also, since most of the searches are performed with known query sequences, similarity scores stored in an up-to-date all-against-all matrix constitute a very valuable data collection for the systematic analysis of genomes. This set can be used easily to explore interesting genome features such as neighbourhood relations, taxonomic distribution of protein families, etc. Postprocessing steps are easily applied to extract conserved domain patterns or to identify inconsistencies in genome annotation.
SIMAP (SImilarity MAtrix of Protein Sequences) provides a precalculated all-against-all comparison of the protein sets of over 200 fully sequenced genomes. The similarity searches were carried out using the FASTA (13) package. Several tools are available to analyse large sets of sequences, including the generation of subsets (clusters) through iterative queries. For instance, Markov-Random-Field clustering methods such as MCL (14) can be applied to detect protein families. Subclusters can be subjected to SESAM for the generation of conserved sequence patterns or using standard software to generate multiple alignments [POA2 (15)] or to build Hidden Markov Models (HMMER2.3, University of St Louis). Results can be filtered using various parameters such as taxonomic assignments or sequences carrying certain features such as domains of the SCOP database. SIMAP is being incrementally updated.
| QUIPOS: QUERY INTERFACE TO PROTEIN SEQUENCE DATA |
|---|
|
|
|---|
The assumption that most sequence queries are performed with sequences that are already part of the database or very closely related to known and annotated sequences is well justified. QUIPOS was introduced as an interface of information present in MIPS databases as well as a PEDANT-like tool (16) for on-the-fly protein sequence analyses. To transfer information from well characterized proteins to their close relatives, already known and annotated sequences have to be identified. Most strategies to find related proteins employ similarity searches against a database of annotated sequences; however, in most cases such sensitive but time-consuming searches can be skipped as a close relative is found.
QUIPOS has implemented its own Fast Similarity Sequence Search method (F3S). Closely similar sequences are detected and pairwise aligned to the query sequence by F3S in a few seconds. In a further step, selected information coming from MIPS in-house databases, corresponding to the best hit is mapped to the query. In case no closely related sequence is found, a PEDANT session starts a multi-threaded sequence analysis workflow. In its current version, QUIPOS displays primary information such as statistical evaluation of protein properties, best BLAST hits, multiple alignment to homologous sequences and the presence of sequence domains or domain patterns. Additionally, secondary structures and trans-membrane segments are predicted using standard algorithms, and a 3D structure (17) is assigned to the sequence whenever possible.
| ACKNOWLEDGEMENTS |
|---|
This work was supported by the Federal Ministry of Education, Science, Research and Technology (BMBF: BFAM: 031U112C; NGFN: 01KW9710; HNB: 01SF9985), the Deutsche Forschungsgemeinschaft (MNCDB), and the European Commission (BIO4-CT98-0549, QLRI-1999-01333).
| REFERENCES |
|---|
|
|
|---|
- Schueller,C. and Fritz,A. (2002) An enhanced human-genome database. Genet. Eng., 22, 38.
- Van Belle,D. and Andre,B. (2001) A genomic view of yeast membrane transporters. Curr. Opin. Cell Biol., 13, 389398.[CrossRef][Web of Science][Medline]
- Matys,V., Fricke,E., Geffers,R., Gossling,E., Haubrock,M., Hehl,R., Hornischer,K., Karas,D., Kel,A.E., Kel-Margoulis,O.V. et al. (2003) TRANSFAC: transcriptional regulation, from patterns to profiles. Nucleic Acids Res., 31, 374378.
[Abstract/Free Full Text] - Strack,N. and Mewes,H.W. (1999) SESAM: Seed Extraction Sequence Analysis Method. Giegerich, R. and Wingender, E. Proceedings of the German Conference on Bioinformatics GCB 99. Computer Science and Biology, Braunschweig, Bielefeld. 4-6-0099, pp. 5965.
- Mulder,N.J., Apweiler,R., Attwood,T.K., Bairoch,A., Barrell,D., Bateman,A., Binns,D., Biswas,M., Bradley,P., Bork,P. et al. (2003) The InterPro Database, 2003 brings increased coverage and new features. Nucleic Acids Res., 31, 315318.
[Abstract/Free Full Text] - Souciet,J., Aigle,M., Artiguenave,F., Blandin,G., Bolotin-Fukuhara,M., Bon,B., Brottier,P., Casaregola,S., de Montigny,J., Dujon,B. et al. (2000) Genomic exploration of the hemiascomycetous yeasts: 1. A set of yeast species for molecular evolution studies. FEBS Lett., 487, 312.[CrossRef][Web of Science][Medline]
- Bon,E., Casaregola,S., Blandin,G., Llorente,B., Neuveglise,C., Munsterkotter,M., Guldener,U., Mewes,H.W., Van Helden,J., Dujon,B. et al. (2003) Molecular evolution of eukaryotic genomes: hemiascomycetous yeast spliceosomal introns. Nucleic Acids Res., 31, 11211135.
[Abstract/Free Full Text] - Schulte,U., Becker,I., Mewes,H.W. and Mannhaupt,G. (2002) Large scale analysis of sequences from Neurospora crassa. J. Biotechnol., 94, 313.[CrossRef][Web of Science][Medline]
- Galagan,J.E., Calvo,S.E., Borkovich,K.A., Selker,E.U., Read,N.D., Jaffe,D., FitzHugh,W., Ma,L.J., Smirnov,S., Purcell,S. et al. (2003) The genome sequence of the filamentous fungus Neurospora crassa. Nature, 422, 859868.[CrossRef][Medline]
- Wiemann,S., Weil,B., Wellenreuther,R., Gassenhuber,H., Glassl,S., Ansorge,W., Bocher,M., Blöcker,H., Bauersachs,S., Blum,H. et al. (2001) Toward a catalog of human genes and proteins: sequencing and analysis of 500 novel complete protein coding human cDNAs. Genome Res., 11, 422435.
[Abstract/Free Full Text] - Olson,M.V. and Varki,A. (2003) Sequencing the chimpanzee genome: insights into human evolution and disease. Nature Rev. Genet., 4, 2028.[CrossRef][Web of Science][Medline]
- Vazquez,A., Flammini,A., Maritan,A. and Vespignani,A. (2003) Global protein function prediction from proteinprotein interaction networks. Nat. Biotechnol., 21, 697700.[CrossRef][Web of Science][Medline]
- Pearson,W.R. (2000) Flexible sequence similarity searching with the FASTA3 program package. Methods Mol. Biol., 132, 185219.[Medline]
- Enright,A.J., Van Dongen,S. and Ouzounis,C.A. (2002) An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res., 30, 15751584.
[Abstract/Free Full Text] - Lee,C., Grasso,C. and Sharlow,M.F. (2002) Multiple sequence alignment using partial order graphs. Bioinformatics, 18, 452464.
[Abstract/Free Full Text] - Frishman,D., Albermann,K., Hani,J., Heumann,K., Metanomski,A., Zollner,A. and Mewes,H.W. (2001) Functional and structural genomics using PEDANT. Bioinformatics, 17, 144157.
- Westbrook,J., Feng,Z., Jain,S., Bhat,T.N., Thanki,N., Ravichandran,V., Gilliland,G.L., Bluhm,W., Weissig,H., Greer,D.S. et al. (2002) The Protein Data Bank: unifying the archive. Nucleic Acids Res., 30, 245248.
[Abstract/Free Full Text]
This article has been cited by other articles:
![]() |
J. Song and M. Singh How and when should interactome-derived clusters be used to predict functional modules and protein function? Bioinformatics, December 1, 2009; 25(23): 3143 - 3150. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Ali and C. M. Deane Functionally guided alignment of protein interaction networks for module detection Bioinformatics, December 1, 2009; 25(23): 3166 - 3173. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Yosef, M. Kupiec, E. Ruppin, and R. Sharan A complex-centric view of protein network evolution Nucleic Acids Res., July 1, 2009; 37(12): e88 - e88. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. V. Antonov, S. Dietmann, P. Wong, D. Lutter, and H. W. Mewes GeneSet2miRNA: finding the signature of cooperative miRNA activities in the gene lists Nucleic Acids Res., July 1, 2009; 37(suppl_2): W323 - W328. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Pu, J. Wong, B. Turner, E. Cho, and S. J. Wodak Up-to-date catalogues of yeast protein complexes Nucleic Acids Res., February 1, 2009; 37(3): 825 - 831. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Qi, Y. Suhail, Y.-y. Lin, J. D. Boeke, and J. S. Bader Finding friends and enemies in an enemies-only network: A graph diffusion kernel for predicting novel genetic interactions and co-complex membership from yeast genetic interactions Genome Res., December 1, 2008; 18(12): 1991 - 2004. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Greene, G. Cagney, N. Krogan, and P. Cunningham Ensemble non-negative matrix factorization methods for clustering protein-protein interactions Bioinformatics, August 1, 2008; 24(15): 1722 - 1728. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Musso, M. Costanzo, M. Huangfu, A. M. Smith, J. Paw, B.-J. San Luis, C. Boone, G. Giaever, C. Nislow, A. Emili, et al. The extensive and condition-dependent nature of epistasis among whole-genome duplicates in yeast Genome Res., July 1, 2008; 18(7): 1092 - 1099. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Qi, F. Balem, C. Faloutsos, J. Klein-Seetharaman, and Z. Bar-Joseph Protein complex identification by supervised graph local clustering Bioinformatics, July 1, 2008; 24(13): i250 - i268. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. V. Antonov, T. Schmidt, Y. Wang, and H. W. Mewes ProfCom: a web tool for profiling the complex functionality of gene groups identified from high-throughput data Nucleic Acids Res., July 1, 2008; 36(suppl_2): W347 - W351. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Brohee, K. Faust, G. Lima-Mendez, O. Sand, R. Janky, G. Vanderstocken, Y. Deville, and J. van Helden NeAT: a toolbox for the analysis of biological networks, clusters, classes and pathways Nucleic Acids Res., July 1, 2008; 36(suppl_2): W444 - W451. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Guo, X. Wu, D.-Y. Zhang, and K. Lin Genome-wide inference of protein interaction sites: lessons from the yeast high-quality negative protein-protein interaction dataset Nucleic Acids Res., April 1, 2008; 36(6): 2002 - 2011. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Scholtens, T. Chiang, W. Huber, and R. Gentleman Estimating node degree in bait-prey graphs Bioinformatics, January 15, 2008; 24(2): 218 - 224. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Al-Shahrour, P. Minguez, J. Tarraga, I. Medina, E. Alloza, D. Montaner, and J. Dopazo FatiGO +: a functional profiling tool for genomic data. Integration of functional annotation, regulatory motifs and interaction data with microarray experiments Nucleic Acids Res., July 13, 2007; 35(suppl_2): W91 - W96. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. R. Collins, P. Kemmeren, X.-C. Zhao, J. F. Greenblatt, F. Spencer, F. C. P. Holstege, J. S. Weissman, and N. J. Krogan Toward a Comprehensive Atlas of the Physical Interactome of Saccharomyces cerevisiae Mol. Cell. Proteomics, March 1, 2007; 6(3): 439 - 450. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Guan, M. J. Dunham, and O. G. Troyanskaya Functional Analysis of Gene Duplications in Saccharomyces cerevisiae Genetics, February 1, 2007; 175(2): 933 - 943. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Alterovitz, M. Xiang, M. Mohan, and M. F. Ramoni GO PaD: the Gene Ontology Partition Database Nucleic Acids Res., January 12, 2007; 35(suppl_1): D322 - D327. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hollunder, M. Friedel, A. Beyer, C. T. Workman, and T. Wilhelm DASS: efficient discovery and p-value calculation of substructures in unordered data Bioinformatics, January 1, 2007; 23(1): 77 - 83. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Lysoe, S. S. Klemsdal, K. R. Bone, R. J. N. Frandsen, T. Johansen, U. Thrane, and H. Giese The PKS4 Gene of Fusarium graminearum Is Essential for Zearalenone Production. Appl. Envir. Microbiol., June 1, 2006; 72(6): 3924 - 3932. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Huang and W. Pan Incorporating biological knowledge into distance-based clustering analysis of microarray gene expression data Bioinformatics, May 15, 2006; 22(10): 1259 - 1268. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Herrgard, B.-S. Lee, V. Portnoy, and B. O. Palsson Integrated analysis of regulatory and metabolic networks reveals novel regulatory mechanisms in Saccharomyces cerevisiae Genome Res., May 1, 2006; 16(5): 627 - 635. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Pan Incorporating gene functions as priors in model-based clustering of microarray gene expression data Bioinformatics, April 1, 2006; 22(7): 795 - 801. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Titz, S. Thomas, S. V. Rajagopala, T. Chiba, T. Ito, and P. Uetz Transcriptional activators in yeast Nucleic Acids Res., February 7, 2006; 34(3): 955 - 967. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. V. Antonov, I. V. Tetko, and H. W. Mewes A systematic approach to infer biological relevance and biases of gene network structures Nucleic Acids Res., January 10, 2006; 34(1): e6 - e6. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Wu, L. Zhu, J. Guo, D.-Y. Zhang, and K. Lin Prediction of yeast protein-protein interaction network: insights from the Gene Ontology and annotations. Nucleic Acids Res., January 1, 2006; 34(7): 2137 - 2150. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Wu, J. Wang, C. Liu, Y. Zhang, B. Shi, X. Zhu, Z. Zhang, G. Skogerbo, L. Chen, H. Lu, et al. NPInter: the noncoding RNAs and protein related biomacromolecules interaction database Nucleic Acids Res., January 1, 2006; 34(suppl_1): D150 - D152. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. K. L. Leung, L. Trinkle-Mulcahy, Y. W. Lam, J. S. Andersen, M. Mann, and A. I. Lamond NOPdb: Nucleolar Proteome Database Nucleic Acids Res., January 1, 2006; 34(suppl_1): D218 - D220. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Stark, B.-J. Breitkreutz, T. Reguly, L. Boucher, A. Breitkreutz, and M. Tyers BioGRID: a general repository for interaction datasets Nucleic Acids Res., January 1, 2006; 34(suppl_1): D535 - D539. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ruepp, O. N. Doudieu, J. van den Oever, B. Brauner, I. Dunger-Kaltenbach, G. Fobo, G. Frishman, C. Montrone, C. Skornia, S. Wanka, et al. The Mouse Functional Genome Database (MfunGD): functional annotation of proteins in the light of their cellular context Nucleic Acids Res., January 1, 2006; 34(suppl_1): D568 - D571. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. B. Smirnova, J. N. Selley, F. Sanchez-Cabo, K. Carroll, A. A. Eddy, J. E. G. McCarthy, S. J. Hubbard, G. D. Pavitt, C. M. Grant, and M. P. Ashe Global Gene Expression Profiling Reveals Widespread yet Distinctive Translational Responses to Different Eukaryotic Translation Initiation Factor 2B-Targeting Stress Pathways Mol. Cell. Biol., November 1, 2005; 25(21): 9340 - 9349. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. R. Gale, J. D. Bryant, S. Calvo, H. Giese, T. Katan, K. O'Donnell, H. Suga, M. Taga, T. R. Usgaard, T. J. Ward, et al. Chromosome Complement of the Fungal Plant Pathogen Fusarium graminearum Based on Genetic and Physical Mapping and Cytological Observations Genetics, November 1, 2005; 171(3): 985 - 1001. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Espadaler, O. Romero-Isart, R. M. Jackson, and B. Oliva Prediction of protein-protein interactions using distant conservation of sequence patterns and structure relationships Bioinformatics, August 15, 2005; 21(16): 3360 - 3368. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Mintseris and Z. Weng Structure, function, and evolution of transient and obligate protein-protein interactions PNAS, August 2, 2005; 102(31): 10930 - 10935. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Liu, N. Liu, and H. Zhao Inferring protein-protein interactions through high-throughput interaction data from diverse organisms Bioinformatics, August 1, 2005; 21(15): 3279 - 3285. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. L. Saito, J. Sese, Y. Nakatani, F. Sano, M. Yukawa, Y. Ohya, and S. Morishita Data mining tools for the Saccharomyces cerevisiae morphological database Nucleic Acids Res., July 1, 2005; 33(suppl_2): W753 - W757. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. N. Vemuri and A. A. Aristidou Metabolic Engineering in the -omics Era: Elucidating and Modulating Regulatory Networks Microbiol. Mol. Biol. Rev., June 1, 2005; 69(2): 197 - 216. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Espadaler, R. Aragues, N. Eswar, M. A. Marti-Renom, E. Querol, F. X. Aviles, A. Sali, and B. Oliva Detecting remotely related proteins by their interactions and sequence similarity PNAS, May 17, 2005; 102(20): 7151 - 7156. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. V. Tetko, B. Brauner, I. Dunger-Kaltenbach, G. Frishman, C. Montrone, G. Fobo, A. Ruepp, A. V. Antonov, D. Surmeli, and H.-W. Mewes MIPS bacterial genomes functional annotation benchmark dataset Bioinformatics, May 15, 2005; 21(10): 2520 - 2521. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Natter, P. Leitner, A. Faschinger, H. Wolinski, S. McCraith, S. Fields, and S. D. Kohlwein The Spatial Organization of Lipid Synthesis in the Yeast Saccharomyces cerevisiae Derived from Large Scale Green Fluorescent Protein Tagging and High Resolution Microscopy Mol. Cell. Proteomics, May 1, 2005; 4(5): 662 - 672. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Kopka, N. Schauer, S. Krueger, C. Birkemeyer, B. Usadel, E. Bergmuller, P. Dormann, W. Weckwerth, Y. Gibon, M. Stitt, et al. GMD@CSB.DB: the Golm Metabolome Database Bioinformatics, April 15, 2005; 21(8): 1635 - 1638. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. R. Patil and J. Nielsen Uncovering transcriptional regulation of metabolism by using metabolic network topology PNAS, February 22, 2005; 102(8): 2685 - 2689. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Sharan, S. Suthram, R. M. Kelley, T. Kuhn, S. McCuine, P. Uetz, T. Sittler, R. M. Karp, and T. Ideker From the Cover: Conserved patterns of protein interaction in multiple species PNAS, February 8, 2005; 102(6): 1974 - 1979. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kahraman, A. Avramov, L. G. Nashev, D. Popov, R. Ternes, H.-D. Pohlenz, and B. Weiss PhenomicDB: a multi-species genotype/phenotype database for comparative phenomics Bioinformatics, February 1, 2005; 21(3): 418 - 420. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Tai, V. M. Boer, P. Daran-Lapujade, M. C. Walsh, J. H. de Winde, J.-M. Daran, and J. T. Pronk Two-dimensional Transcriptome Analysis in Chemostat Cultures: COMBINATORIAL EFFECTS OF OXYGEN AVAILABILITY AND MACRONUTRIENT LIMITATION IN SACCHAROMYCES CEREVISIAE J. Biol. Chem., January 7, 2005; 280(1): 437 - 447. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Brooksbank, G. Cameron, and J. Thornton The European Bioinformatics Institute's data resources: towards systems biology Nucleic Acids Res., January 1, 2005; 33(suppl_1): D46 - D53. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Hermida, S. Brachat, S. Voegeli, P. Philippsen, and M. Primig The Ashbya Genome Database (AGD)--a tool for the yeast community and genome biologists Nucleic Acids Res., January 1, 2005; 33(suppl_1): D348 - D352. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Stein, R. B. Russell, and P. Aloy 3did: interacting protein domains of known three-dimensional structure Nucleic Acids Res., January 1, 2005; 33(suppl_1): D413 - D417. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Alfarano, C. E. Andrade, K. Anthony, N. Bahroos, M. Bajec, K. Bantoft, D. Betel, B. Bobechko, K. Boutilier, E. Burgess, et al. The Biomolecular Interaction Network Database and related tools 2005 update Nucleic Acids Res., January 1, 2005; 33(suppl_1): D418 - D424. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. von Mering, L. J. Jensen, B. Snel, S. D. Hooper, M. Krupp, M. Foglierini, N. Jouffre, M. A. Huynen, and P. Bork STRING: known and predicted protein-protein associations, integrated and transferred across organisms Nucleic Acids Res., January 1, 2005; 33(suppl_1): D433 - D437. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Rudd, H. Schoof, and K. Mayer PlantMarkers--a database of predicted molecular markers from plants Nucleic Acids Res., January 1, 2005; 33(suppl_1): D628 - D632. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ruepp, A. Zollner, D. Maier, K. Albermann, J. Hani, M. Mokrejs, I. Tetko, U. Guldener, G. Mannhaupt, M. Munsterkotter, et al. The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes Nucleic Acids Res., October 14, 2004; 32(18): 5539 - 5545. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Lu, X. Zhu, H. Liu, G. Skogerbo, J. Zhang, Y. Zhang, L. Cai, Y. Zhao, S. Sun, J. Xu, et al. The interactome as a tree--an attempt to visualize the protein-protein interaction network in yeast Nucleic Acids Res., September 8, 2004; 32(16): 4804 - 4811. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Tanay, R. Sharan, M. Kupiec, and R. Shamir Revealing modularity and organization in the yeast molecular network by integrated analysis of highly heterogeneous genomewide data PNAS, March 2, 2004; 101(9): 2981 - 2986. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Schoof, R. Ernst, V. Nazarov, L. Pfeifer, H.-W. Mewes, and K. F. X. Mayer MIPS Arabidopsis thaliana Database (MAtDB): an integrated biological knowledge resource for plant genomics Nucleic Acids Res., January 1, 2004; 32(90001): D373 - 376. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||










