Article |
InterProScan: protein domains identifier
European Bioinformatics Institute, Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SD, UK
*To whom correspondence should be addressed at: EMBL Outstation The European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK. Tel: +44 1223 494423; Fax: +44 1223 494468; Email: rls{at}ebi.ac.uk
Received February 11, 2005. Revised March 30, 2005. Accepted March 30, 2005.
| ABSTRACT |
|---|
|
|
|---|
InterProScan [E. M. Zdobnov and R. Apweiler (2001) Bioinformatics, 17, 847848] is a tool that combines different protein signature recognition methods from the InterPro [N. J. Mulder, R. Apweiler, T. K. Attwood, A. Bairoch, A. Bateman, D. Binns, P. Bradley, P. Bork, P. Bucher, L. Cerutti et al. (2005) Nucleic Acids Res., 33, D201D205] consortium member databases into one resource. At the time of writing there are 10 distinct publicly available databases in the application. Protein as well as DNA sequences can be analysed. A web-based version is accessible for academic and commercial organizations from the EBI (http://www.ebi.ac.uk/InterProScan/). In addition, a standalone Perl version and a SOAP Web Service [J. Snell, D. Tidwell and P. Kulchenko (2001) Programming Web Services with SOAP, 1st edn. O'Reilly Publishers, Sebastopol, CA, http://www.w3.org/TR/soap/] are also available to the users. Various output formats are supported and include text tables, XML documents, as well as various graphs to help interpret the results.
| INTRODUCTION |
|---|
|
|
|---|
When carrying out analysis of protein sequences, the aim is to find out as much information as possible about potential relationships with other sequences as well as characterizing their physiochemical properties. The first step usually involves comparing the protein sequence against a non-redundant protein sequence database by using Blast (1) or Fasta (2), which will reveal which sequence(s) are similar to the query sequence alone. To obtain further information about a protein's specific function, searches against secondary databases (also known as pattern or signature databases) are necessary. When such searches return significant matches or hits, these results help in the assignment of a particular function or functional domain to the query protein. The InterPro (http://www.ebi.ac.uk/interpro) (3) database was created to unite secondary databases that contain overlapping information on protein families, domains and functional sites (Table 1). InterPro entries are divided into groups based on the protein families or the domains that the signatures represent. If the structure and function of a protein family is well characterized, searches of the secondary databases offer a fast track into inferring biological function. Searching individually against each of these databases to get the most information is repetitive, time consuming and labour intensive. To search InterPro with a novel protein sequence, a tool, InterProScan (4), has been developed (http://www.ebi.ac.uk/InterProScan) that combines the protein function recognition methods of the member databases of InterPro into one application. Since its creation, there have been several releases to improve it, adding functionality as well as new databases to the system.
|
| InterProScan TOOL |
|---|
|
|
|---|
Here, we describe the use of the web browser based version of InterProScan available from the EBI at http://www.ebi.ac.uk/InterProScan (Figure 1). This service is free to all academic and commercial organizations and offers interactive as well as email job submission. Direct email submissions should be directed to interproscan{at}ebi.ac.uk. Instructions and documentation are available when sending an email to the above address that contains the word help in the message body. Users requiring high-throughput use of the application or who wish to carry out analysis using other databases can download a standalone version from ftp://ftp.ebi.ac.uk/pub/databases/interpro/iprscan. Users requiring programmatic access to the InterProScan can do so using a SOAP-based Web Service called WSInterProScan (5), which is described at http://www.ebi.ac.uk/Tools/webservices/WSInterProScan.html. All of these make use of a centrally maintained core version of InterProScan version 4.0.
|
The job input form
The first section of the input form consists of the users email address and how the results are to be displayed. The first thing a user needs to decide when using the InterProScan submission form (Figure 1) is how he/she wants to see the results. This is carried out by making a selection on the RESULTS menu. Two options are available: interactive, which will return the results to the browser once the job is completed, and email, which will return the results to the email specified in the YOUR EMAIL text dialog.
The next section has a set of check boxes that either choose all or clears all the methods available. Each method can be ticked on or off, according to the user's requirements. For example, users interested only in signal peptide cleavage sites or the transmembrane domains described in InterPro entries may choose the corresponding methods individually.
The third section of the submission form is specific for DNA as the sequence input. DNA sequences will be translated to protein according to the translation rules specified in the TRANSLATION TABLE menu. The default is the standard code. Each translation will generate peptide sequences in six frames and all will be searched. The minimum length of an open reading frame produced after translation can be specified in the MIN. OPEN READING FRAME SIZE menu. This dictates that only peptides above the selected value will be searched by the methods chosen in the second section.
The fourth section of the input form consists of the sequence input panel. The components of this panel include a selection menu for the molecule type. This one can be DNA or protein. The default is protein. When DNA is selected it enables the TRANSLATION TABLE menu in the third section of the form. Help is available by clicking on the HELP image. This will open a new browser window that contains comprehensive information about InterProScan. There is also an UPLOAD dialog that can be used instead of cutting and pasting sequences into the input window. Finally, there are the Submit and Reset buttons. The sequence input text dialog will accept protein or DNA sequence in any of the standard sequence formats in use today. These include EMBL, SWISS, GenBank, NBRF/PIR, CODATA, Fasta, GCG and RAW text. Primary or secondary identifiers (accession number or identifier) of a protein sequence in the databases can also be used. In this case, the user will type a database name followed by a colon and the identifier. For example, UNIPROT:INSR_HUMAN. It is not possible to submit more than 10 protein input sequences at the same time. Each protein sequence must be at least five amino acids long. Only one nucleic acid sequence may be used at a time and the length for this sequence must be
5000 bases.
InterProScan output
Before InterProScan launches each of the protein sequence analysis applications, it takes advantage of pre-computed results whenever possible. It calculates a checksum (CRC64) for the query sequence and compares it with the checksums of the protein sequences that are present in a database called IPRMATCHES. This is a database that lists all the entries from UniProt/Swiss-Prot and UniProt/TrEMBL that match one or more InterPro entries. If the checksum calculated for the query sequence does not match any checksums found in the IPRMATCHES database, the protein sequence analysis applications are launched in parallel; otherwise the IPRMATCHES entry is returned.
Once a job is completed, the output of each of the applications is individually parsed to produce a merged results file. This file is in the tab-delimited format. A converter is called onto generate, on the fly, an XML document, which is used to generate the HTML output. This consists of two views: a picture or a graphical view (Figure 2) that displays a cartoon of the sequence with highlighted domains or functional sites corresponding to the matches in the InterPro databases. Each match contains hypertext links to the InterPro database main web resource as well as to the individual member databases' websites where the matches are further described. A table view (Figure 3) is also available by clicking on the table view button. This one consists of complete database names, hyperlinked match identifiers, the sequence coordinates (startstop pairs) where the match occurs, E-values and the status of the match in InterPro (e.g. T for true or ? for unknown). Parentchild relationships are displayed if they exist in an InterPro entry. GO annotation is also shown if available. Other options in the HTML results page include the raw output in the tab-delimited format, the XML document and the sequences used as input (original sequences). The results for each job are stored at the EBI for at least 24 h.
|
|
| THE STANDALONE VERSION OF InterProScan |
|---|
|
|
|---|
For users who wish to run their own installation of InterProScan, there is a free standalone version available from the EBI's ftp server (ftp://ftp.ebi.ac.uk/pub/databases/interpro/iprscan/RELEASE/latest). This version can run from the command line or as a CGI through a web interface. Additional features that have been developed for this version include the use of the Perl indexing library, which indexes all data files, input sequences and the applications results for easy querying and retrieval of primary identifiers, names, job results and their status. From the results page, users can access each input sequence individually or the full input file, and also access the original outputs from each application.
The standalone version of InterProScan has been designed to run on either a single machine or a cluster of machines. It supports the use of various queuing systems such as LSF, OpenPBS and SGE.
The standalone version is composed of three different packages:
- Perl core package containing all the scripts and modules to run InterProScan.
- Data package, which contains all the data needed by each application to run (
4 GB unzipped).
- Binary package precompiled for six different platforms (Linux, OSF1, AIX, Sun, IRIX and MacOSX).
| CONCLUSIONS |
|---|
|
|
|---|
We describe here the current state of the EBI InterProScan server, along with much of its unique flexibility, which is freely available to the public. Questions, comments and suggestions from users are encouraged and may be addressed to http://www.ebi.ac.uk/support/.
| ACKNOWLEDGEMENTS |
|---|
InterPro is funded by the award of grant number QLRI-CT-2000-00517 and in part by grant number QLRI-CT-2001000015 from the European Union under the RTD program Quality of Life and Management of Living Resources. InterPro is a member database of the MRC-funded eFamily project. Funding to pay the Open Access publication charges for this article was provided by The European Molecular Biology Laboratory (EMBL).
Conflict of interest statement. None declared.
| REFERENCES |
|---|
|
|
|---|
- Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J. (1997) Gapped Blast and Psi-Blast: a new generation of protein database search programs Nucleic Acids Res., 25, 33893402
[Abstract/Free Full Text] . - Pearson, W.R. and Lipman, D.J. (1988) Improved tools for biological sequence analysis Proc. Natl Acad. Sci. USA, 85, 24442448
[Abstract/Free Full Text] . - Mulder, N.J., Apweiler, R., Attwood, T.K., Bairoch, A., Bateman, A., Binns, D., Bradley, P., Bork, P., Bucher, P., Cerutti, L., et al. (2005) InterPro, progress and status in 2005 Nucleic Acids Res., 33, D201D205
[Abstract/Free Full Text] . - Zdobnov, E.M. and Apweiler, R. (2001) InterProScanan integration platform for the signature-recognition methods in InterPro Bioinformatics, 17, 847848
[Abstract/Free Full Text] . - Snell, J., Tidwell, D., Kulchenko, P. Programming Web Services with SOAP, 1st edn, (2001) Sebastopol, CA O'Reilly & Associates .
- Bru, C., Courcelle, E., Carrère, S., Beausse, Y., Dalmar, S., Kahn, D. (2005) The ProDom database of protein domain families: more emphasis on 3D Nucleic Acids Res., 33, D212D215
[Abstract/Free Full Text] . - Attwood, T.K., Bradley, P., Flower, D.R., Gaulton, A., Maudling, N., Mitchell, A.L., Moulton, G., Nordle, A., Paine, K., Taylor, P., et al. (2003) PRINTS and its automatic supplement, prePRINTS Nucleic Acids Res., 31, 400402
[Abstract/Free Full Text] . - Scordis, P., Flower, D.R., Attwood, T.K. (1999) FingerPRINTScan: intelligent searching of the PRINTS motif database Bioinformatics, 15, 799806
[Abstract/Free Full Text] . - Letunic, I., Goodstadt, L., Dickens, N.J., Doerks, T., Schultz, J., Mott, R., Ciccarelli, F., Copley, R.R., Ponting, C.P., Bork, P. (2002) Recent improvements to the SMART domain-based sequence annotation resource Nucleic Acids Res., 30, 242244
[Abstract/Free Full Text] . - Haft, D.H., Selengut, J.D., White, O. (2003) The TIGRFAMs database of protein families Nucleic Acids Res., 31, 371373
[Abstract/Free Full Text] . - Bateman, A., Coin, L., Durbin, R., Finn, R.D., Hollich, V., Griffiths-Jones, S., Khanna, A., Marshall, M., Moxon, S., Sonnhammer, E.L., et al. (2004) The Pfam protein families database Nucleic Acids Res., 32, D138D141
[Abstract/Free Full Text] . - Hulo, N., Sigrist, C.J., Le Saux, V., Langendijk-Genevaux, P.S., Bordoli, L., Gattiker, A., De Castro, E., Bucher, P., Bairoch, A. (2004) Recent improvements to the PROSITE database Nucleic Acids Res., 32, D134D137
[Abstract/Free Full Text] . - Thompson, J.D., Higgins, D.G., Gibson, T.J. (1994) Improved sensitivity of profile searches through the use of sequence weights and gap excision Comput. Appl. Biosci., 10, 1929
[Abstract/Free Full Text] . - Wu, C.H., Nikolskaya, A., Huang, H., Yeh, L.S., Natale, D.A., Vinayaka, C.R., Hu, Z.Z., Mazumder, R., Kumar, S., Kourtesis, P., et al. (2004) PIRSF: family classification system at the Protein Information Resource Nucleic Acids Res., 32, D112D114
[Abstract/Free Full Text] . - Gough, J., Karplus, K., Hughey, R., Chothia, C. (2001) Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure J. Mol. Biol., 313, 903919[CrossRef][Web of Science][Medline] .
- Pearl, F.M., Lee, D., Bray, J.E., Sillitoe, I., Todd, A.E., Harrison, A.P., Thornton, J.M., Orengo, C.A. (2000) Assigning genomic sequences to CATH Nucleic Acids Res., 28, 277282
[Abstract/Free Full Text] . - Mi, H., Lazareva-Ulitsky, B., Loo, R., Kejariwal, A., Vandergriff, J., Rabkin, S., Guo, N., Muruganujan, A., Doremieux, O., Campbell, M.J., et al. (2005) The PANTHER database of protein families, subfamilies, functions and pathways Nucleic Acids Res., 33, D284D288
[Abstract/Free Full Text] . - Bendtsen, J.D., Nielsen, H., von Heijne, G., Brunak, S. (2004) Improved prediction of signal peptides: SignalP 3.0 J. Mol. Biol., 340, 783795[CrossRef][Web of Science][Medline] .
- Sonnhammer, E.L., von Heijne, G., Krogh, A. (1998) A hidden Markov model for predicting transmembrane helices in protein sequences Proc. Int. Conf. Intell. Syst. Mol. Biol., 6, 175182[Medline]
.
This article has been cited by other articles:
![]() |
H. Mcwilliam, F. Valentin, M. Goujon, W. Li, M. Narayanasamy, J. Martin, T. Miyar, and R. Lopez Web services at the European Bioinformatics Institute-2009 Nucleic Acids Res., July 1, 2009; 37(suppl_2): W6 - W10. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Wichadakul, S. Numnark, and S. Ingsriswang d-Omix: a mixer of generic protein domain analysis tools Nucleic Acids Res., July 1, 2009; 37(suppl_2): W417 - W421. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Saito, M. Robert, H. Kochi, G. Matsuo, Y. Kakazu, T. Soga, and M. Tomita Metabolite Profiling Reveals YihU as a Novel Hydroxybutyrate Dehydrogenase for Alternative Succinic Semialdehyde Metabolism in Escherichia coli J. Biol. Chem., June 12, 2009; 284(24): 16442 - 16451. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ghosh, K. Chakrabarti, and D. Chattopadhyay Cloning of feather-degrading minor extracellular protease from Bacillus cereus DCUW: dissection of the structural domains Microbiology, June 1, 2009; 155(6): 2049 - 2057. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Hoshi, K. Oshima, S. Kakizawa, Y. Ishii, J. Ozeki, M. Hashimoto, K. Komatsu, S. Kagiwada, Y. Yamaji, and S. Namba A unique virulence factor for proliferation and dwarfism in plants identified from a phytopathogenic bacterium PNAS, April 14, 2009; 106(15): 6416 - 6421. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. L. Menon, F. L. Poole II, A. Cvetkovic, S. A. Trauger, E. Kalisiak, J. W. Scott, S. Shanmukh, J. Praissman, F. E. Jenney Jr., W. R. Wikoff, et al. Novel Multiprotein Complexes Identified in the Hyperthermophilic Archaeon Pyrococcus furiosus by Non-denaturing Fractionation of the Native Proteome Mol. Cell. Proteomics, April 1, 2009; 8(4): 735 - 751. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Walter, T. Rattei, R. Arnold, U. Guldener, M. Munsterkotter, K. Nenova, G. Kastenmuller, P. Tischler, A. Wolling, A. Volz, et al. PEDANT covers all complete RefSeq genomes Nucleic Acids Res., January 1, 2009; 37(suppl_1): D408 - D411. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Hunter, R. Apweiler, T. K. Attwood, A. Bairoch, A. Bateman, D. Binns, P. Bork, U. Das, L. Daugherty, L. Duquenne, et al. InterPro: the integrative protein signature database Nucleic Acids Res., January 1, 2009; 37(suppl_1): D211 - D215. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Driscoll, M. D. Dyer, T. M. Murali, and B. W. Sobral PIG--the pathogen interaction gateway Nucleic Acids Res., January 1, 2009; 37(suppl_1): D647 - D650. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Goble, R. Stevens, D. Hull, K. Wolstencroft, and R. Lopez Data curation + process curation=data integration + science Brief Bioinform, December 6, 2008; (2008) bbn034v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. L. Franck, W.-S. Chang, J. Qiu, M. Sugawara, M. J. Sadowsky, S. A. Smith, and G. Stacey Whole-Genome Transcriptional Profiling of Bradyrhizobium japonicum during Chemoautotrophic Growth J. Bacteriol., October 15, 2008; 190(20): 6697 - 6705. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Gotz, J. M. Garcia-Gomez, J. Terol, T. D. Williams, S. H. Nagaraj, M. J. Nueda, M. Robles, M. Talon, J. Dopazo, and A. Conesa High-throughput functional annotation and data mining with the Blast2GO suite Nucleic Acids Res., June 1, 2008; 36(10): 3420 - 3435. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Gonzalez-Aguero, L. Pavez, F. Ibanez, I. Pacheco, R. Campos-Vargas, L. A. Meisel, A. Orellana, J. Retamales, H. Silva, M. Gonzalez, et al. Identification of woolliness response genes in peach fruit after post-harvest treatments J. Exp. Bot., May 3, 2008; (2008) ern069v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Stout, E. Romero-Severson, M. O. Ruegger, and C. Chapple Semidominant Mutations in Reduced Epidermal Fluorescence 4 Reduce Phenylpropanoid Content in Arabidopsis Genetics, April 1, 2008; 178(4): 2237 - 2251. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Siezen, M. J. C. Starrenburg, J. Boekhorst, B. Renckens, D. Molenaar, and J. E. T. van Hylckama Vlieg Genome-Scale Genotype-Phenotype Matching of Two Lactococcus lactis Isolates from Plants Identifies Mechanisms of Adaptation to the Plant Niche Appl. Envir. Microbiol., January 15, 2008; 74(2): 424 - 436. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Birzele, J. E. Gewehr, and R. Zimmer AutoPSI: a database for automatic structural classification of protein sequences and structures Nucleic Acids Res., January 11, 2008; 36(suppl_1): D398 - D401. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. L. Hong, R. Balakrishnan, Q. Dong, K. R. Christie, J. Park, G. Binkley, M. C. Costanzo, S. S. Dwight, S. R. Engel, D. G. Fisk, et al. Gene Ontology annotations at SGD: new data sources and annotation methods Nucleic Acids Res., January 11, 2008; 36(suppl_1): D577 - D581. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Botet, L. Mateos, J. L. Revuelta, and M. A. Santos A Chemogenomic Screening of Sulfanilamide-Hypersensitive Saccharomyces cerevisiae Mutants Uncovers ABZ2, the Gene Encoding a Fungal Aminodeoxychorismate Lyase Eukaryot. Cell, November 1, 2007; 6(11): 2102 - 2111. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. K. Ekanayake, M. J. Cipriano, and R. Sabatini Telomeric co-localization of the modified base J and contingency genes in the protozoan parasite Trypanosoma cruzi Nucleic Acids Res., October 8, 2007; 35(19): 6367 - 6377. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Virel and L. Backman A Comparative and Phylogenetic Analysis of the {alpha}-Actinin Rod Domain Mol. Biol. Evol., October 1, 2007; 24(10): 2254 - 2265. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Li, Q. Lin, J. Chen, J. L. Wu, T. K. Lim, S. S. Loh, X. Tang, and C.-L. Hew Shotgun Identification of the Structural Proteome of Shrimp White Spot Syndrome Virus and iTRAQ Differentiation of Envelope and Nucleocapsid Subproteomes Mol. Cell. Proteomics, September 1, 2007; 6(9): 1609 - 1620. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. P. Watkins, T. S. Kroeger, A. M. Cooke, R. E. Williams-Carrier, G. Friso, S. E. Belcher, K. J. van Wijk, and A. Barkan A Ribonuclease III Domain Protein Functions in Group II Intron Splicing in Maize Chloroplasts PLANT CELL, August 1, 2007; 19(8): 2606 - 2623. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Labarga, F. Valentin, M. Anderson, and R. Lopez Web Services at the European Bioinformatics Institute Nucleic Acids Res., July 13, 2007; 35(suppl_2): W6 - W11. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Lee, T. Hong, S. J. Byun, T. Woo, and Y. J. Choi ESTpass: a web-based server for processing and annotating expressed sequence tag (EST) sequences Nucleic Acids Res., July 13, 2007; 35(suppl_2): W159 - W162. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. H. Nagaraj, N. Deshpande, R. B. Gasser, and S. Ranganathan ESTExplorer: an expressed sequence tag (EST) assembly and annotation platform Nucleic Acids Res., July 13, 2007; 35(suppl_2): W143 - W147. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. D. Dyer, T. M. Murali, and B. W. Sobral Computational prediction of host-pathogen protein protein interactions Bioinformatics, July 1, 2007; 23(13): i159 - i166. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Minic, E. Jamet, L. Negroni, P Arsene der Garabedian, M. Zivy, and L. Jouanin A sub-proteome of Arabidopsis thaliana mature stems trapped on Concanavalin A is enriched in cell wall glycoside hydrolases J. Exp. Bot., July 1, 2007; 58(10): 2503 - 2512. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. M. Hufford, P. Canaran, D. H. Ware, M. D. McMullen, and B. S. Gaut Patterns of Selection and Tissue-Specific Expression among Maize Domestication and Crop Improvement Loci Plant Physiology, July 1, 2007; 144(3): 1642 - 1653. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Sulakhe, M. D'Souza, M. Syed, A. Rodriguez, Y. Zhang, E. M. Glass, M. F. Romine, and N. Maltsev GNARE--a grid-based server for the analysis of user submitted genomes Nucleic Acids Res., May 25, 2007; (2007) gkm366v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Gewehr, V. Hintermair, and R. Zimmer AutoSCOP: automated prediction of SCOP classifications using unique pattern-class mappings Bioinformatics, May 15, 2007; 23(10): 1203 - 1210. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q.-H. Zhu, A.-Y. Guo, G. Gao, Y.-F. Zhong, M. Xu, M. Huang, and J. Luo DPTF: a database of poplar transcription factors Bioinformatics, May 15, 2007; 23(10): 1307 - 1308. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Savas, I. W. Taylor, J. L. Wrana, and H. Ozcelik Functional nonsynonymous single nucleotide polymorphisms from the TGF-{beta} protein interaction network Physiol Genomics, April 24, 2007; 29(2): 109 - 117. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Richardt, D. Lang, R. Reski, W. Frank, and S. A. Rensing PlanTAPDB, a Phylogeny-Based Resource of Plant Transcription-Associated Proteins Plant Physiology, April 1, 2007; 143(4): 1452 - 1466. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. J. Poysti, E. D. M. Loewen, Z. Wang, and I. J. Oresnik Sinorhizobium meliloti pSymB carries genes necessary for arabinose transport and catabolism Microbiology, March 1, 2007; 153(3): 727 - 736. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-H. Ling, M.-A. Rajandream, P. Rivailler, A. Ivens, S.-J. Yap, A. M.B.N. Madeira, K. Mungall, K. Billington, W.-Y. Yee, A. T. Bankier, et al. Sequencing and analysis of chromosome 1 of Eimeria tenella reveals a unique segmental organization Genome Res., March 1, 2007; 17(3): 311 - 319. [Abstract] [Full Text] [PDF] |
||||
![]() |
F.-C. Chen, S.-S. Wang, S.-M. Chaw, Y.-T. Huang, and T.-J. Chuang Plant Gene and Alternatively Spliced Variant Annotator. A Plant Genome Annotation Pipeline for Rice Gene and Alternatively Spliced Variant Identification with Cross-Species Expressed Sequence Tag Conservation from Seven Plant Species Plant Physiology, March 1, 2007; 143(3): 1086 - 1095. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Itoh, T. Tanaka, R. A. Barrero, C. Yamasaki, Y. Fujii, P. B. Hilton, B. A. Antonio, H. Aono, R. Apweiler, R. Bruskiewich, et al. Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana Genome Res., February 1, 2007; 17(2): 175 - 183. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. J. Mulder, R. Apweiler, T. K. Attwood, A. Bairoch, A. Bateman, D. Binns, P. Bork, V. Buillard, L. Cerutti, R. Copley, et al. New developments in the InterPro database Nucleic Acids Res., January 12, 2007; 35(suppl_1): D224 - D228. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. S. Walker, N. Stiffler, and A. Barkan POGs/PlantRBP: a resource for comparative genomics in plants Nucleic Acids Res., January 12, 2007; 35(suppl_1): D852 - D856. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Nash, S. Weng, B. Hitz, R. Balakrishnan, K. R. Christie, M. C. Costanzo, S. S. Dwight, S. R. Engel, D. G. Fisk, J. E. Hirschman, et al. Expanded protein information at SGD: new pages and proteome browser Nucleic Acids Res., January 12, 2007; 35(suppl_1): D468 - D471. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Mi, N. Guo, A. Kejariwal, and P. D. Thomas PANTHER version 6: protein sequence and function evolution data with expanded representation of biological pathways Nucleic Acids Res., January 12, 2007; 35(suppl_1): D247 - D252. [Abstract] [Full Text] [PDF] |
||||
![]() |
F.-C. Chen, C.-J. Chen, W.-H. Li, and T.-J. Chuang Human-specific insertions and deletions inferred from mammalian genome sequences Genome Res., January 1, 2007; 17(1): 16 - 22. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Kaplan and M. Linial ProtoBee: Hierarchical classification and annotation of the honey bee proteome Genome Res., November 1, 2006; 16(11): 1431 - 1438. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Hasegawa, S. Fukuda, K. Shimokawa, S. Kondo, N. Maeda, and Y. Hayashizaki A RecA-mediated exon profiling method Nucleic Acids Res., August 8, 2006; 34(13): e97 - e97. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-X. Chou, X.-Y. Wei, D.-S. Chen, and J.-C. Zhou Thirteen nodule-specific or nodule-enhanced genes encoding products homologous to cysteine cluster proteins or plant lipid transfer proteins are identified in Astragalus sinicus L. by suppressive subtractive hybridization J. Exp. Bot., August 1, 2006; 57(11): 2673 - 2685. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Nagamune and L. D. Sibley Comparative Genomic and Phylogenetic Analyses of Calcium ATPases and Calcium-Regulated Proteins in the Apicomplexa Mol. Biol. Evol., August 1, 2006; 23(8): 1613 - 1627. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Gao, Y. Zhong, A. Guo, Q. Zhu, W. Tang, W. Zheng, X. Gu, L. Wei, and J. Luo DRTF: a database of rice transcription factors Bioinformatics, May 15, 2006; 22(10): 1286 - 1287. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. A. Reeves, J. M. Thornton, and the BioSapiens Network of Excellence Integrating biological data through the genome Hum. Mol. Genet., April 15, 2006; 15(suppl_1): R81 - R87. [Abstract] [Full Text] [PDF] |
||||
![]() |
F.-C. Chen, S.-S. Wang, C.-J. Chen, W.-H. Li, and T.-J. Chuang Alternatively and Constitutively Spliced Exons Are Subject to Different Evolutionary Forces Mol. Biol. Evol., March 1, 2006; 23(3): 675 - 682. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Glasner, M. Rusch, P. Liss, G. Plunkett III, E. L. Cabot, A. Darling, B. D. Anderson, P. Infield-Harm, M. C. Gilson, and N. T. Perna ASAP: a resource for annotating, curating, comparing, and disseminating genomic data Nucleic Acids Res., January 1, 2006; 34(suppl_1): D41 - D45. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Heiges, H. Wang, E. Robinson, C. Aurrecoechea, X. Gao, N. Kaluskar, P. Rhodes, S. Wang, C.-Z. He, Y. Su, et al. CryptoDB: a Cryptosporidium bioinformatics resource update Nucleic Acids Res., January 1, 2006; 34(suppl_1): D419 - D422. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Hartmann, D. Lu, J. Phillips, and T. J. Vision Phytome: a platform for plant comparative genomics Nucleic Acids Res., January 1, 2006; 34(suppl_1): D724 - D730. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||




















