Nucleic Acids Research, 2005, Vol. 33, Database issue D284-D288
© 2005, the authors
Nucleic Acids Research, Vol. 33, Database issue © Oxford University Press 2005; all rights reserved
The PANTHER database of protein families, subfamilies, functions and pathways
Computational Biology, Applied Biosystems, 850 Lincoln Center Drive, Foster City, CA 94404, USA and 1 The Systems Biology Institute and ERATO-SORST Kitano Symbiotic Systems Project/Japan Science and Technology Agency, Suite 6A, M31, 6-31-15 Jingumae, Shibuya, Tokyo 150-0001, Japan
* To whom correspondence should be addressed. Tel: +1 650 554 2723; Fax: +1 650 554 2344; Email: paul.thomas{at}appliedbiosystems.com
Received September 15, 2004; Revised and Accepted October 8, 2004
| ABSTRACT |
|---|
|
|
|---|
PANTHER is a large collection of protein families that have been subdivided into functionally related subfamilies, using human expertise. These subfamilies model the divergence of specific functions within protein families, allowing more accurate association with function (ontology terms and pathways), as well as inference of amino acids important for functional specificity. Hidden Markov models (HMMs) are built for each family and subfamily for classifying additional protein sequences. The latest version, 5.0, contains 6683 protein families, divided into 31 705 subfamilies, covering
90% of mammalian protein-coding genes. PANTHER 5.0 includes a number of significant improvements over previous versions, most notably (i) representation of pathways (primarily signaling pathways) and association with subfamilies and individual protein sequences; (ii) an improved methodology for defining the PANTHER families and subfamilies, and for building the HMMs; (iii) resources for scoring sequences against PANTHER HMMs both over the web and locally; and (iv) a number of new web resources to facilitate analysis of large gene lists, including data generated from high-throughput expression experiments. Efforts are underway to add PANTHER to the InterPro suite of databases, and to make PANTHER consistent with the PIRSF database. PANTHER is now publicly available without restriction at http://panther.appliedbiosystems.com. | INTRODUCTION |
|---|
|
|
|---|
The philosophy, as well as the basic methodology, behind the PANTHER database has been described previously (1,2); therefore, we focus here on the recent improvements to the database and to the functionality available on the website. In brief, there are two main parts to PANTHER: PANTHER/LIB, a library of protein families and subfamilies; and PANTHER/X, a set of ontology terms describing protein function. The database's main advantage is in the curator-defined grouping of protein sequences into functional subfamilies, allowing more detailed and accurate association with the ontology terms, and now biological pathways. Each family and subfamily is represented by a phylogenetic tree of training sequences, and a hidden Markov model (HMM) that represents these sequences as a statistical model. The HMM library can be searched to classify new sequences, or to provide a score to predict the likely functional consequence of a mutation (1). PANTHER is quite comprehensive for the annotation of protein sequences encoded by metazoan genomes:
90% of mammalian protein-coding genes, and nearly two-thirds of Drosophila genes, are hit by a PANTHER HMM. The PANTHER database has recently been expanded to include associations between protein sequences and the biological pathways they participate in. Like the molecular function and biological process ontology terms, these pathways are associated with individual protein sequences, and when possible with PANTHER subfamily HMMs, by expert curators.
We have also improved the methodology used to define protein families and subfamilies. These improvements are mainly in two areas: global clustering of protein sequence space to allow definition of family boundaries, and new algorithms that make use of ontology terms to provide a guide for curators to define both families and subfamilies.
There are also a number of significant improvements to the website. Perhaps most importantly for users, the site is now free of the previous restrictions on its use (3). In addition, HMMs can be downloaded, and/or searched interactively using a protein sequence as a query. Pathways can be interactively browsed and queried. Gene lists (e.g. from mRNA expression data) can be uploaded to the site and analyzed relative to molecular functions, biological processes and pathways.
| STATISTICS FOR PANTHER 5.0 |
|---|
|
|
|---|
PANTHER/LIB (library of protein family and subfamily HMMs), version 5.0 contains 256 413 training sequences, grouped into 6683 families. These families were then divided further into 31 705 subfamilies.
PANTHER HMMs have been used to annotate the protein-coding genes annotated in the human, mouse, rat and Drosophila melanogaster genomes. The fractions of these genes that were given a functional annotation by PANTHER 5.0 are shown in Table 1.
|
| PANTHER WEBSITE FUNCTIONALITY |
|---|
|
|
|---|
Several resources are now available at the PANTHER website.
Interactive
- Ontology term browser. The PANTHER Prowler (1) is designed for browsing ontology terms to retrieve associated families, subfamilies or individual proteins.
- Updated: tree and multiple sequence alignment (MSA) viewer. The PANTHER Tree-Attribute Viewer facilitates exploration of each protein family tree. It has been modified recently to allow a user to view either the sequence annotations as described previously (1), or the family MSA. The MSA view includes a number of features such as highlighting subfamily-specific amino acid conservation.
- New: sequence search against PANTHER HMMs. The website now provides interactive scoring of user-submitted sequences against the PANTHER library.
- Classification of whole genomes. Users can browse or query stored PANTHER HMM hits for all protein sequences annotated in the whole genomes of human, mouse, rat [from the LocusLink database (4)] and Drosophila melanogaster [from FlyBase (5)].
- New: pathways. Users can browse or query pathways associated with PANTHER families, subfamilies and training sequences (Figure 1). Pathway diagrams were drawn by expert curators using the CellDesigner software program (6), which supports Systems Biology Markup Language (SBML) standards (7) and uses the process notation of Kitano (8) to represent pathways as a series of reactions. The same curators associated proteins in the diagrams with PANTHER families, subfamilies and training sequences. There are over 60 pathways, primarily signaling pathways, available as of January 2005.
- New: gene expression analysis tools. Users can upload gene lists (e.g. from mRNA expression experiments) and view them in the context of the pathways described above. In addition, users can analyze gene lists to look for statistically significant trends with respect to different groupings of genes: families, molecular functions, biological processes and pathways. In addition to the binomial test described previously (9), the MannWhitney U-test as described in (10) can be performed on uploaded data to look for statistically significant differences in distributions of uploaded values (Figure 2).
- New: pie and bar charts of functions. Graphical representations of the functions of genes or proteins across an entire list can be generated in a single click from any list on the site, either generated at the website or uploaded by a user (Figure 3).
|
|
|
Downloads
- PANTHER HMMs are available in both SAM (11) and HMMER (12) format.
- Modified InterProScan (13) software can be downloaded for scoring sequences locally against PANTHER HMMs.
| INTEGRATION WITH OTHER WEB RESOURCES |
|---|
|
|
|---|
PANTHER has been mapped to existing InterPro (14) entries, and this file is available from http://panther.appliedbiosystems.com/downloads/. PANTHER will be incorporated into the InterPro suite of databases incrementally. PANTHER HMMs have also been mapped to existing PIRSF (15) entries, and a collaboration is currently underway to make PANTHER and PIRSF consistent and cross-referenced.
| NEW METHODS FOR PANTHER 5.0 |
|---|
|
|
|---|
For version 5.0, we implemented a number of improvements to the PANTHER library building procedure as described previously (1). At the end of this process, we evaluated the HMM classifications of a test set of over 10 000 sequences from SWISS-PROT to make sure that the new process did not lower the accuracy of the classifications reported (16). We found that the classification accuracy was nearly identical, and the coverage was slightly improved in 5.0, probably due to the new HMM building process outlined below.
Global UPGMA clustering to define family boundaries
PANTHER version 3.0 (1,2) used seed-based clustering to define protein families. The advantage of this approach was its modularity: new families could be easily added in areas that were inadequately covered in previous versions. However, the seed-based clustering resulted in significant redundancy for a number of large protein families, such as protein kinases and G-protein-coupled receptors, which were covered by a number of families that overlapped to varying degrees.
The current version, PANTHER version 5.0, addresses this issue by implementing a global clustering of proteins. Proteins from PANTHER version 4.0 were clustered using a similarity metric derived from the pairwise BLASTP scores:
![]() | (1) |
This pairwise similarity was used to define single-linkage clusters (maximal clusters in which each protein is connected to at least one other protein in the cluster by a non-zero similarity score). A dendrogram was built for each single-linkage cluster using the UPGMA algorithm (17). The family labels from the PANTHER version 4.0 library were then used to define the optimal cut of each UPGMA dendrogram into family clusters, to maximize the correspondence to previous versions of PANTHER. In the great majority of cases, the PANTHER version 5.0 family was almost identical to the corresponding family in the previous version of the library. Only about 40 subtrees in the UPGMA dendrograms, primarily those that were represented by overlapping clusters in the previous version, had to be broken further into functionally homogeneous clusters using manual curation. Overall, the family clusters identified from the UPGMA dendrograms covered over 96% of the version 4.0 training sequences. The rest of the sequences were either singletons according to Equation 1 (often due to low-complexity masking), or lay outside the family boundaries defined by PANTHER version 4.0 family labels on the UPGMA dendrograms. Each of these leftover sequences (unmasked) was scored against SAM HMMs built for the family clusters, and was brought into the family of the best scoring HMM if the NLL-NULL score was less than 50. Those leftovers not meeting this criterion were added as singleton families if they were from a primate or rodent species; otherwise they were removed from the library.
Simplified HMM building process
The UPGMA-derived family clusters allow us to simplify the HMM-building process detailed previously (1). Rather than building initial and extended HMMs, for PANTHER 5.0, we built the family HMM directly from the UPGMA family cluster in a single step. Because the HMM training sequences are of varying lengths, we pre-set the SAM buildmodel modellength option to be 1.1 times the maximum sequence length in the cluster, and also added the option sw2, to create a local HMM. Similar to previous versions of the library, this temporary HMM was used to create an alignment (using the SAM align2model procedure with the sw2 option) that could be used to estimate the weights of the sequences in the initial HMM. A weighted model was then constructed followed by a weighted alignment.
In PANTHER 5.0, we used a faster version of TIPS (version 2.0, available from the Downloads section of the PANTHER website) to create the phylogenetic trees (18). As in previous versions, the MSA was used as input to the new TIPS2 algorithm, along with the following parameters. -prior uprior.9.com, -score_matrix BLOSUM 62, -cut_using_distance 0.5, -pair_type 1 and -use_are_as_branch_length 0.
Subfamily division guided by ontology terms
Because the subfamily labels and associated ontology terms were expanded and reviewed by curators for both versions 3.0 and 4.0, and shown to have a high rate of accuracy (16), we developed an algorithm for optimally dividing a tree into subfamilies given subfamily labels on each sequence (18). These divisions were then reviewed once again by expert curators, and adjusted if necessary. This methodology will allow regular updates to PANTHER training sequences with minimal curation effort.
Another significant advantage of this approach is that any arbitrary grouping of sequences can be superimposed on our phylogenetic trees to define subfamilies (and associated HMMs). This approach will allow straightforward incorporation of external annotations such as those produced by single protein family databases, or from large ontology association projects such as GOA (19,20).
| Notes |
|---|
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use permissions, please contact journals.permissions{at}oupjournals.org.
| REFERENCES |
|---|
|
|
|---|
- Thomas,P.D., Campbell,M.C., Kejariwal,A., Mi,H., Karlak,B., Daveran,R., Diemer,K., Muruganujan,A. and Narechania,A. ( (2003) ) PANTHER: a library of protein families and subfamilies indexed by function. Genome Res., , 13, , 21292141.
[Abstract/Free Full Text] . - Thomas,P.D., Kejariwal,A., Campbell,M.C., Mi,H., Diemer,K., Guo,N., Ladunga,I., Ulitsky-Lazareva,B., Muruganujan,A. and Rabkin,S. ( (2003) ) PANTHER: a browsable database of gene products organized by biological function, using curated protein family and subfamily classification. Nucleic Acids Res., , 31, , 334341.
[Abstract/Free Full Text] . - Thomas,P.D., Campbell,M.C., Kejariwal,A., Mi,H., Karlak,B., Daveran,R., Diemer,K., Muruganujan,A. and Narechania,A. ( (2003) ) Corrigendum for PANTHER: a library of protein families and subfamilies indexed by function. Nucleic Acids Res., , 31, , 2024.
[Free Full Text] . - Pruitt,K.D. and Maglott,D.R. ( (2001) ) RefSeq and LocusLink: NCBI gene-centered resources. Nucleic Acids Res., , 29, , 137140.
[Abstract/Free Full Text] . - FlyBase Consortium ( (2002) ) The FlyBase database of the Drosophila genome projects and community literature. Nucleic Acids Res., , 30, , 106108.
[Abstract/Free Full Text] . - Funahashi,A., Morohashi,M. and Kitano,H. ( (2003) ) CellDesigner: a process diagram editor for gene-regulatory and biochemical networks. Biosilico, , 1, , 159162.[CrossRef] .
- Hucka,M., Finney,A., Sauro,H.M., Bolouri,H., Doyle,J.C., Kitano,H., Arkin,A.P., Bornstein,B.J., Bray,D., Cornish-Bowden,A. et al. ( (2003) ) The Systems Biology Markup Language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics, , 19, , 524531.
[Abstract/Free Full Text] . - Kitano,H. ( (2003) ) A graphical notation for biochemical networks. Biosilico, , 1, , 169176.[CrossRef] .
- Cho,R.J. and Campbell,M.J. ( (2000) ) Transcription, genomes, function. Trends Genet., , 16, , 409415.[CrossRef][Web of Science][Medline] .
- Clark,A.G., Glanowski,S., Nielsen,R., Thomas,P.D., Kejariwal,A., Todd,M.J., Tanenbaum,D.M., Civello,D., Lu,F., Murphy,B. et al. ( (2003) ) Inferring nonneutral evolution from humanchimpmouse orthologous trios. Science, , 302, , 19601963.
[Abstract/Free Full Text] . - Karplus,K., Barrett,C. and Hughey,R. ( (1998) ) Hidden Markov models for detecting remote protein homologies. Bioinformatics, , 14, , 846856.
[Abstract/Free Full Text] . - Eddy,S.R. ( (1996) ) Hidden Markov models. Curr. Opin. Struct. Biol., , 6, , 361365.[CrossRef][Web of Science][Medline] .
- Zdobnov,E.M. and Apweiler,R. ( (2001) ) InterProScanan integration platform for the signature-recognition methods in InterPro. Bioinformatics, , 17, , 847848.
[Abstract/Free Full Text] . - Mulder,N.J., Apweiler,R., Attwood,T.K., Bairoch,A., Barrell,D., Bateman,A., Binns,D., Biswas,M., Bradley,P., Bork,P. et al. ( (2003) ) The InterPro Database, 2003 brings increased coverage and new features. Nucleic Acids Res., , 31, , 315318.
[Abstract/Free Full Text] . - Wu,C.H., Nikolskaya,A., Huang,H., Yeh,L.S., Natale,D.A., Vinayaka,C.R., Hu,Z.Z., Mazumder,R., Kumar,S., Kourtesis,P. et al. ( (2004) ) PIRSF: family classification system at the Protein Information Resource. Nucleic Acids Res., , 32, , 112114. .
- Mi,H., Vandergriff,J., Campbell,M., Narechania,A., Majoros,W., Lewis,S., Thomas,P.D. and Ashburner,M. ( (2003) ) Assessment of genome-wide protein function classification for Drosophila melanogaster. Genome Res., , 13, , 21182128.
[Abstract/Free Full Text] . - Sokal,R.R. and Michener,C.D. ( (1958) ) A statistical method for evaluation systematic relationships. Univ. Kansas Sci. Bull., , 28, , 14091438. .
- Lazareva-Ulitsky,B. and Thomas,P.D. ( (2005) ) On the quality of tree-based protein classification. Bioinformatics, in press. .
- Gene Ontology Consortium ( (2000) ) Gene Ontology: tool for the unification of biology. Nature Genet., , 25, , 2529.[CrossRef][Web of Science][Medline] .
- Camon,E., Magrane,M., Barrell,D., Binns,D., Fleischmann,W., Kersey,P., Mulder,N., Oinn,T., Maslen,J., Cox,A. et al. ( (2003) ) The Gene Ontology Annotation (GOA) project: implementation of GO in SWISS-PROT, TrEMBL, and InterPro. Genome Res., , 13, , 662672.
[Abstract/Free Full Text] .
This article has been cited by other articles:
![]() |
C. C. Babbitt, O. Fedrigo, A. D. Pfefferle, A. P. Boyle, J. E. Horvath, T. S. Furey, and G. A. Wray Both Noncoding and Protein-Coding RNAs Contribute to Gene Expression Evolution in the Primate Brain Gen Biol Evol, February 8, 2010; 2010(0): 67 - 79. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. K. Kim, H. S. Lillehoj, S. H. Lee, S. I. Jang, and D. Bravo High-throughput gene expression analysis of intestinal intraepithelial lymphocytes after oral feeding of carvacrol, cinnamaldehyde, or Capsicum oleoresin Poult. Sci., January 1, 2010; 89(1): 68 - 81. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Beckstette, R. Homann, R. Giegerich, and S. Kurtz Significant speedup of database searches with HMMs by search space reduction with PSSM family models Bioinformatics, December 15, 2009; 25(24): 3251 - 3258. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Meyer, R. Overbeek, and A. Rodriguez FIGfams: yet another set of protein families Nucleic Acids Res., November 1, 2009; 37(20): 6643 - 6654. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Yuan, J. Han, G. Guo, Y. L. Orlov, M. Huss, Y.-H. Loh, L.-P. Yaw, P. Robson, B. Lim, and H.-H. Ng Eset partners with Oct4 to restrict extraembryonic trophoblast lineage potential in embryonic stem cells Genes & Dev., November 1, 2009; 23(21): 2507 - 2520. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Jaroudi, G. Kakourou, S. Cawood, A. Doshi, D. M. Ranieri, P. Serhal, J. C. Harper, and S. B. SenGupta Expression profiling of DNA repair genes in human oocytes and blastocysts using microarrays Hum. Reprod., October 1, 2009; 24(10): 2649 - 2655. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Lawson and L. Zhang Sexy gene conversions: locating gene conversions on the X-chromosome Nucleic Acids Res., August 1, 2009; 37(14): 4570 - 4579. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. K. Kim, C. H. Kim, S. J. Lamont, C. L. Keeler Jr., and H. S. Lillehoj Gene expression profiles of two B-complex disparate, genetically inbred Fayoumi chicken lines that differ in susceptibility to Eimeria maxima Poult. Sci., August 1, 2009; 88(8): 1565 - 1579. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Han, J. M. Burnette III, and S. R. Wessler TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences Nucleic Acids Res., June 1, 2009; 37(11): e78 - e78. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. KOKENY, J. PAPP, G. WEBER, T. VASZKO, P. CARMONA-SAEZ, and E. OLAH Ribavirin Acts via Multiple Pathways in Inhibition of Leukemic Cell Proliferation Anticancer Res, June 1, 2009; 29(6): 1971 - 1980. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Lorente-Rodriguez, M. Heidtman, and C. Barlowe Multicopy suppressor analysis of thermosensitive YIP1 alleles implicates GOT1 in transport from the ER J. Cell Sci., May 15, 2009; 122(10): 1540 - 1550. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Wilflingseder, A. Kainz, P. Perco, R. Korbely, B. Mayer, and R. Oberbauer Molecular predictors for anaemia after kidney transplantation Nephrol. Dial. Transplant., March 1, 2009; 24(3): 1015 - 1023. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Suss, C. Czupalla, C. Winter, T. Pursche, K.-P. Knoch, M. Schroeder, B. Hoflack, and M. Solimena Rapid Changes of mRNA-binding Protein Levels following Glucose and 3-Isobutyl-1-methylxanthine Stimulation of Insulinoma INS-1 Cells Mol. Cell. Proteomics, March 1, 2009; 8(3): 393 - 408. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Girirajan, L. Chen, T. Graves, T. Marques-Bonet, M. Ventura, C. Fronick, L. Fulton, M. Rocchi, R. S. Fulton, R. K. Wilson, et al. Sequencing human-gibbon breakpoints of synteny reveals mosaic new insertions at rearrangement sites Genome Res., February 1, 2009; 19(2): 178 - 190. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. A. Martin, D. Miranda-Saavedra, and G. J. Barton Kinomer v. 1.0: a database of systematically classified eukaryotic protein kinases Nucleic Acids Res., January 1, 2009; 37(suppl_1): D244 - D250. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Wilson, R. Pethica, Y. Zhou, C. Talbot, C. Vogel, M. Madera, C. Chothia, and J. Gough SUPERFAMILY--sophisticated comparative genomics, data mining, visualization and phylogeny Nucleic Acids Res., January 1, 2009; 37(suppl_1): D380 - D386. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Meng, M. J. Moscou, and R. P. Wise Blufensin1 Negatively Impacts Basal Defense in Response to Barley Powdery Mildew Plant Physiology, January 1, 2009; 149(1): 271 - 285. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Larrouy, P. Barbe, C. Valle, S. Dejean, V. Pelloux, C. Thalamas, J.-P. Bastard, A. Le Bouil, B. Diquet, K. Clement, et al. Gene expression profiling of human skeletal muscle in response to stabilized weight loss Am. J. Clinical Nutrition, July 1, 2008; 88(1): 125 - 132. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Khan, R. Vadigepalli, M. K. McDonald, R. F. Rogers, G. R. Gao, and J. S. Schwaber Dynamic transcriptomic response to acute hypertension in the nucleus tractus solitarius Am J Physiol Regulatory Integrative Comp Physiol, July 1, 2008; 295(1): R15 - R27. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. F. W. Saunders and B. Kobe The Predikin webserver: improved prediction of protein kinase peptide specificity using structural information Nucleic Acids Res., July 1, 2008; 36(suppl_2): W286 - W290. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Wang miRDB: A microRNA target prediction and functional annotation database with a wiki interface RNA, June 1, 2008; 14(6): 1012 - 1017. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Akagi, J. Li, R. M. Stephens, N. Volfovsky, and D. E. Symer Extensive variation between inbred mouse strains due to endogenous L1 retrotransposition Genome Res., June 1, 2008; 18(6): 869 - 880. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Singh, A. Olowoyeye, P. H. Baenziger, J. Dantzer, M. G. Kann, P. Radivojac, R. Heiland, and S. D. Mooney MutDB: update on development of tools for the biochemical analysis of genetic variation Nucleic Acids Res., January 11, 2008; 36(suppl_1): D815 - D819. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Crespi, K. Summers, and S. Dorus Adaptive evolution of genes underlying schizophrenia Proc R Soc B, November 22, 2007; 274(1627): 2801 - 2810. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Alexe, G. S. Dalgin, D. Scanfeld, P. Tamayo, J. P. Mesirov, C. DeLisi, L. Harris, N. Barnard, M. Martel, A. J. Levine, et al. High Expression of Lymphocyte-Associated Genes in Node-Negative HER2+ Breast Cancers Correlates with Lower Recurrence Rates Cancer Res., November 15, 2007; 67(22): 10669 - 10676. [Abstract] [Full Text] [PDF] |
||||
![]() |
T C T M van der Pouw Kraan, C A Wijbrandts, L G M van Baarsen, A E Voskuyl, F Rustenburg, J M Baggen, S M Ibrahim, M Fero, B A C Dijkmans, P P Tak, et al. Rheumatoid arthritis subtypes identified by genomic profiling of peripheral blood cells: assignment of a type I interferon signature in a subpopulation of patients Ann Rheum Dis, August 1, 2007; 66(8): 1008 - 1014. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Kultz, D. Fiol, N. Valkova, S. Gomez-Jimenez, S. Y. Chan, and J. Lee Functional genomics and proteomics of the cellular osmotic stress response in `non-model' organisms J. Exp. Biol., May 1, 2007; 210(9): 1593 - 1601. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Bakewell, P. Shi, and J. Zhang More genes underwent positive selection in chimpanzee evolution than in human evolution PNAS, May 1, 2007; 104(18): 7489 - 7494. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Chang and M. A. F. Noor The Genetics of Hybrid Male Sterility Between the Allopatric Species Pair Drosophila persimilis and D. pseudoobscura bogotana: Dominant Sterility Alleles in Collinear Autosomal Regions Genetics, May 1, 2007; 176(1): 343 - 349. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Agrawal, W.-K. Hofmann, N. Tidow, M. Ehrich, D. v. d. Boom, S. Koschmieder, W. E. Berdel, H. Serve, and C. Muller-Tidow The C/EBP{delta} tumor suppressor is silenced by hypermethylation in acute myeloid leukemia Blood, May 1, 2007; 109(9): 3895 - 3905. [Abstract] [Full Text] [PDF] |
||||
![]() |
Rhesus Macaque Genome Sequencing and Analysis Cons, R. A. Gibbs, J. Rogers, M. G. Katze, R. Bumgarner, G. M. Weinstock, E. R. Mardis, K. A. Remington, R. L. Strausberg, J. C. Venter, et al. Evolutionary and Biomedical Insights from the Rhesus Macaque Genome Science, April 13, 2007; 316(5822): 222 - 234. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Richardt, D. Lang, R. Reski, W. Frank, and S. A. Rensing PlanTAPDB, a Phylogeny-Based Resource of Plant Transcription-Associated Proteins Plant Physiology, April 1, 2007; 143(4): 1452 - 1466. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. J. Mulder, R. Apweiler, T. K. Attwood, A. Bairoch, A. Bateman, D. Binns, P. Bork, V. Buillard, L. Cerutti, R. Copley, et al. New developments in the InterPro database Nucleic Acids Res., January 12, 2007; 35(suppl_1): D224 - D228. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Mi, N. Guo, A. Kejariwal, and P. D. Thomas PANTHER version 6: protein sequence and function evolution data with expanded representation of biological pathways Nucleic Acids Res., January 12, 2007; 35(suppl_1): D247 - D252. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Voolstra, D. Tautz, P. Farbrother, L. Eichinger, and B. Harr Contrasting evolution of expression differences in the testis between species and subspecies of the house mouse Genome Res., January 1, 2007; 17(1): 42 - 49. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Moehring, K. C. Teeter, and M. A. F. Noor Genome-Wide Patterns of Expression in Drosophila Pure Species and Hybrid Males. II. Examination of Multiple-Species Hybridizations, Platforms, and Life Cycle Stages Mol. Biol. Evol., January 1, 2007; 24(1): 137 - 145. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ng, B. Bursteinas, Q. Gao, E. Mollison, and M. Zvelebil Resources for integrative systems biology: from data through databases to networks and dynamic system models Brief Bioinform, December 1, 2006; 7(4): 318 - 330. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. I. Zeller, X. Zhao, C. W. H. Lee, K. P. Chiu, F. Yao, J. T. Yustein, H. S. Ooi, Y. L. Orlov, A. Shahab, H. C. Yong, et al. Global mapping of c-Myc binding sites and target gene networks in human B cells PNAS, November 21, 2006; 103(47): 17834 - 17839. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. V. Ivshina, J. George, O. Senko, B. Mow, T. C. Putti, J. Smeds, T. Lindahl, Y. Pawitan, P. Hall, H. Nordgren, et al. Genetic Reclassification of Histologic Grade Delineates New Clinical Subtypes of Breast Cancer Cancer Res., November 1, 2006; 66(21): 10292 - 10301. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Friedberg Automated protein function prediction--the genomic challenge Brief Bioinform, September 1, 2006; 7(3): 225 - 242. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Bryson, V. Loux, R. Bossy, P. Nicolas, S. Chaillou, M. van de Guchte, S. Penaud, E. Maguin, M. Hoebeke, P. Bessieres, et al. AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system Nucleic Acids Res., July 19, 2006; 34(12): 3533 - 3545. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. P. Duffy, A. M. Young, B. Morin, C. J. Lucarotti, B. F. Koop, and D. B. Levin Sequence Analysis and Organization of the Neodiprion abietis Nucleopolyhedrovirus Genome J. Virol., July 15, 2006; 80(14): 6952 - 6963. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. D. Thomas, A. Kejariwal, N. Guo, H. Mi, M. J. Campbell, A. Muruganujan, and B. Lazareva-Ulitsky Applications for protein sequence-function evolution data: mRNA/protein expression analysis and coding SNP scoring tools. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W645 - W650. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Harr Genomic islands of differentiation between house mouse subspecies Genome Res., June 1, 2006; 16(6): 730 - 737. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Sivakumar, C. Wilton, and L. Holm From sequences to a functional unit Physiol Genomics, March 13, 2006; 25(1): 1 - 8. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li, A. Coghlan, J. Ruan, L. J. Coin, J.-K. Heriche, L. Osmotherly, R. Li, T. Liu, Z. Zhang, L. Bolund, et al. TreeFam: a curated database of phylogenetic trees of animal gene families Nucleic Acids Res., January 1, 2006; 34(suppl_1): D572 - D580. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Quevillon, V. Silventoinen, S. Pillai, N. Harte, N. Mulder, R. Apweiler, and R. Lopez InterProScan: protein domains identifier Nucleic Acids Res., July 1, 2005; 33(suppl_2): W116 - W120. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||






























