Nucleic Acids Research, 2005, Vol. 33, Database issue D192-D196
© 2005, the authors
Nucleic Acids Research, Vol. 33, Database issue © Oxford University Press 2005; all rights reserved
CDD: a Conserved Domain Database for protein classification
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38 A, Room 8N805, 8600 Rockville Pike, Bethesda, MD 20894, USA
* To whom correspondence should be addressed. Tel: +1 301 435 4919; Fax: +1 301 480 9241; Email: bauer{at}ncbi.nlm.nih.gov
Received September 22, 2004; Accepted October 5, 2004
| ABSTRACT |
|---|
|
|
|---|
The Conserved Domain Database (CDD) is the protein classification component of NCBI's Entrez query and retrieval system. CDD is linked to other Entrez databases such as Proteins, Taxonomy and PubMed®, and can be accessed at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=cdd. CD-Search, which is available at http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi, is a fast, interactive tool to identify conserved domains in new protein sequences. CD-Search results for protein sequences in Entrez are pre-computed to provide links between proteins and domain models, and computational annotation visible upon request. Proteinprotein queries submitted to NCBI's BLAST search service at http://www.ncbi.nlm.nih.gov/BLAST are scanned for the presence of conserved domains by default. While CDD started out as essentially a mirror of publicly available domain alignment collections, such as SMART, Pfam and COG, we have continued an effort to update, and in some cases replace these models with domain hierarchies curated at the NCBI. Here, we report on the progress of the curation effort and associated improvements in the functionality of the CDD information retrieval system.
| INTRODUCTION |
|---|
|
|
|---|
Protein domains are distinct units of molecular evolution, usually associated with particular aspects of molecular function such as catalysis or binding. In general, they represent discrete units of three-dimensional (3D) structure. The identification of functionally characterized domains in protein sequences may give the first clues as to their molecular and cellular function.
Protein domains come in families. A dazzling array of functional diversity, and a large number of clusters grouped by obvious sequence similarity, can be reduced to anywhere between several hundred and a few thousand domain superfamilies, depending on how aggressively one groups clusters based on 3D-structural and/or functional similarities. In many cases, a single or a few search models are sufficient to uniquely identify all members of a large, diverse superfamily in a sequence database. In fact, it is possible to identify and label domains in more than two-thirds of the known protein sequences with only a few thousand domain models, as exemplified by the comprehensive collection Pfam (1). However, even a compact collection such as Pfam cannot help but create separate models for what are truly homologous families. Overlapping regions in protein sequences will sometimes be annotated by more than one model. The Conserved Domain Database (CDD) also mirrors other collections, which are largely redundant with Pfam: SMART (2) and COG (3). This, of course, aggravates the annotation problem. Users of the CD-Search resource (4) may face multiple overlapping annotations, sometimes with very similar scores but distinct functional association. This often-confusing redundancy is a necessary, but not desired property of a multiple-source collection such as CDD.
One can take certain obvious steps to reduce the redundancy, and this is what we have begun to do in CDD version v2.00. Search models are clustered based on overlapping hits in the protein database. Members of a cluster that do not significantly add to the cluster's total coverage are removed from Entrez's default CDD collection. We have also removed search models, which annotated very few or no sequences, and search models that seem to be specific for proteins and/or domains found only in narrow phylogenetic lineages.
However, redundancy can be a good thing, if it provides more specific functional annotations, and the relationships between related models are clear and well explained to the user. There are practical limits to subdividing domain superfamilies: a large number of domain models will affect the database search time, and experimentally backed functional annotation is sparse in many cases. For CDD, we have adopted a principle of creating subfamilies only for ancient conserved domains, present in diverse organisms. We create subfamilies only when the phylogenetic distribution of member sequences suggests an origin of a domain orthology group by gene duplication occurring
0.5 Byr in the past or earlier. This principle helps us to maintain what we hope will be a uniform and understandable level of granularity. In subfamilies, we attempt to identify function from the sequence annotation and the published literature. Alignment models are kept consistent throughout superfamily hierarchies. The core model in a subfamily alignment can be mapped onto the often less extensive alignment in the parent model, greatly facilitating updates to include novel representative structures and sequences.
To identify ancient subfamilies for splitting out individual search models, we perform phylogenetic analysis on the multiple sequence alignments and construct sequence trees. This procedure requires fairly accurate alignments, and frequently we do revise alignment models imported from outside sources. In alignment curation, we consider information from 3D structure and structure superposition, when possible, to define structurally conserved cores, accurately delineate domain boundaries and resolve conflicts between sequence-based alignment methods and structure superposition (5). Alignments curated at the NCBI conform to a simple block-structure, with uniformly aligned, gap-less, structurally conserved blocks separated by unaligned regions, which capture length variation.
Alignment models from both curated and imported sets are converted into position-specific scoring matrices, and the latter are assembled into search databases for use with RPS-BLAST (6).
| CDD CONTENTS AND ACCESS |
|---|
|
|
|---|
CDD is accessible through the Entrez data retrieval system (7), and can be queried as Entrez's Domains database. Domain names and terms found in functional descriptions are indexed, and additional search capabilities are provided through reciprocal links to other Entrez resources, such as the NCBI Taxonomy Database, PubMed® and Entrez's protein database. Pre-calculated CD-Searches for proteins are recorded in the CDART database (8), which provides summaries of domain architecture for all proteins in Entrez. Pre-calculated search results are readily accessible, and provide data for proteindomain links, proteinprotein links based on similar domain architecture and domaindomain links based on overlapping hit-lists.
Most of the domain models in CDD have been imported from two outside sources, Pfam and COG. CDD also contains models from SMART, and several hundred NCBI-curated domain models, identifiable by accessions starting with cd. While CD-Search continues to mirror Pfam version 11.0, SMART version 4.0 and COG as individual search sets, the default non-redundant CDD v2.01, as available in Entrez, currently retains only 5252 of 7255 Pfam models, 575 of 663 SMART models and 4101 of 4873 COG models. The remainder has been removed as redundant, ineffective or lineage-specific.
Search models for use with local RPS-BLAST installations, as well as CDD alignments are available at ftp://ftp.ncbi.nlm.nih.gov/pub/mmdb/cdd/. The source code for RPS-BLAST is part of the NCBI toolkit distribution, accessible at ftp://ftp.ncbi.nlm.nih.gov/toolbox.
| FINDING DOMAINS IN ENTREZ |
|---|
|
|
|---|
When protein queries are submitted for proteinprotein BLAST® searches, they are submitted to CD-Search by default, and the resulting domain annotation is displayed graphically on the intermediate BLAST® results page. One may launch a browser window with the detailed results. Pre-calculated CD-Search results are also readily available for proteins in Entrez, following the [Domains] links.
One might, for example, study a family of plant kinesins, exemplified by gi|10130006 from Zea mays. CD-Search produces a graphical display as shown in the upper half of Figure 1. Two regions of the query protein receive multiple and seemingly redundant hits. The central coiled-coil region scores well with a variety of coiled-coil models contained in the uncurated subset of CDD. The C-terminal motor domain scores well with several models curated at the NCBI.
|
One may follow the link to the best-scoring match, cd01366 or KISc_C_terminal, to see the query sequence embedded in its multiple sequence alignment. What will become evident is that cd01366 is one terminal node in a larger hierarchy of related domain models, summarizing kinesin and myosin motor domains in this example. The second-best, third-best scoring hits, and so on, for the C-terminal region of gi|10130006 are to other nodes in this hierarchy. One may want to compare scores and E-values to understand whether the query sequence scores significantly better with one particular subgroup or not.
At the level of each individual subgroup, the similarity of the query sequence to other members of that family may be examined. We record conserved features in CD alignment models, such as active sites or binding interfaces, and their locations and residue conservation patterns may be examined in the context of the query. We provide additional annotation, such as links to literature in PubMed and links to textbooks in Entrez, so that the user can learn more about the biology of the respective families.
Building the domain family hierarchies and recording conserved features are major goals of NCBI's curation effort. We record conserved features together with evidence, such as structure evidence, particular 3D complexes that exemplify binding, for example, or literature citations. We also record the sequence trees used in making decisions about subfamily splits, as an evidence for the domain family hierarchy.
| FUTURE DEVELOPMENTS |
|---|
|
|
|---|
Beginning in 2005, we plan to distribute the software used to build and maintain these hierarchies, to serve as a helper application for the web-browser, enabling users to visualize CD family hierarchies, sequence trees and taxonomic diversity across nodes in sequence trees. Figure 2 displays a sequence-tree calculated for the Myosin/Kinesin motor domain family, which was used as an example in Figure 1.
|
Having access to the alignment data and analysis algorithms used by the NCBI curators should make the hierarchy editing process more transparent, should users want to investigate. Interested users of the hierarchy editor and of Cn3D (9), the associated structure-based alignment editor, will be able to import additional sequences and examine their behavior in phylogenetic clustering.
| ACKNOWLEDGEMENTS |
|---|
We thank the authors of Pfam, SMART and COG, for creating invaluable resources and for helping with access to data. We also thank the NIH Intramural Research Program for support. We are grateful to the NCBI BLAST group for developing RPS-BLAST and for continuous support. Comments, suggestions and questions are welcome and should be directed to info{at}ncbi.nlm.nih.gov.
| Notes |
|---|
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use permissions, please contact journals.permissions{at}oupjournals.org.
| REFERENCES |
|---|
|
|
|---|
- Bateman,A., Coin,L., Durbin,R., Finn,R.D., Hollich,V., Griffiths-Jones,S., Khanna,A., Marshall,M., Moxon,S., Sonnhammer,E.L., Studholme,D.J., Yeats,C. and Eddy,S.R. ( (2004) ) The Pfam protein families database. Nucleic Acids Res., , 32, , 138141. .
- Letunic,I., Copley,R.R., Schmidt,S., Ciccarelli,F.D., Doerks,T., Schultz,J., Ponting,C.P. and Bork,P. ( (2004) ) SMART 4.0: towards genomic data integration. Nucleic Acids Res., , 32, , 142144. .
- Tatusov,R.L., Fedorova,N.D., Jackson,J.D., Jacobs,A.R., Kiryutin,B., Koonin,E.V., Krylov,D.M., Mazumder,R., Mekhedov,S.L., Nikolskaya,A.N., Rao,B.S., Smirnov,S., Sverdlov,A.V., Vasudevan,S., Wolf,Y.I., Yin,J.J. and Natale,D.A. ( (2003) ) The COG database: and updated version includes eukaryotes. BMC Bioinformatics, , 4, , 41.[CrossRef][Medline] .
- Marchler-Bauer,A. and Bryant,S.H. ( (2004) ) CD-Search: protein domain annotations on the fly. Nucleic Acids Res., , 32, , W327W331.
[Abstract/Free Full Text] . - Marchler-Bauer,A., Panchenko,A.R., Ariel,N. and Bryant,S.H. ( (2002) ) Comparison of sequence and structure alignments for protein domains. Proteins, , 48, , 439446.[CrossRef][Web of Science][Medline] .
- Marchler-Bauer,A., Panchenko,A.R., Shoemaker,B.A., Thiessen,P.A., Geer,L.Y. and Bryant,S.H. ( (2002) ) CDD: a database of conserved domain alignments with links to domain three-dimensional structure. Nucleic Acids Res., , 30, , 281283.
[Abstract/Free Full Text] . - Wheeler,D.L., Church,D.M., Edgar,R., Federhen,S., Helmberg,W., Madden,T.L., Pontius,J.U., Schuler,G.D., Schriml,L.M., Sequeira,E., Suzek,T.O., Tatusova,T.A. and Wagner,L. ( (2004) ) Database resources of the National Center for Biotechnology Information: update. Nucleic Acids Res., , 32, , 3540.
[Abstract/Free Full Text] . - Geer,L.Y., Domrachev,M., Lipman,D.J. and Bryant,S.H. ( (2002) ) CDART: protein homology by domain architecture. Genome Res., , 12, , 16191623.
[Abstract/Free Full Text] . - Wang,Y., Geer,L.Y., Chappey,C., Kans,J.A. and Bryant,S.H. ( (2000) ) Cn3D: sequence and structure views for Entrez. Trends Biochem. Sci., , 25, , 300302.[CrossRef][Web of Science][Medline]
.
This article has been cited by other articles:
![]() |
R. M. Presti, G. Zhao, W. L. Beatty, K. A. Mihindukulasuriya, A. P. A. Travassos da Rosa, V. L. Popov, R. B. Tesh, H. W. Virgin, and D. Wang Quaranfil, Johnston Atoll, and Lake Chad Viruses Are Novel Members of the Family Orthomyxoviridae J. Virol., November 15, 2009; 83(22): 11599 - 11606. [Abstract] [Full Text] [PDF] |
||||
![]() |
B.-C. Lee and D. Kim A new method for revealing correlated mutations under the structural and functional constraints in proteins Bioinformatics, October 1, 2009; 25(19): 2506 - 2513. [Abstract] [Full Text] [PDF] |
||||
![]() |
D.-D. Wu, G.-D. Wang, D. M Irwin, and Y.-P. Zhang A Profound Role for the Expansion of Trypsin-Like Serine Protease Family in the Evolution of Hematophagy in Mosquito Mol. Biol. Evol., October 1, 2009; 26(10): 2333 - 2341. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Meyer, R. Overbeek, and A. Rodriguez FIGfams: yet another set of protein families Nucleic Acids Res., September 17, 2009; (2009) gkp698v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
T.-H. Lee, Y.-K. Kim, T. T. M. Pham, S. I. Song, J.-K. Kim, K. Y. Kang, G. An, K.-H. Jung, D. W. Galbraith, M. Kim, et al. RiceArrayNet: A Database for Correlating Gene Expression from Transcriptome Profiling, and Its Application to the Analysis of Coexpressed Genes in Rice Plant Physiology, September 1, 2009; 151(1): 16 - 33. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A. Richards, D. M. Soanes, P. G. Foster, G. Leonard, C. R. Thornton, and N. J. Talbot Phylogenomic Analysis Demonstrates a Pattern of Rare and Ancient Horizontal Gene Transfer between Plants and Fungi PLANT CELL, July 1, 2009; 21(7): 1897 - 1911. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. B. Gammon and D. H. Evans The 3'-to-5' Exonuclease Activity of Vaccinia Virus DNA Polymerase Is Essential and Plays a Role in Promoting Virus Genetic Recombination J. Virol., May 1, 2009; 83(9): 4236 - 4250. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. P. Riley, T. Bykowski, A. E. Cooley, L. H. Burns, K. Babb, C. A. Brissette, A. Bowman, M. Rotondi, M. C. Miller, E. DeMoll, et al. Borrelia burgdorferi EbfC defines a newly-identified, widespread family of bacterial DNA-binding proteins Nucleic Acids Res., April 1, 2009; 37(6): 1973 - 1983. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Cuthbertson, I. L. Mainprize, J. H. Naismith, and C. Whitfield Pivotal Roles of the Outer Membrane Polysaccharide Export and Polysaccharide Copolymerase Protein Families in Export of Extracellular Polysaccharides in Gram-Negative Bacteria Microbiol. Mol. Biol. Rev., March 1, 2009; 73(1): 155 - 177. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. J. Bialas, P. N. Inglis, C. Li, J. F. Robinson, J. D. K. Parker, M. P. Healey, E. E. Davis, C. D. Inglis, T. Toivonen, D. C. Cottell, et al. Functional interactions between the ciliopathy-associated Meckel syndrome 1 (MKS1) protein and two novel MKS1-related (MKSR) proteins J. Cell Sci., March 1, 2009; 122(5): 611 - 624. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Schuette, A. J. Wood, M. Geisler, J. Geisler-Lee, R. Ligrone, and K. S. Renzaglia Novel localization of callose in the spores of Physcomitrella patens and phylogenomics of the callose synthase gene family Ann. Bot., March 1, 2009; 103(5): 749 - 756. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. E. Cozen, M. T. Weirauch, K. S. Pollard, D. L. Bernick, J. M. Stuart, and T. M. Lowe Transcriptional Map of Respiratory Versatility in the Hyperthermophilic Crenarchaeon Pyrobaculum aerophilum J. Bacteriol., February 1, 2009; 191(3): 782 - 794. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. Hur, K. Van Doninck, M. L. Mandigo, and M. Meselson Degenerate Tetraploidy Was Established Before Bdelloid Rotifer Families Diverged Mol. Biol. Evol., February 1, 2009; 26(2): 375 - 383. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Chaban, S. M. Logan, J. F. Kelly, and K. F. Jarrell AglC and AglK Are Involved in Biosynthesis and Attachment of Diacetylated Glucuronic Acid to the N-Glycan in Methanococcus voltae J. Bacteriol., January 1, 2009; 191(1): 187 - 195. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Liu and M. Pop ARDB--Antibiotic Resistance Genes Database Nucleic Acids Res., January 1, 2009; 37(suppl_1): D443 - D447. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Marchler-Bauer, J. B. Anderson, F. Chitsaz, M. K. Derbyshire, C. DeWeese-Scott, J. H. Fong, L. Y. Geer, R. C. Geer, N. R. Gonzales, M. Gwadz, et al. CDD: specific functional annotation with the Conserved Domain Database Nucleic Acids Res., January 1, 2009; 37(suppl_1): D205 - D210. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Zhang, J. Ju, H. Peng, F. Gao, C. Zhou, Y. Zeng, Y. Xue, Y. Li, B. Henrissat, G. F. Gao, et al. Biochemical and Structural Characterization of the Intracellular Mannanase AaManA of Alicyclobacillus acidocaldarius Reveals a Novel Glycoside Hydrolase Family Belonging to Clan GH-A J. Biol. Chem., November 14, 2008; 283(46): 31551 - 31558. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-H. Kim, H.-W. Chang, Y.-D. Nam, S. W. Roh, M.-S. Kim, Y. Sung, C. O. Jeon, H.-M. Oh, and J.-W. Bae Amplification of Uncultured Single-Stranded DNA Viruses from Rice Paddy Soil Appl. Envir. Microbiol., October 1, 2008; 74(19): 5975 - 5985. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. S. Chang, Y. Hong, K. D. Ko, G. Bhardwaj, E. C. Holmes, R. L. Patterson, and D. B. van Rossum Phylogenetic profiles reveal evolutionary relationships within the "twilight zone" of sequence similarity PNAS, September 9, 2008; 105(36): 13474 - 13479. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Eudes, G. Mouille, J. Thevenin, A. Goyallon, Z. Minic, and L. Jouanin Purification, Cloning and Functional Characterization of an Endogenous beta-Glucuronidase in Arabidopsis thaliana Plant Cell Physiol., September 1, 2008; 49(9): 1331 - 1341. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. C. Almeida and R. DeSalle Evidence of Adaptive Evolution of Accessory Gland Proteins in Closely Related Species of the Drosophila repleta Group Mol. Biol. Evol., September 1, 2008; 25(9): 2043 - 2053. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Lambert, R. J. Siezen, W. M. de Vos, and M. Kleerebezem Improved annotation of conjugated bile acid hydrolase superfamily members in Gram-positive bacteria Microbiology, August 1, 2008; 154(8): 2492 - 2500. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Schilde, K. Lutter, R. Kissmehl, and H. Plattner Molecular Identification of a SNAP-25-Like SNARE Protein in Paramecium Eukaryot. Cell, August 1, 2008; 7(8): 1387 - 1402. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Xu, K. F. Chater, Z. Deng, and M. Tao A Cellulose Synthase-Like Protein Involved in Hyphal Tip Growth and Morphological Differentiation in Streptomyces J. Bacteriol., July 15, 2008; 190(14): 4971 - 4978. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Overton, C. A. J. van Niekerk, L. G. Carter, A. Dawson, D. M. A. Martin, S. Cameron, S. A. McMahon, M. F. White, W. N. Hunter, J. H. Naismith, et al. TarO: a target optimisation system for structural biology Nucleic Acids Res., July 1, 2008; 36(suppl_2): W190 - W196. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Cai Unicellular Ca2+ Signaling 'Toolkit' at the Origin of Metazoa Mol. Biol. Evol., July 1, 2008; 25(7): 1357 - 1361. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. T. T. Tran-Nguyen, M. Kube, B. Schneider, R. Reinhardt, and K. S. Gibb Comparative Genome Analysis of "Candidatus Phytoplasma australiense" (Subgroup tuf-Australia I; rp-A) and "Ca. Phytoplasma asteris" Strains OY-M and AY-WB J. Bacteriol., June 1, 2008; 190(11): 3979 - 3991. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Orlowski and J. M. Bujnicki Structural and evolutionary classification of Type II restriction enzymes based on theoretical and experimental analyses Nucleic Acids Res., June 1, 2008; 36(11): 3552 - 3569. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Jasinski, D. Sudre, G. Schansker, M. Schellenberg, S. Constant, E. Martinoia, and L. Bovet AtOSA1, a Member of the Abc1-Like Family, as a New Factor in Cadmium and Oxidative Stress Response Plant Physiology, June 1, 2008; 147(2): 719 - 731. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pilhofer, K. Rappl, C. Eckl, A. P. Bauer, W. Ludwig, K.-H. Schleifer, and G. Petroni Characterization and Evolution of Cell Division and Cell Wall Synthesis Genes in the Bacterial Phyla Verrucomicrobia, Lentisphaerae, Chlamydiae, and Planctomycetes and Phylogenetic Comparison with rRNA Genes J. Bacteriol., May 1, 2008; 190(9): 3192 - 3202. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Onaga and T. Taira A new type of plant chitinase containing LysM domains from a fern (Pteris ryukyuensis): Roles of LysM domains in chitin binding and antifungal activity Glycobiology, May 1, 2008; 18(5): 414 - 423. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Delaney, J. Zhang, G. Carlson, J. Schmidt, B. Stagg, B. Comstock, A. Babb, C. Finlay, R. F. Cressman, G. Ladics, et al. A Gene-Shuffled Glyphosate Acetyltransferase Protein from Bacillus licheniformis (GAT4601) Shows No Evidence of Allergenicity or Toxicity Toxicol. Sci., April 1, 2008; 102(2): 425 - 432. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Naponelli, A. Noiriel, M. J. Ziemak, S. M. Beverley, L.-F. Lye, A. M. Plume, J. R. Botella, K. Loizeau, S. Ravanel, F. Rebeille, et al. Phylogenomic and Functional Analysis of Pterin-4a-Carbinolamine Dehydratase Family (COG2154) Proteins in Plants and Microorganisms Plant Physiology, April 1, 2008; 146(4): 1515 - 1527. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. T. Edwards, N. K. Fry, and T. G. Harrison Clonal population structure of Legionella pneumophila inferred from allelic profiling Microbiology, March 1, 2008; 154(3): 852 - 864. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Youds, L. J. Barber, J. D. Ward, S. J. Collis, N. J. O'Neil, S. J. Boulton, and A. M. Rose DOG-1 Is the Caenorhabditis elegans BRIP1/FANCJ Homologue and Functions in Interstrand Cross-Link Repair Mol. Cell. Biol., March 1, 2008; 28(5): 1470 - 1479. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. King, E. F. Mulrooney, E. Vinogradov, B. Kneidinger, K. Mead, and J. S. Lam lfnA from Pseudomonas aeruginosa O12 and wbuX from Escherichia coli O145 Encode Membrane-Associated Proteins and Are Required for Expression of 2,6-Dideoxy-2-Acetamidino-L-Galactose in Lipopolysaccharide O Antigen J. Bacteriol., March 1, 2008; 190(5): 1671 - 1679. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Kosaka, S. Kato, T. Shimoyama, S. Ishii, T. Abe, and K. Watanabe The genome of Pelotomaculum thermopropionicum reveals niche-associated evolution in anaerobic microbiota Genome Res., March 1, 2008; 18(3): 442 - 448. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. K. Basu, L. Carmel, I. B. Rogozin, and E. V. Koonin Evolution of protein domain promiscuity in eukaryotes Genome Res., March 1, 2008; 18(3): 449 - 461. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Marzo, M. Puig, and A. Ruiz The Foldback-like element Galileo belongs to the P superfamily of DNA transposons and is widespread within the Drosophila genus PNAS, February 26, 2008; 105(8): 2957 - 2962. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. C. Hoopman, W. Wang, C. A. Brautigam, J. L. Sedillo, T. J. Reilly, and E. J. Hansen Moraxella catarrhalis Synthesizes an Autotransporter That Is an Acid Phosphatase J. Bacteriol., February 15, 2008; 190(4): 1459 - 1472. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Comfort, C.-J. Chou, S. B. Conners, A. L. VanFossen, and R. M. Kelly Functional-Genomics-Based Identification and Characterization of Open Reading Frames Encoding {alpha}-Glucoside-Processing Enzymes in the Hyperthermophilic Archaeon Pyrococcus furiosus Appl. Envir. Microbiol., February 15, 2008; 74(4): 1281 - 1283. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Boyer, J. Haurat, S. Samain, B. Segurens, F. Gavory, V. Gonzalez, P. Mavingui, R. Rohr, R. Bally, and F. Wisniewski-Dye Bacteriophage Prevalence in the Genus Azospirillum and Analysis of the First Genome Sequence of an Azospirillum brasilense Integrative Phage Appl. Envir. Microbiol., February 1, 2008; 74(3): 861 - 874. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Miyata, J. Begun, E. R. Troemel, and F. M. Ausubel DAF-16-Dependent Suppression of Immunity During Reproduction in Caenorhabditis elegans Genetics, February 1, 2008; 178(2): 903 - 918. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Nord and B.-M. Sjoberg Unconventional GIY-YIG homing endonuclease encoded in group I introns in closely related strains of the Bacillus cereus group Nucleic Acids Res., January 17, 2008; 36(1): 300 - 310. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Brilli, R. Fani, and P. Lio Current trends in the bioinformatic sequence analysis of metabolic pathways in prokaryotes Brief Bioinform, January 1, 2008; 9(1): 34 - 45. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Tomita, B. Meehan, N. Wongkattiya, J. Malmo, G. Pullinger, J. Leigh, and M. Deighton Identification of Streptococcus uberis Multilocus Sequence Types Highly Associated with Mastitis Appl. Envir. Microbiol., January 1, 2008; 74(1): 114 - 124. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-P. Nuccio and A. J. Baumler Evolution of the Chaperone/Usher Assembly Pathway: Fimbrial Classification Goes Greek Microbiol. Mol. Biol. Rev., December 1, 2007; 71(4): 551 - 575. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Y. Lee, D. A. Weber, O. Laur, E. A. Severson, I. McCall, R. P. Jen, A. C. Chin, T. Wu, K. M. Gernet, and C. A. Parkos Novel Structural Determinants on SIRP{alpha} that Mediate Binding to CD47 J. Immunol., December 1, 2007; 179(11): 7741 - 7750. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. W. Jones, C. Nielsen-Leroux, Y. Yang, Z. Yuan, V. F. Dumas, R. Gomes Monnerat, and C. Berry A new Cry toxin with a unique two-component dependency from Bacillus sphaericus FASEB J, December 1, 2007; 21(14): 4112 - 4120. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Samen, B. J. Eikmanns, D. J. Reinscheid, and F. Borges The Surface Protein Srr-1 of Streptococcus agalactiae Binds Human Keratin 4 and Promotes Adherence to Epithelial HEp-2 Cells Infect. Immun., November 1, 2007; 75(11): 5405 - 5414. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Doungudomdacha, A. Volgina, and J. M. DiRienzo Evidence that the cytolethal distending toxin locus was once part of a genomic island in the periodontal pathogen Aggregatibacter (Actinobacillus) actinomycetemcomitans strain Y4 J. Med. Microbiol., November 1, 2007; 56(11): 1519 - 1527. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Blanc, H. Ogata, C. Robert, S. Audic, J.-M. Claverie, and D. Raoult Lateral gene transfer between obligate intracellular bacteria: Evidence from the Rickettsia massiliae genome Genome Res., November 1, 2007; 17(11): 1657 - 1664. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. K. Hane, R. G.T. Lowe, P. S. Solomon, K.-C. Tan, C. L. Schoch, J. W. Spatafora, P. W. Crous, C. Kodira, B. W. Birren, J. E. Galagan, et al. Dothideomycete Plant Interactions Illuminated by Genome Sequencing and EST Analysis of the Wheat Pathogen Stagonospora nodorum PLANT CELL, November 1, 2007; 19(11): 3347 - 3368. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Rojas-Cartagena, P. Ortiz-Pineda, F. Ramirez-Gomez, E. C. Suarez-Castillo, V. Matos-Cruz, C. Rodriguez, H. Ortiz-Zuazaga, and J. E. Garcia-Arraras Distinct profiles of expressed sequence tags during intestinal regeneration in the sea cucumber Holothuria glaberrima Physiol Genomics, October 19, 2007; 31(2): 203 - 215. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Lee and A. D. Grossman Identification of the Origin of Transfer (oriT) and DNA Relaxase Required for Conjugation of the Integrative and Conjugative Element ICEBs1 of Bacillus subtilis J. Bacteriol., October 15, 2007; 189(20): 7254 - 7261. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. V. Lynch, L. Dixon, M. R. Benoit, E. L. Brodie, M. Keyhan, P. Hu, D. F. Ackerley, G. L. Andersen, and A. Matin Role of the rapA Gene in Controlling Antibiotic Resistance of Escherichia coli Biofilms Antimicrob. Agents Chemother., October 1, 2007; 51(10): 3650 - 3658. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Kapteyn, A. V. Qualley, Z. Xie, E. Fridman, N. Dudareva, and D. R. Gang Evolution of Cinnamate/p-Coumarate Carboxyl Methyltransferases and Their Role in the Biosynthesis of Methylcinnamate PLANT CELL, October 1, 2007; 19(10): 3212 - 3229. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. S. Sagaram, B. D. Shaw, and W.-B. Shim Fusarium verticillioides GAP1, a gene encoding a putative glycolipid-anchored surface protein, participates in conidiation and cell wall structure but not virulence Microbiology, September 1, 2007; 153(9): 2850 - 2861. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. I. Montero, M. R. Johnson, C.-J. Chou, S. B. Conners, S. G. Geouge, S. Tachdjian, J. D. Nichols, and R. M. Kelly Responses of Wild-Type and Resistant Strains of the Hyperthermophilic Bacterium Thermotoga maritima to Chloramphenicol Challenge Appl. Envir. Microbiol., August 1, 2007; 73(15): 5058 - 5065. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Gentry, R. H. Dowen III, C. A. Worby, S. Mattoo, J. R. Ecker, and J. E. Dixon The phosphatase laforin crosses evolutionary boundaries and links carbohydrate metabolism to neuronal disease J. Cell Biol., July 24, 2007; 178(3): 477 - 488. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Nord, E. Torrents, and B.-M. Sjoberg A Functional Homing Endonuclease in the Bacillus anthracis nrdE Group I Intron J. Bacteriol., July 15, 2007; 189(14): 5293 - 5301. [Abstract] [Full Text] [PDF] |
||||
![]() |
X.-L. Chen, B.-B. Xie, J.-T. Lu, H.-L. He, and Y. Zhang A novel type of subtilase from the psychrotolerant bacterium Pseudoalteromonas sp. SM9913: catalytic and structural properties of deseasin MCP-01 Microbiology, July 1, 2007; 153(7): 2116 - 2125. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Movahedi and D. J. Hampson Distribution of the clpX gene in Brachyspira species and reactivity of recombinant Brachyspira pilosicoli ClpX with sera from mice and humans J. Med. Microbiol., July 1, 2007; 56(7): 930 - 936. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. W. Brown, R. A. E. Butchko, M. Busman, and R. H. Proctor The Fusarium verticillioides FUM Gene Cluster Encodes a Zn(II)2Cys6 Protein That Affects FUM Gene Expression and Fumonisin Production Eukaryot. Cell, July 1, 2007; 6(7): 1210 - 1218. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. I. H. Godinho, E. S. Maywood, L. Shaw, V. Tucci, A. R. Barnard, L. Busino, M. Pagano, R. Kendall, M. M. Quwailid, M. R. Romero, et al. The After-Hours Mutant Reveals a Role for Fbxl3 in Determining Mammalian Circadian Period Science, May 11, 2007; 316(5826): 897 - 900. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. I. Pottosin and G. Schonknecht Vacuolar calcium channels J. Exp. Bot., May 1, 2007; 58(7): 1559 - 1569. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Papadopoulos and R. Agarwala COBALT: constraint-based alignment tool for multiple protein sequences Bioinformatics, May 1, 2007; 23(9): 1073 - 1079. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Chandler, M. Cole, A. Flier, B. Grewe, and W. Werr The AP2 transcription factors DORNROSCHEN and DORNROSCHEN-LIKE redundantly control Arabidopsis embryo patterning via interaction with PHAVOLUTA Development, May 1, 2007; 134(9): 1653 - 1662. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. E. Gentle, A. J. Perry, F. H. Alcock, V. A. Likic, P. Dolezal, E. T. Ng, A. W. Purcell, M. McConnville, T. Naderer, A.-L. Chanez, et al. Conserved Motifs Reveal Details of Ancestry and Structure in the Small TIM Chaperones of the Mitochondrial Intermembrane Space Mol. Biol. Evol., May 1, 2007; 24(5): 1149 - 1160. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Wegmann, M. O'Connell-Motherway, A. Zomer, G. Buist, C. Shearman, C. Canchaya, M. Ventura, A. Goesmann, M. J. Gasson, O. P. Kuipers, et al. Complete Genome Sequence of the Prototype Lactic Acid Bacterium Lactococcus lactis subsp. cremoris MG1363 J. Bacteriol., April 15, 2007; 189(8): 3256 - 3270. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Asgari, J. Davis, D. Wood, P. Wilson, and A. McGrath Sequence and organization of the Heliothis virescens ascovirus genome J. Gen. Virol., April 1, 2007; 88(4): 1120 - 1132. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Welz and R. R. Breaker Ligand binding and gene control characteristics of tandem riboswitches in Bacillus anthracis RNA, April 1, 2007; 13(4): 573 - 582. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Przybylski and B. Rost Consensus sequences improve PSI-BLAST through mimicking profile-profile alignments Nucleic Acids Res., April 1, 2007; 35(7): 2238 - 2246. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Sukackaite, A. Lagunavicius, K. Stankevicius, C. Urbanke, C. Venclovas, and V. Siksnys Restriction endonuclease BpuJI specific for the 5'-CCCGT sequence is related to the archaeal Holliday junction resolvase family Nucleic Acids Res., April 1, 2007; 35(7): 2377 - 2389. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Schult, K. Meierhoff, S. Paradies, T. Toller, P. Wolff, and P. Westhoff The Nuclear-Encoded Factor HCF173 Is Involved in the Initiation of Translation of the psbA mRNA in Arabidopsis thaliana PLANT CELL, April 1, 2007; 19(4): 1329 - 1346. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Rodionov, O. V. Kurnasov, B. Stec, Y. Wang, M. F. Roberts, and A. L. Osterman Genomic identification and in vitro reconstitution of a complete biosynthetic pathway for the osmolyte di-myo-inositol-phosphate PNAS, March 13, 2007; 104(11): 4279 - 4284. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Shindou, D. Hishikawa, H. Nakanishi, T. Harayama, S. Ishii, R. Taguchi, and T. Shimizu A Single Enzyme Catalyzes Both Platelet-activating Factor Production and Membrane Biogenesis of Inflammatory Cells: CLONING AND CHARACTERIZATION OF ACETYL-CoA:LYSO-PAF ACETYLTRANSFERASE J. Biol. Chem., March 2, 2007; 282(9): 6532 - 6539. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. W. Smolenaars, O. Madsen, K. W. Rodenburg, and D. J. Van der Horst Molecular diversity and evolution of the large lipid transfer protein superfamily J. Lipid Res., March 1, 2007; 48(3): 489 - 502. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Nakagawa, T. Katagiri, K. Shinozaki, Z. Qi, H. Tatsumi, T. Furuichi, A. Kishigami, M. Sokabe, I. Kojima, S. Sato, et al. Arabidopsis plasma membrane protein crucial for Ca2+ influx and touch sensing in roots PNAS, February 27, 2007; 104(9): 3639 - 3644. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Shiels, J. M. King, D. S. Mackay, and S. Bassnett Refractive Defects and Cataracts in Mice Lacking Lens Intrinsic Membrane Protein-2 Invest. Ophthalmol. Vis. Sci., February 1, 2007; 48(2): 500 - 508. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. E. Snow, M. Mateyak, J. Paderova, A. Wakeham, C. Iorio, V. Zakian, J. Squire, and L. Harrington Murine Pif1 Interacts with Telomerase and Is Dispensable for Telomere Function In Vivo Mol. Cell. Biol., February 1, 2007; 27(3): 1017 - 1026. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Fresno, N. Jimenez, R. Canals, S. Merino, M. M. Corsaro, R. Lanzetta, M. Parrilli, G. Pieretti, M. Regue, and J. M. Tomas A Second Galacturonic Acid Transferase Is Required for Core Lipopolysaccharide Biosynthesis and Complete Capsule Association with the Cell Surface in Klebsiella pneumoniae J. Bacteriol., February 1, 2007; 189(3): 1128 - 1137. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Komatsuda, M. Pourkheirandish, C. He, P. Azhaguvel, H. Kanamori, D. Perovic, N. Stein, A. Graner, T. Wicker, A. Tagiri, et al. Six-rowed barley originated from a mutation in a homeodomain-leucine zipper I-class homeobox gene PNAS, January 23, 2007; 104(4): 1424 - 1429. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L. Wheeler, T. Barrett, D. A. Benson, S. H. Bryant, K. Canese, V. Chetvernin, D. M. Church, M. DiCuccio, R. Edgar, S. Federhen, et al. Database resources of the National Center for Biotechnology Information Nucleic Acids Res., January 12, 2007; 35(suppl_1): D5 - D12. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. G. Jegga, S. Gowrisankar, J. Chen, and B. J. Aronow PolyDoms: a whole genome database for the identification of non-synonymous coding SNPs with the potential to impact disease Nucleic Acids Res., January 12, 2007; 35(suppl_1): D700 - D706. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Marchler-Bauer, J. B. Anderson, M. K. Derbyshire, C. DeWeese-Scott, N. R. Gonzales, M. Gwadz, L. Hao, S. He, D. I. Hurwitz, J. D. Jackson, et al. CDD: a conserved domain database for interactive domain family analysis Nucleic Acids Res., January 12, 2007; 35(suppl_1): D237 - D240. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||





































