Skip Navigation


Nucleic Acids Research Advance Access originally published online on November 29, 2006
Nucleic Acids Research 2007 35(Database issue):D237-D240; doi:10.1093/nar/gkl951
This Article
Right arrow Abstract Freely available
Right arrow Print PDF (1474K) Freely available
Right arrow Screen PDF (510K) Freely available
Right arrowOA All Versions of this Article:
35/suppl_1/D237    most recent
gkl951v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Marchler-Bauer, A.
Right arrow Articles by Bryant, S. H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Marchler-Bauer, A.
Right arrow Articles by Bryant, S. H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2007, Vol. 35, Database issue D237-D240
Published by Oxford University Press 2006
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.


Articles

CDD: a conserved domain database for interactive domain family analysis

Aron Marchler-Bauer*, John B. Anderson, Myra K. Derbyshire, Carol DeWeese-Scott, Noreen R. Gonzales, Marc Gwadz, Luning Hao, Siqian He, David I. Hurwitz, John D. Jackson, Zhaoxi Ke, Dmitri Krylov, Christopher J. Lanczycki, Cynthia A. Liebert, Chunlei Liu, Fu Lu, Shennan Lu, Gabriele H. Marchler, Mikhail Mullokandov, James S. Song, Narmada Thanki, Roxanne A. Yamashita, Jodie J. Yin, Dachuan Zhang and Stephen H. Bryant

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health Building 38 A, Room 8N805, 8600 Rockville Pike, Bethesda, MD 20894, USA

*To whom correspondence should be addressed. Tel: +1 301 435 4919; Fax: +1 301 435 7793; Email: bauer{at}ncbi.nlm.nih.gov

Received September 16, 2006. Revised October 19, 2006. Accepted October 20, 2006.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 ASSESSING DOMAIN FAMILY...
 CDD CONTENTS AND AVAILABILITY
 REFERENCES
 
The conserved domain database (CDD) is part of NCBI's Entrez database system and serves as a primary resource for the annotation of conserved domain footprints on protein sequences in Entrez. Entrez's global query interface can be accessed at http://www.ncbi.nlm.nih.gov/Entrez and will search CDD and many other databases. Domain annotation for proteins in Entrez has been pre-computed and is readily available in the form of ‘Conserved Domain’ links. Novel protein sequences can be scanned against CDD using the CD-Search service; this service searches databases of CDD-derived profile models with protein sequence queries using BLAST heuristics, at http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi. Protein query sequences submitted to NCBI's protein BLAST search service are scanned for conserved domain signatures by default. The CDD collection contains models imported from Pfam, SMART and COG, as well as domain models curated at NCBI. NCBI curated models are organized into hierarchies of domains related by common descent. Here we report on the status of the curation effort and present a novel helper application, CDTree, which enables users of the CDD resource to examine curated hierarchies. More importantly, CDD and CDTree used in concert, serve as a powerful tool in protein classification, as they allow users to analyze protein sequences in the context of domain family hierarchies.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 ASSESSING DOMAIN FAMILY...
 CDD CONTENTS AND AVAILABILITY
 REFERENCES
 
The annotation of conserved domain footprints on protein sequences often serves as the first step toward characterizing protein function in silico. Protein domains may be viewed as units in the molecular evolution of proteins and can be organized into an evolutionary classification. The set of protein domains characterized so far appears to describe no more than a few thousand superfamilies, where members of each superfamily are related to each other by common descent. NCBI's conserved domain database (CDD) attempts to collate that set and to organize related domain models in a hierarchical fashion, meant to reflect major ancient gene duplication events and subsequent functional diversification.

Computational annotation of protein function is generally obtained via sequence similarity: once a close neighbor with known function has been identified, its annotation is copied to the sequence with unknown function. This strategy may work very well in functionally homogeneous families and when applied only for very close neighbors or suspected orthologs, but it is doomed to fail often when domain or protein families are sufficiently diverse and when no close neighbors with known function are available.

To this end, the CDD (1) provides a strategy toward a more accurate assessment of such neighbor relationships, similar to approaches termed ‘phylogenomic inference’ (2). CDD acknowledges that protein domain families may be very diverse and that they may contain sets of related subfamilies. Of these, only few may have been characterized experimentally, and within this set function may have diverged considerably. While it may be possible, and certainly efficient, to represent such a set of subfamilies with just a single family model, that model could only provide very generic annotation. In CDD curation, we attempt to detect evidence for duplication and functional divergence in domain families by means of phylogenetic analysis. We record the resulting subfamily structure as a set of explicit models, but limit the analysis to ancient duplication events—several hundred million years in the past, as judged by the taxonomic distribution of protein sequences with particular domain subfamily footprints.

CDD provides a search tool employing reverse position-specific BLAST (RPS–BLAST), where query sequences are compared to databases of position-specific score matrices (PSSMs), and E-values are obtained in much the same way as in the widely used PSI-BLAST application (3). When CDD is scanned with protein query sequences, a region on a query may pick up more than one overlapping footprint from a set of related models. One of those models provides the best score or lowest E-value, but that alone may not be sufficient to indicate that the query sequence is a bona fide member of the corresponding subfamily. Since the CDD collection also contains imported models, which have not been curated at NCBI, search results may present a mixture of curated models (accessions starting with ‘cd.’) and un-curated models (accessions starting with ‘pfam’, ‘smart’ or ‘COG’). By default, overlapping domain hits are sorted by E-value, but curated models are listed first, if their E-values exceed a secondary significance threshold of 1e-05. Default displays are presented in a concise fashion, where domain hits that overlap with the top-ranked domain hits are hidden.

We have started to distribute CDTree, a helper application for the web browser. CDTree allows users to examine the results of simple phylogenetic analysis on the sequences from a curated domain hierarchy, and view their query sequence in the context of such a phylogenetic sequence tree.


    ASSESSING DOMAIN FAMILY MEMBERSHIP
 TOP
 ABSTRACT
 INTRODUCTION
 ASSESSING DOMAIN FAMILY...
 CDD CONTENTS AND AVAILABILITY
 REFERENCES
 
Figures 1 and 2 demonstrate how one might use a web browser, the curated CDD resource and the CDTree application to obtain confidence for the transfer of annotation from a domain model to a particular protein sequence. A cartoon depicting domain model footprints on a protein sequence can be obtained by following the ‘Conserved Domains’ link from an Entrez protein search results page, or by submitting a live CD-search (4) request (Figure 1a). A particular domain annotation is examined in detail by clicking on the corresponding item in the graphical display (Figure 1b). This generates a CD summary page (Figure 1c) which lists descriptive information and a multiple sequence alignment including the user query sequence (not shown). The summary page contains a description of the particular family. If the domain model has been curated at NCBI (CDD accessions starting with ‘cd.’), it also lists conserved features that have been recorded by NCBI curators, displays a sequence cluster tree diagram for the particular family and indicates its position in a hierarchy of related domains. A button labeled ‘Interactive Display with CDTree’ (Figure 1d) launches CDTree.


Figure 1
View larger version (30K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 1 (a) Pre-computed domain annotation is retrieved for a protein sequence in the Entrez database, gi|47123187 from Xenopus laevis. The graphical annotation can be obtained by following the ‘Conserved Domains’ link on the Entrez document summary for gi|47123187. By default, graphical domain summaries hide redundant information from the user. By clicking on the red balloon (b), representing a conserved domain footprint for the model cd02907, the user launches a summary view of that domain model, which also preserves information about the (query) sequence of interest (c). The CD summary page displays details about the actual model and its hierarchy. A section labeled ‘Links’ (data not shown), for example, provides links to all protein sequences in Entrez that match the current domain model, to references in PubMed and Entrez Books, and to the original source of the curated family, which may be a model imported from outside databases such as Pfam. Clicking on the button labeled ‘Interactive Display with CDTree’ (d) launches CDTree on the user's computer as a local application, which retrieves its data via the web-browser. The CDTree view corresponding to this example is shown in Figure 2. CDTree launching is not enabled for alignment models imported from outside sources.

 


Figure 2
View larger version (71K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 2 CDTree default display, as launched from a web browser, showing a curated domain hierarchy with an embedded user query sequence. A protein sequence found in NCBI's Entrez database was inspected for the presence of conserved domains. The user has followed links from one of the domain footprints annotated on the model, and inspected a particular domain model, cd02907. From the CD summary page, as shown in Figure 1, CDTree was launched as a helper application. The main window (a) presents the organization of the conserved domain hierarchy, as already visible on the CD summary page (Figure 1). In this case, cd02749 or ‘Macro’ is a generic ‘parent’ model, which has been split up into several more specific ‘children’. The sequence tree shown in panel (c) provides evidence for this particular subfamily structure. Groups of branches rendered in the same color correspond to alignment rows that have been assigned to a particular subgroup. Sequence trees are always calculated from the curated CD alignments; in this particular example, the distance data have been obtained from pair-wise alignment scores. Aligned residue pairs are scored with the BLOSUM62 matrix. Pair-wise scores are subtracted from the highest observed pair-wise score to yield distances, and the distance units plotted here correspond to BLOSUM62 scores. By default, a taxonomy viewer window is opened as well (b). Users may select and highlight whole branches in the sequence tree view and examine corresponding highlights in the taxonomy viewer, to understand the taxonomic scope of particular subfamilies, or select/highlight taxa in the taxonomy viewer and examine their distribution in the sequence tree. In this example, a user query sequence has been added to one of the models, cd02907. cd02907 gave the best scoring hit in a database search for a particular region of the user's query sequence. In the sequence tree display, the user query is highlighted by default (d). It appears that the user query is a typical member of this particular subfamily, as it clusters tightly with all the other members, and therefore transfer of annotation from the model to the sequence - or functional inference - may be appropriate.

 
CDTree is a helper application for the web browser and must be downloaded and installed on the user's computer. Instructions for installing CDTree are found at http://www.ncbi.nlm.nih.gov/Structure/cdtree/cdtree.shtml. CDTree functions as a viewer for curated protein domain hierarchies; it retrieves data for display via the web browser. CDTree is a combined domain hierarchy viewer and editor. It uses a separate program, Cn3D (5), to view 3D structure and to display and edit multiple alignments of protein structure and sequence. Cn3D is distributed, installed and configured along with CDTree. CDTree requires a recent version of Cn3D, version v4.2, which is contained in the CDTree installation package. The installation package also contains a stand-alone application, ‘fa2cd’, which can be used to convert FASTA-formatted multiple sequence alignments into models stored in the ‘CD’ format, so that they can be imported into CDTree. CDTree also allows the de novo buildup of alignment models starting from single protein sequences. More details can be found on the CDTree home page (see Table 1). A manuscript detailing CDTree and its various uses is in preparation.

When the user launches CDTree from a CD summary page, CDTree displays the contents of the curated domain or domain hierarchy, and serves as a viewer for the evidence supporting a particular subfamily structure. By default, CDTree displays a sequence tree view, a taxonomy view and a hierarchy overview. The sequence tree has been pre-computed and is contained in the data sent by the server. When a query sequence has been submitted to CD-Search and a matching CDD model has been identified, launching CDTree from its CD summary page will cause the user's query sequence to be added to the model and CDTree will recalculate a sequence tree. This tree will now contain the query sequence as well (Figure 2c).

Potentially, this allows users to distinguish between several cases: (i) Query sequences may be bona fide members of clusters which curators have explicitly declared subfamilies, and which may carry specific functional annotation, as in the case illustrated in Figures 1 and 2; (ii) query sequences may be bona fide members of subfamilies which do not (yet) carry specific functional annotation; (iii) query sequences may be members of clusters, which curators have not declared subfamilies, as they may not have seen enough evidence for a subfamily at the time of curation; (iv) query sequences may be outliers and not cluster with any particular group of sequences in the curated hierarchy. Only the first scenario listed above may allow the transfer of specific functional annotation from the model to the query. In all the other cases, annotation transfer from the hierarchy's generic ‘parent’ model may be more appropriate, assuming that the ‘parent’ model provides annotation that is valid for all or the majority of members in a superfamily.


    CDD CONTENTS AND AVAILABILITY
 TOP
 ABSTRACT
 INTRODUCTION
 ASSESSING DOMAIN FAMILY...
 CDD CONTENTS AND AVAILABILITY
 REFERENCES
 
CDD can be searched directly as part of NCBI's Entrez database and query system (6), where it is listed as the ‘Domains’ database. Entries in CDD are cross-linked reciprocally to NCBI taxonomy, citations in PubMed®, and to protein sequences in Entrez. Links to protein sequences reflect the results of pre-computed RPS–BLAST searches and are updated on a daily basis and stored in the CDART database (7).

The current version of CDD, version v2.09, contains a total of 12 422 models, of which 2494 have been curated at NCBI. Of these curated models, <300 are solitary domain models, while the rest are organized into hierarchies. The largest hierarchies contain well over 100 individual models each. 5252 models have been obtained from Pfam (version 11) (8), 575 have been obtained from SMART (9) and 4101 have been derived from the COG collection (10). Together, these models cover about 69% of non-identical protein sequences in NCBI's Entrez protein database. The full set of models as imported from Pfam, SMART, COG and KOG(9), are available in separate search databases, although not all of them have been indexed in Entrez, since lineage-specific models with limited taxonomic scope, as well as largely redundant models, have been filtered out.

The size of the CDD model collection, details with respect to versions of external databases mirrored in CDD, and the control of redundancy may change over time, as we attempt to provide a resource that is more comprehensive as well as efficient. Expert curation of CDD is an ongoing effort, and we plan to eventually replace a majority of imported models with hierarchies curated at NCBI. Table 1 lists URLs and FTP site addresses for tools and services mentioned above.


View this table:
[in this window]
[in a new window]

 
Table 1 URLs and FTP-site addresses for CDD and CDD-related services

 

    ACKNOWLEDGEMENTS
 
We thank the authors of the Pfam, SMART and COG resources. Development of CDTree would not have been possible without Paul Thiessen and his work on Cn3D, Lewis Geer, Jane He, Naigong Zhang and Praveen Cherukuri and their work on the CDART resource, the NCBI BLAST group, the NCBI IEB and the NCBI C++ toolkit developers. This work was supported by the Intramural Research Program of the National Library of Medicine at National Institutes of Health/DHHS. Comments, suggestions and questions are welcome and should be directed to: info{at}ncbi.nlm.nih.gov. Funding to pay the Open Access publication charges for this article was provided by Intramural Research Program of the National Library of Medicine at National Institutes of Health/DHHS.

Conflict of interest statement. None declared.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 ASSESSING DOMAIN FAMILY...
 CDD CONTENTS AND AVAILABILITY
 REFERENCES
 

  1. Marchler-Bauer, A., Anderson, J.B., Cherukuri, P.F., DeWeese-Scott, C., Geer, L.Y., Gwadz, M., He, S., Hurwitz, D.I., Jackson, J.D., Ke, Z., et al. (2005) CDD: a Conserved Domain Database for protein classification Nucleic Acids Res, . 33, D192–D196[Abstract/Free Full Text] .

  2. Brown, D. and Sjolander, K. (2006) Functional classification using phylogenomic inference PLoS Comput. Biol, . 2, e77[CrossRef][Medline] .

  3. Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs Nucleic Acids Res, . 25, 3389–3402[Abstract/Free Full Text] .

  4. Marchler-Bauer, A. and Bryant, S.H. (2004) CD-Search: protein domain annotations on the fly Nucleic Acids Res, . 32, 327–331[CrossRef] .

  5. Wang, Y., Geer, L.Y., Chappey, C., Kans, J.A., Bryant, S.H. (2000) Cn3D: sequence and structure views for Entrez Trends Biochem. Sci, . 25, 300–302[CrossRef][Web of Science][Medline] .

  6. Wheeler, D.L., Barrett, T., Benson, D.A., Bryant, S.H., Canese, K., Chetvernin, V., Church, D.M., DiCuccio, M., Edgar, R., Federhen, S., et al. (2006) Database resources of the National Center of Biotechnology Information Nucleic Acids Res, . 34, D173–D180[Abstract/Free Full Text] .

  7. Geer, L.Y., Domrachev, M., Lipman, D.J., Bryant, S.H. (2002) CDART: protein homology by domain architecture Genome Res, . 12, 1619–1623[Abstract/Free Full Text] .

  8. Finn, R.D., Mistry, I., Schuster-Bockler, B., Griffiths-Jones, S., Hollich, V., Lassmann, T., Moxon, S., Marshall, M., Khanna, A., Durbin, R., et al. (2006) Pfam: clans, web tools and services Nucleic Acids Res, . 34, D247–D251[Abstract/Free Full Text] .

  9. Letunic, I., Copley, R.R., Pils, B., Pinkert, S., Schultz, J., Bork, P. (2006) SMART 5: domains in the context of genomes and networks Nucleic Acids Res, . 34, D257–D260[Abstract/Free Full Text] .

  10. Tatusov, R.L., Fedorova, N.D., Jackson, J.D., Jacobs, A.R., Kiryutin, B., Koonin, E.V., Krylov, D.M., Mazumder, R., Mekhedov, S.L., Nikolskaya, A.N., et al. (2003) The COG database: and updated version includes eukaryotes BMC Bioinformatics, 4, 41[CrossRef][Medline] .


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
JDRHome page
Y. Ding, M.R.P. Estrella, Y.Y. Hu, H.L. Chan, H.D. Zhang, J.-W. Kim, J.P. Simmer, and J.C.-C. Hu
Fam83h is Associated with Intracellular Vesicles and ADHCAI
Journal of Dental Research, November 1, 2009; 88(11): 991 - 996.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
Y.-F. Ma, Y. Zhang, J.-Y. Zhang, D.-W. Chen, Y. Zhu, H. Zheng, S.-Y. Wang, C.-Y. Jiang, G.-P. Zhao, and S.-J. Liu
The Complete Genome of Comamonas testosteroni Reveals Its Genetic Adaptations to Changing Environments
Appl. Envir. Microbiol., November 1, 2009; 75(21): 6812 - 6819.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
R. R. Nayak, M. Kearns, R. S. Spielman, and V. G. Cheung
Coexpression network based on natural variation in human gene expression reveals gene interactions and functions
Genome Res., November 1, 2009; 19(11): 1953 - 1962.
[Abstract] [Full Text] [PDF]


Home page
J. Gen. Virol.Home page
R. A. Valverde and S. Sabanadzovic
A novel plant virus with unique properties infecting Japanese holly fern
J. Gen. Virol., October 1, 2009; 90(10): 2542 - 2549.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
P. Vasudevan, J. McElligott, C. Attkisson, M. Betteken, and D. L. Popham
Homologues of the Bacillus subtilis SpoVB Protein Are Involved in Cell Wall Metabolism
J. Bacteriol., October 1, 2009; 191(19): 6012 - 6019.
[Abstract] [Full Text] [PDF]


Home page
Innate ImmunityHome page
J. D. King, D. Kocincova, E. L. Westman, and J. S. Lam
Review: Lipopolysaccharide biosynthesis in Pseudomonas aeruginosa
Innate Immunity, October 1, 2009; 15(5): 261 - 312.
[Abstract] [PDF]


Home page
J. Clin. Microbiol.Home page
S. C. Leao, E. Tortoli, C. Viana-Niero, S. Y. M. Ueki, K. V. B. Lima, M. L. Lopes, J. Yubero, M. C. Menendez, and M. J. Garcia
Characterization of Mycobacteria from a Major Brazilian Outbreak Suggests that Revision of the Taxonomic Status of Members of the Mycobacterium chelonae-M. abscessus Group Is Needed
J. Clin. Microbiol., September 1, 2009; 47(9): 2691 - 2698.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
M. Fauvart, N. Verstraeten, B. Dombrecht, R. Venmans, S. Beullens, C. Heusdens, and J. Michiels
Rhizobium etli HrpW is a pectin-degrading enzyme and differs from phytopathogenic homologues in enzymically crucial tryptophan and glycine residues
Microbiology, September 1, 2009; 155(9): 3045 - 3054.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Schwede, T. Manful, B. A. Jha, C. Helbig, N. Bercovich, M. Stewart, and C. Clayton
The role of deadenylation in the degradation of unstable mRNAs in trypanosomes
Nucleic Acids Res., September 1, 2009; 37(16): 5511 - 5528.
[Abstract] [Full Text] [PDF]


Home page
JCBHome page
R. Zaidel-Bar
Evolution of complexity in the integrin adhesome
J. Cell Biol., August 10, 2009; 186(3): 317 - 321.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
J. Trevino, N. Perez, E. Ramirez-Pena, Z. Liu, S. A. Shelburne III, J. M. Musser, and P. Sumby
CovS Simultaneously Activates and Inhibits the CovR-Mediated Repression of Distinct Subsets of Group A Streptococcus Virulence Factor-Encoding Genes
Infect. Immun., August 1, 2009; 77(8): 3141 - 3149.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
S. Herold, J. C. Paton, and A. W. Paton
Sab, a Novel Autotransporter of Locus of Enterocyte Effacement-Negative Shiga-Toxigenic Escherichia coli O113:H21, Contributes to Adherence and Biofilm Formation
Infect. Immun., August 1, 2009; 77(8): 3234 - 3243.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Fang, W.-M. Zeisberg, C. Condon, V. Ogryzko, A. Danchin, and U. Mechold
Degradation of nanoRNA is performed by multiple redundant RNases in Bacillus subtilis
Nucleic Acids Res., August 1, 2009; 37(15): 5114 - 5125.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. D. Morgan, E. A. Dwinell, T. K. Bhatia, E. M. Lang, and Y. A. Luyten
The MmeI family: type II restriction-modification enzymes that employ single-strand modification for host protection
Nucleic Acids Res., August 1, 2009; 37(15): 5208 - 5221.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
P. A. Novick, H. Basta, M. Floumanhaft, M. A. McClure, and S. Boissinot
The Evolutionary Dynamics of Autonomous Non-LTR Retrotransposons in the Lizard Anolis Carolinensis Shows More Similarity to Fish Than Mammals
Mol. Biol. Evol., August 1, 2009; 26(8): 1811 - 1822.
[Abstract] [Full Text] [PDF]


Home page
J HeredHome page
K. B. Walters, P. Grant, and D. L. E. Johnson
Evolution of the GST Omega Gene Family in 12 Drosophila Species
J. Hered., July 16, 2009; (2009) esp043v1.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. J. Smith, F. Antonacci, E. E. Eichler, and C. T. Amemiya
Programmed loss of millions of base pairs from a vertebrate genome
PNAS, July 7, 2009; 106(27): 11212 - 11217.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
R. C. Bertossa, L. van de Zande, and L. W. Beukeboom
The Fruitless Gene in Nasonia Displays Complex Sex-Specific Splicing and Contains New Zinc Finger Domains
Mol. Biol. Evol., July 1, 2009; 26(7): 1557 - 1569.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
C. Sanchez-Porro, E. Mellado, A. P. Pugsley, O. Francetic, and A. Ventosa
The Haloprotease CPI Produced by the Moderately Halophilic Bacterium Pseudoalteromonas ruthenica Is Secreted by the Type II Secretion Pathway
Appl. Envir. Microbiol., June 15, 2009; 75(12): 4197 - 4201.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Guo and A. J. Hartemink
Domain-oriented edge-based alignment of protein interaction networks
Bioinformatics, June 15, 2009; 25(12): i240 - 1246.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
H. Thompson, K. A. Homer, S. Rao, V. Booth, and A. H. F. Hosie
An Orthologue of Bacteroides fragilis NanH Is the Principal Sialidase in Tannerella forsythia
J. Bacteriol., June 1, 2009; 191(11): 3623 - 3628.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. Maitra and M. J. Sadofsky
A WW-like module in the RAG1 N-terminal domain contributes to previously unidentified protein-protein interactions
Nucleic Acids Res., June 1, 2009; 37(10): 3301 - 3309.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
R. D. Barabote, G. Xie, D. H. Leu, P. Normand, A. Necsulea, V. Daubin, C. Medigue, W. S. Adney, X. C. Xu, A. Lapidus, et al.
Complete genome of the cellulolytic thermophile Acidothermus cellulolyticus 11B provides insights into its ecophysiological and evolutionary adaptations
Genome Res., June 1, 2009; 19(6): 1033 - 1043.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
T. Inagawa, T. Yamada-Inagawa, T. Eydmann, I. S. Mian, T. S. Wang, and J. Z. Dalgaard
Schizosaccharomyces pombe Rtf2 mediates site-specific replication termination by inhibiting replication restart
PNAS, May 12, 2009; 106(19): 7927 - 7932.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
M. K. Basu, E. Poliakov, and I. B. Rogozin
Domain mobility in proteins: functional and evolutionary implications
Brief Bioinform, May 1, 2009; 10(3): 205 - 216.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
C. Bousquet-Antonelli and J.-M. Deragon
A comprehensive analysis of the La-motif protein superfamily
RNA, May 1, 2009; 15(5): 750 - 764.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
C. Simon, J. Herath, S. Rockstroh, and R. Daniel
Rapid Identification of Genes Encoding DNA Polymerases by Function-Based Screening of Metagenomic Libraries Derived from Glacial Ice
Appl. Envir. Microbiol., May 1, 2009; 75(9): 2964 - 2968.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
A. Wakeel, J. A. Kuriakose, and J. W. McBride
An Ehrlichia chaffeensis Tandem Repeat Protein Interacts with Multiple Host Targets Involved in Cell Signaling, Transcriptional Regulation, and Vesicle Trafficking
Infect. Immun., May 1, 2009; 77(5): 1734 - 1745.
[Abstract] [Full Text] [PDF]


Home page
Eukaryot CellHome page
J. Luo, Y. Matsuo, G. Gulis, H. Hinz, J. Patton-Vogt, and S. Marcus
Phosphatidylethanolamine Is Required for Normal Cell Morphology and Cytokinesis in the Fission Yeast Schizosaccharomyces pombe
Eukaryot. Cell, May 1, 2009; 8(5): 790 - 799.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
T. Waschkowitz, S. Rockstroh, and R. Daniel
Isolation and Characterization of Metalloproteases with a Novel Domain Structure by Construction and Screening of Metagenomic Libraries
Appl. Envir. Microbiol., April 15, 2009; 75(8): 2506 - 2516.
[Abstract] [Full Text] [PDF]


Home page
J Exp BotHome page
M. S. Kalamaki, D. Alexandrou, D. Lazari, G. Merkouropoulos, V. Fotopoulos, I. Pateraki, A. Aggelis, A. Carrillo-Lopez, M. J. Rubio-Cabetas, and A. K. Kanellis
Over-expression of a tomato N-acetyl-L-glutamate synthase gene (SlNAGS1) in Arabidopsis thaliana results in high ornithine levels and increased tolerance in salt and drought stresses
J. Exp. Bot., April 8, 2009; (2009) erp072v1.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
B.-B. Xie, F. Bian, X.-L. Chen, H.-L. He, J. Guo, X. Gao, Y.-X. Zeng, B. Chen, B.-C. Zhou, and Y.-Z. Zhang
Cold Adaptation of Zinc Metalloproteases in the Thermolysin Family from Deep Sea and Arctic Sea Ice Bacteria Revealed by Catalytic and Structural Properties and Molecular Dynamics: NEW INSIGHTS INTO RELATIONSHIP BETWEEN CONFORMATIONAL FLEXIBILITY AND HYDROGEN BONDING
J. Biol. Chem., April 3, 2009; 284(14): 9257 - 9269.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
C. Marion, D. H. Limoli, G. S. Bobulsky, J. L. Abraham, A. M. Burnaugh, and S. J. King
Identification of a Pneumococcal Glycosidase That Modifies O-Linked Glycans
Infect. Immun., April 1, 2009; 77(4): 1389 - 1396.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
J. Shanks, M. N. Burtnick, P. J. Brett, D. M. Waag, K. B. Spurgers, W. J. Ribot, M. A. Schell, R. G. Panchal, F. C. Gherardini, K. D. Wilkinson, et al.
Burkholderia mallei tssM Encodes a Putative Deubiquitinase That Is Secreted and Expressed inside Infected RAW 264.7 Murine Macrophages
Infect. Immun., April 1, 2009; 77(4): 1636 - 1648.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. D. Hooper, I. J. Anderson, A. Pati, D. Dalevi, K. Mavromatis, and N. C. Kyrpides
Integration of phenotypic metadata and protein similarity in Archaea using a spectral bipartitioning approach
Nucleic Acids Res., April 1, 2009; 37(7): 2096 - 2104.
[Abstract] [Full Text] [PDF]


Home page
GlycobiologyHome page
S. Banerjee, P. W Robbins, and J. Samuelson
Molecular characterization of nucleocytosolic O-GlcNAc transferases of Giardia lamblia and Cryptosporidium parvum
Glycobiology, April 1, 2009; 19(4): 331 - 336.
[Abstract] [Full Text] [PDF]


Home page
J Exp BotHome page
W.-Q. Song, Y.-M. Qin, M. Saito, T. Shirai, F. M. Pujol, A. J. Kastaniotis, J. K. Hiltunen, and Y.-X. Zhu
Characterization of two cotton cDNAs encoding trans-2-enoyl-CoA reductase reveals a putative novel NADPH-binding motif
J. Exp. Bot., April 1, 2009; 60(6): 1839 - 1848.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
O. Rossier, J. Dao, and N. P. Cianciotto
A type II secreted RNase of Legionella pneumophila facilitates optimal intracellular infection of Hartmannella vermiformis
Microbiology, March 1, 2009; 155(3): 882 - 890.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
S. D. Pope, L.-L. Chen, and V. Stewart
Purine Utilization by Klebsiella oxytoca M5al: Genes for Ring-Oxidizing and -Opening Enzymes
J. Bacteriol., February 1, 2009; 191(3): 1006 - 1017.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
K. Shibuya, T. Yamada, T. Suzuki, K. Shimizu, and K. Ichimura
InPSR26, a Putative Membrane Protein, Regulates Programmed Cell Death during Petal Senescence in Japanese Morning Glory
Plant Physiology, February 1, 2009; 149(2): 816 - 824.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
L. Zhang, L.-H. Tian, J.-F. Zhao, Y. Song, C.-J. Zhang, and Y. Guo
Identification of an Apoplastic Protein Involved in the Initial Phase of Salt Stress Response in Rice Root by Two-Dimensional Electrophoresis
Plant Physiology, February 1, 2009; 149(2): 916 - 928.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
K. A. Howell, R. Narsai, A. Carroll, A. Ivanova, M. Lohse, B. Usadel, A. H. Millar, and J. Whelan
Mapping Metabolic and Transcript Temporal Switches during Germination in Rice Highlights Specific Transcription Factors and the Role of RNA Instability in the Germination Process
Plant Physiology, February 1, 2009; 149(2): 961 - 980.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
T. A. Springer
Structural basis for selectin mechanochemistry
PNAS, January 6, 2009; 106(1): 91 - 96.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
W. Klimke, R. Agarwala, A. Badretdin, S. Chetvernin, S. Ciufo, B. Fedorov, B. Kiryutin, K. O'Neill, W. Resch, S. Resenchuk, et al.
The National Center for Biotechnology Information's Protein Clusters Database
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D216 - D223.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Marchler-Bauer, J. B. Anderson, F. Chitsaz, M. K. Derbyshire, C. DeWeese-Scott, J. H. Fong, L. Y. Geer, R. C. Geer, N. R. Gonzales, M. Gwadz, et al.
CDD: specific functional annotation with the Conserved Domain Database
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D205 - D210.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
E. A. Lambert and D. L. Popham
The Bacillus anthracis SleL (YaaH) Protein Is an N-Acetylglucosaminidase Involved in Spore Cortex Depolymerization
J. Bacteriol., December 1, 2008; 190(23): 7601 - 7607.
[Abstract] [Full Text] [PDF]


Home page
Eukaryot CellHome page
J. M. Palmer, R. M. Perrin, T. R. T. Dagenais, and N. P. Keller
H3K9 Methylation Regulates Growth and Development in Aspergillus fumigatus
Eukaryot. Cell, December 1, 2008; 7(12): 2052 - 2060.
[Abstract] [Full Text] [PDF]


Home page
J. Lipid Res.Home page
R. E. Gimeno and J. Cao
Thematic Review Series: Glycerolipids. Mammalian glycerol-3-phosphate acyltransferases: new genes for an old activity
J. Lipid Res., October 1, 2008; 49(10): 2079 - 2088.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. A. Larrea, I. M. Pedroso, A. Malhotra, and R. S. Myers
Identification of two conserved aspartic acid residues required for DNA digestion by a novel thermophilic Exonuclease VII in Thermotoga maritima
Nucleic Acids Res., October 1, 2008; 36(18): 5992 - 6003.
[Abstract] [Full Text] [PDF]


Home page
Arch OphthalmolHome page
J. H. Fingert, K. Oh, M. Chung, T. E. Scheetz, J. L. Andorf, R. M. Johnson, V. C. Sheffield, and E. M. Stone
Association of a Novel Mutation in the Retinol Dehydrogenase 12 (RDH12) Gene With Autosomal Dominant Retinitis Pigmentosa
Arch Ophthalmol, September 1, 2008; 126(9): 1301 - 1307.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. J. Rigden and M. Y. Galperin
Sequence analysis of GerM and SpoVS, uncharacterized bacterial 'sporulation' proteins with widespread phylogenetic distribution
Bioinformatics, August 15, 2008; 24(16): 1793 - 1797.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
G. Shen, H. S. Leonard, W. M. Schluchter, and D. A. Bryant
CpcM Posttranslationally Methylates Asparagine-71/72 of Phycobiliprotein Beta Subunits in Synechococcus sp. Strain PCC 7002 and Synechocystis sp. Strain PCC 6803
J. Bacteriol., July 15, 2008; 190(14): 4808 - 4817.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
K. Nykamp, M.-H. Lee, and J. Kimble
C. elegans La-related protein, LARP-1, localizes to germline P bodies and attenuates Ras-MAPK signaling during oogenesis
RNA, July 1, 2008; 14(7): 1378 - 1389.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. Simockova, R. Holic, D. Tahotna, J. Patton-Vogt, and P. Griac
Yeast Pgc1p (YPL206c) Controls the Amount of Phosphatidylglycerol via a Phospholipase C-type Degradation Mechanism
J. Biol. Chem., June 20, 2008; 283(25): 17107 - 17115.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
Y. Zhang, S. Feng, F. Chen, H. Chen, J. Wang, C. McCall, Y. Xiong, and X. W. Deng
Arabidopsis DDB1-CUL4 ASSOCIATED FACTOR1 Forms a Nuclear E3 Ubiquitin Ligase with DDB1 and CUL4 That Is Involved in Multiple Plant Developmental Processes
PLANT CELL, June 1, 2008; 20(6): 1437 - 1455.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. Csuros, I. B. Rogozin, and E. V. Koonin
Extremely Intron-Rich Genes in the Alveolate Ancestors Inferred with a Flexible Maximum-Likelihood Approach
Mol. Biol. Evol., May 1, 2008; 25(5): 903 - 911.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
J. C. Gauntlett, S. Gebhard, S. Keis, J. M. Manson, K. M. Pos, and G. M. Cook
Molecular Analysis of BcrR, a Membrane-bound Bacitracin Sensor and DNA-binding Protein from Enterococcus faecalis
J. Biol. Chem., March 28, 2008; 283(13): 8591 - 8600.
[Abstract] [Full Text] [PDF]


Home page
Bioscience HorizonsHome page
J. J. van Aartsen
The Klebsiella pheV tRNA locus: a hotspot for integration of alien genomic islands
Bioscience Horizons, March 1, 2008; 1(1): 51 - 60.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
S. S.-W. Cot, A. K.-C. So, and G. S. Espie
A Multiprotein Bicarbonate Dehydration Complex Essential to Carboxysome Function in Cyanobacteria
J. Bacteriol., February 1, 2008; 190(3): 936 - 945.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
M. L. Madsen, S. Puttamreddy, E. L. Thacker, M. D. Carruthers, and F. C. Minion
Transcriptome Changes in Mycoplasma hyopneumoniae during Infection
Infect. Immun., February 1, 2008; 76(2): 658 - 663.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
D. Kerk, G. Templeton, and G. B.G. Moorhead
Evolutionary Radiation Pattern of Novel Protein Phosphatases Revealed by Analysis of Protein Data from the Completely Sequenced Genomes of Humans, Green Algae, and Higher Plants
Plant Physiology, February 1, 2008; 146(2): 351 - 367.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. M.C. Robb, E. Ross, and A. S. Alvarado
SmedGD: the Schmidtea mediterranea genome database
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D599 - D606.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. M. R. Davila, P. N. Mendes, G. Wagner, D. A. Tschoeke, R. R. C. Cuadrat, F. Liberman, L. Matos, T. Satake, K. A. C. S. Ocana, O. Triana, et al.
ProtozoaDB: dynamic visualization and exploration of protozoan genomes
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D547 - D552.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
I. Pedroso, G. Rivera, F. Lazo, M. Chacon, F. Ossandon, F. A. Veloso, and D. S. Holmes
AlterORF: a database of alternate open reading frames
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D517 - D518.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
K. Denger, S. Weinitschke, T. H. M. Smits, D. Schleheck, and A. M. Cook
Bacterial sulfite dehydrogenases in organotrophic metabolism: separation and identification in Cupriavidus necator H16 and in Delftia acidovorans SPH-1
Microbiology, January 1, 2008; 154(1): 256 - 263.
[Abstract] [Full Text] [PDF]


Home page
MycologiaHome page
E. Chew, Y. Aweiss, C.-y. Lu, and F. Banuett
Fuz1, a MYND domain protein, is required for cell morphogenesis in Ustilago maydis
Mycologia, January 1, 2008; 100(1): 31 - 46.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
N. L. Hiller, B. Janto, J. S. Hogg, R. Boissy, S. Yu, E. Powell, R. Keefe, N. E. Ehrlich, K. Shen, J. Hayes, et al.
Comparative Genomic Analyses of Seventeen Streptococcus pneumoniae Strains: Insights into the Pneumococcal Supragenome
J. Bacteriol., November 15, 2007; 189(22): 8186 - 8195.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
H. Zhang, J. P. Herman, H. Bolton Jr., Z. Zhang, S. Clark, and L. Xun
Evidence that Bacterial ABC-Type Transporter Imports Free EDTA for Metabolism
J. Bacteriol., November 15, 2007; 189(22): 7991 - 7997.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
S. M. Tallent, T. B. Langston, R. G. Moran, and G. E. Christie
Transducing Particles of Staphylococcus aureus Pathogenicity Island SaPI1 Are Comprised of Helper Phage-Encoded Proteins
J. Bacteriol., October 15, 2007; 189(20): 7520 - 7524.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
B. S. Srinivasan, N. H. Shah, J. A. Flannick, E. Abeliuk, A. F. Novak, and S. Batzoglou
Current progress in network research: toward reference networks for key model organisms
Brief Bioinform, September 1, 2007; 8(5): 318 - 332.
[Abstract] [Full Text] [PDF]


Home page
DevelopmentHome page
C. C. Baker and M. T. Fuller
Translational control of meiotic cell cycle progression and spermatid differentiation in male germ cells by a novel eIF4G homolog
Development, August 1, 2007; 134(15): 2863 - 2869.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Smits, J. A. M. Smeitink, L. P. van den Heuvel, M. A. Huynen, and T. J. G. Ettema
Reconstructing the evolution of the mitochondrial ribosomal proteome
Nucleic Acids Res., July 9, 2007; 35(14): 4686 - 4703.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Csuros, J. A. Holey, and I. B. Rogozin
In search of lost introns
Bioinformatics, July 1, 2007; 23(13): i87 - i96.
[Abstract] [Full Text] [PDF]


Home page
Cold Spring Harb Symp Quant BiolHome page
E. Garbarino-Pico and C. B. Green
Posttranscriptional Regulation of Mammalian Circadian Clock Output
Cold Spring Harb Symp Quant Biol, January 1, 2007; 72(0): 145 - 156.
[Abstract] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (1474K) Freely available
Right arrow Screen PDF (510K) Freely available
Right arrowOA All Versions of this Article:
35/suppl_1/D237    most recent
gkl951v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Marchler-Bauer, A.
Right arrow Articles by Bryant, S. H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Marchler-Bauer, A.
Right arrow Articles by Bryant, S. H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?