Nucleic Acids Research, 2003, Vol. 31, No. 1 383-387
© 2003 Oxford University Press
CDD: a curated Entrez database of conserved domain alignments
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, Room 8N805, 8600 Rockville Pike, Bethesda, MD 20894, USA
*To whom correspondence should be addressed. Tel: +1 3014354919; Fax: +1 3014809241; Email: bauer{at}ncbi.nlm.nih.gov
Received September 30, 2002; Accepted October 2, 2002
ABSTRACT
The Conserved Domain Database (CDD) is now indexed as a separate database within the Entrez system and linked to other Entrez databases such as MEDLINE®. This allows users to search for domain types by name, for example, or to view the domain architecture of any protein in Entrez's sequence database. CDD can be accessed on the WorldWideWeb at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=cdd. Users may also employ the CD-Search service to identify conserved domains in new sequences, at http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi. CD-Search results, and pre-computed links from Entrez's protein database, are calculated using the RPS-BLAST algorithm and Position Specific Score Matrices (PSSMs) derived from CDD alignments. CD-Searches are also run by default for proteinprotein queries submitted to BLAST® at http://www.ncbi.nlm.nih.gov/BLAST.
CDD mirrors the publicly available domain alignment collections SMART and PFAM, and now also contains alignment models curated at NCBI. Structure information is used to identify the core substructure likely to be present in all family members, and to produce sequence alignments consistent with structure conservation. This alignment model allows NCBI curators to annotate columns corresponding to functional sites conserved among family members.
INTRODUCTION
Protein domains are distinct units of protein three-dimensional structure, which also carry function. Proteins can be composed of single or multiple domains. As units of divergent molecular evolution, domains offer a rational level at which the protein universe may be studied. A few thousand conserved domain models are sufficient to cover more than two thirds of known protein sequences, greatly reducing the redundancy encountered in analyses involving sequence databases. Thus the annotation of protein sequence data with the location and extents of conserved domains has become an indispensable tool in the comparative analysis of genes and genomes. Pre-calculated assignments of functional and structural domains on protein sequence may provide valuable insights into the molecular evolution of single- and multiple-domain proteins, as well as help to validate other annotation.
We have continued to mirror two public collections of conserved domain models, Pfam (1) and SMART (2), and to convert their alignment models into searchable databases of Position Specific Score Matrices (PSSMs) (3). The current version of the CDD contains about 5000 such un-curated models, with sequence alignments imported from outside sources. In addition to this set, a first batch of several hundred curated alignment models is being offered.
NCBI-curated alignments are meant to give an accurate representation of the conserved core of a protein domain family, and can be used to instantiate approximate three-dimensional models for aligned sequences, assuming that the 3D structure of at least one family member is known. In some cases we need to resolve conflicts between imported sequence alignments and 3D structure information (4), such as the actual extents of conserved domains, the location and extents of conserved core blocks, and particular alignment details. Curated alignments are also meant to record conserved functional features, if applicable, in a way that assists visualization and permits computational transfer of such features across the family.
CDD CONTENTS
Access
CDD is an integral part of the Entrez data retrieval system (5), and can be accessed by querying or linking to Entrez's Domains database. This allows retrieval by domain names and keywords found in functional descriptions. Conserved Domains are linked to the NCBI Taxonomy Database, PubMed®, and Entrez's protein database, which provides additional search mechanisms. A query of Entrez's PubMed Database, for example, may identify citations referring to a particular type of domain. Links from these citations to the Domains database might help find the corresponding domain in CDD, as abstracts in PubMed often contain additional search terms not found in terse domain descriptions, and most entries in CDD are linked to relevant, carefully chosen citations.
We make extensive use of pre-calculated CD-Searches for proteins in Entrez. CDART, which stores that information, can be invoked from within Entrez to visualize domain architectures (6). Proteins in Entrez can now be neighboured by similar domain architecture, in addition to sequence similarity as detected by BLAST (7). Conserved Domains are neighboured to others by similarity, highlighting evolutionary relationships between families as well as the redundancy in the dataset, and by co-occurrence, highlighting domains, which are found next to each other in a set of protein sequences.
Data sources
Most of the domain models in CDD have been imported from two outside sources, Pfam and SMART. CDD also contains a small set of models labelled LOAD, and several hundred curated domain models, most of which are originally based on imported SMART and Pfam families. Some of the curated models have been generated de novo, to increase CDD coverage with respect to three-dimensional structures in MMDB (8). New SMART and Pfam distributions are imported on a regular basis, typically with several weeks delay. Upon import, we identify sequence fragments used in the alignments so that they can be linked to corresponding protein entries in Entrez. We also identify closely related three-dimensional structures in MMDB, so that alignment rows can be replaced with sequences corresponding to those structures. This allows us to present integrated sequence/structure/alignment views using Cn3D (9) as a helper application.
Links and Neighbours
Conserved Domains in Entrez are linked to PubMed citations, nodes in NCBI's taxonomy tree, and Entrez protein entries. PubMed identifiers are supplied by CDDs source databases, and those links are subject to change in curated CDs. For each domain alignment model, the set of representative sequences defines a common node in NCBIs taxonomy tree. Links to these common nodes are recorded in the database. The CDART database is the source for both links between CDs and proteins, and for CD neighbour data (6). CDART is populated with results from CD-searches comparing all of the proteins in Entrez to the current set of conserved domain models. CD-protein links are recorded as significant hits from these database searches, yielding E-values of 1e-2 or less. We record two types of CDCD neighbour relationships. Two CDs are defined as similar if they hit overlapping intervals on a set of protein sequences. Two CDs are defined as co-occurring if they hit non-overlapping intervals on sets of protein sequences.
USING CDD TO FIND DOMAINS IN ENTREZ
Users of NCBI's services are likely to encounter CDD in two ways. (i) When protein query sequences are submitted for BLAST searches against protein databases, the queries will be submitted to CD-Search by default, and the resultsif anywill be displayed graphically on the intermediate BLAST results page. Clicking on the image will launch a browser window with the detailed results, which allow further analysis. (ii) Pre-calculated CD-search results exist for proteins in Entrez, and are readily available following the [Domains] link associated with protein records and document summaries. One might, for example, study a hypothetical protein from a complete genome sequence, say gi|2495965 from Methanocaldococcus jannaschii. Following the [Domains] link and expanding the summary to show more details will produce a graphical display, as shown in Figure 1. While the protein maps to a conserved family of unknown function (DUF135/pfam02003), the sequence also produces hits to two models for DNA ligases (pfam01068 and LOAD_ligase). In fact these three are grouped together with other domains as related in the CDART database, as displayed on each member's conserved domain summary page. This bigger group of related domains comprises ATP- and NAD-dependent DNA Ligases, whose adenylation domains are known to share a well-conserved core structure around the active site (10). The representative model from the LOAD set, LOAD_ligase, aligns very diverse members from a large superfamily, also including RNA-ligases and mRNA capping enzymes (11).
|
These and other interesting family relationships are recorded implicitly in CDD and CDART. Related domains share subsets of sequences for which overlapping intervals hit both domains with significant E-values in CD-Searches. The pre-recorded relationships help understand the redundancy in the imported and curated collections.
But how do we know whether these relationships are indicative of common molecular function? Multiple alignments are readily available for inspection, with the ability to colour by conservation. If a three-dimensional structure has been linked to the domain model, Entrez's structure viewer Cn3D can be used to interactively visualize structure and sequence data for a family. With these tools, and by exploring relevant literature, starting from CD-linked citations, the user may understand that it's in fact the catalytic core which is preserved among these families, and that they are likely to share a common enzymatic mechanism. However, if the location of functionally relevant residues had been recorded in the alignment models, it might have been easier to arrive at that conclusion.
MANUAL CURATION OF DOMAIN ALIGNMENT MODELS
Recording conserved features is one of the major tasks of expert domain alignment curation undertaken at NCBI. Alignment displays will highlight selected features, so that users can examine their agreement with residue conservation patterns, in particular when examining alignments with queries merged in according to CD-Search results. This may help, for example, to resolve the significance of CD-Search hits when the reported scores and E-values are not convincing.
For selected features we record structure evidence, which can be visualized with Cn3D. Most commonly structure evidence is used with the annotation of sites involved in binding of cofactors, substrates, and other biopolymers, to indicate that we know about actual three-dimensional data sets demonstrating such molecular complexes. Figure 2 shows an example of such structure evidence.
|
Conserved features can be recorded only for residues aligned consistently across the family model. We find it necessary to re-evaluate and often change imported alignments to ensure this consistency. We also attempt to define the conserved core structure when curating alignments of diverse families, in agreement with data from comparative analysis of 3D structure (4).
We plan to update the contents of curated CDs periodically. The update process mines the CDART database for additional members of the domain family, which are dissimilar enough to already aligned rows to be interesting. Updates are curated, so that family membership as derived from the results of an automated procedure is validated. In the process of curating updates, new family members may suggest changes to the existing core model, for example, or provide additional data for feature annotation and feature evidence.
FUTURE DEVELOPMENTS
With data imported from a variety of sources, and with adding curated versions of many models to the collection, the set of conserved domains in CDD has become redundant. Relationships between conserved domains, as mentioned in the example above, may be interesting and lead to discovery, but they may as well just point out duplication of data. We intend to explicitly record relationships between curated domain models in CDD, by curating hierarchies of conserved domain models. Diverse families will be represented by parent alignments with many divergent members, and very often it will be desirable to also represent more specific sub-families, for more precise functional annotation. If the resulting family relationships are recorded and clearly presented when visualizing results, users will be able to focus on the interesting aspects of data redundancy.
ACKNOWLEDGEMENTS
We thank the NIH Intramural Research Program for support. We thank the authors of Pfam, SMART and LOAD, for creating invaluable resources and for helping with access to data. We are grateful towards the NCBI Blast group for developing RPS-BLAST and continuous support. Comments, suggestions, and questions are welcome and should be directed to: info{at}ncbi.nlm.nih.gov.
REFERENCES
- Bateman,A., Birney,E., Cerruti,L., Durbin,R., Etwiller,L., Eddy,S.R., Griffiths-Jones,S., Howe,K.L., Marshall,M. and Sonnhammer,E.L. (2002) The Pfam protein families database. Nucleic Acids Res., 30, 276280.
[Abstract/Free Full Text] - Letunic,I., Goodstadt,L., Dickens,N.J., Doerks,T., Schultz,J., Mott,R., Ciccarelli,F., Copley,R.R., Ponting,C.P. and Bork,P. (2002) Recent improvements to the SMART domain-based sequence annotation resource. Nucleic Acids Res., 30, 242244.
[Abstract/Free Full Text] - Marchler-Bauer,A., Panchenko,A.R., Shoemaker,B.A., Thiessen,P.A., Geer,L.Y. and Bryant,S.H. (2002) CDD: a database of conserved domain alignments with links to domain three-dimensional structure. Nucleic Acids Res., 30, 281283.
[Abstract/Free Full Text] - Marchler-Bauer,A., Panchenko,A.R., Ariel,N. and Bryant,S.H. (2002) Comparison of sequence and structure alignments for protein domains. Proteins, 48, 439446.[CrossRef][ISI][Medline]
- Wheeler,D.L., Church,D.M., Lash,A.E., Leipe,D.D., Madden,T.L., Pontius,J.U., Schuler,G.D., Schriml,L.M., Tatusova,T.A., Wagner,L. and Rapp,B.A. (2002) Database resources of the National Center for Biotechnology Information: 2002 update. Nucleic Acids Res., 30, 1316.
[Abstract/Free Full Text] - Geer,L.Y., Domrachev,M., Lipman,D.J. and Bryant,S.H. (2002) CDART: Protein Homology by Domain Architecture. Genome Res., 12, 16191623.
[Abstract/Free Full Text] - Altschul,S.F., Madden,T.L., Schäffer,A.A., Zhang,J., Zhang,Z., Miller,W. and Lipman,D.J. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res., 25, 33893402.
[Abstract/Free Full Text] - Chen,J., Anderson,J.B., DeWeese-Scott,C., Fedorova,N.D., Geer,L.Y., He,S., Hurwitz,D.I., Jackson,J.D., Jacobs,A.R., Lanczycki,C.J., Liebert,C.A., Madej,T., Marchler-Bauer,A., Marchler,G.H., Mazumder,R., Nikolskaya,A.N., Rao,B.S., Panchenko,A.R., Shoemaker,B.A., Song,J.S., Thiessen,P.A., Vasudevan,S., Wang,Y., Yamashita,R.A., Yin,J.J. and Bryant,S.H. (2003) MMDB: Entrez's 3D-Structure Database. Nucleic Acids Res., 31, 474477.
[Abstract/Free Full Text] - http://www.ncbi.nlm.nih.gov/Structure/CN3D/cn3d.shtml.
- Singleton,M.R., Hakannson,K., Timson,D.J. and Wigley,D.B. (1999) Structure of the adenylation domain of an NAD+-dependent DNA ligase. Struct. Fold. Des., 15, 3542.
- Aravind,L. and Koonin,E.V. (1999) Gleaning non-trivial structural, functional and evolutionary information about proteins by iterative database searches. J. Mol. Biol., 287, 10231040.[CrossRef][ISI][Medline]
This article has been cited by other articles:
![]() |
B. Park, S. Subbian, S. H. El-Etr, S. L. G. Cirillo, and J. D. Cirillo Use of Gene Dosage Effects for a Whole-Genome Screen To Identify Mycobacterium marinum Macrophage Infection Loci Infect. Immun., July 1, 2008; 76(7): 3100 - 3115. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. K. Kojima and M. Kanehisa Systematic Survey for Novel Types of Prokaryotic Retroelements Based on Gene Neighborhood and Protein Architecture Mol. Biol. Evol., July 1, 2008; 25(7): 1395 - 1404. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Jasinski, D. Sudre, G. Schansker, M. Schellenberg, S. Constant, E. Martinoia, and L. Bovet AtOSA1, a Member of the Abc1-Like Family, as a New Factor in Cadmium and Oxidative Stress Response Plant Physiology, June 1, 2008; 147(2): 719 - 731. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Magnani and S. Hake KNOX Lost the OX: The Arabidopsis KNATM Gene Defines a Novel Class of KNOX Transcriptional Regulators Missing the Homeodomain PLANT CELL, April 1, 2008; 20(4): 875 - 887. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Bennett, R. M. Aimino, and J. R. McCormick Streptomyces coelicolor Genes ftsL and divIC Play a Role in Cell Division but Are Dispensable for Colony Formation J. Bacteriol., December 15, 2007; 189(24): 8982 - 8992. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Brock, K. Talley, K. Coley, P. Kundrotas, and E. Alexov Optimization of Electrostatic Interactions in Protein-Protein Complexes Biophys. J., November 15, 2007; 93(10): 3340 - 3352. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Clardy and S. F. Brady Cyclic AMP Directly Activates NasP, an N-Acyl Amino Acid Antibiotic Biosynthetic Enzyme Cloned from an Uncultured {beta}-Proteobacterium J. Bacteriol., September 1, 2007; 189(17): 6487 - 6489. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Kann, S. L. Sheetlin, Y. Park, S. H. Bryant, and J. L. Spouge The identification of complete domains within protein sequences using accurate E-values for semi-global alignment Nucleic Acids Res., July 9, 2007; 35(14): 4678 - 4685. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Neumann, H. Yan, and J. Jiang The Centromeric Retrotransposons of Rice Are Transcribed and Differentially Processed by RNA Interference Genetics, June 1, 2007; 176(2): 749 - 761. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Begun, H. A. Lindfors, A. D. Kern, and C. D. Jones Evidence for de Novo Evolution of Testis-Expressed Genes in the Drosophila yakuba/Drosophila erecta Clade Genetics, June 1, 2007; 176(2): 1131 - 1137. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Carre-Mlouka, S. Gaumer, P. Gay, A. M. Petitjean, C. Coulondre, P. Dru, F. Bras, S. Dezelee, and D. Contamine Control of Sigma Virus Multiplication by the ref(2)P Gene of Drosophila melanogaster: An in Vivo Study of the PB1 Domain of Ref(2)P Genetics, May 1, 2007; 176(1): 409 - 419. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Newcomb, C.-Y. Chen, and J. H. D. Wu Induction of the celC operon of Clostridium thermocellum by laminaribiose PNAS, March 6, 2007; 104(10): 3747 - 3752. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Van der Geize, K. Yam, T. Heuser, M. H. Wilbrink, H. Hara, M. C. Anderton, E. Sim, L. Dijkhuizen, J. E. Davies, W. W. Mohn, et al. A gene cluster encoding cholesterol catabolism in a soil actinomycete provides insight into Mycobacterium tuberculosis survival in macrophages PNAS, February 6, 2007; 104(6): 1947 - 1952. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Nayar, H. Yamaura, R. Rajagopalan, D. D. Risser, and S. M. Callahan FraG is necessary for filament integrity and heterocyst maturation in the cyanobacterium Anabaena sp. strain PCC 7120 Microbiology, February 1, 2007; 153(2): 601 - 607. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Taniguchi, M. Katayama, R. Ito, N. Takai, T. Kondo, and T. Oyama labA: a novel gene required for negative feedback regulation of the cyanobacterial circadian clock protein KaiC Genes & Dev., January 1, 2007; 21(1): 60 - 70. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Li, X. Ying, Y. Guo, Z. Yu, X. Zhou, Z. Deng, H. Kieser, K. F. Chater, and M. Tao Identification of a Gene Negatively Affecting Antibiotic Production and Morphological Differentiation in Streptomyces coelicolor A3(2) J. Bacteriol., December 15, 2006; 188(24): 8368 - 8375. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Miyata, S. Vallette-Kasic, A. Saveanu, M. Takeuchi, H. Yoshikawa, A. Tajima, K. Tojo, R. Reynaud, M. Gueydan, A. Enjalbert, et al. Identification and Functional Analysis of the Novel S179R POU1F1 Mutation Associated with Combined Pituitary Hormone Deficiency J. Clin. Endocrinol. Metab., December 1, 2006; 91(12): 4981 - 4987. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M.E. Jones, V. Thomas, M. H. Bennett, J. Mansfield, and M. Grant Modifications to the Arabidopsis Defense Proteome Occur Prior to Significant Transcriptional Change in Response to Inoculation with Pseudomonas syringae Plant Physiology, December 1, 2006; 142(4): 1603 - 1620. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Gerlach and J. Reidl NAD+ utilization in pasteurellaceae: simplification of a complex pathway. J. Bacteriol., October 1, 2006; 188(19): 6719 - 6727. [Full Text] [PDF] |
||||
![]() |
C. I. Montero, D. L. Lewis, M. R. Johnson, S. B. Conners, E. A. Nance, J. D. Nichols, and R. M. Kelly Colocation of Genes Encoding a tRNA-mRNA Hybrid and a Putative Signaling Peptide on Complementary Strands in the Genome of the Hyperthermophilic Bacterium Thermotoga maritima. J. Bacteriol., October 1, 2006; 188(19): 6802 - 6807. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. G. Reeve, L. Brau, J. Castelli, G. Garau, C. Sohlenkamp, O. Geiger, M. J. Dilworth, A. R. Glenn, J. G. Howieson, and R. P. Tiwari The Sinorhizobium medicae WSM419 lpiA gene is transcriptionally activated by FsrR and required to enhance survival in lethal acid conditions. Microbiology, October 1, 2006; 152(Pt 10): 3049 - 3059. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Loening, T. D. Fenn, A. M. Wu, and S. S. Gambhir Consensus guided mutagenesis of Renilla luciferase yields enhanced stability and light output Protein Eng. Des. Sel., September 1, 2006; 19(9): 391 - 400. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Lin, L. Zhu, and D.-Y. Zhang An initial strategy for comparing proteins at the domain architecture level Bioinformatics, September 1, 2006; 22(17): 2081 - 2086. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. McBroom, A. P. Johnson, S. Vemulapalli, and M. J. Kuehn Outer Membrane Vesicle Production by Escherichia coli Is Independent of Membrane Instability. J. Bacteriol., August 1, 2006; 188(15): 5385 - 5392. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. S. Garrenton, S. L. Young, and J. Thorner Function of the MAPK scaffold protein, Ste5, requires a cryptic PH domain Genes & Dev., July 15, 2006; 20(14): 1946 - 1958. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Tazoe, K. Ichikawa, and T. Hoshino Flavin Adenine Dinucleotide-Dependent 4-Phospho-D-Erythronate Dehydrogenase Is Responsible for the 4-Phosphohydroxy-L-Threonine Pathway in Vitamin B6 Biosynthesis in Sinorhizobium meliloti J. Bacteriol., July 1, 2006; 188(13): 4635 - 4645. [Abstract] [Full Text] [PDF] |
||||
![]() |
P.-P. Freymond, V. Lazarevic, B. Soldo, and D. Karamata Poly(glucosyl-N-acetylgalactosamine 1-phosphate), a wall teichoic acid of Bacillus subtilis 168: its biosynthetic pathway and mode of attachment to peptidoglycan Microbiology, June 1, 2006; 152(6): 1709 - 1718. [Abstract] [Full Text] [PDF] |
||||
![]() |
A Warrington, A R Vieira, K Christensen, I M Orioli, E E Castilla, P A Romitti, and J C Murray Genetic evidence for the role of loci at 19q13 in cleft lip and palate. J. Med. Genet., June 1, 2006; 43(6): e26 - e26. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Neumann, A. Koblizkova, A. Navratilova, and J. Macas Significant Expansion of Vicia pannonica Genome Size Mediated by Amplification of a Single Type of Giant Retroelement Genetics, June 1, 2006; 173(2): 1047 - 1056. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Bai, J. Zhang, A. Ewing, S. A. Miller, A. Jancso Radek, D. V. Shevchenko, K. Tsukerman, T. Walunas, A. Lapidus, J. W. Campbell, et al. Living with genome instability: the adaptation of phytoplasmas to diverse environments of their insect and plant hosts. J. Bacteriol., May 1, 2006; 188(10): 3682 - 3696. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Lord, B. D Murphy, J. A Desmarais, S. Ledoux, D. Beaudry, and M.-F. Palin Modulation of peroxisome proliferator-activated receptor {delta} and {gamma} transcripts in swine endometrial tissue during early gestation. Reproduction, May 1, 2006; 131(5): 929 - 942. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Providenti, R. E. Shaye, K. D. Lynes, N. T. McKenna, J. M. O'Brien, S. Rosolen, R. C. Wyndham, and I. B. Lambert The Locus Coding for the 3-Nitrobenzoate Dioxygenase of Comamonas sp. Strain JS46 Is Flanked by IS1071 Elements and Is Subject to Deletion and Inversion Events Appl. Envir. Microbiol., April 1, 2006; 72(4): 2651 - 2660. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Fleckenstein, K. Roy, J. F. Fischer, and M. Burkitt Identification of a Two-Partner Secretion Locus of Enterotoxigenic Escherichia coli Infect. Immun., April 1, 2006; 74(4): 2245 - 2258. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Hansmeier, A. Albersmeier, A. Tauch, T. Damberg, R. Ros, D. Anselmetti, A. Puhler, and J. Kalinowski The surface (S)-layer gene cspB of Corynebacterium glutamicum is transcriptionally activated by a LuxR-type regulator and located on a 6 kb genomic island absent from the type strain ATCC 13032. Microbiology, April 1, 2006; 152(Pt 4): 923 - 935. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. MacWilliams, K. Doquang, R. Pedrola, G. Dollman, D. Grassi, T. Peis, A. Tsang, and A. Ceccarelli A retinoblastoma ortholog controls stalk/spore preference in Dictyostelium Development, April 1, 2006; 133(7): 1287 - 1297. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. I. Orsborn, L. F. Shubitz, T. Peng, E. M. Kellner, M. J. Orbach, P. A. Haynes, and J. N. Galgiani Protein Expression Profiling of Coccidioides posadasii by Two-Dimensional Differential In-Gel Electrophoresis and Evaluation of a Newly Recognized Peroxisomal Matrix Protein as a Recombinant Vaccine Candidate Infect. Immun., March 1, 2006; 74(3): 1865 - 1872. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Begun, H. A. Lindfors, M. E. Thompson, and A. K. Holloway Recently Evolved Genes Identified From Drosophila yakuba and D. erecta Accessory Gland Expressed Sequence Tags Genetics, March 1, 2006; 172(3): 1675 - 1681. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. J. StJohn, J. D. Rice, and J. F. Preston Paenibacillus sp. Strain JDR-2 and XynA1: a Novel System for Methylglucuronoxylan Utilization Appl. Envir. Microbiol., February 1, 2006; 72(2): 1496 - 1506. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Chesney, A. R. Kidd III, and J. Kimble gon-14 Functions With Class B and Class C Synthetic Multivulva Genes to Control Larval Growth in Caenorhabditis elegans Genetics, February 1, 2006; 172(2): 915 - 928. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Dvornyk Subfamilies of cpmA, a gene involved in circadian output, have different evolutionary histories in cyanobacteria Microbiology, January 1, 2006; 152(1): 75 - 84. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Kissmehl, T. P. Kruger, T. Treptau, M. Froissard, and H. Plattner Multigene Family Encoding 3',5'-Cyclic-GMP-Dependent Protein Kinases in Paramecium tetraurelia Cells Eukaryot. Cell, January 1, 2006; 5(1): 77 - 91. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. L. Jensen, M. P. Styczynski, I. Rigoutsos, and G. N. Stephanopoulos A generic motif discovery algorithm for sequential data Bioinformatics, January 1, 2006; 22(1): 21 - 28. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. N. Kinch, K. Ginalski, and N. V. Grishin Site-2 protease regulated intramembrane proteolysis: Sequence homologs suggest an ancient signaling cascade Protein Sci., January 1, 2006; 15(1): 84 - 93. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. D. Weyman, Z. Pan, Q. Feng, D. G. Gilchrist, and R. M. Bostock A Circadian Rhythm-Regulated Tomato Gene Is Induced by Arachidonic Acid and Phythophthora infestans Infection Plant Physiology, January 1, 2006; 140(1): 235 - 248. [Abstract] [Full Text] [PDF] |
||||
![]() |
|





















