Skip Navigation


Nucleic Acids Research Advance Access originally published online on November 26, 2007
Nucleic Acids Research 2008 36(Database issue):D281-D288; doi:10.1093/nar/gkm960
This Article
Right arrow Abstract Freely available
Right arrow Print PDF (8891K) Freely available
Right arrow Screen PDF (1177K) Freely available
Right arrowOA All Versions of this Article:
36/suppl_1/D281    most recent
gkm960v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Finn, R. D.
Right arrow Articles by Bateman, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Finn, R. D.
Right arrow Articles by Bateman, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2008, Vol. 36, Database issue D281-D288
© 2007 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article appears in the following Nucleic Acids Research issue: Database issue [View the issue table of contents]

Articles

The Pfam protein families database

Robert D. Finn1,*, John Tate1, Jaina Mistry1, Penny C. Coggill1, Stephen John Sammut1, Hans-Rudolf Hotz1, Goran Ceric2, Kristoffer Forslund3, Sean R. Eddy2, Erik L. L. Sonnhammer3 and Alex Bateman1

1Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton Hall, Hinxton, Cambridgeshire, CB10 1SA, UK, 2Howard Hughes Medical Institute Janelia Farm Research Campus, 19700 Helix Drive, Ashburn, VA 20147, USA and 3Stockholm Bioinformatics Center, Albanova, Stockholm University, SE-10691 Stockholm, Sweden

*To whom correspondence should be addressed. Tel: +44 1223 495330; Fax: +44 1223 494919; Email: rdf{at}sanger.ac.uk

Received September 15, 2007. Revised October 10, 2007. Accepted October 16, 2007.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 COVERAGE UPDATE
 IMPROVING ACCESS TO PFAM
 WIDENING THE SCOPE OF...
 SUMMARY
 AVAILABILITY
 REFERENCES
 
Pfam is a comprehensive collection of protein domains and families, represented as multiple sequence alignments and as profile hidden Markov models. The current release of Pfam (22.0) contains 9318 protein families. Pfam is now based not only on the UniProtKB sequence database, but also on NCBI GenPept and on sequences from selected metagenomics projects. Pfam is available on the web from the consortium members using a new, consistent and improved website design in the UK (http://pfam.sanger.ac.uk/), the USA (http://pfam.janelia.org/) and Sweden (http://pfam.sbc.su.se/), as well as from mirror sites in France (http://pfam.jouy.inra.fr/) and South Korea (http://pfam.ccbb.re.kr/).


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 COVERAGE UPDATE
 IMPROVING ACCESS TO PFAM
 WIDENING THE SCOPE OF...
 SUMMARY
 AVAILABILITY
 REFERENCES
 
Pfam is designed to be a comprehensive and accurate collection of protein domains and families (1,2). Pfam families are divided into two categories, Pfam-A and Pfam-B. Each Pfam-A family consists of a curated seed alignment containing a small set of representative members of the family, profile hidden Markov models (profile HMMs) built from the seed alignment and an automatically generated full alignment which contains all detectable protein sequences belonging to the family, as defined by profile HMM searches of primary sequence databases. Pfam-B entries are automatically generated from the ProDom database (3), and are represented by a single alignment. The use of representative seed alignments for Pfam-A families allows efficient and sustainable manual curation of alignments and annotation, while the automatic generation of full alignments and Pfam-B clusters ensures that Pfam is a comprehensive classification of protein families that scales effectively with the growth of the sequence databases. Pfam data are freely accessible via the web and are available for download in a variety of forms (see availability section).


    COVERAGE UPDATE
 TOP
 ABSTRACT
 INTRODUCTION
 COVERAGE UPDATE
 IMPROVING ACCESS TO PFAM
 WIDENING THE SCOPE OF...
 SUMMARY
 AVAILABILITY
 REFERENCES
 
We quantify the completeness of the Pfam classification of sequence space using two measures of coverage. ‘Sequence coverage’ is the fraction of protein sequences listed in UniProtKB (4) that has at least one Pfam domain, whilst ‘residue coverage’ is the fraction of protein residues that fall within Pfam domains, as defined by the sub-sequences included in Pfam-A full alignments. The current Pfam release (version 22.0) contains a total of 9318 Pfam-A families, which cover 73.23% of sequences and 50.79% of residues found in UniProtKB version 9.7. Since the last Pfam publication (release 18.0), we have added 1335 families (a 17% increase) and maintained approximately the same coverage of UniProtKB, despite a 100% increase in the number of UniProtKB sequences.

Coverage of protein structures
The availability of 3D protein structures has been essential for finding distant evolutionary relationships and understanding protein function at the molecular level. Sequences with a known structure usually have a clear domain organization and ideally each domain should be represented in Pfam. We looked at a non-redundant set of sequences whose structures were deposited in the Protein Data Bank (PDB), and found more than 1000 structures, which did not have any corresponding Pfam-A domains. We also found that residue coverage of some PDB structures was <50%. In order to improve this situation, we built over 500 new Pfam-A families for PDB sequences and SCOP (Structural Classification of Proteins) database entries (6), which were not previously covered. As measured on the current database of non-redundant sequences of known structure (3 August 2007), our sequence and residue coverages are now 94.7% and 77.5%, respectively. We have developed a protocol to ensure that this level of coverage is maintained as the number of protein structures increases. As novel structures are identified through structural genomics programs so their sequences will be made a priority for curation into Pfam.

Expansion of Pfam clans
The concept of clans was introduced into Pfam in 2005 (2). Briefly, a clan is a collection of Pfam-A entries that are judged likely to be homologous. Clans are built manually, based on a wide variety of information sources: the primary literature, known structures, profile–profile comparisons and other databases such as SCOP (6). We try to represent the relationships between member families graphically by providing clan alignments. Clans thus define a simple hierarchical classification of Pfam entries, allowing better transfer of structural and/or functional information between families, and better predictions of function and structure for families of unknown function.

To infer relationships between families, we now use three different tools. In addition to the profile–profile comparison tool PRC (http://supfam.mrc-lmb.cam.ac.uk/PRC/), we now use another profile–profile comparisons tool HHsearch (7), and the simple comparison of outputs program, SCOOP (8). We use three different computational methods because each is sensitive to a slightly different set of relationships and, more importantly, the combination of the three tools reinforces independently detected relationships. Notably, SCOOP has allowed us to infer many novel relationships that were not detected using either of the profile–profile comparison tools. Pfam version 18.0 (the first major release of Pfam with clans) contained 172 clans, comprising 1181 Pfam-A families. As new families are added to Pfam and new relationships are discovered, we build new clans and add families to existing ones. In the current release of Pfam (version 22.0) there are now 283 clans, comprising a total of 1808 Pfam-A families, an increase of 53% since release 18.0. The proportion of Pfam domain hits that fall within a clan has increased from 31% in release 18.0 to 43% in release 22.0. This shows that many families in Pfam are related and that, to date, many of the largest Pfam-A families have been assigned into clans. We expect the clan classification to grow still further, since many automatically detected relationships still need to be manually verified.


    IMPROVING ACCESS TO PFAM
 TOP
 ABSTRACT
 INTRODUCTION
 COVERAGE UPDATE
 IMPROVING ACCESS TO PFAM
 WIDENING THE SCOPE OF...
 SUMMARY
 AVAILABILITY
 REFERENCES
 
Pfam website development
Although Pfam data have always been centrally maintained and curated, historically each member of the Pfam consortium has run a separate website to serve the same data. The three primary mirror sites are based in the UK, Sweden and the USA, with a further two recognized mirror sites in France and South Korea. Each of the primary consortium sites has tended to adopt a different look and feel and, although all sites have provided the same set of core services, each has also provided some additional tools and services that are unique to that particular site. This has lead to an entirely different user experience at each Pfam site, and has led to users’ confusion as to which site provides which services. The development of three main websites also caused a significant duplication of effort for the Pfam consortium.

A new Pfam website has been developed, with the goal of providing a single, unified website for Pfam data and services, that combines the best features of the separate sites in a single, common interface. In re-designing the website, we have been able not only to improve the navigation and architecture of the website itself, but also to design a more easily extensible and maintainable code-base for the future. This new code-base will be common to and developed by all members of the Pfam consortium. Furthermore, the new website code has been written with portability in mind and has been made publicly available, so that users may install and run the website locally if desired.

We have improved the organization and presentation of Pfam data. Everything related to, for example, a Pfam-A family, is collected into a single page, which is sub-divided into tab-panes that the user can easily switch between. Figure 1 shows a typical page for a Pfam-A family. We have similar tab-layout pages for data related to protein sequences, Pfam-B families, Pfam clans, proteome data from completed genomes and 3D protein structures. Each type of page represents a different route into the Pfam data, and each tabbed page provides links that allow the user to navigate easily between these different sections of Pfam. Additionally, users can browse lists of Pfam families or clans and can jump quickly between any type of entry in the site via a ‘jump to’ box found on most pages.


Figure 1
View larger version (79K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 1. The Pfam family page from the new website. This page shows the summary information for the Piwi domain. The tabs on the left allow users to browse the different types of associated information and beneath the tabs is the ‘jump to’ box, a tool which can direct the user to the page for any other entry in the site, given any type of accession or identifier. The panel at the top right gives a summary of the number of protein architectures, sequences, interactions, species and structures available. The same page layout and navigational tools are common to all of the different types of data in the Pfam website.

 
A common feature of every type of page is a summary box, providing the salient details of every entry in a single glance. The five summarized features of the entry are: the number of architectures associated with the entry; the number of protein sequences; the number of interactions [as determined by iPfam (9)]; the number of species; and the number of 3D structures. The exact meaning of each value is context-dependent, so that in the Pfam family page, for example, the structure icon shows the number of structures associated with that family, whilst in a protein sequence page the structure icon shows the number of structures, which map to that sequence. The link for each icon is also context-dependent, taking the user to the most appropriate section of the page for the icon clicked.

Previously, it has been difficult to search Pfam by species or taxonomic division. In addition to the species tree found on each family page, which provides a breakdown of the species found in that family, we have implemented a new taxonomy search tool. As with the taxonomy search tool in the old Pfam website, the new tool returns a list of Pfam domains that match a Boolean query expression. For example, the query ‘Caenorhabditis elegans AND NOT Homo sapiens’ will return all Pfam domains found in C. elegans, but are not found in H. sapiens. As well as being less error prone and significantly quicker than the version in the old Pfam website, the new taxonomy search tool also provides a feedback mechanism that suggests organism names as the user enters them. This reduces the likelihood of typographical or spelling errors in queries, since incorrectly entered species terms are immediately highlighted in the interface, as well as providing an insight into the organisms that are found in the database.

A commonly requested capability for the Pfam site is the ability to find Pfam domains that are unique to a given taxonomic division or species. This feature is now available. For example, searching for unique ‘Metazoa’ families returns a list of domains that are found only in Metazoans will be returned.

In addition to the standard features of the old Pfam websites, such as search tools for quickly finding Pfam domains on a protein sequence or for locating sequences with a specified domain architecture, we have also introduced several new features in the new site, many of which use the Distributed Annotation System (DAS) (10) to aggregate multiple data sources in a single display.

The Distributed Annotation System
We have improved access to Pfam by providing data through the DAS. DAS is a system for disseminating annotations and alignments of DNA or protein sequences through a simple, web-based protocol. Three types of Pfam data are now available via DAS (11): domain annotations for both Pfam-A and Pfam-B families; sequence features such as active sites (12) and transmembrane region predictions; and seed and full alignments for Pfam-A families. The availability of Pfam data via DAS enables users to access specific parts of the database as a web service, without the need to download and install it in its entirety.

We have also been able to incorporate other data sources that are accessible through DAS, in order to enrich our own display of Pfam data. One feature of the new website is a DAS-based viewer for sequence annotations (Figure 2). This allows the user to view annotations of protein sequences from a wide range of third-party databases alongside information from Pfam itself. The viewer presents the standard Pfam domain structure image, showing the arrangement of Pfam domains on the sequence in question, and allows users to add or hide annotations from any of the available DAS sources. As the user moves their mouse over each feature, a tool-tip gives detailed information about it. If provided by the external DAS source, a link to further information is also given.


Figure 2
View larger version (57K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 2. The Pfam protein sequence page, showing the DAS annotation viewer. Various tracks can be selected using the check boxes beneath the annotation view, allowing the user to view features derived from any of the listed DAS sources. For instance, in the example displayed, the membrane topology calculated by Phobius (18) can be viewed alongside the Pfam domain annotations and those from a variety of different domain databases. The figure shows a tool-tip for one feature, which gives the details of the annotation and, in this case, highlights the link to further information.

 
Another use of DAS within the new website is in the Pfam sequence alignment viewer. Pfam provides two alignments for every family: the seed alignment is a manually curated alignment of related sequences and generally contains a relatively small number of sequences; the full alignment is generated by searching the sequence database using the HMM for the family and may contain a very large number of sequences (the largest alignment, that of GP120, currently contains over 68 000 sequences). Historically, it has been difficult, if not impossible, to view the largest sequence alignments in a web browser, due simply to the size of the resulting web page. We have implemented a DAS-based sequence alignment viewer (shown in Figure 3) that is able to present even the largest alignments in manageable portions, by retrieving only the required section of the alignment and rendering it as HTML. This allows the user to scroll through wide alignments (those with long sequences) or to page through long alignments (those with a large number of sequences), without having to load the entire alignment into their browser. Alignments are coloured according to a pre-calculated consensus sequence, which is also retrieved via DAS, and in this way even alignment fragments can be marked-up using the properties of the whole alignment.


Figure 3
View larger version (86K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 3. The DAS-based Pfam sequence alignment viewer. A portion of the Piwi domain full alignment is shown with residue conservation highlighted in a ClustalX (19) style colour scheme.

 

    WIDENING THE SCOPE OF PFAM ANNOTATIONS
 TOP
 ABSTRACT
 INTRODUCTION
 COVERAGE UPDATE
 IMPROVING ACCESS TO PFAM
 WIDENING THE SCOPE OF...
 SUMMARY
 AVAILABILITY
 REFERENCES
 
NCBI GenPept sequences
The two main public repositories of protein sequence data are UniProtKB (4) and the GenPept database from NCBI (13). These two resources are independent of each other and consequently contain two separate, though often overlapping, sets of sequences, which are referred to by entirely separate sets of accessions. Historically, Pfam has been based on a sequence database termed ‘pfamseq’, which is a frozen version of the UniProtKB database that we update at each major Pfam release. This has caused problems for users wanting to retrieve Pfam data for a sequence for which they have only a GenPept accession or the NCBI GI number.

To make Pfam sequence annotation accessible via both UniProtKB and GenPept accessions, we now provide Pfam domain assignments for members of both sequence databases, each as a separate section of Pfam. Where possible, we transfer the annotation from UniProtKB sequences to the equivalent sequence in GenPept, by using the EMBL/GenBank cross-references (13,14) in the UniProtKB entry and ensuring that the CRC64 checksums of the sequences from the two databases are the same. For example, of the five listed protein identifiers cross-referenced by the UniProtKB accession P51003 [GenBank] (AAH00927 [GenBank] .1, AAH36014 [GenBank] .1, CAD62628 [GenBank] .1, CAD66560 [GenBank] .1, CAD61935 [GenBank] .1), only one GenPept protein (AAH36014 [GenBank] .1) has the same CRC64 checksum as the UniProtKB entry. As a quality control procedure, we make use of the UniProtKB/Swiss-Prot mapping provided by GenPept to perform the reverse mapping. Overall, this mapping procedure provides a UniProtKB equivalent for around 75% of sequences in GenPept. We search the remaining GenPept sequences against the library of Pfam HMMs and use the pre-defined, curated thresholds to determine which sequences should be included in the full alignment for a family. Two or more families belonging to the same clan may match the same sequence region. We resolve such overlaps using the same method as we use when resolving overlapping matches between clan families that have been searched against the UniProtKB sequence database (2). This ensures that there are no overlapping sequences between families that belong to the same clan.

The Pfam domain annotations and alignments for GenPept (release 158) are available for download in a flat-file format (Pfam-A.full.ncbi), as an ASCII representation of the domains matches on each sequence (similar to the swisspfam file) and in the Pfam MySQL database. We also provide access to this data via the websites, where Genbank identifiers (GI numbers) or GenPept protein identifiers can be entered into the ‘jump to’ box. Searching in this way will take users to a page that is similar to the protein pages produced for UniProtKB sequences. One caveat when using this data is that we do not resolve overlaps between domains for GenPept sequences. However, since we curate Pfam-A domain thresholds in a conservative manner to ensure high specificity (at the expense of some sensitivity), we expect the number of domain overlaps for the GenPept data to be low. Furthermore, three-quarters of the sequences in GenPept are identical to a UniProtKB entry, which are guaranteed to be non-overlapping.

Metagenomic samples
Current microbiological culturing techniques are inadequate for studying the vast majority of microorganisms (15). Consequently, many organisms remain under-represented in the main sequence databases. An emerging field in biology is metagenomics, the analysis of genomic material from environmental samples. Recently, with the advent of better sequencing technologies, large samples from environments such as the sea have been sequenced directly, thereby avoiding the need for culturing. Sequencing using this approach gives rise to many sequences from a diverse set of organisms, albeit at low read coverage and with no knowledge of the source organisms. In addition, when compared with proteins in UniProtKB, the sequences from metagenomic samples are more fragmentary.

Within Pfam we have collected together several available metagenomic datasets and amalgamated them into a single sequence collection (listed on the ftp site), which we have termed ‘metaseq’. Currently, the only centralized public repository for such sequences that we are aware of is the UniProt Metagenomic and Environmental Sequence database (UniMes). However, UniMes currently only contains data from the Global Ocean Sampling (GOS) Expedition (16), the largest and most publicized of such environmental sampling projects. When the UniMes database becomes more comprehensive, we will use this as the underlying sequence database.

There are currently more than 6.6 million sequences in the ‘metaseq’ database, making it significantly larger than the current version of ‘pfamseq’, which currently contains around 4.5 million sequences. We have searched the sequences in ‘metaseq’ against the library of Pfam HMMs. All sequence regions that score above the pre-defined curated thresholds have been recorded, and for each family, the significant matches aligned, in the same manner as we generate our full alignments. These domain annotations and alignments are available both in flat-file formats (as with the NCBI GenPept dataset) and in the MySQL database. The metagenomics domain annotations can also be retrieved via the website. Similar to the NCBI dataset, ‘metaseq’ accessions and identifiers can be used to retrieve a graphical representation of the sequence and Pfam domains (if any have been found). As with the GenPept data, we have not resolved any overlaps, but, unlike the situation with the GenPept data, we have not ‘competed’ the overlapping sequence hits for families within clans, which means that there will inevitably be some overlapping hits between families that belong to the same clan.

The metagenomics dataset contains many novel protein sequences, which are currently unannotated. This section within Pfam enables the community to assess our current understanding of the domain composition found in such environmental datasets. Uniquely, users will also be able to access domain alignments that can be compared to those historically found in Pfam. It was estimated that there are over 1000 new protein families that are not currently represented by Pfam in the GOS dataset alone (17). Thus, this dataset will provide a potential source of new Pfam families and/or allow verification of families where there are few representatives found in UniProtKB.


    SUMMARY
 TOP
 ABSTRACT
 INTRODUCTION
 COVERAGE UPDATE
 IMPROVING ACCESS TO PFAM
 WIDENING THE SCOPE OF...
 SUMMARY
 AVAILABILITY
 REFERENCES
 
In the last 2 years the Pfam database has continued to grow, improving both coverage and quality of families. In particular, we have widened the scope of Pfam to include sequences from the GenPept database, as well as providing matches to new metagenomics sequence data. We have also developed a new website that provides a unified view for the primary Pfam consortium sites. Pfam has been curating protein families for over 10 years, but there is still much to be done to provide a complete and accurate classification of proteins.


    AVAILABILITY
 TOP
 ABSTRACT
 INTRODUCTION
 COVERAGE UPDATE
 IMPROVING ACCESS TO PFAM
 WIDENING THE SCOPE OF...
 SUMMARY
 AVAILABILITY
 REFERENCES
 
Pfam data can be downloaded directly from the WTSI FTP site (ftp://ftp.sanger.ac.uk/pub/databases/Pfam), either as flat files or in the form of MySQL table dumps. You can visit new Pfam websites at WTSI (http://pfam.sanger.ac.uk/), Stockholm Bioinformatics Center (http://pfam.sbc.su.se/) and Janelia Farm (http://pfam.janelia.org/). The source code for the website can be retrieved by CVS from the WTSI CVS repository, that can be browsed at http://cvs.sanger.ac.uk/cgi-bin/viewcvs.cgi/?root=PfamWeb. Instructions for downloading the code directly from CVS are available at http://cvs.sanger.ac.uk/cvs.users.shtml.


    ACKNOWLEDGEMENTS
 
The authors would like to thank Roger Pettett and Jody Clements for useful discussions on the design and implementation of the new website. We are grateful for the infrastructure support provided by Guy Coates, Tim Cutts and Andy Bryant at Wellcome Trust Sanger Institute (WTSI). Finally, we would to like to thank all of the users of Pfam who have submitted new families and/or annotation updates for existing entries. R.D.F., J.T., J.M., P.C.C., S.J.S., H.-R.H. and A.B. are funded by The Wellcome Trust, R.D.F. and J.M. were funded partly by an MRC (UK) E-science grant (G0100305). S.R.E. and G.C. are supported by the Howard Hughes Medical Institute. K.F. and E.L.L.S. are funded by Stockholm University, Royal Institute of Technology and the Swedish Natural Sciences Research Council. Funding to pay the Open Access publication charges for this article was provided by The Wellcome Trust.

Conflict of interest statement. None declared.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 COVERAGE UPDATE
 IMPROVING ACCESS TO PFAM
 WIDENING THE SCOPE OF...
 SUMMARY
 AVAILABILITY
 REFERENCES
 

  1. Sonnhammer ELL, Eddy SR, Durbin R. Pfam: a comprehensive database of protein domain families based on seed alignments. Proteins (1997) 28:405–420.[CrossRef][Web of Science][Medline]

  2. Finn RD, Mistry J, Schuster-Bockler B, Griffiths-Jones S, Hollich V, Lassmann T, Moxon S, Marshall M, Khanna A, et al. Pfam: clans, web tools and services. Nucleic Acids Res. (2006) 34:D247–D251.[Abstract/Free Full Text]

  3. Bru C, Courcelle E, Carrere S, Beausse Y, Dalmar S, Kahn D. The ProDom database of protein domain families: more emphasis on 3D. Nucleic Acids Res. (2005) 33:D212–D215.[Abstract/Free Full Text]

  4. Bairoch A, Apweiler R, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, et al. The Universal Protein Resource (UniProt). Nucleic Acids Res. (2007) 35:D193–D197.[Abstract/Free Full Text]

  5. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The Protein Data Bank. Nucleic Acids Res. (2000) 28:235–242.[Abstract/Free Full Text]

  6. Andreeva A, Howorth D, Brenner SE, Hubbard TJ, Chothia C, Murzin AG. SCOP database in 2004: refinements integrate structure and sequence family data. Nucleic Acids Res. (2004) 32:D226–D229.[Abstract/Free Full Text]

  7. Soding J. Protein homology detection by HMM-HMM comparison. Bioinformatics (2005) 21:951–960.[Abstract/Free Full Text]

  8. Bateman A, Finn RD. SCOOP: a simple method for identification of novel protein superfamily relationships. Bioinformatics (2007) 23:809–814.[Abstract/Free Full Text]

  9. Finn RD, Marshall M, Bateman A. iPfam: visualization of protein–protein interactions in PDB at domain and amino acid resolutions. Bioinformatics (2005) 21:410–412.[Abstract/Free Full Text]

  10. Dowell RD, Jokerst RM, Day A, Eddy SR, Stein L. The distributed annotation system. BMC Bioinformatics (2001) 2:7.[CrossRef][Medline]

  11. Finn RD, Stalker JW, Jackson DK, Kulesha E, Clements J, Pettett R. ProServer: a simple, extensible Perl DAS server. Bioinformatics (2007) 23:1568–1570.[Abstract/Free Full Text]

  12. Mistry J, Bateman A, Finn RD. Predicting active site residue annotations in the Pfam Database. BMC Bioinformatics (2007) 8:298.[CrossRef][Medline]

  13. Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, Chetvernin V, Church DM, DiCuccio M, Edgar R, et al. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. (2007) 35:D5–D12.[Abstract/Free Full Text]

  14. Kulikova T, Akhtar R, Aldebert P, Althorpe N, Andersson M, Baldwin A, Bates K, Bhattacharyya S, Bower L, et al. EMBL Nucleotide Sequence Database in 2006. Nucleic Acids Res. (2007) 35:D16–D20.[Abstract/Free Full Text]

  15. Schloss PD, Handelsman J. Metagenomics for studying unculturable microorganisms: cutting the Gordian knot. Genome Biol. (2005) 6:229.[CrossRef][Medline]

  16. Rusch DB, Halpern AL, Sutton G, Heidelberg KB, Williamson S, Yooseph S, Wu D, Eisen JA, Hoffman JM, et al. The Sorcerer II Global Ocean Sampling expedition: northwest Atlantic through eastern tropical Pacific. PLoS Biol. (2007) 5:e77.[CrossRef][Medline]

  17. Yooseph S, Sutton G, Rusch DB, Halpern AL, Williamson SJ, Remington K, Eisen JA, Heidelberg KB, Manning G, et al. The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families. PLoS Biol. (2007) 5:e16.[CrossRef][Medline]

  18. Kall L, Krogh A, Sonnhammer EL. A combined transmembrane topology and signal peptide prediction method. J. Mol. Biol. (2004) 338:1027–1036.[CrossRef][Web of Science][Medline]

  19. Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, et al. ClustalW and ClustalX version 2.0. Bioinformatics (2007) 23:2947–2948.[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Bacteriol.Home page
M. Gomelsky
Cyclic-di-GMP-Binding CRP-Like Protein: a Spectacular New Role for a Veteran Signal Transduction Actor
J. Bacteriol., November 15, 2009; 191(22): 6785 - 6787.
[Full Text] [PDF]


Home page
J. Bacteriol.Home page
V. N. Kouvelis, E. Saunders, T. S. Brettin, D. Bruce, C. Detter, C. Han, M. A. Typas, and K. M. Pappas
Complete Genome Sequence of the Ethanol Producer Zymomonas mobilis NCIMB 11163
J. Bacteriol., November 15, 2009; 191(22): 7140 - 7141.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Li, Y. He, G. Ding, C. Wang, L. Xie, and Y. Li
dbDEPC: a database of Differentially Expressed Proteins in human Cancers
Nucleic Acids Res., November 9, 2009; (2009) gkp933v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. E. Ulrich and I. B. Zhulin
The MiST2 database: a comprehensive genomics resource on microbial signal transduction
Nucleic Acids Res., November 9, 2009; (2009) gkp940v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Muller, D. Szklarczyk, P. Julien, I. Letunic, A. Roth, M. Kuhn, S. Powell, C. von Mering, T. Doerks, L. J. Jensen, et al.
eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations
Nucleic Acids Res., November 9, 2009; (2009) gkp951v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. Davidsen, E. Beck, A. Ganapathy, R. Montgomery, N. Zafar, Q. Yang, R. Madupu, P. Goetz, K. Galinsky, O. White, et al.
The comprehensive microbial resource
Nucleic Acids Res., November 5, 2009; (2009) gkp912v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. D. Rawlings, A. J. Barrett, and A. Bateman
MEROPS: the peptidase database
Nucleic Acids Res., November 5, 2009; (2009) gkp971v1.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
J. N. Murphy and C. W. Saltikov
The ArsR Repressor Mediates Arsenite-Dependent Regulation of Arsenate Respiration and Detoxification Operons of Shewanella sp. Strain ANA-3
J. Bacteriol., November 1, 2009; 191(21): 6722 - 6731.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
S. D. Friedman, F. J. Genthner, J. Gentry, M. D. Sobsey, and J. Vinje
Gene Mapping and Phylogenetic Analysis of the Complete Genome from 30 Single-Stranded RNA Male-Specific Coliphages (Family Leviviridae)
J. Virol., November 1, 2009; 83(21): 11233 - 11243.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
B. Li, V. G. Krishnan, M. E. Mort, F. Xin, K. K. Kamati, D. N. Cooper, S. D. Mooney, and P. Radivojac
Automated inference of molecular mechanisms of disease from amino acid substitutions
Bioinformatics, November 1, 2009; 25(21): 2744 - 2750.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
M. Libault, T. Joshi, V. A. Benedito, D. Xu, M. K. Udvardi, and G. Stacey
Legume Transcription Factor Genes: What Makes Legumes So Special?
Plant Physiology, November 1, 2009; 151(3): 991 - 1001.
[Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Vanhee, J. Reumers, F. Stricher, L. Baeten, L. Serrano, J. Schymkowitz, and F. Rousseau
PepX: a structural database of non-redundant protein-peptide complexes
Nucleic Acids Res., October 30, 2009; (2009) gkp893v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. M. Markowitz, I-M. A. Chen, K. Palaniappan, K. Chu, E. Szeto, Y. Grechkin, A. Ratner, I. Anderson, A. Lykidis, K. Mavromatis, et al.
The integrated microbial genomes system: an expanding comparative analysis resource
Nucleic Acids Res., October 28, 2009; (2009) gkp887v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Perez-Rodriguez, D. M. Riano-Pachon, L. G. G. Correa, S. A. Rensing, B. Kersten, and B. Mueller-Roeber
PlnTFDB: updated content and new features of the plant transcription factor database
Nucleic Acids Res., October 25, 2009; (2009) gkp805v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. J. A. Sigrist, L. Cerutti, E. de Castro, P. S. Langendijk-Genevaux, V. Bulliard, A. Bairoch, and N. Hulo
PROSITE, a protein domain database for functional characterization and annotation
Nucleic Acids Res., October 25, 2009; (2009) gkp885v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Velankar, C. Best, B. Beuth, C. H. Boutselakis, N. Cobley, A. W. Sousa Da Silva, D. Dimitropoulos, A. Golovin, M. Hirshberg, M. John, et al.
PDBe: Protein Data Bank in Europe
Nucleic Acids Res., October 25, 2009; (2009) gkp916v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Mihara, T. Itoh, and T. Izawa
SALAD database: a motif-based database of protein annotations for plant comparative genomics
Nucleic Acids Res., October 23, 2009; (2009) gkp831v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Z.-X. Tian, E. Fargier, M. Mac Aogain, C. Adams, Y.-P. Wang, and F. O'Gara
Transcriptome profiling defines a novel regulon modulated by the LysR-type transcriptional regulator MexT in Pseudomonas aeruginosa
Nucleic Acids Res., October 21, 2009; (2009) gkp828v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. J. Roberts, T. Vincze, J. Posfai, and D. Macelis
REBASE--a database for DNA restriction and modification: enzymes, genes and genomes
Nucleic Acids Res., October 21, 2009; (2009) gkp874v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. M. Brinkac, T. Davidsen, E. Beck, A. Ganapathy, E. Caler, R. J. Dodson, A. S. Durkin, D. M. Harkins, H. Lorenzi, R. Madupu, et al.
Pathema: a clade-specific bioinformatics resource center for pathogen research
Nucleic Acids Res., October 20, 2009; (2009) gkp850v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Kuchta, L. Knizewski, L. S. Wyrwicz, L. Rychlewski, and K. Ginalski
Comprehensive classification of nucleotidyltransferase fold proteins: identification of novel families and their representatives in human
Nucleic Acids Res., October 15, 2009; (2009) gkp854v1.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
C. Rancurel, M. Khosravi, A. K. Dunker, P. R. Romero, and D. Karlin
Overlapping Genes Produce Proteins with Unusual Sequence Properties and Offer Insight into De Novo Protein Creation
J. Virol., October 15, 2009; 83(20): 10719 - 10736.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
A. Briegel, D. R. Ortega, E. I. Tocheva, K. Wuichet, Z. Li, S. Chen, A. Muller, C. V. Iancu, G. E. Murphy, M. J. Dobro, et al.
Universal architecture of bacterial chemoreceptor arrays
PNAS, October 6, 2009; 106(40): 17181 - 17186.
[Abstract] [Full Text] [PDF]


Home page
JCBHome page
J. R. Skaar, D. J. Richard, A. Saraf, A. Toschi, E. Bolderson, L. Florens, M. P. Washburn, K. K. Khanna, and M. Pagano
INTS3 controls the hSSB1-mediated DNA damage response
J. Cell Biol., October 5, 2009; 187(1): 25 - 32.
[Abstract] [Full Text] [PDF]


Home page
Clin. Microbiol. Rev.Home page
J. Strahilevitz, G. A. Jacoby, D. C. Hooper, and A. Robicsek
Plasmid-Mediated Quinolone Resistance: a Multifaceted Threat
Clin. Microbiol. Rev., October 1, 2009; 22(4): 664 - 689.
[Abstract] [Full Text] [PDF]


Home page
J. Gen. Virol.Home page
K. Rosario, S. Duffy, and M. Breitbart
Diverse circovirus-like genome architectures revealed by environmental metagenomics
J. Gen. Virol., October 1, 2009; 90(10): 2418 - 2424.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. A. Grobei, E. Qeli, E. Brunner, H. Rehrauer, R. Zhang, B. Roschitzki, K. Basler, C. H. Ahrens, and U. Grossniklaus
Deterministic protein inference for shotgun proteomics data provides new insights into Arabidopsis pollen development and function
Genome Res., October 1, 2009; 19(10): 1786 - 1800.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
T. J. Sharpton, J. E. Stajich, S. D. Rounsley, M. J. Gardner, J. R. Wortman, V. S. Jordar, R. Maiti, C. D. Kodira, D. E. Neafsey, Q. Zeng, et al.
Comparative genomic analyses of the human fungal pathogens Coccidioides and their relatives
Genome Res., October 1, 2009; 19(10): 1722 - 1731.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Forslund and E. L. Sonnhammer
Benchmarking homology detection procedures with low complexity filters
Bioinformatics, October 1, 2009; 25(19): 2500 - 2505.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Liu, X.-w. Chen, and R. Jothi
Knowledge-guided inference of domain-domain interactions from incomplete protein-protein interaction networks
Bioinformatics, October 1, 2009; 25(19): 2492 - 2499.
[Abstract] [Full Text] [PDF]


Home page
DatabaseHome page
R. P. Huntley, D. Binns, E. Dimmer, D. Barrell, C. O'Donovan, and R. Apweiler
QuickGO: a user tutorial for the web-based Gene Ontology browser
Database, September 30, 2009; 2009(0): bap010 - bap010.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. P. C. da Cunha, P. A. F. Galante, J. E. de Souza, R. F. de Souza, P. M. Carvalho, D. T. Ohara, R. P. Moura, S. M. Oba-Shinja, S. K. N. Marie, W. A. Silva Jr, et al.
Bioinformatics construction of the human cell surfaceome
PNAS, September 29, 2009; 106(39): 16752 - 16757.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Steinbiss, U. Willhoeft, G. Gremme, and S. Kurtz
Fine-grained annotation and classification of de novo predicted LTR retrotransposons
Nucleic Acids Res., September 28, 2009; (2009) gkp759v1.
[Abstract] [Full Text] [PDF]


Home page
J R Soc InterfaceHome page
M. Izumi, A. M. Sweeney, D. DeMartini, J. C. Weaver, M. L. Powers, A. Tao, T. V. Silvas, R. M. Kramer, W. J. Crookes-Goodson, L. M. Mathger, et al.
Changes in reflectin protein phosphorylation are associated with dynamic iridescence in squid
J R Soc Interface, September 23, 2009; (2009) rsif.2009.0299v1.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
D. Albanesi, M. Martin, F. Trajtenberg, M. C. Mansilla, A. Haouz, P. M. Alzari, D. de Mendoza, and A. Buschiazzo
Structural plasticity and catalysis regulation of a thermosensor histidine kinase
PNAS, September 22, 2009; 106(38): 16185 - 16190.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
N. Delmotte, C. Knief, S. Chaffron, G. Innerebner, B. Roschitzki, R. Schlapbach, C. von Mering, and J. A. Vorholt
Community proteogenomics reveals insights into the physiology of phyllosphere bacteria
PNAS, September 22, 2009; 106(38): 16428 - 16433.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
B. Contreras-Moreira
3D-footprint: a database for the structural analysis of protein-DNA complexes
Nucleic Acids Res., September 18, 2009; (2009) gkp781v1.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
S. Campuzano, B. Serra, D. Llull, J. L. Garcia, and P. Garcia
Cloning, Expression, and Characterization of a Peculiar Choline-Binding {beta}-Galactosidase from Streptococcus mitis
Appl. Envir. Microbiol., September 15, 2009; 75(18): 5972 - 5980.
[Abstract] [Full Text] [PDF]


Home page
Sci SignalHome page
A. Ernst, S. L. Sazinsky, S. Hui, B. Currell, M. Dharsee, S. Seshagiri, G. D. Bader, and S. S. Sidhu
Rapid Evolution of Functional Complexity in a Domain Family
Sci. Signal., September 8, 2009; 2(87): ra50 - ra50.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. C. Friedel, L. Dolken, Z. Ruzsics, U. H. Koszinowski, and R. Zimmer
Conserved principles of mammalian transcriptional regulation revealed by RNA half-life
Nucleic Acids Res., September 1, 2009; 37(17): e115 - e115.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Y.-Q. Shen, B. F. Lang, and G. Burger
Diversity and dispersal of a ubiquitous protein family: acyl-CoA dehydrogenases
Nucleic Acids Res., September 1, 2009; 37(17): 5619 - 5631.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
W. Xiong, T. Li, K. Chen, and K. Tang
Local combinational variables: an approach used in DNA-binding helix-turn-helix motif prediction with sequence information
Nucleic Acids Res., September 1, 2009; 37(17): 5632 - 5640.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
W. Liu, D. Li, J. Wang, H. Xie, Y. Zhu, and F. He
Proteome-wide Prediction of Signal Flow Direction in Protein Interaction Networks Based on Interacting Domains
Mol. Cell. Proteomics, September 1, 2009; 8(9): 2063 - 2070.
[Abstract] [Full Text] [PDF]


Home page
Mol PlantHome page
M. Mutwil, C. Ruprecht, F. M. Giorgi, M. Bringmann, B. Usadel, and S. Persson
Transcriptional Wiring of Cell Wall-Related Genes in Arabidopsis
Mol Plant, September 1, 2009; 2(5): 1015 - 1024.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
N. Conde e Silva, I. R. Goncalves, M. Lemaire, E. Lesuisse, J. M. Camadro, and P. L. Blaiseau
KlAft, the Kluyveromyces lactis Ortholog of Aft1 and Aft2, Mediates Activation of Iron-Responsive Transcription Through the PuCACCC Aft-Type Sequence
Genetics, September 1, 2009; 183(1): 93 - 106.
[Abstract] [Full Text] [PDF]


Home page
J BiochemHome page
K. Mitsunaga, J. Harada-Itadani, T. Shikanai, H. Tateno, Y. Ikehara, J. Hirabayashi, H. Narimatsu, and T. Angata
Human C21orf63 is a Heparin-binding Protein
J. Biochem., September 1, 2009; 146(3): 369 - 373.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. Rambaldi and F. D. Ciccarelli
FancyGene: dynamic visualization of gene structures and protein domain architectures on genomic loci
Bioinformatics, September 1, 2009; 25(17): 2281 - 2282.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
V. M. Markowitz, K. Mavromatis, N. N. Ivanova, I-M. A. Chen, K. Chu, and N. C. Kyrpides
IMG ER: a system for microbial genome annotation expert review and curation
Bioinformatics, September 1, 2009; 25(17): 2271 - 2278.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
S. Teotia and R. S. Lamb
The Paralogous Genes RADICAL-INDUCED CELL DEATH1 and SIMILAR TO RCD ONE1 Have Partially Redundant Functions during Arabidopsis Development
Plant Physiology, September 1, 2009; 151(1): 180 - 198.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
T. Uehara, T. Dinh, and T. G. Bernhardt
LytM-Domain Factors Are Required for Daughter Cell Separation and Rapid Ampicillin-Induced Lysis in Escherichia coli
J. Bacteriol., August 15, 2009; 191(16): 5094 - 5107.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
C. Choudhary, C. Kumar, F. Gnad, M. L. Nielsen, M. Rehman, T. C. Walther, J. V. Olsen, and M. Mann
Lysine Acetylation Targets Protein Complexes and Co-Regulates Major Cellular Functions
Science, August 14, 2009; 325(5942): 834 - 840.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
M. Guo, Y. E. Chong, K. Beebe, R. Shapiro, X.-L. Yang, and P. Schimmel
The C-Ala Domain Brings Together Editing and Aminoacylation Functions on One tRNA
Science, August 7, 2009; 325(5941): 744 - 747.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
Q. H. Christensen and J. E. Cronan
The Thermoplasma acidophilum LplA-LplB Complex Defines a New Class of Bipartite Lipoate-protein Ligases
J. Biol. Chem., August 7, 2009; 284(32): 21317 - 21326.
[Abstract] [Full Text] [PDF]


Home page
DatabaseHome page
L. Wissler, E. Dattolo, A. D. Moore, T. B. H. Reusch, J. L. Olsen, M. Migliaccio, E. Bornberg-Bauer, and G. Procaccini
Dr. Zompo: an online data repository for Zostera marina and Posidonia oceanica ESTs
Database, August 4, 2009; 2009(0): bap009 - bap009.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
A. Piotrowski, P. Burghout, and D. A. Morrison
spr1630 Is Responsible for the Lethality of clpX Mutations in Streptococcus pneumoniae
J. Bacteriol., August 1, 2009; 191(15): 4888 - 4895.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. A. McMahon, G. A. Roberts, K. A. Johnson, L. P. Cooper, H. Liu, J. H. White, L. G. Carter, B. Sanghvi, M. Oke, M. D. Walkinshaw, et al.
Extensive DNA mimicry by the ArdA anti-restriction protein and its role in the spread of antibiotic resistance
Nucleic Acids Res., August 1, 2009; 37(15): 4887 - 4897.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. F. Neuwald
Rapid detection, classification and accurate alignment of up to a million or more related protein sequences
Bioinformatics, August 1, 2009; 25(15): 1869 - 1875.
[Abstract] [Full Text] [PDF]


Home page
Biol. Reprod.Home page
J. Tejomurtula, K.-B. Lee, S. K. Tripurani, G. W. Smith, and J. Yao
Role of Importin Alpha8, a New Member of the Importin Alpha Family of Nuclear Transport Proteins, in Early Embryonic Development in Cattle
Biol Reprod, August 1, 2009; 81(2): 333 - 342.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
J. C. Setubal, P. dos Santos, B. S. Goldman, H. Ertesvag, G. Espin, L. M. Rubio, S. Valla, N. F. Almeida, D. Balasubramanian, L. Cromes, et al.
Genome Sequence of Azotobacter vinelandii, an Obligate Aerobe Specialized To Support Diverse Anaerobic Metabolic Processes
J. Bacteriol., July 15, 2009; 191(14): 4534 - 4545.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
Y.-J. Pan, C.-C. Cho, Y.-Y. Kao, and C.-H. Sun
A Novel WRKY-like Protein Involved in Transcriptional Activation of Cyst Wall Protein Genes in Giardia lamblia
J. Biol. Chem., July 3, 2009; 284(27): 17975 - 17988.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
S. Kimura, Y. Shiraiwa, and I. Suzuki
Function of the N-terminal region of the phosphate-sensing histidine kinase, SphS, in Synechocystis sp. PCC 6803
Microbiology, July 1, 2009; 155(7): 2256 - 2264.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
B. W. Brandt and J. Heringa
webPRC: the Profile Comparer for alignment-based searching of public domain databases
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W48 - W52.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. I. Sadreyev, M. Tang, B.-H. Kim, and N. V. Grishin
COMPASS server for homology detection: improved statistical accuracy, speed and functionality
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W90 - W94.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C.-C. Chen, C.-Y. Lin, Y.-S. Lo, and J.-M. Yang
PPISearch: a web server for searching homologous protein-protein interactions across multiple species
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W369 - W375.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Bruggeman, J. Heringa, and B. W. Brandt
PhyloPars: estimation of missing parameter values using phylogeny
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W179 - W184.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Rose, S. Lorenzen, A. Goede, B. Gruening, and P. W. Hildebrand
RHYTHM--a server to predict the orientation of transmembrane helices in channels and membrane-coils
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W575 - W580.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Shridhar, D. Chattopadhyay, and G. Yadav
PLecDom: a program for identification and analysis of plant lectin domains
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W452 - W458.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. T.-H. Chang, T.-Y. Chien, and C.-Y. Chen
seeMotif: exploring and visualizing sequence motifs in 3D structures
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W552 - W558.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Kuzniar, K. Lin, Y. He, H. Nijveen, S. Pongor, and J. A. M. Leunissen
ProGMap: an integrated annotation resource for protein orthology
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W428 - W434.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Blankenburg, F. Ramirez, J. Buch, and M. Albrecht
DASMIweb: online integration, analysis and assessment of distributed protein interaction data
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W122 - W128.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. Sandoval-Calderon, O. Geiger, Z. Guan, F. Barona-Gomez, and C. Sohlenkamp
A Eukaryote-like Cardiolipin Synthase Is Present in Streptomyces coelicolor and in Most Actinobacteria
J. Biol. Chem., June 26, 2009; 284(26): 17383 - 17390.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
S. Ehira, H. Ogino, H. Teramoto, M. Inui, and H. Yukawa
Regulation of Quinone Oxidoreductase by the Redox-sensing Transcriptional Regulator QorR in Corynebacterium glutamicum
J. Biol. Chem., June 19, 2009; 284(25): 16736 - 16742.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
O. Fornes, R. Aragues, J. Espadaler, M. A. Marti-Renom, A. Sali, and B. Oliva
ModLink+: improving fold recognition by using protein-protein interactions
Bioinformatics, June 15, 2009; 25(12): 1506 - 1512.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Ramana and D. Gupta
ProtVirDB: a database of protozoan virulent proteins
Bioinformatics, June 15, 2009; 25(12): 1568 - 1569.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Guo and A. J. Hartemink
Domain-oriented edge-based alignment of protein interaction networks
Bioinformatics, June 15, 2009; 25(12): i240 - 1246.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Y. Han, J. M. Burnette III, and S. R. Wessler
TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences
Nucleic Acids Res., June 1, 2009; 37(11): e78 - e78.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. N. Messina and E. L. L. Sonnhammer
DASher: a stand-alone protein sequence client for DAS, the Distributed Annotation System
Bioinformatics, May 15, 2009; 25(10): 1333 - 1334.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Blankenburg, R. D. Finn, A. Prlic, A. M. Jenkinson, F. Ramirez, D. Emig, S.-E. Schelhorn, J. Buch, T. Lengauer, and M. Albrecht
DASMI: exchanging, annotating and assessing molecular interaction data
Bioinformatics, May 15, 2009; 25(10): 1321 - 1328.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
G. Minasov, S. Padavattan, L. Shuvalova, J. S. Brunzelle, D. J. Miller, A. Basle, C. Massa, F. R. Collart, T. Schirmer, and W. F. Anderson
Crystal Structures of YkuI and Its Complex with Second Messenger Cyclic Di-GMP Suggest Catalytic Mechanism of Phosphodiester Bond Cleavage by EAL Domains
J. Biol. Chem., May 8, 2009; 284(19): 13174 - 13184.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
M. K. Basu, E. Poliakov, and I. B. Rogozin
Domain mobility in proteins: functional and evolutionary implications
Brief Bioinform, May 1, 2009; 10(3): 205 - 216.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (8891K) Freely available
Right arrow Screen PDF (1177K) Freely available
Right arrowOA All Versions of this Article:
36/suppl_1/D281    most recent
gkm960v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Finn, R. D.
Right arrow Articles by Bateman, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Finn, R. D.
Right arrow Articles by Bateman, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?