Nucleic Acids Research Advance Access originally published online on November 26, 2007
Nucleic Acids Research 2008 36(Database issue):D281-D288; doi:10.1093/nar/gkm960
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Nucleic Acids Research, 2008, Vol. 36, Database issue D281-D288
© 2007 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article appears in the following Nucleic Acids Research issue: Database issue [View the issue table of contents]
Articles |
The Pfam protein families database
1Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton Hall, Hinxton, Cambridgeshire, CB10 1SA, UK, 2Howard Hughes Medical Institute Janelia Farm Research Campus, 19700 Helix Drive, Ashburn, VA 20147, USA and 3Stockholm Bioinformatics Center, Albanova, Stockholm University, SE-10691 Stockholm, Sweden
*To whom correspondence should be addressed. Tel: +44 1223 495330; Fax: +44 1223 494919; Email: rdf{at}sanger.ac.uk
Received September 15, 2007. Revised October 10, 2007. Accepted October 16, 2007.
| ABSTRACT |
|---|
|
|
|---|
Pfam is a comprehensive collection of protein domains and families, represented as multiple sequence alignments and as profile hidden Markov models. The current release of Pfam (22.0) contains 9318 protein families. Pfam is now based not only on the UniProtKB sequence database, but also on NCBI GenPept and on sequences from selected metagenomics projects. Pfam is available on the web from the consortium members using a new, consistent and improved website design in the UK (http://pfam.sanger.ac.uk/), the USA (http://pfam.janelia.org/) and Sweden (http://pfam.sbc.su.se/), as well as from mirror sites in France (http://pfam.jouy.inra.fr/) and South Korea (http://pfam.ccbb.re.kr/).
| INTRODUCTION |
|---|
|
|
|---|
Pfam is designed to be a comprehensive and accurate collection of protein domains and families (1,2). Pfam families are divided into two categories, Pfam-A and Pfam-B. Each Pfam-A family consists of a curated seed alignment containing a small set of representative members of the family, profile hidden Markov models (profile HMMs) built from the seed alignment and an automatically generated full alignment which contains all detectable protein sequences belonging to the family, as defined by profile HMM searches of primary sequence databases. Pfam-B entries are automatically generated from the ProDom database (3), and are represented by a single alignment. The use of representative seed alignments for Pfam-A families allows efficient and sustainable manual curation of alignments and annotation, while the automatic generation of full alignments and Pfam-B clusters ensures that Pfam is a comprehensive classification of protein families that scales effectively with the growth of the sequence databases. Pfam data are freely accessible via the web and are available for download in a variety of forms (see availability section).
| COVERAGE UPDATE |
|---|
|
|
|---|
We quantify the completeness of the Pfam classification of sequence space using two measures of coverage. Sequence coverage is the fraction of protein sequences listed in UniProtKB (4) that has at least one Pfam domain, whilst residue coverage is the fraction of protein residues that fall within Pfam domains, as defined by the sub-sequences included in Pfam-A full alignments. The current Pfam release (version 22.0) contains a total of 9318 Pfam-A families, which cover 73.23% of sequences and 50.79% of residues found in UniProtKB version 9.7. Since the last Pfam publication (release 18.0), we have added 1335 families (a 17% increase) and maintained approximately the same coverage of UniProtKB, despite a 100% increase in the number of UniProtKB sequences.
Coverage of protein structures
The availability of 3D protein structures has been essential for finding distant evolutionary relationships and understanding protein function at the molecular level. Sequences with a known structure usually have a clear domain organization and ideally each domain should be represented in Pfam. We looked at a non-redundant set of sequences whose structures were deposited in the Protein Data Bank (PDB), and found more than 1000 structures, which did not have any corresponding Pfam-A domains. We also found that residue coverage of some PDB structures was <50%. In order to improve this situation, we built over 500 new Pfam-A families for PDB sequences and SCOP (Structural Classification of Proteins) database entries (6), which were not previously covered. As measured on the current database of non-redundant sequences of known structure (3 August 2007), our sequence and residue coverages are now 94.7% and 77.5%, respectively. We have developed a protocol to ensure that this level of coverage is maintained as the number of protein structures increases. As novel structures are identified through structural genomics programs so their sequences will be made a priority for curation into Pfam.
Expansion of Pfam clans
The concept of clans was introduced into Pfam in 2005 (2). Briefly, a clan is a collection of Pfam-A entries that are judged likely to be homologous. Clans are built manually, based on a wide variety of information sources: the primary literature, known structures, profile–profile comparisons and other databases such as SCOP (6). We try to represent the relationships between member families graphically by providing clan alignments. Clans thus define a simple hierarchical classification of Pfam entries, allowing better transfer of structural and/or functional information between families, and better predictions of function and structure for families of unknown function.
To infer relationships between families, we now use three different tools. In addition to the profile–profile comparison tool PRC (http://supfam.mrc-lmb.cam.ac.uk/PRC/), we now use another profile–profile comparisons tool HHsearch (7), and the simple comparison of outputs program, SCOOP (8). We use three different computational methods because each is sensitive to a slightly different set of relationships and, more importantly, the combination of the three tools reinforces independently detected relationships. Notably, SCOOP has allowed us to infer many novel relationships that were not detected using either of the profile–profile comparison tools. Pfam version 18.0 (the first major release of Pfam with clans) contained 172 clans, comprising 1181 Pfam-A families. As new families are added to Pfam and new relationships are discovered, we build new clans and add families to existing ones. In the current release of Pfam (version 22.0) there are now 283 clans, comprising a total of 1808 Pfam-A families, an increase of 53% since release 18.0. The proportion of Pfam domain hits that fall within a clan has increased from 31% in release 18.0 to 43% in release 22.0. This shows that many families in Pfam are related and that, to date, many of the largest Pfam-A families have been assigned into clans. We expect the clan classification to grow still further, since many automatically detected relationships still need to be manually verified.
| IMPROVING ACCESS TO PFAM |
|---|
|
|
|---|
Pfam website development
Although Pfam data have always been centrally maintained and curated, historically each member of the Pfam consortium has run a separate website to serve the same data. The three primary mirror sites are based in the UK, Sweden and the USA, with a further two recognized mirror sites in France and South Korea. Each of the primary consortium sites has tended to adopt a different look and feel and, although all sites have provided the same set of core services, each has also provided some additional tools and services that are unique to that particular site. This has lead to an entirely different user experience at each Pfam site, and has led to users confusion as to which site provides which services. The development of three main websites also caused a significant duplication of effort for the Pfam consortium.
A new Pfam website has been developed, with the goal of providing a single, unified website for Pfam data and services, that combines the best features of the separate sites in a single, common interface. In re-designing the website, we have been able not only to improve the navigation and architecture of the website itself, but also to design a more easily extensible and maintainable code-base for the future. This new code-base will be common to and developed by all members of the Pfam consortium. Furthermore, the new website code has been written with portability in mind and has been made publicly available, so that users may install and run the website locally if desired.
We have improved the organization and presentation of Pfam data. Everything related to, for example, a Pfam-A family, is collected into a single page, which is sub-divided into tab-panes that the user can easily switch between. Figure 1 shows a typical page for a Pfam-A family. We have similar tab-layout pages for data related to protein sequences, Pfam-B families, Pfam clans, proteome data from completed genomes and 3D protein structures. Each type of page represents a different route into the Pfam data, and each tabbed page provides links that allow the user to navigate easily between these different sections of Pfam. Additionally, users can browse lists of Pfam families or clans and can jump quickly between any type of entry in the site via a jump to box found on most pages.
|
A common feature of every type of page is a summary box, providing the salient details of every entry in a single glance. The five summarized features of the entry are: the number of architectures associated with the entry; the number of protein sequences; the number of interactions [as determined by iPfam (9)]; the number of species; and the number of 3D structures. The exact meaning of each value is context-dependent, so that in the Pfam family page, for example, the structure icon shows the number of structures associated with that family, whilst in a protein sequence page the structure icon shows the number of structures, which map to that sequence. The link for each icon is also context-dependent, taking the user to the most appropriate section of the page for the icon clicked.
Previously, it has been difficult to search Pfam by species or taxonomic division. In addition to the species tree found on each family page, which provides a breakdown of the species found in that family, we have implemented a new taxonomy search tool. As with the taxonomy search tool in the old Pfam website, the new tool returns a list of Pfam domains that match a Boolean query expression. For example, the query Caenorhabditis elegans AND NOT Homo sapiens will return all Pfam domains found in C. elegans, but are not found in H. sapiens. As well as being less error prone and significantly quicker than the version in the old Pfam website, the new taxonomy search tool also provides a feedback mechanism that suggests organism names as the user enters them. This reduces the likelihood of typographical or spelling errors in queries, since incorrectly entered species terms are immediately highlighted in the interface, as well as providing an insight into the organisms that are found in the database.
A commonly requested capability for the Pfam site is the ability to find Pfam domains that are unique to a given taxonomic division or species. This feature is now available. For example, searching for unique Metazoa families returns a list of domains that are found only in Metazoans will be returned.
In addition to the standard features of the old Pfam websites, such as search tools for quickly finding Pfam domains on a protein sequence or for locating sequences with a specified domain architecture, we have also introduced several new features in the new site, many of which use the Distributed Annotation System (DAS) (10) to aggregate multiple data sources in a single display.
The Distributed Annotation System
We have improved access to Pfam by providing data through the DAS. DAS is a system for disseminating annotations and alignments of DNA or protein sequences through a simple, web-based protocol. Three types of Pfam data are now available via DAS (11): domain annotations for both Pfam-A and Pfam-B families; sequence features such as active sites (12) and transmembrane region predictions; and seed and full alignments for Pfam-A families. The availability of Pfam data via DAS enables users to access specific parts of the database as a web service, without the need to download and install it in its entirety.
We have also been able to incorporate other data sources that are accessible through DAS, in order to enrich our own display of Pfam data. One feature of the new website is a DAS-based viewer for sequence annotations (Figure 2). This allows the user to view annotations of protein sequences from a wide range of third-party databases alongside information from Pfam itself. The viewer presents the standard Pfam domain structure image, showing the arrangement of Pfam domains on the sequence in question, and allows users to add or hide annotations from any of the available DAS sources. As the user moves their mouse over each feature, a tool-tip gives detailed information about it. If provided by the external DAS source, a link to further information is also given.
|
Another use of DAS within the new website is in the Pfam sequence alignment viewer. Pfam provides two alignments for every family: the seed alignment is a manually curated alignment of related sequences and generally contains a relatively small number of sequences; the full alignment is generated by searching the sequence database using the HMM for the family and may contain a very large number of sequences (the largest alignment, that of GP120, currently contains over 68 000 sequences). Historically, it has been difficult, if not impossible, to view the largest sequence alignments in a web browser, due simply to the size of the resulting web page. We have implemented a DAS-based sequence alignment viewer (shown in Figure 3) that is able to present even the largest alignments in manageable portions, by retrieving only the required section of the alignment and rendering it as HTML. This allows the user to scroll through wide alignments (those with long sequences) or to page through long alignments (those with a large number of sequences), without having to load the entire alignment into their browser. Alignments are coloured according to a pre-calculated consensus sequence, which is also retrieved via DAS, and in this way even alignment fragments can be marked-up using the properties of the whole alignment.
|
| WIDENING THE SCOPE OF PFAM ANNOTATIONS |
|---|
|
|
|---|
NCBI GenPept sequences
The two main public repositories of protein sequence data are UniProtKB (4) and the GenPept database from NCBI (13). These two resources are independent of each other and consequently contain two separate, though often overlapping, sets of sequences, which are referred to by entirely separate sets of accessions. Historically, Pfam has been based on a sequence database termed pfamseq, which is a frozen version of the UniProtKB database that we update at each major Pfam release. This has caused problems for users wanting to retrieve Pfam data for a sequence for which they have only a GenPept accession or the NCBI GI number.
To make Pfam sequence annotation accessible via both UniProtKB and GenPept accessions, we now provide Pfam domain assignments for members of both sequence databases, each as a separate section of Pfam. Where possible, we transfer the annotation from UniProtKB sequences to the equivalent sequence in GenPept, by using the EMBL/GenBank cross-references (13,14) in the UniProtKB entry and ensuring that the CRC64 checksums of the sequences from the two databases are the same. For example, of the five listed protein identifiers cross-referenced by the UniProtKB accession P51003 [GenBank] (AAH00927 [GenBank] .1, AAH36014 [GenBank] .1, CAD62628 [GenBank] .1, CAD66560 [GenBank] .1, CAD61935 [GenBank] .1), only one GenPept protein (AAH36014 [GenBank] .1) has the same CRC64 checksum as the UniProtKB entry. As a quality control procedure, we make use of the UniProtKB/Swiss-Prot mapping provided by GenPept to perform the reverse mapping. Overall, this mapping procedure provides a UniProtKB equivalent for around 75% of sequences in GenPept. We search the remaining GenPept sequences against the library of Pfam HMMs and use the pre-defined, curated thresholds to determine which sequences should be included in the full alignment for a family. Two or more families belonging to the same clan may match the same sequence region. We resolve such overlaps using the same method as we use when resolving overlapping matches between clan families that have been searched against the UniProtKB sequence database (2). This ensures that there are no overlapping sequences between families that belong to the same clan.
The Pfam domain annotations and alignments for GenPept (release 158) are available for download in a flat-file format (Pfam-A.full.ncbi), as an ASCII representation of the domains matches on each sequence (similar to the swisspfam file) and in the Pfam MySQL database. We also provide access to this data via the websites, where Genbank identifiers (GI numbers) or GenPept protein identifiers can be entered into the jump to box. Searching in this way will take users to a page that is similar to the protein pages produced for UniProtKB sequences. One caveat when using this data is that we do not resolve overlaps between domains for GenPept sequences. However, since we curate Pfam-A domain thresholds in a conservative manner to ensure high specificity (at the expense of some sensitivity), we expect the number of domain overlaps for the GenPept data to be low. Furthermore, three-quarters of the sequences in GenPept are identical to a UniProtKB entry, which are guaranteed to be non-overlapping.
Metagenomic samples
Current microbiological culturing techniques are inadequate for studying the vast majority of microorganisms (15). Consequently, many organisms remain under-represented in the main sequence databases. An emerging field in biology is metagenomics, the analysis of genomic material from environmental samples. Recently, with the advent of better sequencing technologies, large samples from environments such as the sea have been sequenced directly, thereby avoiding the need for culturing. Sequencing using this approach gives rise to many sequences from a diverse set of organisms, albeit at low read coverage and with no knowledge of the source organisms. In addition, when compared with proteins in UniProtKB, the sequences from metagenomic samples are more fragmentary.
Within Pfam we have collected together several available metagenomic datasets and amalgamated them into a single sequence collection (listed on the ftp site), which we have termed metaseq. Currently, the only centralized public repository for such sequences that we are aware of is the UniProt Metagenomic and Environmental Sequence database (UniMes). However, UniMes currently only contains data from the Global Ocean Sampling (GOS) Expedition (16), the largest and most publicized of such environmental sampling projects. When the UniMes database becomes more comprehensive, we will use this as the underlying sequence database.
There are currently more than 6.6 million sequences in the metaseq database, making it significantly larger than the current version of pfamseq, which currently contains around 4.5 million sequences. We have searched the sequences in metaseq against the library of Pfam HMMs. All sequence regions that score above the pre-defined curated thresholds have been recorded, and for each family, the significant matches aligned, in the same manner as we generate our full alignments. These domain annotations and alignments are available both in flat-file formats (as with the NCBI GenPept dataset) and in the MySQL database. The metagenomics domain annotations can also be retrieved via the website. Similar to the NCBI dataset, metaseq accessions and identifiers can be used to retrieve a graphical representation of the sequence and Pfam domains (if any have been found). As with the GenPept data, we have not resolved any overlaps, but, unlike the situation with the GenPept data, we have not competed the overlapping sequence hits for families within clans, which means that there will inevitably be some overlapping hits between families that belong to the same clan.
The metagenomics dataset contains many novel protein sequences, which are currently unannotated. This section within Pfam enables the community to assess our current understanding of the domain composition found in such environmental datasets. Uniquely, users will also be able to access domain alignments that can be compared to those historically found in Pfam. It was estimated that there are over 1000 new protein families that are not currently represented by Pfam in the GOS dataset alone (17). Thus, this dataset will provide a potential source of new Pfam families and/or allow verification of families where there are few representatives found in UniProtKB.
| SUMMARY |
|---|
|
|
|---|
In the last 2 years the Pfam database has continued to grow, improving both coverage and quality of families. In particular, we have widened the scope of Pfam to include sequences from the GenPept database, as well as providing matches to new metagenomics sequence data. We have also developed a new website that provides a unified view for the primary Pfam consortium sites. Pfam has been curating protein families for over 10 years, but there is still much to be done to provide a complete and accurate classification of proteins.
| AVAILABILITY |
|---|
|
|
|---|
Pfam data can be downloaded directly from the WTSI FTP site (ftp://ftp.sanger.ac.uk/pub/databases/Pfam), either as flat files or in the form of MySQL table dumps. You can visit new Pfam websites at WTSI (http://pfam.sanger.ac.uk/), Stockholm Bioinformatics Center (http://pfam.sbc.su.se/) and Janelia Farm (http://pfam.janelia.org/). The source code for the website can be retrieved by CVS from the WTSI CVS repository, that can be browsed at http://cvs.sanger.ac.uk/cgi-bin/viewcvs.cgi/?root=PfamWeb. Instructions for downloading the code directly from CVS are available at http://cvs.sanger.ac.uk/cvs.users.shtml.
| ACKNOWLEDGEMENTS |
|---|
The authors would like to thank Roger Pettett and Jody Clements for useful discussions on the design and implementation of the new website. We are grateful for the infrastructure support provided by Guy Coates, Tim Cutts and Andy Bryant at Wellcome Trust Sanger Institute (WTSI). Finally, we would to like to thank all of the users of Pfam who have submitted new families and/or annotation updates for existing entries. R.D.F., J.T., J.M., P.C.C., S.J.S., H.-R.H. and A.B. are funded by The Wellcome Trust, R.D.F. and J.M. were funded partly by an MRC (UK) E-science grant (G0100305). S.R.E. and G.C. are supported by the Howard Hughes Medical Institute. K.F. and E.L.L.S. are funded by Stockholm University, Royal Institute of Technology and the Swedish Natural Sciences Research Council. Funding to pay the Open Access publication charges for this article was provided by The Wellcome Trust.
Conflict of interest statement. None declared.
| REFERENCES |
|---|
|
|
|---|
- Sonnhammer ELL, Eddy SR, Durbin R. Pfam: a comprehensive database of protein domain families based on seed alignments. Proteins (1997) 28:405–420.[CrossRef][Web of Science][Medline]
- Finn RD, Mistry J, Schuster-Bockler B, Griffiths-Jones S, Hollich V, Lassmann T, Moxon S, Marshall M, Khanna A, et al. Pfam: clans, web tools and services. Nucleic Acids Res. (2006) 34:D247–D251.
[Abstract/Free Full Text] - Bru C, Courcelle E, Carrere S, Beausse Y, Dalmar S, Kahn D. The ProDom database of protein domain families: more emphasis on 3D. Nucleic Acids Res. (2005) 33:D212–D215.
[Abstract/Free Full Text] - Bairoch A, Apweiler R, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, et al. The Universal Protein Resource (UniProt). Nucleic Acids Res. (2007) 35:D193–D197.
[Abstract/Free Full Text] - Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The Protein Data Bank. Nucleic Acids Res. (2000) 28:235–242.
[Abstract/Free Full Text] - Andreeva A, Howorth D, Brenner SE, Hubbard TJ, Chothia C, Murzin AG. SCOP database in 2004: refinements integrate structure and sequence family data. Nucleic Acids Res. (2004) 32:D226–D229.
[Abstract/Free Full Text] - Soding J. Protein homology detection by HMM-HMM comparison. Bioinformatics (2005) 21:951–960.
[Abstract/Free Full Text] - Bateman A, Finn RD. SCOOP: a simple method for identification of novel protein superfamily relationships. Bioinformatics (2007) 23:809–814.
[Abstract/Free Full Text] - Finn RD, Marshall M, Bateman A. iPfam: visualization of protein–protein interactions in PDB at domain and amino acid resolutions. Bioinformatics (2005) 21:410–412.
[Abstract/Free Full Text] - Dowell RD, Jokerst RM, Day A, Eddy SR, Stein L. The distributed annotation system. BMC Bioinformatics (2001) 2:7.[CrossRef][Medline]
- Finn RD, Stalker JW, Jackson DK, Kulesha E, Clements J, Pettett R. ProServer: a simple, extensible Perl DAS server. Bioinformatics (2007) 23:1568–1570.
[Abstract/Free Full Text] - Mistry J, Bateman A, Finn RD. Predicting active site residue annotations in the Pfam Database. BMC Bioinformatics (2007) 8:298.[CrossRef][Medline]
- Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, Chetvernin V, Church DM, DiCuccio M, Edgar R, et al. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. (2007) 35:D5–D12.
[Abstract/Free Full Text] - Kulikova T, Akhtar R, Aldebert P, Althorpe N, Andersson M, Baldwin A, Bates K, Bhattacharyya S, Bower L, et al. EMBL Nucleotide Sequence Database in 2006. Nucleic Acids Res. (2007) 35:D16–D20.
[Abstract/Free Full Text] - Schloss PD, Handelsman J. Metagenomics for studying unculturable microorganisms: cutting the Gordian knot. Genome Biol. (2005) 6:229.[CrossRef][Medline]
- Rusch DB, Halpern AL, Sutton G, Heidelberg KB, Williamson S, Yooseph S, Wu D, Eisen JA, Hoffman JM, et al. The Sorcerer II Global Ocean Sampling expedition: northwest Atlantic through eastern tropical Pacific. PLoS Biol. (2007) 5:e77.[CrossRef][Medline]
- Yooseph S, Sutton G, Rusch DB, Halpern AL, Williamson SJ, Remington K, Eisen JA, Heidelberg KB, Manning G, et al. The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families. PLoS Biol. (2007) 5:e16.[CrossRef][Medline]
- Kall L, Krogh A, Sonnhammer EL. A combined transmembrane topology and signal peptide prediction method. J. Mol. Biol. (2004) 338:1027–1036.[CrossRef][Web of Science][Medline]
- Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, et al. ClustalW and ClustalX version 2.0. Bioinformatics (2007) 23:2947–2948.
[Abstract/Free Full Text]
This article has been cited by other articles:
![]() |
M. Gomelsky Cyclic-di-GMP-Binding CRP-Like Protein: a Spectacular New Role for a Veteran Signal Transduction Actor J. Bacteriol., November 15, 2009; 191(22): 6785 - 6787. [Full Text] [PDF] |
||||
![]() |
V. N. Kouvelis, E. Saunders, T. S. Brettin, D. Bruce, C. Detter, C. Han, M. A. Typas, and K. M. Pappas Complete Genome Sequence of the Ethanol Producer Zymomonas mobilis NCIMB 11163 J. Bacteriol., November 15, 2009; 191(22): 7140 - 7141. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li, Y. He, G. Ding, C. Wang, L. Xie, and Y. Li dbDEPC: a database of Differentially Expressed Proteins in human Cancers Nucleic Acids Res., November 9, 2009; (2009) gkp933v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. E. Ulrich and I. B. Zhulin The MiST2 database: a comprehensive genomics resource on microbial signal transduction Nucleic Acids Res., November 9, 2009; (2009) gkp940v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Muller, D. Szklarczyk, P. Julien, I. Letunic, A. Roth, M. Kuhn, S. Powell, C. von Mering, T. Doerks, L. J. Jensen, et al. eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations Nucleic Acids Res., November 9, 2009; (2009) gkp951v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Davidsen, E. Beck, A. Ganapathy, R. Montgomery, N. Zafar, Q. Yang, R. Madupu, P. Goetz, K. Galinsky, O. White, et al. The comprehensive microbial resource Nucleic Acids Res., November 5, 2009; (2009) gkp912v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. D. Rawlings, A. J. Barrett, and A. Bateman MEROPS: the peptidase database Nucleic Acids Res., November 5, 2009; (2009) gkp971v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. N. Murphy and C. W. Saltikov The ArsR Repressor Mediates Arsenite-Dependent Regulation of Arsenate Respiration and Detoxification Operons of Shewanella sp. Strain ANA-3 J. Bacteriol., November 1, 2009; 191(21): 6722 - 6731. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. D. Friedman, F. J. Genthner, J. Gentry, M. D. Sobsey, and J. Vinje Gene Mapping and Phylogenetic Analysis of the Complete Genome from 30 Single-Stranded RNA Male-Specific Coliphages (Family Leviviridae) J. Virol., November 1, 2009; 83(21): 11233 - 11243. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Li, V. G. Krishnan, M. E. Mort, F. Xin, K. K. Kamati, D. N. Cooper, S. D. Mooney, and P. Radivojac Automated inference of molecular mechanisms of disease from amino acid substitutions Bioinformatics, November 1, 2009; 25(21): 2744 - 2750. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Libault, T. Joshi, V. A. Benedito, D. Xu, M. K. Udvardi, and G. Stacey Legume Transcription Factor Genes: What Makes Legumes So Special? Plant Physiology, November 1, 2009; 151(3): 991 - 1001. [Full Text] [PDF] |
||||
![]() |
P. Vanhee, J. Reumers, F. Stricher, L. Baeten, L. Serrano, J. Schymkowitz, and F. Rousseau PepX: a structural database of non-redundant protein-peptide complexes Nucleic Acids Res., October 30, 2009; (2009) gkp893v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. M. Markowitz, I-M. A. Chen, K. Palaniappan, K. Chu, E. Szeto, Y. Grechkin, A. Ratner, I. Anderson, A. Lykidis, K. Mavromatis, et al. The integrated microbial genomes system: an expanding comparative analysis resource Nucleic Acids Res., October 28, 2009; (2009) gkp887v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Perez-Rodriguez, D. M. Riano-Pachon, L. G. G. Correa, S. A. Rensing, B. Kersten, and B. Mueller-Roeber PlnTFDB: updated content and new features of the plant transcription factor database Nucleic Acids Res., October 25, 2009; (2009) gkp805v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. A. Sigrist, L. Cerutti, E. de Castro, P. S. Langendijk-Genevaux, V. Bulliard, A. Bairoch, and N. Hulo PROSITE, a protein domain database for functional characterization and annotation Nucleic Acids Res., October 25, 2009; (2009) gkp885v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Velankar, C. Best, B. Beuth, C. H. Boutselakis, N. Cobley, A. W. Sousa Da Silva, D. Dimitropoulos, A. Golovin, M. Hirshberg, M. John, et al. PDBe: Protein Data Bank in Europe Nucleic Acids Res., October 25, 2009; (2009) gkp916v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Mihara, T. Itoh, and T. Izawa SALAD database: a motif-based database of protein annotations for plant comparative genomics Nucleic Acids Res., October 23, 2009; (2009) gkp831v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z.-X. Tian, E. Fargier, M. Mac Aogain, C. Adams, Y.-P. Wang, and F. O'Gara Transcriptome profiling defines a novel regulon modulated by the LysR-type transcriptional regulator MexT in Pseudomonas aeruginosa Nucleic Acids Res., October 21, 2009; (2009) gkp828v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Roberts, T. Vincze, J. Posfai, and D. Macelis REBASE--a database for DNA restriction and modification: enzymes, genes and genomes Nucleic Acids Res., October 21, 2009; (2009) gkp874v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. M. Brinkac, T. Davidsen, E. Beck, A. Ganapathy, E. Caler, R. J. Dodson, A. S. Durkin, D. M. Harkins, H. Lorenzi, R. Madupu, et al. Pathema: a clade-specific bioinformatics resource center for pathogen research Nucleic Acids Res., October 20, 2009; (2009) gkp850v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Kuchta, L. Knizewski, L. S. Wyrwicz, L. Rychlewski, and K. Ginalski Comprehensive classification of nucleotidyltransferase fold proteins: identification of novel families and their representatives in human Nucleic Acids Res., October 15, 2009; (2009) gkp854v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Rancurel, M. Khosravi, A. K. Dunker, P. R. Romero, and D. Karlin Overlapping Genes Produce Proteins with Unusual Sequence Properties and Offer Insight into De Novo Protein Creation J. Virol., October 15, 2009; 83(20): 10719 - 10736. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Briegel, D. R. Ortega, E. I. Tocheva, K. Wuichet, Z. Li, S. Chen, A. Muller, C. V. Iancu, G. E. Murphy, M. J. Dobro, et al. Universal architecture of bacterial chemoreceptor arrays PNAS, October 6, 2009; 106(40): 17181 - 17186. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. R. Skaar, D. J. Richard, A. Saraf, A. Toschi, E. Bolderson, L. Florens, M. P. Washburn, K. K. Khanna, and M. Pagano INTS3 controls the hSSB1-mediated DNA damage response J. Cell Biol., October 5, 2009; 187(1): 25 - 32. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Strahilevitz, G. A. Jacoby, D. C. Hooper, and A. Robicsek Plasmid-Mediated Quinolone Resistance: a Multifaceted Threat Clin. Microbiol. Rev., October 1, 2009; 22(4): 664 - 689. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Rosario, S. Duffy, and M. Breitbart Diverse circovirus-like genome architectures revealed by environmental metagenomics J. Gen. Virol., October 1, 2009; 90(10): 2418 - 2424. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Grobei, E. Qeli, E. Brunner, H. Rehrauer, R. Zhang, B. Roschitzki, K. Basler, C. H. Ahrens, and U. Grossniklaus Deterministic protein inference for shotgun proteomics data provides new insights into Arabidopsis pollen development and function Genome Res., October 1, 2009; 19(10): 1786 - 1800. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. J. Sharpton, J. E. Stajich, S. D. Rounsley, M. J. Gardner, J. R. Wortman, V. S. Jordar, R. Maiti, C. D. Kodira, D. E. Neafsey, Q. Zeng, et al. Comparative genomic analyses of the human fungal pathogens Coccidioides and their relatives Genome Res., October 1, 2009; 19(10): 1722 - 1731. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Forslund and E. L. Sonnhammer Benchmarking homology detection procedures with low complexity filters Bioinformatics, October 1, 2009; 25(19): 2500 - 2505. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Liu, X.-w. Chen, and R. Jothi Knowledge-guided inference of domain-domain interactions from incomplete protein-protein interaction networks Bioinformatics, October 1, 2009; 25(19): 2492 - 2499. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. P. Huntley, D. Binns, E. Dimmer, D. Barrell, C. O'Donovan, and R. Apweiler QuickGO: a user tutorial for the web-based Gene Ontology browser Database, September 30, 2009; 2009(0): bap010 - bap010. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. C. da Cunha, P. A. F. Galante, J. E. de Souza, R. F. de Souza, P. M. Carvalho, D. T. Ohara, R. P. Moura, S. M. Oba-Shinja, S. K. N. Marie, W. A. Silva Jr, et al. Bioinformatics construction of the human cell surfaceome PNAS, September 29, 2009; 106(39): 16752 - 16757. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Steinbiss, U. Willhoeft, G. Gremme, and S. Kurtz Fine-grained annotation and classification of de novo predicted LTR retrotransposons Nucleic Acids Res., September 28, 2009; (2009) gkp759v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Izumi, A. M. Sweeney, D. DeMartini, J. C. Weaver, M. L. Powers, A. Tao, T. V. Silvas, R. M. Kramer, W. J. Crookes-Goodson, L. M. Mathger, et al. Changes in reflectin protein phosphorylation are associated with dynamic iridescence in squid J R Soc Interface, September 23, 2009; (2009) rsif.2009.0299v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Albanesi, M. Martin, F. Trajtenberg, M. C. Mansilla, A. Haouz, P. M. Alzari, D. de Mendoza, and A. Buschiazzo Structural plasticity and catalysis regulation of a thermosensor histidine kinase PNAS, September 22, 2009; 106(38): 16185 - 16190. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Delmotte, C. Knief, S. Chaffron, G. Innerebner, B. Roschitzki, R. Schlapbach, C. von Mering, and J. A. Vorholt Community proteogenomics reveals insights into the physiology of phyllosphere bacteria PNAS, September 22, 2009; 106(38): 16428 - 16433. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Contreras-Moreira 3D-footprint: a database for the structural analysis of protein-DNA complexes Nucleic Acids Res., September 18, 2009; (2009) gkp781v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Campuzano, B. Serra, D. Llull, J. L. Garcia, and P. Garcia Cloning, Expression, and Characterization of a Peculiar Choline-Binding {beta}-Galactosidase from Streptococcus mitis Appl. Envir. Microbiol., September 15, 2009; 75(18): 5972 - 5980. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ernst, S. L. Sazinsky, S. Hui, B. Currell, M. Dharsee, S. Seshagiri, G. D. Bader, and S. S. Sidhu Rapid Evolution of Functional Complexity in a Domain Family Sci. Signal., September 8, 2009; 2(87): ra50 - ra50. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. C. Friedel, L. Dolken, Z. Ruzsics, U. H. Koszinowski, and R. Zimmer Conserved principles of mammalian transcriptional regulation revealed by RNA half-life Nucleic Acids Res., September 1, 2009; 37(17): e115 - e115. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-Q. Shen, B. F. Lang, and G. Burger Diversity and dispersal of a ubiquitous protein family: acyl-CoA dehydrogenases Nucleic Acids Res., September 1, 2009; 37(17): 5619 - 5631. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Xiong, T. Li, K. Chen, and K. Tang Local combinational variables: an approach used in DNA-binding helix-turn-helix motif prediction with sequence information Nucleic Acids Res., September 1, 2009; 37(17): 5632 - 5640. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Liu, D. Li, J. Wang, H. Xie, Y. Zhu, and F. He Proteome-wide Prediction of Signal Flow Direction in Protein Interaction Networks Based on Interacting Domains Mol. Cell. Proteomics, September 1, 2009; 8(9): 2063 - 2070. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Mutwil, C. Ruprecht, F. M. Giorgi, M. Bringmann, B. Usadel, and S. Persson Transcriptional Wiring of Cell Wall-Related Genes in Arabidopsis Mol Plant, September 1, 2009; 2(5): 1015 - 1024. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Conde e Silva, I. R. Goncalves, M. Lemaire, E. Lesuisse, J. M. Camadro, and P. L. Blaiseau KlAft, the Kluyveromyces lactis Ortholog of Aft1 and Aft2, Mediates Activation of Iron-Responsive Transcription Through the PuCACCC Aft-Type Sequence Genetics, September 1, 2009; 183(1): 93 - 106. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Mitsunaga, J. Harada-Itadani, T. Shikanai, H. Tateno, Y. Ikehara, J. Hirabayashi, H. Narimatsu, and T. Angata Human C21orf63 is a Heparin-binding Protein J. Biochem., September 1, 2009; 146(3): 369 - 373. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Rambaldi and F. D. Ciccarelli FancyGene: dynamic visualization of gene structures and protein domain architectures on genomic loci Bioinformatics, September 1, 2009; 25(17): 2281 - 2282. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. M. Markowitz, K. Mavromatis, N. N. Ivanova, I-M. A. Chen, K. Chu, and N. C. Kyrpides IMG ER: a system for microbial genome annotation expert review and curation Bioinformatics, September 1, 2009; 25(17): 2271 - 2278. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Teotia and R. S. Lamb The Paralogous Genes RADICAL-INDUCED CELL DEATH1 and SIMILAR TO RCD ONE1 Have Partially Redundant Functions during Arabidopsis Development Plant Physiology, September 1, 2009; 151(1): 180 - 198. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Uehara, T. Dinh, and T. G. Bernhardt LytM-Domain Factors Are Required for Daughter Cell Separation and Rapid Ampicillin-Induced Lysis in Escherichia coli J. Bacteriol., August 15, 2009; 191(16): 5094 - 5107. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Choudhary, C. Kumar, F. Gnad, M. L. Nielsen, M. Rehman, T. C. Walther, J. V. Olsen, and M. Mann Lysine Acetylation Targets Protein Complexes and Co-Regulates Major Cellular Functions Science, August 14, 2009; 325(5942): 834 - 840. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Guo, Y. E. Chong, K. Beebe, R. Shapiro, X.-L. Yang, and P. Schimmel The C-Ala Domain Brings Together Editing and Aminoacylation Functions on One tRNA Science, August 7, 2009; 325(5941): 744 - 747. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. H. Christensen and J. E. Cronan The Thermoplasma acidophilum LplA-LplB Complex Defines a New Class of Bipartite Lipoate-protein Ligases J. Biol. Chem., August 7, 2009; 284(32): 21317 - 21326. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Wissler, E. Dattolo, A. D. Moore, T. B. H. Reusch, J. L. Olsen, M. Migliaccio, E. Bornberg-Bauer, and G. Procaccini Dr. Zompo: an online data repository for Zostera marina and Posidonia oceanica ESTs Database, August 4, 2009; 2009(0): bap009 - bap009. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Piotrowski, P. Burghout, and D. A. Morrison spr1630 Is Responsible for the Lethality of clpX Mutations in Streptococcus pneumoniae J. Bacteriol., August 1, 2009; 191(15): 4888 - 4895. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. McMahon, G. A. Roberts, K. A. Johnson, L. P. Cooper, H. Liu, J. H. White, L. G. Carter, B. Sanghvi, M. Oke, M. D. Walkinshaw, et al. Extensive DNA mimicry by the ArdA anti-restriction protein and its role in the spread of antibiotic resistance Nucleic Acids Res., August 1, 2009; 37(15): 4887 - 4897. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. F. Neuwald Rapid detection, classification and accurate alignment of up to a million or more related protein sequences Bioinformatics, August 1, 2009; 25(15): 1869 - 1875. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Tejomurtula, K.-B. Lee, S. K. Tripurani, G. W. Smith, and J. Yao Role of Importin Alpha8, a New Member of the Importin Alpha Family of Nuclear Transport Proteins, in Early Embryonic Development in Cattle Biol Reprod, August 1, 2009; 81(2): 333 - 342. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Setubal, P. dos Santos, B. S. Goldman, H. Ertesvag, G. Espin, L. M. Rubio, S. Valla, N. F. Almeida, D. Balasubramanian, L. Cromes, et al. Genome Sequence of Azotobacter vinelandii, an Obligate Aerobe Specialized To Support Diverse Anaerobic Metabolic Processes J. Bacteriol., July 15, 2009; 191(14): 4534 - 4545. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-J. Pan, C.-C. Cho, Y.-Y. Kao, and C.-H. Sun A Novel WRKY-like Protein Involved in Transcriptional Activation of Cyst Wall Protein Genes in Giardia lamblia J. Biol. Chem., July 3, 2009; 284(27): 17975 - 17988. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kimura, Y. Shiraiwa, and I. Suzuki Function of the N-terminal region of the phosphate-sensing histidine kinase, SphS, in Synechocystis sp. PCC 6803 Microbiology, July 1, 2009; 155(7): 2256 - 2264. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. W. Brandt and J. Heringa webPRC: the Profile Comparer for alignment-based searching of public domain databases Nucleic Acids Res., July 1, 2009; 37(suppl_2): W48 - W52. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. I. Sadreyev, M. Tang, B.-H. Kim, and N. V. Grishin COMPASS server for homology detection: improved statistical accuracy, speed and functionality Nucleic Acids Res., July 1, 2009; 37(suppl_2): W90 - W94. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-C. Chen, C.-Y. Lin, Y.-S. Lo, and J.-M. Yang PPISearch: a web server for searching homologous protein-protein interactions across multiple species Nucleic Acids Res., July 1, 2009; 37(suppl_2): W369 - W375. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Bruggeman, J. Heringa, and B. W. Brandt PhyloPars: estimation of missing parameter values using phylogeny Nucleic Acids Res., July 1, 2009; 37(suppl_2): W179 - W184. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Rose, S. Lorenzen, A. Goede, B. Gruening, and P. W. Hildebrand RHYTHM--a server to predict the orientation of transmembrane helices in channels and membrane-coils Nucleic Acids Res., July 1, 2009; 37(suppl_2): W575 - W580. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Shridhar, D. Chattopadhyay, and G. Yadav PLecDom: a program for identification and analysis of plant lectin domains Nucleic Acids Res., July 1, 2009; 37(suppl_2): W452 - W458. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. T.-H. Chang, T.-Y. Chien, and C.-Y. Chen seeMotif: exploring and visualizing sequence motifs in 3D structures Nucleic Acids Res., July 1, 2009; 37(suppl_2): W552 - W558. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kuzniar, K. Lin, Y. He, H. Nijveen, S. Pongor, and J. A. M. Leunissen ProGMap: an integrated annotation resource for protein orthology Nucleic Acids Res., July 1, 2009; 37(suppl_2): W428 - W434. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Blankenburg, F. Ramirez, J. Buch, and M. Albrecht DASMIweb: online integration, analysis and assessment of distributed protein interaction data Nucleic Acids Res., July 1, 2009; 37(suppl_2): W122 - W128. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Sandoval-Calderon, O. Geiger, Z. Guan, F. Barona-Gomez, and C. Sohlenkamp A Eukaryote-like Cardiolipin Synthase Is Present in Streptomyces coelicolor and in Most Actinobacteria J. Biol. Chem., June 26, 2009; 284(26): 17383 - 17390. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ehira, H. Ogino, H. Teramoto, M. Inui, and H. Yukawa Regulation of Quinone Oxidoreductase by the Redox-sensing Transcriptional Regulator QorR in Corynebacterium glutamicum J. Biol. Chem., June 19, 2009; 284(25): 16736 - 16742. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Fornes, R. Aragues, J. Espadaler, M. A. Marti-Renom, A. Sali, and B. Oliva ModLink+: improving fold recognition by using protein-protein interactions Bioinformatics, June 15, 2009; 25(12): 1506 - 1512. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Ramana and D. Gupta ProtVirDB: a database of protozoan virulent proteins Bioinformatics, June 15, 2009; 25(12): 1568 - 1569. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Guo and A. J. Hartemink Domain-oriented edge-based alignment of protein interaction networks Bioinformatics, June 15, 2009; 25(12): i240 - 1246. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Han, J. M. Burnette III, and S. R. Wessler TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences Nucleic Acids Res., June 1, 2009; 37(11): e78 - e78. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. N. Messina and E. L. L. Sonnhammer DASher: a stand-alone protein sequence client for DAS, the Distributed Annotation System Bioinformatics, May 15, 2009; 25(10): 1333 - 1334. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Blankenburg, R. D. Finn, A. Prlic, A. M. Jenkinson, F. Ramirez, D. Emig, S.-E. Schelhorn, J. Buch, T. Lengauer, and M. Albrecht DASMI: exchanging, annotating and assessing molecular interaction data Bioinformatics, May 15, 2009; 25(10): 1321 - 1328. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Minasov, S. Padavattan, L. Shuvalova, J. S. Brunzelle, D. J. Miller, A. Basle, C. Massa, F. R. Collart, T. Schirmer, and W. F. Anderson Crystal Structures of YkuI and Its Complex with Second Messenger Cyclic Di-GMP Suggest Catalytic Mechanism of Phosphodiester Bond Cleavage by EAL Domains J. Biol. Chem., May 8, 2009; 284(19): 13174 - 13184. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. K. Basu, E. Poliakov, and I. B. Rogozin Domain mobility in proteins: functional and evolutionary implications Brief Bioinform, May 1, 2009; 10(3): 205 - 216. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

























