Published online 12 October 2005
Article |
Specificity prediction of adenylation domains in nonribosomal peptide synthetases (NRPS) using transductive support vector machines (TSVMs)
Center for Bioinformatics Tübingen (ZBIT), University of Tübingen Germany 1Department of Microbiology/Biotechnology, University of Tübingen Germany
*To whom correspondence should be addressed. Tel: +49 7071 29 70454; Fax: +49 7071 29 5148; Email: rausch{at}informatik.uni-tuebingen.de
Received June 3, 2005. Revised July 29, 2005. Accepted September 20, 2005.
We present a new support vector machine (SVM)-based approach to predict the substrate specificity of subtypes of a given protein sequence family. We demonstrate the usefulness of this method on the example of aryl acid-activating and amino acid-activating adenylation domains (A domains) of nonribosomal peptide synthetases (NRPS). The residues of gramicidin synthetase A that are 8 Å around the substrate amino acid and corresponding positions of other adenylation domain sequences with 397 known and unknown specificities were extracted and used to encode this physico-chemical fingerprint into normalized real-valued feature vectors based on the physico-chemical properties of the amino acids. The SVM software package SVMlight was used for training and classification, with transductive SVMs to take advantage of the information inherent in unlabeled data. Specificities for very similar substrates that frequently show cross-specificities were pooled to the so-called composite specificities and predictive models were built for them. The reliability of the models was confirmed in cross-validations and in comparison with a currently used sequence-comparison-based method. When comparing the predictions for 1230 NRPS A domains that are currently detectable in UniProt, the new method was able to give a specificity prediction in an additional 18% of the cases compared with the old method. For 70% of the sequences both methods agreed, for <6% they did not, mainly on low-confidence predictions by the existing method. None of the predictive methods could infer any specificity for 2.4% of the sequences, suggesting completely new types of specificity.
Correspondence may also be addressed to Daniel H. Huson. Email: huson{at}informatik.uni-tuebingen.de
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. J. Allender, G. R. LeCleir, J. M. Rinta-Kanto, R. L. Small, M. F. Satchwell, G. L. Boyer, and S. W. Wilhelm Identifying the Source of Unknown Microcystin Genes and Predicting Microcystin Variants by Comparing Genes within Uncultured Cyanobacterial Cells Appl. Envir. Microbiol., June 1, 2009; 75(11): 3598 - 3604. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Ishida, M. Welker, G. Christiansen, S. Cadel-Six, C. Bouchier, E. Dittmann, C. Hertweck, and N. Tandeau de Marsac Plasticity and Evolution of Aeruginosin Biosynthesis in Cyanobacteria Appl. Envir. Microbiol., April 1, 2009; 75(7): 2017 - 2026. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Ortel and U. Keller Combinatorial Assembly of Simple and Complex D-Lysergic Acid Alkaloid Peptide Classes in the Ergot Fungus Claviceps purpurea J. Biol. Chem., March 13, 2009; 284(11): 6650 - 6660. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Starcevic, J. Zucko, J. Simunkovic, P. F. Long, J. Cullum, and D. Hranueli ClustScan: an integrated program package for the semi-automatic annotation of modular biosynthetic gene clusters and in silico prediction of novel chemical structures Nucleic Acids Res., December 1, 2008; 36(21): 6882 - 6892. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. R. Waterfield, M. Sanchez-Contreras, I. Eleftherianos, A. Dowling, G. Yang, P. Wilkinson, J. Parkhill, N. Thomson, S. E. Reynolds, H. B. Bode, et al. Rapid Virulence Annotation (RVA): Identification of virulence factors using a bacterial genome library and multiple invertebrate hosts PNAS, October 14, 2008; 105(41): 15967 - 15972. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. J. Dimise, P. F. Widboom, and S. D. Bruner Structure elucidation and biosynthesis of fuscachelins, peptide siderophores from the moderate thermophile Thermobifida fusca PNAS, October 7, 2008; 105(40): 15311 - 15316. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-S. Moon, B. G. G. Donzelli, S. B. Krasnoff, H. McLane, M. H. Griggs, P. Cooke, J. D. Vandenberg, D. M. Gibson, and A. C. L. Churchill Agrobacterium-Mediated Disruption of a Nonribosomal Peptide Synthetase Gene in the Invertebrate Pathogen Metarhizium anisopliae Reveals a Peptide Spore Factor Appl. Envir. Microbiol., July 15, 2008; 74(14): 4366 - 4380. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. L. Challis Mining microbial genomes for new natural products and biosynthetic pathways Microbiology, June 1, 2008; 154(6): 1555 - 1569. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Eys, D. Schwartz, W. Wohlleben, and E. Schinko Three Thioesterases Are Involved in the Biosynthesis of Phosphinothricin Tripeptide in Streptomyces viridochromogenes Tu494 Antimicrob. Agents Chemother., May 1, 2008; 52(5): 1686 - 1696. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Neuhof, R. Dieckmann, I. S. Druzhinina, C. P. Kubicek, and H. von Dohren Intact-cell MALDI-TOF mass spectrometry analysis of peptaibol formation by the genus Trichoderma/Hypocrea: can molecular phylogeny of species predict peptaibol structures? Microbiology, October 1, 2007; 153(10): 3417 - 3437. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Stack, C. Neville, and S. Doyle Nonribosomal peptide synthesis in Aspergillus fumigatus and other fungi Microbiology, May 1, 2007; 153(5): 1297 - 1306. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Balado, C. R. Osorio, and M. L. Lemos A gene cluster involved in the biosynthesis of vanchrobactin, a chromosome-encoded siderophore produced by Vibrio anguillarum Microbiology, December 1, 2006; 152(12): 3517 - 3528. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. G. Van Lanen, S. Lin, P. C. Dorrestein, N. L. Kelleher, and B. Shen Substrate Specificity of the Adenylation Enzyme SgcC1 Involved in the Biosynthesis of the Enediyne Antitumor Antibiotic C-1027 J. Biol. Chem., October 6, 2006; 281(40): 29633 - 29640. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Yin and T. M. Zabriskie The enduracidin biosynthetic gene cluster from Streptomyces fungicidicus. Microbiology, October 1, 2006; 152(Pt 10): 2969 - 2983. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Agnoli, C. A. Lowe, K. L. Farmer, S. I. Husnain, and M. S. Thomas The Ornibactin Biosynthesis and Transport Genes of Burkholderia cenocepacia Are Regulated by an Extracytoplasmic Function {sigma} Factor Which Is a Part of the Fur Regulon. J. Bacteriol., May 1, 2006; 188(10): 3631 - 3644. [Abstract] [Full Text] [PDF] |
||||






