Skip Navigation



Nucleic Acids Research Advance Access published online on April 28, 2008

Nucleic Acids Research, doi:10.1093/nar/gkn194
This Article
Right arrow Abstract Freely available
Right arrow Print PDF (362K) Freely available
Right arrow Screen PDF (367K) Freely available
Right arrowOA All Versions of this Article:
36/suppl_2/W481    most recent
gkn194v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Suhre, K.
Right arrow Articles by Schmitt-Kopplin, P.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Suhre, K.
Right arrow Articles by Schmitt-Kopplin, P.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2008 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.


Web Server Issue

MassTRIX: mass translator into pathways

Karsten Suhre1,2,* and Philippe Schmitt-Kopplin3

1Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München, German Research Center for Environmental Health (GmbH), Ingolstädter Landstraße 1, 85764 Neuherberg, 2Department of Biology, University of Munich (LMU), Großhaderner Straße 2, 82152 Planegg-Martinsried and 3Institute of Ecological Chemistry, Helmholtz Zentrum München, German Research Center for Environmental Health (GmbH), Ingolstädter Landstraße 1, 85764 Neuherberg, Germany

*To whom correspondence should be addressed. Tel: +49 89 3187 2627; Fax: +49 89 3187 3585; Email: karsten.suhre{at}helmholtz-muenchen.de

Received January 30, 2008. Revised March 26, 2008. Accepted April 2, 2008.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 USING THE WEB SERVER
 INTERPRETING THE RESULTS
 CONCLUDING REMARKS
 REFERENCES
 
Recent technical advances in mass spectrometry (MS) have brought the field of metabolomics to a point where large numbers of metabolites from numerous prokaryotic and eukaryotic organisms can now be easily and precisely detected. The challenge today lies in the correct annotation of these metabolites on the basis of their accurate measured masses. Assignment of bulk chemical formula is generally possible, but without consideration of the biological and genomic context, concrete metabolite annotations remain difficult and uncertain. MassTRIX responds to this challenge by providing a hypothesis-driven approach to high precision MS data annotation. It presents the identified chemical compounds in their genomic context as differentially colored objects on KEGG pathway maps. Information on gene transcription or differences in the gene complement (e.g. samples from different bacterial strains) can be easily added. The user can thus interpret the metabolic state of the organism in the context of its potential and, in the case of submitted transcriptomics data, real enzymatic capacities. The MassTRIX web server is freely accessible at http://masstrix.org


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 USING THE WEB SERVER
 INTERPRETING THE RESULTS
 CONCLUDING REMARKS
 REFERENCES
 
Metabolomics is the systems level approach to the quest of understanding all relevant metabolic processes of an organism. The starting point of any metabolomics study is the measurement of the largest possible set of all naturally occurring organic compounds in a biological sample. It is assumed that the substances that are identified represent the principal metabolites of the organism under study. In contrast to genomics (yielding the proteins that an organism can make), transcriptomics (indicating the proteins an organism intends to make), and proteomics (giving the proteins that are made), metabolomics ideally represents a true endpoint to the classical chain of ‘omics’ technologies in the paradigm of life (DNA–RNA–protein–metabolites), by quantifying the outcome of the action of all regulatory and enzymatic processes that are active in a cell at any given time (1–4).

Recent technology advances in ionization techniques and mass spectrometry (MS), in particular high-resolution instruments based on Fourier transform (FT) technology (ion cyclotron resonance-FT/MS and Orbitrap) or the newest time of flight (TOF) MS systems, now allow high-precision measurements of single metabolite masses within an error range down to only a few parts per million (p.p.m.) (5–8). Sub-p.p.m. precision in ultrahigh resolution can even be reached in routine and full scan with higher field magnets in ICR-FT/MS (9). Accordingly, a number of algorithms have been developed that allow deriving the most likely bulk chemical formula for any given individual mass (10–13). It is well known that a mass precision as high as 1 p.p.m. may not always suffice for an unambiguous annotation of all mass peaks considering all possible elements (14), but may be sufficient with materials of known elementary composition (15) or in the context of restricted metabolite possibilities of a given organism, such as presented here.

In a metabolomics setting, it is essential to go beyond the simple derivation of bulk chemical formula, and to annotate the MS data in the context of the actual metabolic pathways of the organism under study, as some recent studies in plants and bacteria show (16–18). A limited number of software tools for this task are available in the form of downloadable program code (19), but to our knowledge no online web server presently provides the possibility to simply upload a high precision mass spectrum and to readily annotate all mass peaks as potential metabolites of a given organism from an actualized database, and then to map automatically the identified compounds onto the metabolic pathway maps of that organism, optionally including information on gene expression or gene coding potential. The MassTRIX web server that we present here responds to this task.


    USING THE WEB SERVER
 TOP
 ABSTRACT
 INTRODUCTION
 USING THE WEB SERVER
 INTERPRETING THE RESULTS
 CONCLUDING REMARKS
 REFERENCES
 
The required input data of MassTRIX is a mass peak list (i.e. mass–intensity pairs) from high-precision MS experiments, along with some parameters describing the dataset, such as the estimated error limit of the instrument and the ionization mode of the measurements. Typically, observations are from particular eukaryotic or prokaryotic cell extracts (e.g. using different gene knock-outs or growth conditions), but analysis of samples such as gut metabolomics have also been tested on our server. As basis for the annotation, an organism from the KEGG database (20) has to be selected. Optionally, a list of enzymes (EC-numbers) or genes (KEGG identifier) can be submitted to selectively highlight user-defined genes on the pathway maps.

MassTRIX then processes the submitted mass peak list by comparing the input experimental masses against all compounds of the KEGG chemical compound database, additionally including 13C, 15N and other isotopes, and optionally adding selected lipids with variable fatty acid chain lengths. Raw input masses from electrospray ionization (ESI) MS can be corrected on-the-fly for the addition or the abstraction of a proton (and optionally a Na+ ion in positive mode). To cope with the requirement of very low measurement errors (in the sub-ppm range), exact masses of all KEGG compounds have been recomputed from the corresponding chemical formula using high-precision atomic mass data (21). MassTRIX then calls the KEGG/API (http://www.genome.jp/kegg/soap/) to generate pathway maps, where the identified compounds and genes are highlighted using different colors—thus differentiating between organism-specific and extra-organism items (Figure 1).


Figure 1
View larger version (48K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 1. MassTRIX sample output from the yeast cell extract example job (http://masstrix.org/examples.html). The number of potentially identifiable and actually identified metabolites per pathway is reported, where Nmap specifies the number of metabolites on the map, Norg gives the number of metabolites related to the organism, Nid is the number of experimentally identified metabolites on the map and Norgid is the number of the identified metabolites that are related to the organism (top left). Clickable KEGG pathway maps (right) are colored according to the legend presented below. Metabolite nodes link to pages describing the mass peak annotations (bottom left), pathway nodes link to related annotated KEGG pathway maps on MassTRIX, while enzyme nodes link to the KEGG gene database. Auxiliary job information (input data, log files, error plot) can be accessed from all pages.

 
By default, all available KEGG pathway maps for the selected organism will be annotated (about 100–250, depending on the organism), but to speed up the computation, a subset of pathways can be specified by the user. A full run takes about 1 h; four parallel jobs can run at any time (this limit can be increased if the server usage warrants the upgrade, since most of the run time is spent in communication with the KEGG/API). The resulting colored pathway maps are fully clickable and cross-linked between different result pages from the server. These result pages include error plots, and pathway- and compound-specific pages, where also alternative annotations are visualized. Out-links to relevant KEGG enzyme and compound pages are provided. A full documentation together with a list of frequently asked questions and a description of all options is maintained online at http://masstrix.org/doc.html.


    INTERPRETING THE RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 USING THE WEB SERVER
 INTERPRETING THE RESULTS
 CONCLUDING REMARKS
 REFERENCES
 
To illustrate the interpretation of a typical MassTRIX job, a mass spectrum from a cytosolic yeast cell extract under exponential growth, collected in positive scan mode on a Bruker-Daltonics APEXQ 12 Tesla ICR-FT mass spectrometer is provided as a work-through example at http://masstrix.org/examples.html. The input data for this test case are available through this web page, so that this test job can be rerun at any time. After completion of the job, two major starting points for interpretation are provided by MassTRIX (Figure 1): the ‘Pathways’ page summarizes the number of identified metabolites on all available pathway maps, while the ‘Compounds’ page gives a list of all metabolites that are annotated on any given pathway of the organism (here Saccharomyces cerevisiae). From these two major starting points, the user can navigate to the different maps and detailed annotations of the identified metabolites. Here it is important to note that in some cases multiple alternative annotations may be found, e.g. in cases where metabolites with identical bulk formula exist, such as fructose, mannose, allose and glucose 6-phosphates. In such a case, the pathway maps may be useful for deciding which of the annotated metabolites is likely to be present under the conditions of the study (here allose 6-phosphate is not a known metabolite of yeast in KEGG and may thus be ruled out). Other potential reasons to rule out some of the annotations could be related to too low concentrations of the metabolites to be seen with the used ionization and mass spectrometric technique, missing major isotopic mass peaks, or the isolated presence of some metabolites in the pathway maps. Multiple annotations may also occur when the masses of two different metabolites lie both within the error range of the observed mass peak. Here again, the pathway maps may be useful for making an educated guess about which metabolite is the most likely candidate.

In this concrete example, some specific biological facts can be readily gleaned from the MassTRIX output. For instance, as expected for yeast cells in exponential growth, a large number of metabolites in the glycolysis/gluconeogenesis pathway has been identified. These metabolites are preferentially connected to the highly expressed genes, indicated by red–gray enzyme boxes on the KEGG pathway map. On the other hand, the citrate cycle (TCA cycle) is shut off during yeast exponential growth. Accordingly, no metabolites have been identified that are linked to expressed genes of this pathway.


    CONCLUDING REMARKS
 TOP
 ABSTRACT
 INTRODUCTION
 USING THE WEB SERVER
 INTERPRETING THE RESULTS
 CONCLUDING REMARKS
 REFERENCES
 
The first purpose of the MassTRIX server presented here is the web-based analysis of MS-based metabolomics data, and its hypothesis-driven interpretation within the genomic context of the organism under study. To continually respond to its objectives, future developments of the MassTRIX web server shall include an automated preprocessing step of the input data, using for example heuristic filtering rules of molecular formulas (8,10,13), together with an inclusion of other metabolite databases, such as MetaCyc (22), the Human Metabolome Database (23) and the Database of Natural Products (24).

At this point, it is important to emphasize the limitations of this approach: we reported recently that high-resolution mass spectra reflect the isomer filtered complement of the entire space of molecular structures (9). An annotation such as the one proposed here thus associates experimental accurate mass (within an experimental error) with a limited number of bulk chemical formula (isomers), derived from the unique elementary composition space and restricted by the choice of the organism (and its annotated genome). The differentiation between isomers and the final metabolite identification can only be done on a case by case basis in further identification steps, using classical analytical chemistry approaches involving metabolite orthogonal separation, spectroscopy and further spectrometry together with chemical synthesis (25). An educated interpretation of the resulting pathway in the light of the genome of the organism thus remains the golden rule.


    ACKNOWLEDGEMENTS
 
We thank Judith Glöckner-Pagel and Philipp Pagel for providing us with the yeast cell extract. Funding to pay the Open Access publication charges for this article was provided by the Helmholtz Zentrum München.

Conflict of interest statement. None declared.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 USING THE WEB SERVER
 INTERPRETING THE RESULTS
 CONCLUDING REMARKS
 REFERENCES
 

  1. Nicholson JK, Connelly J, Lindon JC, Holmes E. Metabonomics: a platform for studying drug toxicity and gene function. Nat. Rev. Drug Discov. (2002) 1:153–161.[CrossRef][ISI][Medline]

  2. Fiehn O. Metabolomics––the link between genotypes and phenotypes. Plant Mol. Biol. (2002) 48:155–171.[CrossRef][ISI][Medline]

  3. Bino RJ, Hall RD, Fiehn O, Kopka J, Saito K, Draper J, Nikolau BJ, Mendes P, Roessner-Tunali U, Beale MH, et al. Potential of metabolomics as a functional genomics tool. Trends Plant Sci. (2004) 9:418–425.[CrossRef][ISI][Medline]

  4. Hirai MY, Yano M, Goodenowe DB, Kanaya S, Kimura T, Awazuhara M, Arita M, Fujiwara T, Saito K. Integration of transcriptomics and metabolomics for understanding of global responses to nutritional stresses in Arabidopsis thaliana. Proc. Natl Acad. Sci. USA (2004) 101:10205–10210.[Abstract/Free Full Text]

  5. Breitling R, Pitt AR, Barrett MP. Precision mapping of the metabolome. Trends Biotechnol. (2006) 24:543–548.[CrossRef][ISI][Medline]

  6. Makarov A, Denisov E, Lange O, Horning S. Dynamic range of mass accuracy in LTQ Orbitrap hybrid mass spectrometer. J. Am. Soc. Mass Spectrom. (2006) 17:977–982.[CrossRef][ISI][Medline]

  7. Satoh T, Sato T, Tamura J. Development of a high-performance MALDI-TOF mass spectrometer utilizing a spiral ion trajectory. J. Am. Soc. Mass Spectrom. (2007) 18:1318–1323.[CrossRef][ISI][Medline]

  8. Schmitt-Kopplin P, Hertkorn N. Ultrahigh resolution mass spectrometry. Anal. Bioanal. Chem. (2007) 389:1309–1310.[CrossRef][ISI]

  9. Hertkorn N, Ruecker C, Meringer M, Gugisch R, Frommberger M, Perdue EM, Witt M, Schmitt-Kopplin P. High-precision frequency measurements: indispensable tools at the core of the molecular-level analysis of complex systems. Anal. Bioanal. Chem. (2007) 389:1311–1327.[CrossRef][ISI][Medline]

  10. Kind T, Fiehn O. Seven golden rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry. BMC Bioinform. (2007) 8:105.[CrossRef][Medline]

  11. Ohta D, Shibata D, Kanaya S. Metabolic profiling using Fourier-transform ion-cyclotron-resonance mass spectrometry. Anal. Bioanal. Chem. (2007) 389:1469–1475.[CrossRef][ISI][Medline]

  12. Katajamaa M, Oresic M. Data processing for mass spectrometry-based metabolomics. J. Chromatogr. A (2007) 1158:318–328.[CrossRef][ISI][Medline]

  13. Rosselló-Mora R, Lucio M, Peña A, Brito-Echeverría J, López-López A, Valens-Vadell M, Frommberger M, Antón J, Schmitt-Kopplin P. Metabolic evidences of biogeographic isolation of the extremophilic bacterium Salinibacter ruber. In: The ISME Journal (2008) (in press).

  14. Kind T, Fiehn O. Metabolomic database annotations via query of elemental compositions: mass accuracy is insufficient even at less than 1 ppm. BMC Bioinform. (2006) 7:234.[CrossRef][Medline]

  15. Hertkorn N, Benner R, Schmitt-Kopplin P, Kaiser K, Kettrup A, Hedges IJ. Characterization of a major refractory component of marine organic matter. Geochim. Cosmochim. Acta. (2006) 70:2990–3010.[CrossRef][ISI]

  16. Aharoni A, Ric de Vos CH, Verhoeven HA, Maliepaard CA, Kruppa G, Bino R, Goodenowe DB. Nontargeted metabolome analysis by use of Fourier transform ion cyclotron mass spectrometry. Omics (2002) 6:217–234.[CrossRef][Medline]

  17. Hirai MY, Klein M, Fujikawa Y, Yano M, Goodenowe DB, Yamazaki Y, Kanaya S, Nakamura Y, Kitayama M, Suzuki H, et al. Elucidation of gene-to-gene and metabolite-to-gene networks in Arabidopsis by integration of metabolomics and transcriptomics. J. Biol. Chem. (2005) 280:25590–25595.[Abstract/Free Full Text]

  18. Tang Y, Pingitore F, Mukhopadhyay A, Phan R, Hazen TC, Keasling JD. Pathway confirmation and flux analysis of central metabolic pathways in Desulfovibrio vulgaris hildenborough using gas chromatography-mass spectrometry and Fourier transform-ion cyclotron resonance mass spectrometry. J. Bacteriol. (2007) 189:940–949.[Abstract/Free Full Text]

  19. Jourdan F, Breitling R, Barrett MP, Gilbert D. MetaNetter: inference and visualization of high-resolution metabolomic networks. Bioinformatics. (2008) 24:143–145.[Abstract/Free Full Text]

  20. Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, Kawashima S, Katayama T, Araki M, Hirakawa M. From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. (2006) 34:D354–D357.[Abstract/Free Full Text]

  21. Wapstra AH, Audi G, Thibault C. The AME2003 atomic mass evaluation (I). Evaluation of input data, adjustment procedures. Nuclear Phys. A. (2003) 729:129–336.[CrossRef]

  22. Caspi R, Foerster H, Fulcher CA, Kaipa P, Krummenacker M, Latendresse M, Paley S, Rhee SY, Shearer AG, Tissier C, et al. The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res. (2008) 36:D623–D631.[Abstract/Free Full Text]

  23. Wishart DS, Tzur D, Knox C, Eisner R, Guo AC, Young N, Cheng D, Jewell K, Arndt D, Sawhney S, et al. HMDB: the human metabolome database. Nucleic Acids Res. (2007) 35:D521–D526.[Abstract/Free Full Text]

  24. Buckingham J. C.H.C. Dictionary of Natural Products. (1993) 6th edn. England: Chapman & Hall, CRC Press.

  25. Chen J, Zhao X, Fritsche J, Yin P, Schmitt-Kopplin P, Wang W, Lu X, Häring HU, Schleicher ED, Lehmann R, et al. Practical approach for the identification and isomer elucidation of biomarkers detected in a metabonomic study for the discovery of individuals at risk for diabetes by integrating the chromatographic and mass spectrometric information. Anal Chem. (2008) 80:1280–1289.[Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow Print PDF (362K) Freely available
Right arrow Screen PDF (367K) Freely available
Right arrowOA All Versions of this Article:
36/suppl_2/W481    most recent
gkn194v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Suhre, K.
Right arrow Articles by Schmitt-Kopplin, P.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Suhre, K.
Right arrow Articles by Schmitt-Kopplin, P.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?