Skip Navigation


Nucleic Acids Research Advance Access originally published online on October 28, 2006
Nucleic Acids Research 2007 35(Database issue):D213-D218; doi:10.1093/nar/gkl863
This Article
Right arrow Abstract Freely available
Right arrow Print PDF (1077K) Freely available
Right arrow Screen PDF (224K) Freely available
Right arrowOA All Versions of this Article:
35/suppl_1/D213    most recent
gkl863v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Heazlewood, J. L.
Right arrow Articles by Millar, A. H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Heazlewood, J. L.
Right arrow Articles by Millar, A. H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2007, Vol. 35, Database issue D213-D218
© 2006 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.


Articles

SUBA: the Arabidopsis Subcellular Database

Joshua L. Heazlewood*, Robert E. Verboom, Julian Tonti-Filippini, Ian Small and A. Harvey Millar

ARC Centre of Excellence in Plant Energy Biology, School of Biomedical, Biomolecular and Chemical Sciences, The University of Western Australia 35 Stirling Highway, Crawley 6009, Western Australia, Australia

*To whom correspondence should be addressed. ARC Centre of Excellence in Plant Energy Biology, CMS Building, M316, The University of Western Australia Crawley 6009, Western Australia, Australia. Tel: +61 8 6648 2795; Fax: +61 8 6648 4401; Email: botjlh{at}cyllene.uwa.edu.au

Received August 15, 2006. Revised September 14, 2006. Accepted October 3, 2006.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 DATABASE STRUCTURE AND INTERFACE
 NUMERICAL ASSESSMENT OF THE...
 HOW TO BUILD QUERIES...
 FUTURE PROSPECTS
 SUPPLEMENTARY DATA
 REFERENCES
 
Knowledge of protein localisation contributes towards our understanding of protein function and of biological inter-relationships. A variety of experimental methods are currently being used to produce localisation data that need to be made accessible in an integrated manner. Chimeric fluorescent fusion proteins have been used to define subcellular localisations with at least 1100 related experiments completed in Arabidopsis. More recently, many studies have employed mass spectrometry to undertake proteomic surveys of subcellular components in Arabidopsis yielding localisation information for ~2600 proteins. Further protein localisation information may be obtained from other literature references to analysis of locations (AmiGO: ~900 proteins), location information from Swiss-Prot annotations (~2000 proteins); and location inferred from gene descriptions (~2700 proteins). Additionally, an increasing volume of available software provides location prediction information for proteins based on amino acid sequence. We have undertaken to bring these various data sources together to build SUBA, a SUBcellular location database for Arabidopsis proteins. The localisation data in SUBA encompasses 10 distinct subcellular locations, >6743 non-redundant proteins and represents the proteins encoded in the transcripts responsible for 51% of Arabidopsis expressed sequence tags. The SUBA database provides a powerful means by which to assess protein subcellular localisation in Arabidopsis (http://www.suba.bcs.uwa.edu.au).


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 DATABASE STRUCTURE AND INTERFACE
 NUMERICAL ASSESSMENT OF THE...
 HOW TO BUILD QUERIES...
 FUTURE PROSPECTS
 SUPPLEMENTARY DATA
 REFERENCES
 
Genomic sequencing efforts of the past decade have provided the scientific community with a vast array of basic information regarding the structure of genomes and their coding capacities. In plants, recent years have seen genome sequencing efforts completed on the model dicot species Arabidopsis (1) and on the important agronomic monocot species, rice (2,3). Curation and analysis of the Arabidopsis genome by The Institute for Genomic Research (TIGR) (4) and more recently by The Arabidopsis Information Resource (TAIR) (5) has produced an annotated genome of high quality (6) that is currently in its sixth revision. While these studies and resources have provided significant insights into the genetic composition of Arabidopsis, the function of many proteins remains poorly understood. A common method employed to assist in the attribution of function to a protein is to establish subcellular location (7). Separate subcellular locations often represent distinct cellular environments and proteins found within such locales share similar attributes and fundamentally play roles in defining the function of a subcellular compartment.

The subcellular location of a protein can be predicted from the primary protein sequence using bioinformatic means. A variety of these bioinformatic programs are available and several studies have assessed their performances using genome wide prediction of localisation (8,9). Experimental approaches can also be used for subcellular determination and commonly utilize cellular fractionation, centrifugation-based purification of organelles or cellular compartments, and mass spectrometry to identify peptides and have been used to define sets of proteins grouped by subcellular location (1012). Such approaches have been extensively undertaken in Arabidopsis with a number of studies producing significant protein sets from major compartments and locations such as plastids (1319), the nucleus (2022), the plasma membrane (23) and the mitochondrion (24,25). A complementary approach to mass spectrometry is the expression and visualisation of fluorescent proteins (FPs) attached to proteins of interest. Many hundreds of Arabidopsis proteins have been visualised in this manner to date and form an important dataset for determining subcellular location (2629). Importantly, they represent the only subcellular location data for intact, living cell structures. Swiss-Prot is a managed database of protein entries that allows hand-annotation, by researchers, of important data regarding a protein's catalytic action, cellular location and broader functional role in biology (30,31). This represents a rich source of data collected over several decades directly from researchers and indirectly from the literature. The Genome Ontology (GO) database maintained by The Gene Ontology Consortium is searchable using the AmiGO browser (32) and contains an array of information regarding genes and associated data that conforms to a consistent and well defined vocabulary.

Currently, the data in most of these literature sources and compiled datasets are relatively inaccessible to the wider research community and the specific proteins localised in some reports are often so numerous that they do not appear in the text, title or abstract of a paper of interest. While TAIR represents the primary source for the majority of information concerning Arabidopsis, it tends to focus on genomic and transcript data. Currently subcellular targeting prediction information is only available for one program (TargetP), while subcellular proteomes and FP targeting information are limited and not accessible as defined sets. Although GO annotations and descriptor fields can be readily searched at TAIR, all these data cannot be collectively investigated as defined sets using Boolean queries. Previously using SUBA we have demonstrated that an analysis of a variety of localisation data sources can be readily utilized to assess subcellular prediction and protein family localisation (9,33). By bringing together additional protein localisation information provided by prediction algorithms, mass spectrometry, GFP studies, Swiss-Prot annotation, AmiGO data and TAIR annotation in the context of defining the subcellular location of proteins in Arabidopsis, we have aimed to provide a one-stop-shop for protein localisation in this model plant.


    DATABASE STRUCTURE AND INTERFACE
 TOP
 ABSTRACT
 INTRODUCTION
 DATABASE STRUCTURE AND INTERFACE
 NUMERICAL ASSESSMENT OF THE...
 HOW TO BUILD QUERIES...
 FUTURE PROSPECTS
 SUPPLEMENTARY DATA
 REFERENCES
 
SUBA is a MySQL relational database, operating on a UNIX-based system. The non-redundant nuclear protein dataset originally utilized to populate the database was obtained from TIGR contained in the file ATH1.pep (release 5) comprising 28 952 non-redundant proteins. Arabidopsis mitochondrial (117) and chloroplast (87) open reading frame sets were obtained from GenBank. Updating of the database to keep pace with new gene model data is an ongoing process. SUBA currently contains a total of 29 156 proteins. Data used to populate the database has been derived from a multitude of sources details of which can be found in Supplementary Data.

SUBA consists of a web-browser based GUI (Graphical User Interface) written in DHTML (Dynamic HTML) that makes use of AJAX (Asynchronous Javascript and XML) to interact with the SUBA server. The back-end of SUBA consists of a number of PHP programs that interact with the MySQL tables that house SUBA data. The SUBA GUI has been designed to allow users with no prior knowledge of SQL (structured query language) to dynamically construct moderately complex SQL queries in a point and click manner. For more advanced users, SQL queries may be manually defined although for security reasons queries are restricted to SELECT type only. The GUI is accessible via http://www.suba.bcs.uwa.edu.au (Figure 1) and is suitable for use with most modern web browsers, although we recommend using the Firefox web browser as it handles the DHTML of SUBA in a significantly faster manner than other common browsers. The interface utilises a tabbed design and refreshes content via AJAX, removing the need for users to depend upon the ‘back’ button of their browsers. The ‘Search’ tab is designed for ease of use consisting of pull down menus and simple text boxes. Complex Boolean queries can be performed using AND, OR and appropriate bracketing. For more advanced users, the ‘SQL search’ tab allows direct entry of SQL queries. Query results are accessed via the ‘Results’ tab in a tabular format with each row containing, by default, protein Arabidopsis genome initiative (AGI) identifier and localisation summary information. The ‘Columns’ tab allows users to control exactly which data columns are loaded whilst the ‘Format’ tab allows users to control which data column are visible and the order in which they are displayed. Some control over column order and visibility is also provided directly via the ‘Results’ tab. Finally, the ‘Rows’ tab keeps a record of all proteins selected by the user via the ‘Results’ tab allowing for simple cutting and pasting of lists and filtering of results. If desired, the results from a query may be downloaded as a tab delimited file for further analysis.


Figure 1
View larger version (34K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 1 The SUBA graphical user interface showing a search (A) where complex queriescan be built from pull down menus and visualised in the sentence structure represented in the text box. In this case for proteins localised to mitochondria by three different experimental data sets. The results (B) of this search is shown, and the AGI page (C) for one of these identifications giving links and information on the requested protein.

 
Each protein match in the result table is hyperlinked to a flatfile that displays further information and provides links to related resources such as BLAST and other Arabidopsis databases including TAIR, TIGR, MIPS Arabidopsis thaliana Database (MAtDB) (34), The Plant Specific Database (35), ARAMEMNON (36), Salk Insertion Sequence Database (37).


    NUMERICAL ASSESSMENT OF THE COMPILED DATA RESOURCES IN ARABIDOPSIS SUBCELLULAR DATABASE (SUBA)
 TOP
 ABSTRACT
 INTRODUCTION
 DATABASE STRUCTURE AND INTERFACE
 NUMERICAL ASSESSMENT OF THE...
 HOW TO BUILD QUERIES...
 FUTURE PROSPECTS
 SUPPLEMENTARY DATA
 REFERENCES
 
The numbers of accumulated sub-cellular location annotations in SUBA are outlined in Table 1. These represent data in 12 subcellular locations (Cell plate, Cytoskeleton, Cytosol, ER, Extracellular, Golgi, Mitochondria, Nucleus, Peroxisome, PM, Plastid, Vacuole) and a range of data in a 13th category in which location is considered speculative (Unclear). Mass spectrometry (MS) leads the number of matches contributed from the direct experimental data sets of MS, FP and AmiGO by ≥2:1 by contributing 3500 location data pieces on 2600 non-redundant proteins, compared to the MS+FP+AmiGO total of 5818 data pieces on 3781 non-redundant identifications. Swiss-Prot and Description data contribute similar number of localisations to MS; to date, 1981 and 2701, respectively. Combined, there are some 10 800 pieces of assembled sub-location data in SUBA on a set of 6743 non-redundant proteins.


View this table:
[in this window]
[in a new window]

 
Table 1 Compiled data resources collected in the Arabidopsis Subcellular Database (SUBA)

 

    HOW TO BUILD QUERIES FOR THE ARABIDOPSIS SUBCELLULAR DATABASE (SUBA)
 TOP
 ABSTRACT
 INTRODUCTION
 DATABASE STRUCTURE AND INTERFACE
 NUMERICAL ASSESSMENT OF THE...
 HOW TO BUILD QUERIES...
 FUTURE PROSPECTS
 SUPPLEMENTARY DATA
 REFERENCES
 
After loading the interface, (http://www.suba.bcs.uwa.edu.au) the query tab is active and from this view any number of characteristics or sets can be selected to define a database query. Simple to moderately complex searches may be constructed using AND, OR and NOT functions to link together any number of data components. Selected data can be easily downloaded using the ‘Download as Excel’ button at the bottom of the results window. Below are some examples that show how this database can help with particular questions.

Building protein sets of known subcellular location from published datasets
In analysis of bioinformatic data, such as transcript data from microarray or yeast two-hybrid interactions, it is often helpful to have lists of gene loci with known location properties. GO annotation provides some tools for this use in many data analysis packages, however, SUBA provides more up to date lists of proteins based on location sets and these can be tailored by the user to include only experimental data or a combination of experimental and prediction data. For example a set of all proteins in the chloroplast, ‘the chloroplast proteome’, can be developed by combining MS, GFP and AmiGO data to give a set of 1309 proteins, this could be expanded by adding other proteins predicted to be in chloroplasts by the prediction programme Predotar to give 2437, or it could be minimised by only taking the experimental set (MS, GFP and AmiGO) that is also firmly predicted by Predotar to give 555 proteins. These AGIs can then be downloaded using the Excel download button to be imported into another programme as a tailored chloroplast location set. These sets might also be browsed by chloroplast researchers interested in which proteins have been located in chloroplasts recently.

Comparison of published proteome sets to each other and to new sets
As the number of reports of proteins identified from different locations accumulates in the literature, it is increasingly difficult to know how accurate these sets are, whether they agree with previous reports or whether claimed new findings have also been reported by other groups in the same or different locations in the cell. SUBA allows a direct comparison of published datasets using the ‘found in reference/not found in reference’ option on the query page that gives access to the lists from each particular paper used to build the sets in SUBA. These can be compared against each other using OR/AND linkages in the query window. For example, Kruft et al. (24) and Millar et al. (25) began the development of the mitochondrial proteome, listing 40 and 83 IDs respectively. The common set between the two studies numbers 28 (Kruft AND Millar). More recently, Froehlich et al. (18) and Ferro et al. (17) both analysed the chloroplast envelope membrane by different methods, identifying 340 and 105 proteins, respectively, with only 59 in common (Froehlich AND Ferro). Interestingly, only 174 of either of these sets were found in the intact chloroplast analysis by Kleffman et al. (19) the following year [(Froehlich OR Ferro) AND Kleffman]. New sets that are unpublished or in review can be easily compared against the published sets by combining in reference queries with AGI lists inserted into the ‘loci in list/loci not in list’ input box.

Comparison of targeting prediction programs
Targeting prediction programs are mostly used to determine the likelihood that a specific protein sequence is targeted to a particular subcellular structure. Usually only one is used, or at best, two programs are used one at a time. However, as SUBA contains pre-analysed predictions across a set of ten prediction programmes for all nuclear-encoded protein sequences from Arabidopsis, the set of proteins predicted to a specific location by multiple targeting prediction programs can be rapidly compared. For example, 3182 proteins are predicted to be mitochondrial by TargetP, but this set is reduced to 1176 if MitoProt II and iPSORT are also queried to form an inclusive consensus set across all three predictors. An assessment of false negative sets and false positive sets for individual predictors can also be made by comparing the existing experimental data with the output of individual prediction programmes. For example, of the 3181 protein predicted by TargetP to be mitochondrial, 470 have been found experimentally somewhere in Arabidopsis [TargetP in mitochondria AND (MS any location OR GFP any location OR AmiGO any location)], but only 268 were found in mitochondria (TargetP in mitochondria AND [MS in mitochondria OR GFP in mitochondria OR Amigo in mitochondira]). This means that 202 proteins, while predicted to be mitochondrial by TargetP, were experimentally found to be located elsewhere. Thus, we can calculate a false positive rate for TargetP mitochondrial predictions of 202/470 = 42%. Proteins are also found in mitochondria (MS in mitochondria OR GFP in mitochondria OR AmiGO in mitochondria = 645) that are not predicted by TargetP [TargetP in mitochondria AND (MS any location OR GFP any location OR AmiGO any location) = 262]. Using these numbers, we can calculate the false negative rate of TargetP mitochondrial predictions as 1–(262/645) = 59%.

Locating proteins or protein families of interest
Many researchers are interested in specific genes or gene families. For an interest in malate dehydrogenases, you could select proteins with a description including the word ‘malate dehydrogenase’ which yields nine entries from the entire Arabidopsis protein set with these words in the primary descriptor. Data are presented on the results page showing the peroxisomal location of three gene products, the mitochondrial localisation of two others, the plastid location of another two and the cytosolic location of a further three based on mixture of MS, Swiss-Prot annotation and AmiGO direct assay evidence. For less analysed gene families, this type of search can be used to assess the predicted location by TargetP for the members of a gene family to identify the most likely member with a particular subcellular location, for example to drive an experimental design to identify an endoplasmic reticulum (ER) member of a particular gene family. The incorporation of a TIGR Paralogous Families list and PFAM (38) Protein Family IDs in the search tab allows the user to quickly consider a set of related proteins based on sequence similarity and functional domains rather than on an intimate knowledge of gene annotations or AGI locus numbers.

In this manner, complex queries may be built to investigate many different questions relating to the assessment of localisation data based on experimentation and prediction, and to link this to assessment of expression levels (using expressed sequence tags) to physical location on chromosomes (using AGI identifiers and chromosome selections) to physical characteristics of proteins (GRAVY, pI, MW, functional motifs).


    FUTURE PROSPECTS
 TOP
 ABSTRACT
 INTRODUCTION
 DATABASE STRUCTURE AND INTERFACE
 NUMERICAL ASSESSMENT OF THE...
 HOW TO BUILD QUERIES...
 FUTURE PROSPECTS
 SUPPLEMENTARY DATA
 REFERENCES
 
Incorporation of further experimental localisation datasets is planned to enhance the value of this database as a resource for plant molecular analysis. Furthermore, new prediction programs, the results of new high-throughput GFP tagging experiments and new mass spectrometry analysis of organelle fractions will be added and protein-protein interaction studies will also be incorporated. This will provide a relational search environment bringing together large datasets with a focus on questions pertaining to subcellular location and organelle function.


    SUPPLEMENTARY DATA
 TOP
 ABSTRACT
 INTRODUCTION
 DATABASE STRUCTURE AND INTERFACE
 NUMERICAL ASSESSMENT OF THE...
 HOW TO BUILD QUERIES...
 FUTURE PROSPECTS
 SUPPLEMENTARY DATA
 REFERENCES
 
Supplementary Data are available at NAR online.


    ACKNOWLEDGEMENTS
 
We would like to thank Torsten Blum (Wilhelm-Schickard-Institut für Informatik Universität Tübingen) for pre-computed Arabidopsis localisation set using MultiLoc and Rajesh Nair (Columbia University Bioinformatics Center) for the pre-computed Arabidopsis localisation set using LOCTree. This work is supported through the Australian Research Council Centre of Excellence Programme, an ARC QEII Research Fellowship to AHM, an ARC Postdoctoral Fellowship to JLH and a WA Premier's Research Fellowship to IS. Funding to pay the Open Access publication charges for this article was provided by the University of Western Australia.

Conflict of interest statement. None declared.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 DATABASE STRUCTURE AND INTERFACE
 NUMERICAL ASSESSMENT OF THE...
 HOW TO BUILD QUERIES...
 FUTURE PROSPECTS
 SUPPLEMENTARY DATA
 REFERENCES
 

  1. Kaul, S., Koo, H.L., Jenkins, J., Rizzo, M., Rooney, T., Tallon, L.J., Feldblyum, T., Nierman, W., Benito, M.I., Lin, X.Y., et al. (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana Nature, 408, 796–815[CrossRef][Medline] .

  2. Goff, S.A., Ricke, D., Lan, T.H., Presting, G., Wang, R., Dunn, M., Glazebrook, J., Sessions, A., Oeller, P., Varma, H., et al. (2002) A draft sequence of the rice genome (Oryza sativa L. ssp. japonica) Science, 296, 92–100[Abstract/Free Full Text] .

  3. Yu, J., Hu, S., Wang, J., Wong, G.K., Li, S., Liu, B., Deng, Y., Dai, L., Zhou, Y., Zhang, X., et al. (2002) A draft sequence of the rice genome (Oryza sativa L. ssp. indica) Science, 296, 79–92[Abstract/Free Full Text] .

  4. Haas, B.J., Wortman, J.R., Ronning, C.M., Hannick, L.I., Smith, R.K., Jr, Maiti, R., Chan, A.P., Yu, C., Farzad, M., Wu, D., et al. (2005) Complete reannotation of the Arabidopsis genome: methods, tools, protocols and the final release BMC Biol, . 3, 7[CrossRef][Medline] .

  5. He, Z.H., Cheeseman, I., He, D., Kohorn, B.D. (1999) A cluster of five cell wall-associated receptor kinase genes, Wak1-5, are expressed in specific organs of Arabidopsis Plant Mol. Biol, . 39, 1189–1196[CrossRef][Web of Science][Medline] .

  6. Rhee, S.Y., Beavis, W., Berardini, T.Z., Chen, G., Dixon, D., Doyle, A., Garcia-Hernandez, M., Huala, E., Lander, G., Montoya, M., et al. (2003) The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and community Nucleic Acids Res, . 31, 224–228[Abstract/Free Full Text] .

  7. Chou, K.C. and Cai, Y.D. (2003) Prediction and classification of protein subcellular location-sequence-order effect and pseudo amino acid composition J. Cell. Biochem, . 90, 1250–1260[CrossRef][Web of Science][Medline] .

  8. Richly, E., Chinnery, P.F., Leister, D. (2003) Evolutionary diversification of mitochondrial proteomes: implications for human disease Trends Genet, . 19, 356–362[CrossRef][Web of Science][Medline] .

  9. Heazlewood, J.L., Tonti-Filippini, J.S., Gout, A.M., Day, D.A., Whelan, J., Millar, A.H. (2004) Experimental analysis of the Arabidopsis mitochondrial proteome highlights signalling and regulatory components, provides assessment of targeting prediction programs and points to plant specific mitochondrial proteins Plant Cell, . 16, 241–256[Abstract/Free Full Text] .

  10. Brunet, S., Thibault, P., Gagnon, E., Kearney, P., Bergeron, J.J., Desjardins, M. (2003) Organelle proteomics: looking at less to see more Trends Cell Biol, . 13, 629–638[CrossRef][Web of Science][Medline] .

  11. Komatsu, S., Kojima, K., Suzuki, K., Ozaki, K., Higo, K. (2004) Rice Proteome Database based on two-dimensional polyacrylamide gel electrophoresis: its status in 2003 Nucleic Acids Res, . 32, D388–D392[Abstract/Free Full Text] .

  12. Millar, A.H. (2004) Location, location, location: surveying the intracellular real estate with proteomics in plants Funct. Plant Biol, . 31, 563–571[CrossRef] .

  13. Peltier, J.B., Emanuelsson, O., Kalume, D.E., Ytterberg, J., Friso, G., Rudella, A., Liberles, D.A., Soderberg, L., Roepstorff, P., von Heijne, G., et al. (2002) Central functions of the lumenal and peripheral thylakoid proteome of Arabidopsis determined by experimentation and genome-wide prediction Plant Cell, . 14, 211–236[Abstract/Free Full Text] .

  14. Schubert, M., Petersson, U.A., Haas, B.J., Funk, C., Schroder, W.P., Kieselbach, T. (2002) Proteome map of the chloroplast lumen of Arabidopsis thaliana J. Biol. Chem, . 277, 8354–8365[Abstract/Free Full Text] .

  15. Friso, G., Giacomelli, L., Ytterberg, A.J., Peltier, J.B., Rudella, A., Sun, Q., Wijk, K.J. (2004) In-depth analysis of the thylakoid membrane proteome of Arabidopsis thaliana chloroplasts: new proteins, new functions, and a plastid proteome database Plant Cell, . 16, 478–499[Abstract/Free Full Text] .

  16. Peltier, J.B., Cai, Y., Sun, Q., Zabrouskov, V., Giacomelli, L., Rudella, A., Ytterberg, A.J., Rutschow, H., van Wijk, K.J. (2006) The oligomeric stromal proteome of Arabidopsis thaliana chloroplasts Mol. Cell. Proteomics, 5, 114–133[Abstract/Free Full Text] .

  17. Ferro, M., Salvi, D., Brugiere, S., Miras, S., Kowalski, S., Louwagie, M., Garin, J., Joyard, J., Rolland, N. (2003) Proteomics of the chloroplast envelope membranes from Arabidopsis thaliana Mol. Cell. Proteomics, 2, 325–345[Abstract/Free Full Text] .

  18. Froehlich, J.E., Wilkerson, C.G., Ray, K., McAndrew, R.S., Osteryoung, K.W., Gage, D.A., Phinney, B.S. (2003) Proteomic study of the Arabidopsis thaliana chloroplastic envelope membrane utilizing alternatives to traditional two-dimensional electrophoresis J. Proteome Res, . 2, 413–425[CrossRef][Web of Science][Medline] .

  19. Kleffmann, T., Russenberger, D., von Zychlinski, A., Christopher, W., Sjolander, K., Gruissem, W., Baginsky, S. (2004) The Arabidopsis thaliana chloroplast proteome reveals pathway abundance and novel protein functions Curr. Biol, . 14, 354–362[CrossRef][Web of Science][Medline] .

  20. Bae, M.S., Cho, E.J., Choi, E.Y., Park, O.K. (2003) Analysis of the Arabidopsis nuclear proteome and its response to cold stress Plant J, . 36, 652–663[CrossRef][Web of Science][Medline] .

  21. Calikowski, T.T., Meulia, T., Meier, I. (2003) A proteomic study of the arabidopsis nuclear matrix J. Cell. Biochem, . 90, 361–378[CrossRef][Web of Science][Medline] .

  22. Pendle, A.F., Clark, G.P., Boon, R., Lewandowska, D., Lam, Y.W., Andersen, J., Mann, M., Lamond, A.I., Brown, J.W., Shaw, P.J. (2005) Proteomic analysis of the Arabidopsis nucleolus suggests novel nucleolar functions Mol. Biol. Cell, . 16, 260–269[Abstract/Free Full Text] .

  23. Dunkley, T.P., Hester, S., Shadforth, I.P., Runions, J., Weimar, T., Hanton, S.L., Griffin, J.L., Bessant, C., Brandizzi, F., Hawes, C., et al. (2006) Mapping the Arabidopsis organelle proteome Proc. Natl Acad. Sci. USA, 103, 6518–6523[Abstract/Free Full Text] .

  24. Kruft, V., Eubel, H., Jansch, L., Werhahn, W., Braun, H.P. (2001) Proteomic approach to identify novel mitochondrial proteins in Arabidopsis Plant Physiol, . 127, 1694–1710[Abstract/Free Full Text] .

  25. Millar, A.H., Sweetlove, L.J., Giege, P., Leaver, C.J. (2001) Analysis of the Arabidopsis mitochondrial proteome Plant Physiol, . 127, 1711–1727[Abstract/Free Full Text] .

  26. Koroleva, O.A., Tomlinson, M.L., Leader, D., Shaw, P., Doonan, J.H. (2005) High-throughput protein localization in Arabidopsis using Agrobacterium-mediated transient expression of GFP-ORF fusions Plant J, . 41, 162–174[CrossRef][Web of Science][Medline] .

  27. Tian, G.W., Mohanty, A., Chary, S.N., Li, S., Paap, B., Drakakaki, G., Kopec, C.D., Li, J., Ehrhardt, D., Jackson, D., et al. (2004) High-throughput fluorescent tagging of full-length Arabidopsis gene products in planta Plant Physiol, . 135, 25–38[Abstract/Free Full Text] .

  28. Cutler, S.R., Ehrhardt, D.W., Griffitts, J.S., Somerville, C.R. (2000) Random GFP::cDNA fusions enable visualization of subcellular structures in cells of Arabidopsis at a high frequency Proc. Natl Acad. Sci. USA, 97, 3718–3723[Abstract/Free Full Text] .

  29. Li, S., Ehrhardt, D.W., Rhee, S.Y. (2006) Systematic analysis of Arabidopsis organelles and a protein localization database for facilitating fluorescent tagging of full-length Arabidopsis proteins Plant Physiol, . 141, 527–539[Abstract/Free Full Text] .

  30. Bairoch, A., Boeckmann, B., Ferro, S., Gasteiger, E. (2004) Swiss-Prot: juggling between evolution and stability Brief Bioinform, . 5, 39–55[Abstract/Free Full Text] .

  31. Schneider, M., Tognolli, M., Bairoch, A. (2004) The Swiss-Prot protein knowledgebase and ExPASy: providing the plant community with high quality proteomic data and tools Plant Physiol. Biochem, . 42, 1013–1021[CrossRef][Web of Science][Medline] .

  32. Gene Ontology Consortium. (2006) The Gene Ontology (GO) project in 2006 Nucleic Acids Res, . 34, D322–326[Abstract/Free Full Text] .

  33. Heazlewood, J.L., Tonti-Filippini, J., Verboom, R.E., Millar, A.H. (2005) Combining experimental and predicted datasets for determination of the subcellular location of proteins in Arabidopsis Plant Physiol, . 139, 598–609[Abstract/Free Full Text] .

  34. Schoof, H., Zaccaria, P., Gundlach, H., Lemcke, K., Rudd, S., Kolesov, G., Arnold, R., Mewes, H.W., Mayer, K.F. (2002) MIPS Arabidopsis thaliana Database (MAtDB): an integrated biological knowledge resource based on the first complete plant genome Nucleic Acids Res, . 30, 91–93[Abstract/Free Full Text] .

  35. Gutierrez, R.A., Larson, M.D., Wilkerson, C. (2004) The plant-specific database. Classification of Arabidopsis proteins based on their phylogenetic profile Plant Physiol, . 135, 1888–1892[Free Full Text] .

  36. Schwacke, R., Schneider, A., van der Graaff, E., Fischer, K., Catoni, E., Desimone, M., Frommer, W.B., Flugge, U.I., Kunze, R. (2003) ARAMEMNON, a novel database for Arabidopsis integral membrane proteins Plant Physiol, . 131, 16–26[Abstract/Free Full Text] .

  37. Alonso, J.M., Stepanova, A.N., Leisse, T.J., Kim, C.J., Chen, H., Shinn, P., Stevenson, D.K., Zimmerman, J., Barajas, P., Cheuk, R., et al. (2003) Genome-wide insertional mutagenesis of Arabidopsis thaliana Science, 301, 653–657[Abstract/Free Full Text] .

  38. Bateman, A., Coin, L., Durbin, R., Finn, R.D., Hollich, V., Griffiths-Jones, S., Khanna, A., Marshall, M., Moxon, S., Sonnhammer, E.L., et al. (2004) The Pfam protein families database Nucleic Acids Res, . 32, D138–141[Abstract/Free Full Text] .


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
P. Durek, R. Schmidt, J. L. Heazlewood, A. Jones, D. MacLean, A. Nagel, B. Kersten, and W. X. Schulze
PhosPhAt: the Arabidopsis thaliana phosphorylation site database. An update
Nucleic Acids Res., October 30, 2009; (2009) gkp810v1.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
E. H. Meyer, T. Tomaz, A. J. Carroll, G. Estavillo, E. Delannoy, S. K. Tanz, I. D. Small, B. J. Pogson, and A. H. Millar
Remodeled Respiration in ndufs4 with Low Phosphorylation Efficiency Suppresses Arabidopsis Germination and Growth and Alters Control of Metabolism at Night
Plant Physiology, October 1, 2009; 151(2): 603 - 619.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
M. Lin, B. Hu, L. Chen, P. Sun, Y. Fan, P. Wu, and X. Chen
Computational Identification of Potential Molecular Interactions in Arabidopsis
Plant Physiology, September 1, 2009; 151(1): 34 - 46.
[Abstract] [Full Text] [PDF]


Home page
Gen Biol EvolHome page
M. Ziemann, M. Bhave, and S. Zachgo
Origin and Diversification of Land Plant CC-Type Glutaredoxins
Gen Biol Evol, August 21, 2009; 2009(0): 265 - 277.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
M. Mazourek, A. Pujar, Y. Borovsky, I. Paran, L. Mueller, and M. M. Jahn
A Dynamic Interface for Capsaicinoid Systems Biology
Plant Physiology, August 1, 2009; 150(4): 1806 - 1821.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
J. Jouhet and J. C. Gray
Interaction of Actin and the Chloroplast Protein Import Apparatus
J. Biol. Chem., July 10, 2009; 284(28): 19132 - 19141.
[Abstract] [Full Text] [PDF]


Home page
Plant Cell PhysiolHome page
N. Mitsuda and M. Ohme-Takagi
Functional Analysis of Transcription Factors in Arabidopsis
Plant Cell Physiol., July 1, 2009; 50(7): 1232 - 1248.
[Abstract] [Full Text] [PDF]


Home page
Plant Cell PhysiolHome page
Y. Makita, N. Kobayashi, Y. Mochizuki, Y. Yoshida, S. Asano, N. Heida, M. Deshpande, R. Bhatia, A. Matsushima, M. Ishii, et al.
PosMed-plus: An Intelligent Search Engine that Inferentially Integrates Cross-Species Information Resources for Molecular Breeding of Plants
Plant Cell Physiol., July 1, 2009; 50(7): 1249 - 1259.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
S. Huang, N. L. Taylor, J. Whelan, and A. H. Millar
Refining the Definition of Plant Mitochondrial Presequences through Analysis of Sorting Signals, N-Terminal Modifications, and Cleavage Motifs
Plant Physiology, July 1, 2009; 150(3): 1272 - 1285.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Mintz-Oron, A. Aharoni, E. Ruppin, and T. Shlomi
Network-based prediction of metabolic enzymes' subcellular localization
Bioinformatics, June 15, 2009; 25(12): i247 - i1252.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
A. H. Millar, C. Carrie, B. Pogson, and J. Whelan
Exploring the Function-Location Nexus: Using Multiple Lines of Evidence in Defining the Subcellular Location of Plant Proteins
PLANT CELL, June 1, 2009; 21(6): 1625 - 1631.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
S. Reiland, G. Messerli, K. Baerenfaller, B. Gerrits, A. Endler, J. Grossmann, W. Gruissem, and S. Baginsky
Large-Scale Arabidopsis Phosphoproteome Profiling Reveals Novel Chloroplast Kinase Substrates and Phosphorylation Networks
Plant Physiology, June 1, 2009; 150(2): 889 - 903.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
S. Reumann, S. Quan, K. Aung, P. Yang, K. Manandhar-Shrestha, D. Holbrook, N. Linka, R. Switzenberg, C. G. Wilkerson, A. P.M. Weber, et al.
In-Depth Proteome Analysis of Arabidopsis Leaf Peroxisomes Combined with in Vivo Subcellular Targeting Verification Indicates Novel Metabolic and Regulatory Functions of Peroxisomes
Plant Physiology, May 1, 2009; 150(1): 125 - 143.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
S. Kierszniowska, B. Seiwert, and W. X. Schulze
Definition of Arabidopsis Sterol-rich Membrane Microdomains by Differential Treatment with Methyl-{beta}-cyclodextrin and Quantitative Proteomics
Mol. Cell. Proteomics, April 1, 2009; 8(4): 612 - 623.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
S. M. Brady and N. J. Provart
Web-Queryable Large-Scale Data Sets for Hypothesis Generation in Plant Biology
PLANT CELL, April 1, 2009; 21(4): 1034 - 1051.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
A. Jamai, P. A. Salome, S. H. Schilling, A. P.M. Weber, and C. R. McClung
Arabidopsis Photorespiratory Serine Hydroxymethyltransferase Activity Requires the Mitochondrial Accumulation of Ferredoxin-Dependent Glutamate Synthase
PLANT CELL, February 1, 2009; 21(2): 595 - 606.
[Abstract] [Full Text] [PDF]


Home page
Plant Cell PhysiolHome page
Y. Ogata, N. Sakurai, K. Aoki, H. Suzuki, K. Okazaki, K. Saito, and D. Shibata
KAGIANA: An Excel-Based Tool for Retrieving Summary Information on Arabidopsis Genes
Plant Cell Physiol., January 1, 2009; 50(1): 173 - 177.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
H. Gao, F. Brandizzi, C. Benning, and R. M. Larkin
A membrane-tethered transcription factor defines a branch of the heat stress response in Arabidopsis thaliana
PNAS, October 21, 2008; 105(42): 16398 - 16403.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
C. P. Lee, H. Eubel, N. O'Toole, and A. H. Millar
Heterogeneity of the Mitochondrial Proteome for Photosynthetic and Non-photosynthetic Arabidopsis Metabolism
Mol. Cell. Proteomics, July 1, 2008; 7(7): 1297 - 1316.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
E. Collakova, A. Goyer, V. Naponelli, I. Krassovskaya, J. F. Gregory III, A. D. Hanson, and Y. Shachar-Hill
Arabidopsis 10-Formyl Tetrahydrofolate Deformylases Are Essential for Photorespiration
PLANT CELL, July 1, 2008; 20(7): 1818 - 1832.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
N. Tsesmetzis, M. Couchman, J. Higgins, A. Smith, J. H. Doonan, G. J. Seifert, E. E. Schmidt, I. Vastrik, E. Birney, G. Wu, et al.
Arabidopsis Reactome: A Foundation Knowledgebase for Plant Systems Biology
PLANT CELL, June 1, 2008; 20(6): 1426 - 1436.
[Full Text] [PDF]


Home page
GeneticsHome page
J. Stout, E. Romero-Severson, M. O. Ruegger, and C. Chapple
Semidominant Mutations in Reduced Epidermal Fluorescence 4 Reduce Phenylpropanoid Content in Arabidopsis
Genetics, April 1, 2008; 178(4): 2237 - 2251.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Mano, T. Miwa, S.-i. Nishikawa, T. Mimura, and M. Nishimura
The plant organelles database (PODB): a collection of visualized plant organelles and protocols for plant organelle research
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D929 - D937.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Cui, P. Li, G. Li, F. Xu, C. Zhao, Y. Li, Z. Yang, G. Wang, Q. Yu, Y. Li, et al.
AtPID: Arabidopsis thaliana protein interactome database an integrative platform for plant systems biology
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D999 - D1008.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
R. Narsai, K. A. Howell, A. H. Millar, N. O'Toole, I. Small, and J. Whelan
Genome-Wide Analysis of mRNA Decay Rates and Their Determinants in Arabidopsis thaliana
PLANT CELL, November 1, 2007; 19(11): 3418 - 3436.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
R. Zentella, Z.-L. Zhang, M. Park, S. G. Thomas, A. Endo, K. Murase, C. M. Fleet, Y. Jikumaru, E. Nambara, Y. Kamiya, et al.
Global Analysis of DELLA Direct Targets in Early Gibberellin Signaling in Arabidopsis
PLANT CELL, October 1, 2007; 19(10): 3037 - 3057.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
J. Geisler-Lee, N. O'Toole, R. Ammar, N. J. Provart, A. H. Millar, and M. Geisler
A Predicted Interactome for Arabidopsis
Plant Physiology, October 1, 2007; 145(2): 317 - 329.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
M. N. Martin, P. H. Saladores, E. Lambert, A. O. Hudson, and T. Leustek
Localization of Members of the {gamma}-Glutamyl Transpeptidase Family Identifies Sites of Glutathione and Glutathione S-Conjugate Hydrolysis
Plant Physiology, August 1, 2007; 144(4): 1715 - 1732.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (1077K) Freely available
Right arrow Screen PDF (224K) Freely available
Right arrowOA All Versions of this Article:
35/suppl_1/D213    most recent
gkl863v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Heazlewood, J. L.
Right arrow Articles by Millar, A. H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Heazlewood, J. L.
Right arrow Articles by Millar, A. H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?