Deterministic protein inference for shotgun proteomics data provides new insights into Arabidopsis pollen development and function

  1. Monica A. Grobei1,2,
  2. Ermir Qeli1,
  3. Erich Brunner1,
  4. Hubert Rehrauer3,
  5. Runxuan Zhang1,6,
  6. Bernd Roschitzki3,
  7. Konrad Basler1,4,
  8. Christian H. Ahrens1,5,7 and
  9. Ueli Grossniklaus1,2,5,7
  1. 1 Center for Model Organism Proteomes, University of Zürich, 8057 Zürich, Switzerland;
  2. 2 Institute of Plant Biology and Zürich-Basel Plant Science Center, University of Zürich, 8008 Zürich, Switzerland;
  3. 3 Functional Genomics Center Zürich, ETH and University of Zürich, 8057 Zürich, Switzerland;
  4. 4 Institute for Molecular Biology, University of Zürich, 8057 Zürich, Switzerland
    • 6 Present address: Translational Medicine Research Collaboration Laboratory, College of Life Sciences, University of Dundee, James Arrott Dr., Dundee DD1 9SY, UK.

    1. 5 These authors contributed equally to this work as senior authors.

    Abstract

    Pollen, the male gametophyte of flowering plants, represents an ideal biological system to study developmental processes, such as cell polarity, tip growth, and morphogenesis. Upon hydration, the metabolically quiescent pollen rapidly switches to an active state, exhibiting extremely fast growth. This rapid switch requires relevant proteins to be stored in the mature pollen, where they have to retain functionality in a desiccated environment. Using a shotgun proteomics approach, we unambiguously identified ∼3500 proteins in Arabidopsis pollen, including 537 proteins that were not identified in genetic or transcriptomic studies. To generate this comprehensive reference data set, which extends the previously reported pollen proteome by a factor of 13, we developed a novel deterministic peptide classification scheme for protein inference. This generally applicable approach considers the gene model–protein sequence–protein accession relationships. It allowed us to classify and eliminate ambiguities inherently associated with any shotgun proteomics data set, to report a conservative list of protein identifications, and to seamlessly integrate data from previous transcriptomics studies. Manual validation of proteins unambiguously identified by a single, information-rich peptide enabled us to significantly reduce the false discovery rate, while keeping valuable identifications of shorter and lower abundant proteins. Bioinformatic analyses revealed a higher stability of pollen proteins compared to those of other tissues and implied a protein family of previously unknown function in vesicle trafficking. Interestingly, the pollen proteome is most similar to that of seeds, indicating physiological similarities between these developmentally distinct tissues.

    Footnotes

    • 7 Corresponding authors.

      E-mail grossnik{at}botinst.uzh.ch; fax +41-44-634-8204.

      E-mail christian.ahrens{at}molbio.uzh.ch; fax +41-44-635-6864.

    • [Supplemental material is available online at http://www.genome.org. The data from this study have been submitted to public protein repository PRIDE (http://www.ebi.ac.uk/pride/) under accession nos. 8743–8750.]

    • Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.089060.108.

      • Received November 10, 2008.
      • Accepted June 16, 2009.
    | Table of Contents

    Preprint Server