Skip to main content
Advertisement
  • Loading metrics

Quantitative proteomics analysis reveals important roles of N-glycosylation on ER quality control system for development and pathogenesis in Magnaporthe oryzae

  • Xiao-Lin Chen,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliations The Provincial Key Laboratory of Plant Pathology of Hubei Province, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, China, State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, China

  • Caiyun Liu,

    Roles Investigation, Methodology

    Affiliation The Provincial Key Laboratory of Plant Pathology of Hubei Province, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, China

  • Bozeng Tang,

    Roles Formal analysis, Software

    Affiliation The Sainsbury Laboratory, University of East Anglia, Norwich, United Kingdom

  • Zhiyong Ren,

    Roles Investigation, Methodology

    Affiliation The Provincial Key Laboratory of Plant Pathology of Hubei Province, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, China

  • Guo-Liang Wang,

    Roles Supervision, Writing – review & editing

    Affiliations State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, China, Department of Plant Pathology, Ohio State University, Columbus, Ohio, United States of America

  • Wende Liu

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    wendeliu@126.com

    Affiliation State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, China

Abstract

Genetic studies have shown essential functions of N-glycosylation during infection of the plant pathogenic fungi, however, systematic roles of N-glycosylation in fungi is still largely unknown. Biological analysis demonstrated N-glycosylated proteins were widely present at different development stages of Magnaporthe oryzae and especially increased in the appressorium and invasive hyphae. A large-scale quantitative proteomics analysis was then performed to explore the roles of N-glycosylation in M. oryzae. A total of 559 N-glycosites from 355 proteins were identified and quantified at different developmental stages. Functional classification to the N-glycosylated proteins revealed N-glycosylation can coordinate different cellular processes for mycelial growth, conidium formation, and appressorium formation. N-glycosylation can also modify key components in N-glycosylation, O-glycosylation and GPI anchor pathways, indicating intimate crosstalk between these pathways. Interestingly, we found nearly all key components of the endoplasmic reticulum quality control (ERQC) system were highly N-glycosylated in conidium and appressorium. Phenotypic analyses to the gene deletion mutants revealed four ERQC components, Gls1, Gls2, GTB1 and Cnx1, are important for mycelial growth, conidiation, and invasive hyphal growth in host cells. Subsequently, we identified the Gls1 N-glycosite N497 was important for invasive hyphal growth and partially required for conidiation, but didn’t affect colony growth. Mutation of N497 resulted in reduction of Gls1 in protein level, and localization from ER into the vacuole, suggesting N497 is important for protein stability of Gls1. Our study showed a snapshot of the N-glycosylation landscape in plant pathogenic fungi, indicating functions of this modification in cellular processes, developments and pathogenesis.

Author summary

The fungal pathogen Magnaporthe oryzae can cause rice blast and wheat blast diseases, which threatens worldwide food production. During infection, M. oryzae follows a sequence of distinct developmental stages adapted to survival and invasion of the host environment. M. oryzae attaches onto the host by the conidium, and then develops an appressorium to breach the host cuticle. After penetrating, it forms invasive hyphae to quickly spread in the host cells. Numerous genetic studies have focused on the mechanisms underlying each step in the infection process, but systemic approaches are needed for a broader, integrated understanding of regulatory events during M. oryzae pathogenesis. Many infection-related signaling events are regulated through post-translational protein modifications within the pathogen. N-linked glycosylation, in which a glycan moiety is added to the amide group of an asparagine residue, is an abundant modification known to be essential for M. oryzae infection. In this study, we employed a quantitative proteomics analysis to unravel the overall regulatory mechanisms of N-glycosylation at different developmental stages of M. oryzae. We detected changes in N-glycosylation levels at 559 glycosylated residues (N-glycosites) in 355 proteins during different stages, and determined that the ER quality control system is elaborately regulated by N-glycosylation. The insights gained will help us to better understand the regulatory mechanisms of infection in pathogenic fungi. These findings may be also important for developing novel strategies for fungal disease control.

Introduction

Asparagine-linked protein glycosylation, or N-glycosylation, is one of the most complex and abundant post-translational modifications of eukaryotic proteins [1]. N-glycosylation can change the folding, stability, quality control, sorting, and localization of target proteins [2]. By altering protein functions, N-glycosylation mediates diverse biological processes, including differentiation, growth, development, and intercellular communication [34]. As an enzymatically catalyzed process, N-glycosylation occurs in the endoplasmic reticulum (ER). This process includes assembling the glycans on a lipid carrier dolichol pyrophosphate (Dol-PP) in the ER membrane and transferring the glycans to target proteins [5]. During the latter process, the assembled oligosaccharide is transferred en bloc from the lipid carrier Dol-PP to the protein substrate, which occurs on select asparagine glycosylation sequons (N-X-S/T; X≠P) as soon as the protein substrate arrives in the ER lumen [6]. The N-linked glycan structure affects protein folding and facilitates the ER quality control system in identifying properly folded proteins. In the secretory pathway, N-glycan linked proteins are transported to the Golgi apparatus, where more complex, hybrid and paucimannose-type N-glycans are added into the N-glycan structures to form mature N-glycosylated proteins [78]. If an N-glycan linked protein is not properly folded, it will be recycled in the ER-associated degradation pathway [9].

Over the past decade, the biological functions of protein N-glycosylation have been analyzed in fungi including the human pathogen Candida albicans [1012] and the plant pathogens Ustilago maydis, Magnaporthe oryzae and Mycosphaerella graminicola [1316], establishing that N-glycosylation is essential for evasion of host immunity or establishment of infection of fungi. However, it is still largely unknown why N-glycosylation is important for fungal pathogenesis. Current understanding of N-glycosylation has largely been established through functional genetic studies of the N-glycosylation pathway, while the N-glycosylated target proteins executing biological functions have received far less attention. Therefore, systematic studies of the targets of N-glycosylation, including profiling of the N-glycoproteins and N-glycosites, will be critical to understand the mechanistic role of this modification in the infection process. N-glycoproteomes of several model organisms have been performed and investigated in the past decade, including humans and plants [1721]. However, few N-glycoproteome studies focused on the global functions of N-glycosylation during fungal pathogenesis.

Magnaporthe oryzae is the causal agent of rice blast disease, one of the most destructive rice diseases in the world [22]. M. oryzae begins the infection process when a conidium touches the surface of a host leaf. The conidium then germinates and develops into a dome-like appressorium [2324]. The appressorium generates high turgor pressure by accumulating compatible solutes to facilitate the penetration of the plant surface. After penetration, the fungus initiates a biotrophic growth stage for a short period, then switches into a necrotrophic growth stage and produces asexual conidia to disseminate. In this study, we used label-free quantification of N-glycosylation sites at different infection stages to identify the functions of N-glycosylation during the development and infection of M. oryzae. A total of 559 unique N-glycosite-containing peptides from 355 N-glycoproteins were identified and quantified. We found that N-glycosylation prominently modifies not only the cell wall and plasma membrane proteins, but also a large number of proteins from the secretory pathway with crucial functions in protein glycosylation, folding, quality control and secretion. N-glycosylation coordinated proteins for mycelial growth, conidium formation, and appressorium formation. We demonstrated that N-glycosylation can regulate endoplasmic reticulum quality control (ERQC) factors to facilitate penetration and host infection. The results of this study provide an overview and shed new light on the N-glycosylation protein mediated regulatory functions, which can help to unravel the relationship of N-glycosylation and pathogenesis in M. oryzae.

Results

Biological importance of N-glycosylation at different development stages of M. oryzae

We firstly analyzed the changes in global N-glycosylation levels for the mycelium, conidium and appressorium stages. The invasive hyphae (IH) stage was omitted due to difficulty separating the IH from host tissue. Western blot analysis using horseradish peroxidase-conjugated concanavalin A (ConA-HRP) as an antibody revealed that the N-glycosylated proteins of the appressoria and conidia were more abundant than that of the mycelia (Fig 1A). We also performed gene expression profiling of the N-glycosylation pathway. Results showed that expression levels of the late pathway genes, which form complex-, hybrid- and paucimannose-type N-glycans in the Golgi apparatus, were elevated in the conidia and early appressorium stage (S1 Fig). These results suggested that the strength of N-glycosylation increased by adding more N-glycan chains during conidium and appressorium formation.

thumbnail
Fig 1. Biological importance of N-glycosylation.

(A) Different N-glycosylated protein levels tested by Western blot using the ConA-HRP antibody. MY, mycelia; AP, appressoria; CO, conidia. (B) ConA-FITC and FLAER staining assay in different tissues of M. oryzae. Mycelia, conidia, appressoria and invasive hyphae were stained with 10 μg/mL ConA-FITC or 50 nM FLAER. ConA-FITC, FITC fluorescence-fused lectin concanavalin A; FLAER, fluorochrome (Alexa 488)–labeled inactivated aerolysin. Bar, 10 μm. Effect of 5 μg/mL tunicamycin (Tu) or 10 mM dithiothreitol (DTT) on: (C) mycelia dry weight, (D) the length of mycelial cell, (E) conidiation, (F) conidiophore formation, (G) appressorium formation ratio, (H) invasive hypha formation. Bar, 10 μm.

https://doi.org/10.1371/journal.ppat.1008355.g001

The existence and localization of N-glycoproteins was assessed in the mycelia, conidia, appressoria and invasive hyphae (IH) of M. oryzae using the FITC fluorescence-fused lectin concanavalin A (ConA-FITC) staining assay. ConA can recognize terminal α-D-mannose and α-D-glucose residues, which make up the main N-glycan structure [25]. Fluorescence was detected in the outer cell layer of all ConA-FITC stained tissues (Fig 1B), indirectly suggesting that N-glycosylated proteins were present in all developmental stages of M. oryzae. Compared with that in the mycelium, fluorescence was much stronger in the conidium, appressorium and invasive hyphal growth stages, indicating increased N-glycosylation during these development stages. We also specifically stained the GPI-anchored glycoproteins (the main glycoproteins in cell wall) with a fluorescently labelled aerolysin (FLAER) dye (Alexa Fluor 488 proaerolysin). The FLAER can selectively bind to GPI-anchored glycoproteins on the anchor [26], which is independent of N-glycan [27] and can exclude interference of carbohydrates and glycolipids. The result indicated that the FLAER staining showed a similar fluorescence level changes as detected by the ConA-FITC (Fig 1B). Taken together, these results were consistent with that of the western blot analysis (Fig 1A).

Tunicamycin, an N-glycosylation inhibitor that blocks the formation of N-glycosidic protein-carbohydrate linkages [28], was used to determine whether N-glycosylation is important for M. oryzae development and pathogenesis. We found that 5 μg/mL tunicamycin significantly inhibited mycelial growth, mycelial cell length, conidiation and conidiophore formation (Fig 1C–1F). More importantly, for infection, tunicamycin suppressed appressorium formation, reducing the appressorium formation rate from 80% in non-treated conidia to 30% in treated conidia (Fig 1G). To establish whether tunicamycin could block invasive hyphal growth within host cells, we added 5 μg/mL tunicamycin to conidia droplets on barley leaves at 18 dpi, when the fungal appressoria were beginning to form penetration pegs. At 24 hpi, fewer branches had formed in the IH treated with tunicamycin than in the untreated strain (60%, Fig 1H), demonstrating that tunicamycin can prevent invasive hyphal growth in host cells. Because tunicamycin (TM) and dithiothreitol (DTT) are two well characterized ER stress inducers, we also used low concentration of DTT (10 mM) to test its effect on M. oryzae development. Obviously, similar phenotypic defects of the wild-type strain were observed comparing with the tunicamycin treatment (Fig 1C–1H). These data indicated that N-glycosylation is therefore crucial for all tested development and infection stages, probably by partially affecting process of the ER stress response.

Quantitative N-glycoproteomics profiling of different development stages in M. oryzae

Label-free quantitative N-glycoproteomics was used to identify changes in the N-glycosylation state of target proteins during M. oryzae mycelium, conidium, and appressorium formation. Sampling points corresponded to five stages of fungal development (Fig 2A). At 6 h after conidial inoculation onto a hydrophobic surface, germ tube tips swelled and cellular components migrated into the nascent appressorium. At 12 hpi, the appressorium became pigmented with melanin, and at 24 hpi the appressorium had matured and began to penetrate the surface.

thumbnail
Fig 2. Schematic for quantitative analysis strategy of the N-glycoproteome in M. oryzae.

(A) Light micrographs of M. oryzae mycelia, conidia, and conidia germinated on a hydrophobic surface showing the development of appressoria at the time points used in the N-glycoproteomics study. (B) Flow chart of the integrated strategy for quantitative analysis of the N-glycoproteome. Samples were digested with trypsin after removal of high-abundance proteins. Glycopeptides were enriched with lectin chromatography from two different samples, desalted, and then catalyzed by immobilized trypsin and PNGase-F in 18O or 16O water, as indicated. Equal amounts of 16O and 18O -labeled glycopeptides were mixed, and the 6 Da mass shifts were generated between paired, labeled glycopeptides, which could be identified by subsequent LC-MS/MS. (C) Venn diagram showing the number of N-proteins identified in different stages. (D) Distribution of the number of glycosylated sites on proteins. (E) Relative frequency plots of the N-glycosylation sequon (N-X-S/T) in the entire population of N-glycopeptides. MY, mycelia; CO, conidia; AP, appressoria.

https://doi.org/10.1371/journal.ppat.1008355.g002

Total protein was collected from mycelia and conidia germinated on the hydrophobic surface at 0, 6, 12 and 24 h to identify N-glycosylation changes associated with developmental stage. The experimental strategy is depicted in Fig 2B. Total protein collected from two biological replicates of each sample was digested with trypsin and subjected to glycopeptide enrichment by lectin affinity chromatography. Efficiency of N-glycoprotein enrichment was enhanced by combining three widely used lectins: Concanavalin A (ConA), Wheat Germ Agglutinin (WGA) and Ricinus communis Agglutinin (RCA120). The enriched peptides were treated with immobilized trypsin and PNGase-F in 16O or 18O. During this process, the N-glycosylation site was labeled with one atom of 16O or 18O and the C terminus of the peptide was labeled with two 16O or 18O. When 16O and 18O-labeled samples were mixed at a 1:1 ratio, the labeled glycopeptides could be detected by a mass shift of 6 Da using Q-Exactive LC-MS/MS, while a 4 Da mass shift represented the 16O or 18O-labeled non-glycopeptides. Analyzing the concentration ratio yielded the quantity of the N-glycosylated peptides. Only peptides identified with a PeptideProphet probability score ≥ 0.95 (∼1.2% error rate) were retained for analysis. The resulting data were validated against a M. oryzae proteome database (http://fungi.ensembl.org/Magnaporthe_oryzae/Info/Index) for peptide/protein identification. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium [29] via the PRIDE [30] partner repository with the dataset identifier PXD015522.

We identified 559 N-glycosylation sites from a total of 355 proteins in all tissues (S1 Table and S2 Table). Of these, 313, 324 and 274 N-glycosylation sites were identified in the mycelium, conidium and appressorium, respectively, with 156 sites common to all tissues (Fig 2C). We found that 204 sites were identified in both the mycelium and conidium, 222 sites in both the conidium and appressorium and 179 sites in both the mycelium and appressorium. There were 86, 54 and 29 N-glycosylation sites that were identified specifically in the mycelium, conidium and appressorium, respectively.

Sequence features of N-glycoproteome

Among the 355 total N-glycosylated proteins, 59.7% had only one N-glycosite. Proteins with two, three and four N-glycosites accounted for 20.6%, 9.9% and 4.2% of the total N-glycosylated proteins, respectively (Fig 2D). Notably, 12 proteins (3.4%) were N-glycosylated at six or more sites (S1 Table), including levanase MGG_05785 (13 sites), trehalase MGG_01261 (12 sites) and glycoside hydrolase MGG_13429 (10 sites).

We used a motif-X algorithm software to generate sequence logos [31]. Probable amino acid positions surrounding N-glycosylation sites were computed to identify specific sequence motifs surrounding asparagine residues in identified N-glycosites. An oligosaccharide chain is commonly attached to asparagine (N) occurring in the tripeptide sequence N-X-T or N-X-S (X represents any amino acid except Pro). As expected, the majority of the N-oligosaccharide chains were attached to N-X-T (47.2%) and N-X-S (18.5%), although other motifs accounted for an over a third of all N-glycosylation sites (Fig 2E).

Classification of identified N-glycoproteins

The BLAST2GO application GO (Gene Ontology) functional classification analysis was used with all identified N-glycosylated proteins to understand the potential regulatory roles (biological, molecular and cellular) of N-glycosylation in M. oryzae (Fig 3A). The largest number of N-glycosylated proteins were linked to processes of carbohydrate metabolism. Numerous N-glycosylation sites were also observed on proteins with predicted involvement in the oxidation-reduction reactions or the metabolism of nitrogen or lipid compounds, suggesting that N-glycosylation is important for regulating the metabolism of various nutrients and maintaining redox balance. Diverse other biological processes were also predicted to be targeted by N-glycosylation, including localization of cellular components, response to stimuli, and host interactions (Fig 3A).

thumbnail
Fig 3. Classification of identified and protein interactions of the N-glycoproteome.

(A) Functional classification of identified N-glycoproteins based on Gene Ontology analysis. (B) KEGG pathway enrichment of the N-glycosylated proteins. (C) Protein interaction networks of the N-glycoproteins. Protein interaction networks were generated with the complete list of N-glycosylated proteins using the STRING database and visualized using the Cytoscape program.

https://doi.org/10.1371/journal.ppat.1008355.g003

For the molecular functions, the majority of N-glycosylated sites were found in various predicted protein hydrolases and peptidases, suggesting that N-glycosylation may regulate protein degradation and amino acid cycling. Proteins with predicted oxidoreductase and transferase activities were also N-glycosylated (Fig 3A). Cellular component predictions placed most of the N-glycosylated proteins in the cytoplasm, membrane-bounded organelles and ER (Fig 3A). These results are consistent with the understanding that N-glycosylation primarily occurs in the cytoplasm, membrane-bounded organelles and secretory system.

N-glycosylated proteins were analyzed for predicted biochemical pathway involvement using KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis. Results showed that proteins involved in the ER protein secretion system were significantly enriched (Fig 3B). Other pathways enriched in N-glycosylated proteins included N-glycan biosynthesis, autophagy, synthesis of GPI-anchors, and metabolism of starch, sugars, glycerolipids, and glyoxylate (Fig 3B). Together, GO annotation and KEGG pathway analysis indicate that N-glycosylation may regulate diverse cellular processes, including the metabolism of carbohydrates, proteins and lipids, and the processing, localization and secretion of secretory proteins. Many of the N-glycosylated proteins identified also have important roles in the developmental biology of M. oryzae as discussed below.

N-glycoprotein protein interaction networks

We generated an interaction network using the Search Tool for Retrieval of Interacting Genes/Proteins (STRING) database to further understand the significance and extent of N-glycosylation in M. oryzae. A number of sub-networks were identified, including glycan biogenesis, ER secretory pathway, carbohydrate metabolism and spore cell wall biogenesis (Fig 3C). The interaction network having the highest number of N-glycosylated proteins is involved in glycan biogenesis, which includes proteins involved in N-glycosylation (STT3, OST1, WBP1, MNN2, MNN9, MNN10), O-glycosylation (PMT2, PMT4) and protein folding and quality control (GLS2, RTB1, CNE1, KRE5), indicating that N-glycosylation plays key roles in the glycan biogenesis system itself. We also observed that N-glycosylated proteins in the ER secretory pathway were enriched. Two interaction networks involved in carbohydrate metabolism and spore cell wall biogenesis, respectively, contained at least six N-glycosylated proteins (Fig 3C).

Comparing N-glycoproteins at different developmental stages of M. oryzae

We sought to understand the N-glycosylation protein dynamics of the different developmental stages of M. oryzae by identifying N-glycosites that are unique to some stages or that change in abundance during development. First, we performed principle component analysis (PCA) to the N-glycoproteome of different samples, including mycelia (MY), conidia (CO), 6h appressoria (AP), 12h AP, and 24h AP. The result demonstrated that both replicates exhibited similar expression pattern for each sample, indicated well repetitiveness. The PCA result also suggested that the MY, CO and AP samples were different in N-glycoproteins expression patterns from each other, while the 6h AP, 12h AP, and 24h AP samples were similar to each other (Fig 4A). Subsequent Pearson correlations and clustering analyses to the different samples also indicated 6 h AP, 12 h AP, and 24 h AP samples enrich a similar abundance of N-glycoproteins, but different from the MY or CO samples (Fig 4B and 4C).

thumbnail
Fig 4. Differentiation of the N-glycoproteome among different samples.

(A) The principal component analysis (PCA) was carried out to get an overview of the data to detect batch effects, and assess differences between replicates. After filtering the missing values, the retained proteins were identified in two replicates of at least one condition. (B) The heatmap is plotted to visualized the Pearson correlations between the different samples. Colors indicate no correlation (white) and strong correlation (red). (C) The heatmap shows a clustering analysis of replicates, and indicates 6 h, 12 h, and 24 h appressoria enrich a similar abundance of proteins. Significant expression differences can be compared between individual samples during the differentiation process. The rows represent different proteins and are clustered in six groups by k-means clustering. Rows and columns are hierarchically clustered on Euclidean distance. Colors represent enrichment in the linkage versus control (red: enriched; blue: depleted). MY, mycelia; CO, conidia; 6h, 12h, 24h, appressoria (AP) at 6, 12 and 24 hpi.

https://doi.org/10.1371/journal.ppat.1008355.g004

Therefore, we combined the average N-glycosylation levels of N-glycosites at the 6h AP, 12h AP, and 24h AP formation stages. This resulted in three sets of N-glycosites corresponding to the highest N-glycosylation levels in the mycelia, conidia and appressoria (S3 Table): 212 sites were highly N-glycosylated in mycelia, 254 in conidia, and 261 in appressoria (Fig 2C). A total of 486 sites on 278 proteins were differentially N-glycosylated in different developmental stages (S3 Table). One hundred thirty-one proteins of diverse predicted functions were highly N-glycosylated in all tested tissues (S4 Table). Conversely, some proteins showed a clear development-specific pattern of N-glycosylation levels. Transient increases and decreases in N-glycosylation levels of these proteins might shed more light on the function of several basic biological processes during the differentiation process. The biological impact of the observed N-glycosylation mediated function is addressed in the following sections.

N-glycosylation is involved in different cellular processes during development

The strength and rigidity of the fungal cell wall can be adjusted to facilitate growth and development. For the pathogenic fungi, the cell wall is vital for adhesion, penetration and invasive hyphal growth during infection. Our results identified numerous N-glycosylated cell wall-related proteins, including cell wall chitin transglycosylases UTR2 and CRH1 (transfer chitin to beta(1–6) and beta(1–3) glucans), endo-beta-1,3-glucanase BGL2 (incorporate newly synthesized mannoprotein molecules into the cell wall), β-1,3-glucanosyltransferases Gel3 and Gel4 (the GPI-anchored membrane proteins required for cell wall biosynthesis), mannan endo-1,6-alpha-mannosidase DCW1 and chitin synthase CHS6 (Fig 5A). Some cell wall proteins, such as DCW1, ECM14, Mad1 and ECM33, had noticeably increased N-glycosylation levels in the mycelia (Fig 5A). Others, including fasciclin-like protein MoFLP1 [32], flocculin FLO11 [33], chitin synthase genes CHS5 and CHS6 [34], chitin-binding protein CBP1 [35], ECM38 and Gel3 [36], were highly N-glycosylated in the conidia and appressoria (Fig 5A). Chitin is a major component of fungal cell wall and is synthesized by chitin synthases (Chs), including CHS5 and CHS6, which was found to be important for plant infection in maintaining polarized growth in vegetative and invasive hyphae [34]. MoFLP1 was reported to encode a fungal fasciclin-like protein, which is important for conidiation and pathogenicity in M. oryzae [32]. CBP1 encodes a putative extracellular chitin-binding protein, which plays an important role in the hydrophobic surface sensing during appressorium differentiation [35]. These results show that M. oryzae uses N-glycosylation to modify different cell wall proteins and structures when coordinating distinct developmental processes.

thumbnail
Fig 5. Different cellular processes mediated by N-glycoproteins.

The heat map shows the N-glycosylation profile of various N-glycoproteins in different developmental stages, including (A) cell wall proteins, (B) protease and peptidase, and (C) glycoside hydrolyses. Data is the mean of two biological replicates. The colored bar represents the scale for the log10 fold change in expression from green (-2) to red (2), white is set as 0. MY, mycelia; CO, conidia; AP, appressoria.

https://doi.org/10.1371/journal.ppat.1008355.g005

Fungi need to activate various cellular processes to assimilate nutrients and facilitate vegetative growth. Some proteins related to nutrient assimilation, such as proteases and peptidases, cell wall, lipid metabolism, and cellular redox homeostasis related proteins, were highly N-glycosylated in the mycelia (Fig 5B and S4 Table). Collectively, N-glycosylated proteins in the mycelia were mainly involved in cell wall biogenesis and nutrient utilization.

In conidia, the storage reserves, including glycogen, trehalose and lipid bodies, are accumulated during the conidium formation, and is required for the function of appressorium. The conidium storage reserves are presumably used to fuel biosynthetic processes and turgor generation in the developing appressorium. Gph1 is a glycogen phosphorylase that is involved in the breakdown of glycogen and is required to mobilize glycogen and full virulence [37]. One glycosylation site, N716, was detected on Gph1 (S1 Table). TRE1, which encodes an enzyme localized in the cell wall with characteristics of both neutral and acidic trehalases [38], contained 10 N-glycosites, most of which had elevated N-glycosylation levels in the conidia (Fig 5C). Glucan 1,3-beta-glucosidases BGL2 and BGL3 and UDP-glucose-4-epimerase GAL10 are involved in glucose metabolism and important for accumulating conidium storages. BGL2, BGL3 and GAL10 had elevated N-glycosylation levels in the conidia at multiple sites (Fig 5C). These results suggest that N-glycosylation is important for accumulating conidium storage reserves.

N-glycosylation coordinates different glycosylation pathways during conidium and appressorium formation

We identified several proteins involved in various glycosylation pathways, including the N-glycosylation pathway, O-glycosylation pathway and GPI anchor pathway, that were modified by N-glycosylation. The N-glycosylation levels increased during conidium and appressorium formation (Fig 6).

thumbnail
Fig 6. Glycosylation pathways are mediated by N-glycosylation.

(A) N-glycan biosynthesis and (B) GPI anchor biosynthesis. (C) Changes in N-glycosylation protein levels in three glycosylation pathways at different stages. N-glycosylated proteins were marked in red font; N-glycosite was indicated by red ring. The colored bar represents the scale for the log10 fold change in expression from green (-2) to red (2), white is set as 0. MY, mycelia; CO, conidia; AP, appressoria.

https://doi.org/10.1371/journal.ppat.1008355.g006

Among the N-linked glycoprotein biosynthesis pathway, the N-linked glycan synthesis starts in the ER and continues into the Golgi apparatus. At least six proteins involved in this pathway are N-glycosylated themselves (Fig 6A). STT3, OST1 and WBP1 are subunits of the oligosaccharyltransferase complex of the ER lumen, which catalyze the linking of N-glycans onto newly synthesized proteins (Fig 6A and 6C). MNN10 and MNN9 are two alpha-1,6-mannosyltransferases, subunits of a Golgi-localized complex involved in glycan biosynthesis. MNN2 is an alpha-1,2-mannosyltransferase that is responsible for adding the first alpha-1,2-linked mannose to form the branches on the mannan backbone of oligosaccharides and was found to be localized into an early Golgi compartment. Interestingly, these proteins perform the later steps of the N-glycosylation pathway, indicating N-glycosylation has a feedback regulatory role in this part of the pathway.

GPI is a glycolipid which can be attached to the C-terminus of a protein. GPI plays a key role in numerous biological processes by targeting proteins on the cell wall or plasma membrane and modifying the anchors. Several key components of the GPI anchor pathway, PIG-N, PIG-K and PIG-T, were identified as N-glycosylated proteins (Fig 6B and 6C). PIG-N contains four N-glycosites, three of which (N145, N208 and N256) are highly N-glycosylated at the conidium and appressorium (Fig 6C). PIG-T contains one glycosite at N41 that was also highly N-glycosylated at these developmental stages (Fig 6C).

O-linked glycosylation is another type of glycosylation, which adds N-acetyl-galactosamine to serine or threonine residues and uses secretion to form the extracellular matrix components. Two of the three key components in the O-glycosylation pathway, PMT2 and PMT4, were identified as N-glycosylation proteins (Fig 6C). PMT2 and PMT4 are protein O-mannosyltransferases, which transfers mannose residues from dolichyl phosphate-D-mannose to protein serine/threonine residues in the ER membrane. Both PMT2 and PMT4 were proved to be important for infection of M. oryzae [3940].

Collectively, N-glycosylation can modify key proteins in all the N-glycosylation, O-glycosylation and GPI anchor pathways, indicating a close crosstalk between these cellular processes.

Most proteins involved in ERQC were highly N-glycosylated in the conidium and appressorium

In eukaryotic cells, the endoplasmic reticulum (ER) serves as a protein-folding factory where elaborate quality and quantity control systems monitor the efficient and accurate production of secretory proteins. The ER uses a quality control (ERQC) system to facilitate folding of secretory proteins, and eliminates misfolded proteins through the ER-associated degradation (ERAD) system or autophagic degradation system. Surprisingly, we found that near all components of the ERQC system are N-glycosylated (Fig 7), including Sec63, Kar2, Scj1, STT3, OST1, WBP1, Gls1, Gls2, GTB1, CNX1, Mns1, PDI1 and KRE5.

thumbnail
Fig 7. The ERQC system is regulated by N-glycosylation.

(A) Schematic diagram of the ERQC pathway. N-glycosylated proteins were marked in red font; N-glycosites was indicated by red ring. (B) Changes in ERQC pathway N-glycosylation protein levels in three glycosylation pathways. The colored bar represents the scale for the log10 fold change in expression from green (-2) to red (2), white is set as 0. MY, mycelia; CO, conidia; AP, appressoria.

https://doi.org/10.1371/journal.ppat.1008355.g007

We found that N-glycosylation affects every step of the ERQC pathway (Fig 7A). Newly synthesized membrane and secretion-bound proteins enter the ER in an unfolded state through a translocon channel [41], of which the N-glycosylated protein Sec63 is an essential subunit. As the newly synthesized proteins enter the ER lumen, chaperones immediately engage with the nascent polypeptides to facilitate translocation and protein folding; these chaperones include the Hsp70 family member Kar2 and the DnaJ class co-chaperone Scj1, both are targets of N-glycosylation. Oligosaccharides are transferred to the proteins by oligosaccharyltransferases containing subunits STT3, OST1 and WBP1, all highly N-glycosylated at the conidium and appressorium stages (Fig 7B). Pre-Golgi processing of the nascent N-glycoproteins involves the elimination of specific Glc and Man subunits by four conserved enzymes (Gls1, Gls2, GTB1, and MNS1), which are also highly N-glycosylated at the conidia or appressoria (Fig 7B). The processing is linked to the calnexin cycle, the main checkpoint for evaluating correct N-glycoprotein folding [42], where another highly N-glycosylated protein, the calnexin CNX1, verifies the presence of N-glycoproteins harboring the oligosaccharide core NAcGlc2Man9Glc [43]. Permanently misfolded proteins are transported to the proteasome for degradation [44], a process mediated by the N-glycosylated proteins PDI1, glucosyltransferase KRE5, and mannosidase Mns1 (Fig 7A).

N-glycosylation regulates the ERQC system for fungal development and infection

CNX1, Gls1, Gls2, and GTB1 were chosen to understand the N-glycosylation of the ERQC pathway proteins. We generated single gene deletion mutants for Gls1, Gls2, GTB1 and CNX1, and performed complementation by transforming single gene’s coding region into its deletion mutant. All of corresponding complementation strains recovered their phenotypic defects (S2 Fig). Loss of these genes produced slight defects in colony growth (Fig 8A and 8B). Mutants of these genes showed no more than 50% conidiation capacity compared with the wild-type strain (Fig 8C). Microscopic observation showed that these mutants formed sparse conidiophores with fewer conidia (Fig 8D). To determine the roles of ERQC pathway proteins in virulence, Δgls1, Δgls2, Δgtb1 and Δcnx1 were inoculated onto host leaves. Interestingly, virulence of all the mutants was significantly reduced (Fig 7E). The mutants exhibited normal appressorium formation during infection (Fig 7F), but Δgls1, Δgls2 and Δcnx1 exhibited growth arrest inside the plant tissues just after appressorium penetration, and Δgtb1 had slowed invasive hyphal growth (Fig 8G and 8H). This indicated that Δgtb1 can complete the early stages of plant interaction after appressorium penetration, but is unable to efficiently spread fungi inside plant tissues. Thus, ERQC is essential for establishing the biotrophic growth during infection of M. oryzae.

thumbnail
Fig 8. Functional analysis of the N-glycosylated components of the ERQC pathway.

(A) Colony morphology of the wild-type strain P131, ERQC pathway gene deletion mutants and complementary strains of Gls1 on oatmeal agar (OTA) plates. (B) Colony diameters of different strains. (C) Conidiation of different strains. (D) Conidiophore morphologies formed by different strains. (E) Lesions formed on barley leaves by different strains at 5 d post-inoculation (dpi). (F) Appressorium formation by different strains at 12 dpi. (G) Invasive hyphae (IH) formed by the same set of strains in barley epidermal cells at 24 h post-inoculation (hpi). Bar, 20 μm. (H) Formed ratio of IH in barley epidermal cells at 24 hpi.

https://doi.org/10.1371/journal.ppat.1008355.g008

To verify regulation of the ERQC system by N-glycosylation, Gls1 was chosen as a target for validation of N-glycosylation. We constructed a GFP fusion construct of Gls1 and transformed it into the Δgls1 or Δalg3 deletion mutant. ALG3 is one of the key components involved in N-glycan biosynthesis [13]. In mycelia, we failed to detect any difference in protein size of GFP:Gls1 between the Δgls1 mutant and the Δalg3 mutant, indicating that N-glycosylation was not frequent enough to be detected by western blot in this tissue (Fig 9A). However, in the appressorium, a protein band larger than GFP:Gls1 fusion protein in the wild-type strain was detected, while a protein band corresponding to GFP:Gls1 itself was detected in the Δalg3 mutant. When we used Endo H, a N-glycanase, to treat the wild-type proteins, the larger protein was disappeared and the GFP:Gls1 protein band was detected (Fig 9A). This experiment showed that Gls1 is N-glycosylated in the appressorium.

thumbnail
Fig 9. Western blot analysis and subcellular localization of Gls1.

(A) Western blot of N-glycosylated ERQC proteins in mycelia and appressoria. Total proteins from extracts of the Δgls1 and the Δalg3 mutant expressing GFP-fused Gls1 were separated by SDS-PAGE and then subjected to western blot analysis with an anti-GFP antibody. Total proteins isolated from the transformants were treated with or without Endo H and detected with an anti-GFP antibody by immunoblot analysis. Anti-actin antibody was used to evaluate protein loading level. (B) Subcellular co-localization of GFP:Gls1/RFP:HDEL and GFP:Gls1N497Q/CMAC in conidium. Bar, 10 μm. (C) Subcellular co-localization of GFP:Gls1/RFP:HDEL and GFP:Gls1N497Q/CMAC in appressorium. Bar, 10 μm. (D) Subcellular localization of GFP:Gls1/RFP:HDEL and GFP:Gls1N497Q in invasive hyphae. Bar, 10 μm. For (B), (C) and (D), linescan graphs show fluorescence intensity in a transverse section of individual appressorium as the arrow indicated direction, line colors refer to corresponding fluorescence colors. (E) Proposed model of the N-glycosylation regulated ERQC system for development.

https://doi.org/10.1371/journal.ppat.1008355.g009

We simultaneously validated the N-glycosite of Gls1 identified in this study (Fig 7B). The GFP:Gls1 fusion construct with or without the glycosite mutation N497Q (Asn at 497aa was mutated as Gln) driven by the native promoter was transformed into the Δgls1 mutant, respectively. In appressoria, western blot analysis showed that the N497Q mutation resulted in a GFP:Gls1 fusion protein band represent the size of the unglycosylated protein, rather than the larger glycosylated band (Fig 9A), confirming that N497 is a functional N-glycosite. Functional analyses revealed that the Gls1N497Q mutation resulted in significant reduction of virulence and a block in the infection process (Fig 8E–8H), indicating that N-glycosylation is important for functions of Gls1. Interestingly, the Gls1N497Q mutation did not affect colony growth, and only partially affected conidiation (Fig 8A–8D). Subcellular localization analysis demonstrated that the GFP:Gls1 co-localizes with RFP-HDEL protein (Fig 9B–9D), which contains a ER-retention signal [45]. Therefore, the Gls1 was located in the ER at different developmental stages. The GFP:Gls1N497Q protein co-localized with the fluorescent vacuole marker dye CMAC (7-amino-4-chloromethylcoumarin, [46]) in the conidia and appressoria, and was hardly visible in invasive hyphae (Fig 9B–9D), demonstrating that N-glycosylation is required for the proper localization of ERQC proteins in ER. In sum, N-glycosylation is required for functions of ERQC proteins, probably through affecting their proper subcellular localization and protein stability.

Discussion

Previous studies have found that protein N-glycosylation is essential during infection of plant pathogenic fungi [1316], yet the systematic regulatory patterns of N-glycosylation are not well understood. In order to fully understand the roles of N-glycosylation in pathogenic fungi, we assessed the importance and modification levels of N-glycosylation at different developmental stages in M. oryzae. Quantitative measurements provided a molecular signature specific to glycoproteins for different developmental and infection stages. The established system for N-glycoproteome quantification in M. oryzae will be helpful in elucidating N-glycoproteome dynamics in fungi and other species.

Recently, N-glycoproteomics studies have been performed in the tissues of different organisms, including yeast [4748], filamentous fungi [4950], nematode [51], Drosophila [52], the plant Arabidopsis thaliana [53] and mammals [5456]. Quantifying the N-glycoproteomes in the plant pathogenic fungus F. graminearum have revealed intensive glycosylation changes during exposure to fungicide: 774 sites in 406 proteins were identified, and the glycosylation level was found to be largely down-regulated upon fungicide treatment [50]. However, no study has focused on the quantitative N-glycosylation changes during normal development and infection processes of plant pathogenic fungi.

N-glycosylation coordinates different cellular processes for development and host infection

Our high-throughput quantitative N-glycosylation proteomics study provides important novel insight into the development and infection process of M. oryzae (S3 Fig). We found that N-glycosylation may be important in coordinating different developments. Distinct proteins that corresponding specifically to the mycelium, conidium or appressorium stages were highly N-glycosylated. The N-glycosylated proteins at the vegetative growth stage were involved in nutrient utilization, cell wall biogenesis and the redox process; at the conidium formation stage, the N-glycosylated proteins were involved in glycogen storage, cell wall biogenesis, glycosylation (N-glycosylation, O-glycosylation and GPI anchor) and ER quality control; and at the appressorium formation stage, the N-glycosylated proteins were relative to glycogen utilization, lipid utilization, cell wall biogenesis, glycosylation pathways, and ER quality control (S3 Fig). We inferred that during invasive hyphal growth, proteins related to nutrient utilization, cell wall biogenesis, redox process and effector secretion could also be heavily N-glycosylated. This study demonstrated major changes in the N-glycosylation level of different stages. This novel approach enabled us to comprehensively analyze the dynamic remodeling processes of N-glycosylated proteins during M. oryzae differentiation and to provide unique insights into the underlying mechanisms.

Systems biology studies are useful to reveal infection mechanisms of the plant pathogenic fungi. Previous studies have focused on the transcript and proteomics profiling during appressorium formation of M. oryzae [5759]. Global patterns of gene expression in appressorium were detected, highlighting the role of autophagy, sugar and lipid metabolism and melanin biosynthesis [59]. Phosphoproteomic profiling of M. oryzae during the appressorium formation stage provided insights into the metabolic regulation of conidial storage reserves and phospholipids, autophagy, actin dynamics and cell wall metabolism [58]. Here, we revealed the N-glycoproteomics profiling of M. oryzae during the appressorium formation stage. The proteins involved in glycogen utilization, lipid utilization and cell wall biogenesis were consistent with the transcript and proteomics profiling results.

However, it should be noted that some homogenous proteins of the known N-glycosylated proteins were not identified in this study. For example, M. oryzae EMP1 (MGG_00527) is a homogenous protein of Fusarium adhesive protein FEM1 [60, 61]. This protein is a putative GPI-anchored, N-glycosylated, extracellular matrix protein, which is preferentially expressed during appressorium stage and is required for appressorium formation and full virulence. Surprisingly, EMP1 was not detected in this study. We infer that this protein, as well as others, may be not abundant enough for detection, due to low expression, or missing on plastic surface during sample collection, or strong linkage with cell wall structure. On the other hand, even we used three different lectins (ConA, WGA and RCA120) for enrichment, there could be still some N-glycosylated proteins can’t be enriched. In fact, as predicted by a bioinformatic tool NetNGlyc 1.0 (http://www.cbs.dtu.dk/services/NetNGlyc/), more than 50% of the total M. oryzae proteins contain predicted N-glycosylation site(s), and only a small proportion of them were identified in this study. Therefore, more efficient enrichment methods need to be developed for identifying more N-glycosylated proteins.

N-glycosylation modifies proteins of different sugar-related biosynthesis pathways

We found that many proteins specific for ER or Golgi localization are found in the N-glycoproteome. Proteins involved in the N-glycosylation pathway itself, such as the OST subunits Ost1, Wbp1, and Stt3, as well as some Golgi mannosyltransferases involved in N-glycan processing (MAN1, MNN9, MNN2, and MNN10), were found to be N-glycosylated. O-glycosylation is initially catalyzed by evolutionarily conserved protein O-mannosyltransferases (PMTs). In this study, two of the three key members of PMTs were identified as N-glycosylated proteins. Three key components of the GPI synthesis pathway (PIG-N, PIG-T, PIG-K) were also identified as N-glycosylated proteins. Therefore, all three types of glycosylation pathways could be regulated by N-glycosylation. Coincidently, previous studies on the O-glycoproteome in yeast also observed an interaction between O- and N-glycosylation [62]. Therefore, understanding the interactions between the different glycosylation synthesis pathways may help decipher the undoubtedly crucial role of protein glycosylation.

The ER quality control system is extensively regulated by N-glycosylation

N-glycosylation of ERQC proteins provides new insight into host infection. It has been reported that ERQC plays key roles in infection of Ustilago maydis [14]. In U. maydis, glucosidase I Gls1 is required for the initial stages of infection following appressorium penetration, and glucosidase II β-subunit Gas2 (GTB1) is required for efficient fungal spreading inside infected tissues. Thus, the ERQC is important for establishing the initial biotrophic state in the plant and allows subsequent colonization of U. maydis.

In this study, we identified most of the ERQC key components were N-glycosylated target proteins in M. oryzae. We subsequently found that these ERQC components are required for host penetration and invasive hyphal growth in M. oryzae. More importantly, we proved that the N-site of Gls1 can regulate its function by affecting its subcellular localization in ER and protein stability, revealing roles of N-glycosylation in protein quality control and the infection process. As the N-site mutation of Gls1 did not affect normal colony growth but severely affected conidium formation and the infection process, we inferred that the ERQC system may be more important for conidiation and the infection processes when N-glycosylation is increased. N-glycosylation may keep protein stability of the ERQC components and help to stabilize the ERQC system (Fig 9E). This regulatory infection mechanism is a novel discovery that has not been previously demonstrated in the plant pathogenic fungi.

In summary, our study shows that N-glycosylation plays critical roles in the development and infection process of M. oryzae through a quantitative N-glycoproteome analysis. The results from this study would increase our understanding of the M. oryzae infection mechanism. The large pool of N-glycosylation targets identified here will serve as a valuable resource for further understanding how the N-glycosylation mediates fungal infection process, which may lead to developing novel fungal disease control strategies.

Materials and methods

Strains and culture conditions

The M. oryzae strain P131 was used as the wild type [13]. All of the wild-type strain and transformants used in this study (S5 Table) were grown on Oatmeal Tomato Agar (OTA) plates at 28°C. Mycelia were incubated at 28°C in liquid culture medium on a rotary shaker (180 rpm) for 36 h. Colony growth and conidiation were performed as described previously [13]. Conidia harvested from 7-day-old OTA cultures were used to test virulence and to observe the infection process. Appressorium formation was performed by dropping a conidial suspension (1 × 105 conidia/mL) onto a hydrophobic coverslip and incubated in a dark, moist chamber at 28°C.

Staining assays

N-glycoproteins during different developmental stages of M. oryzae were stained with 0.1 mg/mL FITC conjugated concanavalin A (Sigma-Aldrich, USA) for 30 min before analysis. To detect the GPI-anchored proteins on the cell wall, 50 nM fluorochrome (Alexa 488)–labeled inactivated aerolysin (FLAER) (Pinewood Scientific Services, Victoria, Canada) was used to stain different samples of M. oryzae for 30 min before observation. For invasive hyphae staining, the conidia suspension was dropped onto the barley leaves and incubated with full humidity, then ConA-FITC or FLAER was added into the conidia droplets at 30 h for staining and observed after 30 min. Both ConA-FITC and FLAER staining samples were observed under a confocal laser scanning microscope (Leica SP8, Leica Microsystems, Germany). The fluorophore was excited at 488 nm, and the emission was detected at 530 nm (530±10 nm), with a same gain value of 800. To observe the cell lengths of hyphal tips, 10 μg/mL clacofluor white (CFW) (Sigma-Aldrich, USA) was used to stain hyphal cell walls and septa for 10 min in the dark, and the hyphal tips were observed under a fluorescence microscope (Nikon Ni90 microscope, Japan) after being rinsed with PBS buffer. The vacuoles of different tissues were stained with CMAC (7-amino-4-chloromethylcoumarin) (Sigma-Aldrich, USA) for 10 min in the dark, and observed under a confocal laser scanning microscope (Leica SP8, Leica Microsystems, Germany).

Infection assays

To test the virulence of different fungal strains, 1-week-old barley leaves (Hordeum vulgare cv. E9) were sprayed with conidial suspensions (1 × 105 conidia/mL) in 0.025% Tween 20. The inoculated plants were incubated at 28°C with full humidity, and the disease lesions were observed at 5 dpi. The infection process was identified in the host cells by inoculating the lower barley leaves with a conidia suspension (1 × 105 conidia/mL) and incubated in a dark, moist chamber at 28°C. The lower barley epidermis was torn down to observe the infection processes at 24 hpi. The effects of 5 μg/mL Tunicamycin (Sigma-Aldrich, USA) or 10 mM Dithiothreitol (DTT, Sigma-Aldrich, USA) at different developmental stages were performed by different treatment methods. The mycelia were treated with either TM or DTT after 24 h incubation in liquid complete medium (CM), and then added with Tunicamycin or DTT, continued to incubate for 12 h. At last, the mycelia dry weight was evaluated and the mycelial morphology was observed by CFW staining. The conidia were produced by spreading mycelial fragments onto the OTA plates containing TM or DTT for conidiation, and the conidiophores were observed after 12h. The effect on appressorium was performed by adding Tunicamycin into the conidia droplets on hydrophobic surface, and the appressorium formation was observed at 12 hpi. Effect on invasive hyphae was performed by adding Tunicamycin into the conidia droplets on barley leaves at 18 hpi, and the invasive hyphal growth was observed at 24 hpi.

Preparation of protein samples

To perform quantitative N-glycoproteomics analysis, the mycelia and conidia were harvested as mentioned above and were stored at −80°C before processing. For appressoria collection, high concentration of the conidia suspension (2×106 conidia mL-1) were sprayed onto a 21 cm×29.7 cm sized hydrophobic plastic surface and incubated in dark, moist plates at 28°C. After 6 h, 12 h or 24 h, the appressoria were collected by scraping plastics surface and filtered with six layers lens wiping paper. Lysis buffer containing 45 mL TCA:acetone (1:9) and 65 mM DTT was added to the frozen samples, followed by refrigerated centrifugation, washing, boiling and sonication. The total protein was quantified using the BCA method.

Protein trypsin digestion

Total proteins were processed according to the “filter-aided sample preparation” (FASP) method as reported. A total of 1.2 mg of protein for each sample was dissolved in a solution containing 8 M urea, 0.1 M NH4HCO3 and Tris-HCl pH 8.0 in a 10-kDa ultrafiltration tube. The samples were subsequently alkylated for 30 min with 50 mM iodoacetamide in the dark and then centrifuged for 10 min at 14000 g. Samples were diluted with UA buffer (8 M urea, 150 mM Tris-HCl, pH 8.0) and washed with 25 mM NH4HCO3 three times. The proteins were incubated with trypsin (proteins/trypsin, 50:1, w/w) at 37°C for 18 h. The digestive reaction was stopped by adding formic acid, and the peptides were desalted on a solid phase extraction C18 cartridge (Waters, MA).

Lectin enrichment and deglycosylation

After digestion, peptides were eluted in the lectin binding buffer (1 mM CaCl2, 1 mM MnCl2, 0.5 M NaCl, 20 mM Tris-HCl [pH 7.3]) and subsequently transferred to a 30 kDa filtration unit. Lectin mixture containing concanavalin A (ConA), wheat germ agglutinin (WGA), and Ricinus communis agglutinin (RCA120) was added to the top of the filters. The non-bound peptides were eluted, and the captured peptides were washed with the binding buffer after adding PNGase F. Deglycosylation was performed in H218O for 3 h at 37°C and the deglycosylated peptides were eluted and analyzed.

Tandem mass spectrometry analysis

The deglycosylated peptides were resuspended in 0.1% FA, then analyzed with LC−MS/MS. LC-MS/MS was performed using a Q-Exactive mass spectrometer coupled with an Easy nLC (Thermo Fisher Scientific, MA). Testing samples were loaded onto a 100 μm × 20 mm C18 precolumn (Thermo Fisher Scientific, CA) and chromatographic separation was performed on a 75 μm × 10 cm C18 nanocolumn. A high-performance liquid chromatography gradient was achieved using 0−55% buffer B (0.1% FA and 95% acetonitrile) and buffer A (0.1% FA) at a flow rate of 300 nL/min for more than 90 min. A Q-Exactive mass spectrometer was used for the analysis. The MS data was acquired with a precursor ion range of 300−1800 in positive ion mode. Resolution was set to 70 000 at m/z 200, the dynamic exclusion 25 s, maximum ion injection time 20 ms and automatic gain control target 3 × 106. Higher-energy collisional dissociation at a normalized collision energy of 27 eV was used to acquire the 10 most intense ions for MS2 scans with a resolution of 17500 at m/z 200 and 60 ms maximum IT.

Data analysis

The label-free quantitative analysis of N-glycosylated peptides was performed using the MaxQuant software version 1.3.0.5 according to a previous study [63]. Tandem mass spectra were searched against the M. oryzae proteome database (http://fungi.ensembl.org/Magnaporthe_oryzae/Info/Index). Ten LC−MS/MS raw files were obtained from two replicates of five samples. The search parameters were set as follows: enzyme cleavage was trypsin; the maximum missed cleavage sites were two; variable modifications were oxidation on methionine and deamidation (18O) in asparagine; peptide mass tolerance was 10 ppm; false discovery rate (FDR) thresholds were 0.01.

Bioinformatics analysis

The Gene Ontology (GO) annotation of the N-glycoproteome was performed using the UniProt-GOA database (http://www.ebi.ac.uk/GOA/). The ‘biological processes’, ‘molecular functions’ and ‘cellular components’ categories were used to classify N-glycosylated proteins based on GO annotation. The Kyoto Encyclopedia of Genes and Genomes (KEGG) database was used to annotate the N-glycosylated protein pathways. KAAS, an online service tool, was used to annotate the KEGG database description of the N-glycosylated proteins (http://www.genome.jp/tools/kaas/). Resulting annotations were used to map the KEGG pathway database. The STRING database (https://string-db.org/) and Cytoscape software (version 3.4.0) were used to identify N-glycoprotein interaction networks [6465]. The principal component analysis (PCA) was performed using the psych R-package with standard settings. The heatmap was created with the NMF R-package. Different samples were compared using the cor function of the stats R-package reporting a Pearson correlation used on complete observations only. To create groups of same expression profiles, the fold change was calculated to a specific sample. Any fold change larger than 2 with p<0.05 (Welch t-test) was reported as significantly up- or down-regulated.

Western blotting

Mycelia samples were incubated and collected from liquid CM medium cultures. Appressoria samples were collected from the hydrophobic surface sprayed by conidia with a high concentration (2×106 spores/mL). To detect N-glycosylation level, total proteins of different samples were extracted and mixed with SDS-PAGE loading buffer, denatured at 95°C for 5 min and then subjected to 12% SDS-PAGE for western blot analysis using ConA-HRP (1:10,000) as an antibody. To generate GFP-fused constructs, the promoter and coding regions of Gls1 was amplified and fused with GFP, then cloned into pKN(S6 Table). In the resulting vectors, the GFP fusion constructs were expressed under the control of the native promoter [48]. These vectors were transformed to protoplasts of P131, the Δalg3 mutant or the Δgls1 mutant to obtain the GFP-tagged strains. Total proteins were extracted to perform western blot using an anti-GFP antibody (1:5,000; Abmart) as the primary antibody. For N-glycosite mutation, the GFP:Gls1N497Q fusion construct was constructed and transformed into the Δgls1 mutant.

Supporting information

S1 Fig. Expression profile of the N-glycosylation pathway genes during development.

The phase specific expression of these genes was quantified by quantitative real-time PCR with synthesis of cDNA from each sample including mycelia, conidia, germ tubes, appressoria and invasive hyphae at indicated time points. Relative abundance was normalized by MoTub1. Three independent biological experiments with three replicates in each were performed. MY: mycelia; CO: conidia; AP: appressoria; IH: invasive hyphae.

https://doi.org/10.1371/journal.ppat.1008355.s001

(TIF)

S2 Fig. Phenotypes of the complement strains.

(A) Colony diameters of the wild-type (WT) and different complement strains. (B) Conidiation of the WT and different complement strains. (C) Invasive hyphae (IH) formed by the same set of strains in barley epidermal cells at 24 h post-inoculation (hpi). Bar, 20 μm. (D) Formed ratio of IH in barley epidermal cells at 24 hpi.

https://doi.org/10.1371/journal.ppat.1008355.s002

(TIF)

S3 Fig. Diagram representation of N-glycosylation functions at M. oryzae different developmental stages.

https://doi.org/10.1371/journal.ppat.1008355.s003

(TIF)

S1 Table. List of all identified N-glycosites in M. oryzae.

https://doi.org/10.1371/journal.ppat.1008355.s004

(XLSX)

S2 Table. List of all identified N-proteins in M. oryzae.

https://doi.org/10.1371/journal.ppat.1008355.s005

(XLSX)

S3 Table. Differentially expressed N-glycosylation sites during different developmental stages.

https://doi.org/10.1371/journal.ppat.1008355.s006

(XLSX)

S4 Table. Differentially expressed N-glycosylation sites for different processes.

https://doi.org/10.1371/journal.ppat.1008355.s007

(XLSX)

S5 Table. Fungal strains used in this study.

https://doi.org/10.1371/journal.ppat.1008355.s008

(XLSX)

Acknowledgments

We thank Dr. Lindsay Triplett in Connecticut Agricultural Experiment Station (CAES) of USA for her critical reading of the manuscript. We also thank Shanghai Applied Protein Technology, Co. for providing technical support.

References

  1. 1. Apweiler R, Hermjakob H, Sharon N (1999) On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database. Biochim Biophys Acta 1473: 4–8. pmid:10580125
  2. 2. Lis H, Sharon N (1993) Protein glycosylation. Eur J Biochem 218 (1): 1–27. pmid:8243456
  3. 3. Baas S, Sharrow M, Kotu V, Middleton M, Nguyen K, Flanagan-Steet H, Aoki K, Tiemeyer M (2011) Sugar-free frosting, a homolog of SAD kinase, drives neural-specific glycan expression in the Drosophila embryo. Development 138 (3): 553–563. pmid:21205799
  4. 4. Rudd PM, Elliott T, Cresswell P, Wilson IA, Dwek RA (2001) Glycosylation and the immune system. Science 291: 5512–5370.
  5. 5. Spiro RG (2002) Protein glycosylation: nature, distribution, enzymatic formation, and disease implications of glycopeptide bonds. Glycobiology 12 (4): 43R–56R. pmid:12042244
  6. 6. Kelleher DJ, Gilmore R (2006) An evolving view of the eukaryotic oligosaccharyltransferase. Glycobiology 16: 47–62.
  7. 7. Rayon C, Lerouge P, Faye L (1998) The protein N-glycosylation in plants. J Exp Bot 49: 1463–1472.
  8. 8. Pattison RJ, Amtmann A (2009) N-glycan production in the endoplasmic reticulum of plants. Trends Plant Sci 14: 92–99. pmid:19162525
  9. 9. Zattas D, Hochstrasser M (2015) Ubiquitin-dependent protein degradation at the yeast endoplasmic reticulum and nuclear envelope. Crit Rev Biochem Mol Biol 50: 1–17. pmid:25231236
  10. 10. Poulain D, Jouault T (2004) Candida albicans cell wall glycans, host receptors and responses: Elements for a decisive crosstalk. Curr Opin Microbiol 7: 342–349. pmid:15358252
  11. 11. Mora-Montes HM, Bates S, Netea MG, Castillo L, Brand A, Buurman ET, Díaz-Jiménez DF, Motteram J, Lovegrove A, Pirie E, Marsh J, Devonshire J, van de Meene A, Hammond-Kosack K, Rudd JJ (2011) Aberrant protein N-glycosylation impacts upon infection-related growth transitions of the haploid plant-pathogenic fungus Mycosphaerella graminicola. Mol Microbiol 81: 415–433. pmid:21623954
  12. 12. Hiller E, Zavrel M, Hauser N, Sohn K, Burger-Kentischer A, Lemuth K, Rupp S (2011) Adaptation, adhesion and invasion during interaction of Candida albicans with the host-Focus on the function of cell wall proteins. Int J Med Microbiol 301: 384–389. pmid:21571590
  13. 13. Chen XL, Shi T, Yang J, Chen D, Xu XW, Xu JR, Talbot NJ, Peng YL (2014) N-glycosylation of effector proteins by an alpha-1, 3-mannosyltransferase is required to evade host innate immunity by the rice blast fungus. Plant Cell 26: 1360–1376. pmid:24642938
  14. 14. Fernández-Álvarez A, Elías-Villalobos A, Jiménez-Martín A, Marín-Menguiano M, Ibeas JI (2013) Endoplasmic reticulum glucosidases and protein quality control factors cooperate to establish biotrophy in Ustilago maydis. Plant Cell 25: 4676–4690. pmid:24280385
  15. 15. Schirawski J, Böhnert HU, Steinberg G, Snetselaar K, Adamikowa L, Kahmann R (2005) Endoplasmic reticulum glucosidase II is required for pathogenicity of Ustilago maydis. Plant Cell 17: 3532–3543. pmid:16272431
  16. 16. Motteram J, Lovegrove A, Pirie E, Marsh J, Devonshire J, van de Meene A, Hammond-Kosack K, Rudd JJ (2011) Aberrant protein N-glycosylation impacts upon infection-related growth transitions of the haploid plant-pathogenic fungus Mycosphaerella graminicola. Mol Microbiol 81(2): 415–433. pmid:21623954
  17. 17. Zielinska DF, Gnad F, Wisniewski JR, Mann M (2010) Precision mapping of an in vivo N-glycoproteome reveals rigid topological and sequence constraints Cell 141: 897–907. pmid:20510933
  18. 18. Zielinska DF, Gnad F, Schropp K, Wiśniewski JR, Mann M (2012) Mapping N-glycosylation sites across seven evolutionarily distant species reveals a divergent substrate proteome despite a common core machinery. Mol Cell 46: 542–548. pmid:22633491
  19. 19. Liu T, Qian WJ, Gritsenko MA, Camp DG, Monroe ME, Moore RJ, Smith RD (2005) Human plasma N-glycoproteome analysis by immunoaffinity subtraction, hydrazide chemistry, and mass spectrometry J Proteome Res 4: 2070–2080. pmid:16335952
  20. 20. Zhang H, Li XJ, Martin DB, Aebersold R (2003) Aebersold Identification and quantification of N-linked glycoproteins using hydrazide chemistry, stable isotope labeling and mass spectrometry. Nature Biotechnol 21: 660–666.
  21. 21. Zhang H, Yi EC, Li XJ, Mallick P, Kelly-Spratt KS, Masselon CD, Camp DG, Smith RD, Kaji H, Shikanai T, Sasaki-Sawa A, Wen H, Fujita M, Suzuki Y, Sugahara D, Sawaki H, Yamauchi Y, Shinkawa T, Taoka M, Takahashi N, Isobe T, Narimatsu H (2012) Large-scale identification of N-glycosylated proteins of mouse tissues and construction of a glycoprotein database, GlycoProtDB. J Proteome Res 11: 4553–4566. pmid:22823882
  22. 22. Dickman M, Kahmann R, Ellis J, Foster GD (2012) The Top 10 fungal pathogens in molecular plant pathology. Mol Plant Pathol 13(4): 414–430. pmid:22471698
  23. 23. Howard RJ, Ferrari MA, Roach DH, Money NP (1991) Penetration of hard substrates by a fungus employing enormous turgor pressures. Proc Nati Acad Sci USA 88: 11281–11284.
  24. 24. de Jong JC, McCormack BJ, Smirnoff N, Talbot NJ (1997) Glycerol generates turgor in rice blast. Nature 389: 244–245.
  25. 25. Brodsky RA, Mukhina GL, Li S, Nelson KL, Chiurazzi PL, Buckley JT and Borowitz MJ (2000) Improved detection and characterization of paroxysmal nocturnal hemoglobinuria using fluorescent aerolysin. Am J Clin Pathol 114: 459–466. pmid:10989647
  26. 26. Baenziger JU, Fiete D (1979) Structural determinants of concanavalin A specificity for oligosaccharides. J Biol Chem 254: 2400–2407. pmid:85625
  27. 27. Hong Y, Ohishi K, Inoue N, Kang JY, Shime H, Horiguchi Y, van der Goot FG, Sugimoto N Kinoshita T (2002) Requirement of N-glycan on GPI anchored proteins for efficient binding of aerolysin but not Clostridium septicum α-toxin. EMBO J 21: 5047–5056. pmid:12356721
  28. 28. Elbein AD (1987) Inhibitors of the biosynthesis and processing of N-linked oligosaccharide chains. Ann Rev Biochem 56: 497–534. pmid:3304143
  29. 29. Deutsch EW, Csordas A, Sun Z, Jarnuczak A, Perez-Riverol Y, Ternent T, Campbell DS, Bernal-Llinares M, Okuda S, Kawano S, Moritz RL, Carver JJ, Wang M, Ishihama Y, Bandeira N, Hermjakob H, Vizcaíno JA (2017) The ProteomeXchange Consortium in 2017: supporting the cultural change in proteomics public data deposition. Nucleic Acids Res 54(D1): D1100–D1106.
  30. 30. Perez-Riverol Y, Csordas A, Bai J, Bernal-Llinares M, Hewapathirana S, Kundu DJ, Inuganti A, Griss J, Mayer G, Eisenacher M, Pérez E, Uszkoreit J, Pfeuffer J, Sachsenberg T, Yilmaz S, Tiwary S, Cox J, Audain E, Walzer M, Jarnuczak AF, Ternent T, Brazma A, Vizcaíno JA (2019) The PRIDE database and related tools and resources in 2019: improving support for quantification data. Nucleic Acids Res 47(D1): D442–D450. pmid:30395289
  31. 31. Chou MF, Schwartz D (2011) Biological sequence motif discovery using motif-x. Curr Prot Bioinform Chapter 13: Unit 13.15–24.
  32. 32. Liu T, Chen G, Min H, Lin F (2009) MoFLP1, encoding a novel fungal fasciclin-like protein, is involved in conidiation and pathogenicity in Magnaporthe oryzae. J Zhejiang Univ SCI B 10: 434–444. pmid:19489109
  33. 33. Lo WS, Dranginis AM (1998) The cell surface flocculin Flo11 is required for pseudohyphae formation and invasion by Saccharomyces cerevisiae. Mol Biol Cell 9: 161–171. pmid:9436998
  34. 34. Kong LA, Yang J, Li GT, Qi LL, Zhang YJ, Wang CF, Zhao WS, Xu JR, Peng YL. 2012. Different chitin synthase genes are required for various developmental and plant infection processes in the rice blast fungus Magnaporthe oryzae. PLoS Pathog 8: e1002526. pmid:22346755
  35. 35. Kamakura T, Yamaguchi S, Saitoh K, Teraoka T, Yamaguchi I (2002) A novel gene, CBP1, encoding a putative extracellular chitin‐binding protein, may play an important role in the hydrophobic surface sensing of Magnaporthe grisea during appressorium differentiation. Mol Plant-Microbe In 15: 437–444.
  36. 36. Samalova M, Melida H, Vilaplana F, Bulone V, Soanes DM, Talbot NJ, Gurr SJ (2017) The β-1,3-glucanosyltransferases (Gels) affect the structure of the rice blast fungal cell wall during appressorium-mediated plant infection. Cell Microbiol 19(3): e12659.
  37. 37. Badaruddin M, Holcombe LJ, Wilson RA, Wang ZY, Kershaw MJ, Talbot NJ (2013) Glycogen metabolic genes are involved in trehalose-6-phosphate synthase-mediated regulation of pathogenicity by the rice blast fungus Magnaporthe oryzae. PLoS Pathog 9: e1003604. pmid:24098112
  38. 38. Foster AJ, Jenkinson JM, Talbot NJ (2003) Trehalose synthesis and metabolism are required at different stages of plant infection by Magnaporthe grisea. EMBO J 22: 225–235. pmid:12514128
  39. 39. Guo M, Tan L, Nie X, Zhu X, Pan Y, Gao Z (2016) The Pmt2p-mediated protein O-Mannosylation is required for morphogenesis, adhesive properties, cell wall integrity and full virulence of Magnaporthe oryzae. Front Microbiol 7: 630. pmid:27199956
  40. 40. Pan Y, Pan R, Tan L, Zhang Z, Guo M (2018) Pleiotropic roles of O-mannosyltransferase MoPmt4 in development and pathogenicity of Magnaporthe oryzae. Cur Genet 65: 223–239.
  41. 41. Rapoport TA (2007) Protein translocation across the eukaryotic endoplasmic reticulum and bacterial plasma membranes. Nature 450: 663–669. pmid:18046402
  42. 42. Ellgaard L, Helenius A (2003) Quality control in the endoplasmic reticulum. Nat Rev Mol Cell Biol 4: 181–191. pmid:12612637
  43. 43. Hammond C, Braakman I, Helenius A (1994) Role of N-linked oligosaccharide recognition, glucose trimming, and calnexin in glycoprotein folding and quality control. Proc Nati Acad Sci, USA 91: 913–917.
  44. 44. Määttänen P, Gehring K, Bergeron JJ, Thomas DY (2010) Protein quality control in the ER: The recognition of misfolded proteins. Semin Cell Dev Biol 21: 500–511. pmid:20347046
  45. 45. Dean N, Pelham HR (1990) Recycling of proteins from the Golgi compartment to the ER in yeast. J Cell Biol 111(2): 369–377. pmid:2199456
  46. 46. Stewart A, Deacon JW (1995) Vital fluorochromes as tracers for fungal growth studies. Biotech Histochem 70(2): 57–65. pmid:7578589
  47. 47. Breidenbach MA, Palaniappan KK, Pitcher AA, Bertozzi CR (2012) Mapping yeast N-glycosites with isotopically recoded glycans. Mol Cell Proteomics 11(6): M111.015339. pmid:22261724
  48. 48. Poljak K, Selevsek N, Ngwa E, Grossmann J, Losfeld ME, Aebi M (2018) Quantitative profiling of N-linked glycosylation machinery in yeast Saccharomyces cerevisiae. Mol Cell Proteomics 17: 18–30. pmid:28993419
  49. 49. Rubio MV, Zubieta MP, Cairo JPLF, Calzado F, Leme AFP, Squina FM, Prade RA, Damásio ARL (2016) Mapping N-linked glycosylation of carbohydrate-active enzymes in the secretome of Aspergillus nidulans grown on lignocellulose. Biotechnol Biofuels 9(1): 168.
  50. 50. Yu L, He H, Hu Z, Ma Z (2016) Comprehensive quantification of N-glycoproteome in Fusarium graminearum reveals intensive glycosylation changes against fungicide. J Proteomics 142: 82–90. pmid:27180282
  51. 51. Kaji H, Kamiie J, Kawakami H, Kido K, Yamauchi Y, Shinkawa T, Taoka M, Takahashi N, Isobe T (2007) Proteomics reveals N-linked glycoprotein diversity in Caenorhabditis elegans and suggests an atypical translocation mechanism for integral membrane proteins. Mol Cell Proteomics 6: 2100–2109. pmid:17761667
  52. 52. Koles K, Lim JM, Aoki K, Porterfield M, Tiemeyer M, Wells L, Panin V (2007) Identification of N-glycosylated proteins from the central nervous system of Drosophila melanogaster. Glycobiology 17(12): 1388–1403. pmid:17893096
  53. 53. Song W, Mentink RA, Henquet MG, Cordewener JH, van Dijk AD, Bosch D, America AH, van der Krol AR (2013) N-glycan occupancy of Arabidopsis N-glycoproteins. J Proteomics 93: 343–355. pmid:23994444
  54. 54. Boersema PJ, Geiger T, Wisniewski JR, Mann M (2013) Quantification of the N-glycosylated secretome by super-SILAC during breast cancer progression and in human blood samples. Mol Cell Proteomics 12(1): 158–171. pmid:23090970
  55. 55. Fang P, Wang XJ, Xue Y, Liu MQ, Zeng WF, Zhang Y, Zhang L, Gao X, Yan GQ, Yao J, Shen HL, Yang PY (2016) In-depth mapping of the mouse brain N-glycoproteome reveals widespread N-glycosylation of diverse brain proteins. Oncotarget 7(25): 38796–38809. pmid:27259237
  56. 56. Picariello G, Ferranti P, Mamone G, Roepstorff P, Addeo F (2008) Identification of N-linked glycoproteins in human milk by hydrophilic interaction liquid chromatography and mass spectrometry. Proteomics 8(18): 3833–3847. pmid:18780401
  57. 57. Franck WL, Gokce E, Oh Y, Muddiman DC, Dean RA (2013) Temporal analysis of the magnaporthe oryzae proteome during conidial germination and cyclic AMP (cAMP)-mediated appressorium formation. Mol Cell Proteomics 12(8): 2249–2265. pmid:23665591
  58. 58. Franck WL, Gokce E, Randall SM, Oh Y, Eyre A1, Muddiman DC, Dean RA (2015) Phosphoproteome analysis links protein phosphorylation to cellular remodeling and metabolic adaptation during Magnaporthe oryzae appressorium development. J Proteome Res 14(6): 2408–2424. pmid:25926025
  59. 59. Oh Y, Donofrio N, Pan H, Coughlan S, Brown DE, Meng S, Mitchell T, Dean RA (2008) Transcriptome analysis reveals new insight into appressorium formation and function in the rice blast fungus Magnaporthe oryzae. Genome Biol 9(5): R85. pmid:18492280
  60. 60. Ahn N, Kim S, Choi W, Im KH, Lee YH (2004) Extracellular matrix protein gene, EMP1, is required for appressorium formation and pathogenicity of the rice blast fungus, Magnaporthe grisea. Mol Cells. 17(1): 166–173. pmid:15055545
  61. 61. Schoffelmeer EA, Vossen JH, van Doorn AA, Cornelissen BJ, Haring MA (2001) FEM1, a Fusarium oxysporum glycoprotein that is covalently linked to the cell wall matrix and is conserved in filamentous fungi. Mol Genet Genomics. 265(1): 143–152. pmid:11370861
  62. 62. Neubert P, Halim A, Zauser M, Essig A, Joshi HJ, Zatorska E, Larsen IS, Loibl M, Castells-Ballester J, Aebi M, Clausen H, Strahl S (2016) Mapping the O-Mannose Glycoproteome in Saccharomyces cerevisiae. Mol Cell Proteomics 15(4): 1323–1337. pmid:26764011
  63. 63. Tyanova S, Temu T, Carlson A, Sinitcyn P, Mann M, Cox J (2015) Visualization of LC-MS/MS proteomics data in MaxQuant. Proteomics 15(8): 1453–1456. pmid:25644178
  64. 64. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13(11): 2498–2504. pmid:14597658
  65. 65. Roth A, Santos A, Tsafou KP, Kuhn M, Bork P, Jensen LJ, von Mering C (2015) STRING v10: protein–protein interaction networks, integrated over the tree of life. Nucleic Acids Res 43: D447–D452. pmid:25352553