Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Predicted 3D model of the M protein of Porcine Epidemic Diarrhea Virus and analysis of its immunogenic potential

  • Alan Rodríguez-Enríquez ,

    Contributed equally to this work with: Alan Rodríguez-Enríquez, Irma Herrera-Camacho, José Francisco Rivera-Benítez, Nora Hilda Rosas-Murrieta

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Software, Visualization, Writing – original draft

    Affiliations Laboratory of Biochemistry and Molecular Biology, Chemistry Center, Institute of Science., Benemérita Universidad Autónoma de Puebla, Puebla, México, Postgraduate in Chemical Sciences, Benemérita Universidad Autónoma de Puebla, Puebla, México

  • Irma Herrera-Camacho ,

    Contributed equally to this work with: Alan Rodríguez-Enríquez, Irma Herrera-Camacho, José Francisco Rivera-Benítez, Nora Hilda Rosas-Murrieta

    Roles Conceptualization, Funding acquisition, Project administration, Supervision

    Affiliation Laboratory of Biochemistry and Molecular Biology, Chemistry Center, Institute of Science., Benemérita Universidad Autónoma de Puebla, Puebla, México

  • Lourdes Millán-Pérez-Peña ,

    Roles Validation, Writing – review & editing

    ‡ LMPP, JRL and GSL also contributed equally to this work.

    Affiliation Laboratory of Biochemistry and Molecular Biology, Chemistry Center, Institute of Science., Benemérita Universidad Autónoma de Puebla, Puebla, México

  • Julio Reyes-Leyva ,

    Roles Conceptualization, Supervision, Validation, Writing – review & editing

    ‡ LMPP, JRL and GSL also contributed equally to this work.

    Affiliation Molecular Biology and Virology Laboratory, Centro de Investigación Biomédica de Oriente, Instituto Mexicano del Seguro Social (IMSS), Metepec, México, México

  • Gerardo Santos-López ,

    Roles Conceptualization, Investigation, Supervision, Validation, Writing – review & editing

    ‡ LMPP, JRL and GSL also contributed equally to this work.

    Affiliation Molecular Biology and Virology Laboratory, Centro de Investigación Biomédica de Oriente, Instituto Mexicano del Seguro Social (IMSS), Metepec, México, México

  • José Francisco Rivera-Benítez ,

    Contributed equally to this work with: Alan Rodríguez-Enríquez, Irma Herrera-Camacho, José Francisco Rivera-Benítez, Nora Hilda Rosas-Murrieta

    Roles Conceptualization, Funding acquisition, Supervision, Validation, Writing – review & editing

    Affiliation Centro Nacional de Investigación Disciplinaria en Salud Animal e Inocuidad, Instituto Nacional de Investigaciones Forestales, Agrícolas y Pecuarias, Ciudad de México, México

  • Nora Hilda Rosas-Murrieta

    Contributed equally to this work with: Alan Rodríguez-Enríquez, Irma Herrera-Camacho, José Francisco Rivera-Benítez, Nora Hilda Rosas-Murrieta

    Roles Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    nora.rosas@correo.buap.mx

    Affiliation Laboratory of Biochemistry and Molecular Biology, Chemistry Center, Institute of Science., Benemérita Universidad Autónoma de Puebla, Puebla, México

Abstract

The membrane protein M of the Porcine Epidemic Diarrhea Virus (PEDV) is the most abundant component of the viral envelope. The M protein plays a central role in the morphogenesis and assembly of the virus through protein interactions of the M-M, M-Spike (S) and M-nucleocapsid (N) type. The M protein is known to induce protective antibodies in pigs and to participate in the antagonistic response of the cellular antiviral system coordinated by the type I and type III interferon pathways. The 3D structure of the PEDV M protein is still unknown. The present work exposes a predicted 3D model of the M protein generated using the Robetta protocol. The M protein model is organized into a transmembrane and a globular region. The obtained 3D model of the PEDV M protein was compared with 3D models of the SARS-CoV-2 M protein created using neural networks and with initial machine learning-based models created using trRosetta. The 3D model of the present study predicted four linear B-cell epitopes (RSVNASSGTG and KHGDYSAVSNPSALT peptides are noteworthy), six discontinuous B-cell epitopes, forty weak binding and fourteen strong binding T-cell epitopes in the CV777 M protein. A high degree of conservation of the epitopes predicted in the PEDV M protein was observed among different PEDV strains isolated in different countries. The data suggest that the M protein could be a potential candidate for the development of new treatments or strategies that activate protective cellular mechanisms against viral diseases.

Introduction

The Porcine Epidemic Diarrhea Virus (PEDV) is the causative agent of porcine epidemic diarrhea, a highly contagious disease that affects farm animals. The disease is diagnosed by clinical symptomatology such as vomiting, diarrhea, dehydration, and anorexia. The infection affects pigs of all ages, with piglets less than one week old being the most affected (100% morbidity and mortality). Although the virus does not affect human health, it causes significant economic losses to the pig breeding industry [1]. Current treatment is primarily aimed at diarrhea controlling. Control strategies have been oriented to the establishment in farms of rigorous biosecurity programs, adequate zootechnical management and vaccination [2]. Although these strategies have been effective in controlling disease, specific treatments are needed, which would require molecular knowledge of each stage of the viral replication cycle, as well as information on the biochemical characteristics of all the viral proteins. PEDV belongs to the order Nidovirales, family Coronaviridae, subfamily Coronavirinae, genus Alphacoronavirus. Its genome is a single-stranded, positive-sense RNA molecule that encodes 16 non-structural proteins (nsp 1–16) and 5 structural proteins, S, M, N, E and ORF3 [2]. Coronavirus M proteins share the same structural characteristics: three TM domains with N-exo-C-endo orientation flanked by a short glycosylated NH2-terminal domain on the virion surface and a long carboxy-terminal globular domain within the virion [3, 4]. The 3D structure of the PEDV M protein is still unknown. The functions described for the PEDV M protein include assembly and budding of virions, inhibition of innate immune response by blocking type I and III interferon pathways in infected cells and induction of protective antibodies in pigs [59]. The M protein performs its functions by interacting with other viral proteins. It is known that M proteins interact with each other and with spike and nucleocapsid proteins during virus assembly. Interestingly, M-M interactions form the overall scaffold for the viral envelope [3]. The indispensable role played by the M protein in the viral replication cycle can be explained in part by the lower genetic variability of the M gene compared to the S gene [10]. Therefore, the PEDV M protein could be a good candidate for developing specific antiviral drugs and as a target for vaccine design. The three-dimensional structure of the PEDV M protein has not been reported yet. The structure of some crystallized proteins of PEDV can be found in the PDB database, including the S protein [11], 3CLpro protease [12], NSP9 [13], papain-like protease (PDB: 6NOZ) and NSP1 (PDB 5XBC). Due to the importance of the M protein in the viral infection cycle, the present work proposes a 3D model of the M protein generated using the Robetta protocol. An immunoinformatics analysis was also carried out to determine the potential antigenic regions of the M protein.

Material and methods

Sequences

For the generation of the 3D model of the M protein, the sequence of the M protein of the PEDV CV777 virulent strain (Uniprot ID: P59771) (226 residues) and the M protein from strain PEDV/MEX/MICH/01/2013 (GenBank number: MH006957.1) isolated in Michoacán, Mexico, called 2013MMV were used. Different sequences of the M protein were retrieved from the NCBI Taxonomy Browser page (https://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi). The sequence of the SARS-CoV-2 M protein of the AlphaFold and Feig lab models corresponds to Uniprot ID: P0DTC5.

Cloning sequence of the M gen and phylogenetic analyses

The nucleotide sequence of the 2013MMV was subcloned in the vector pPICZαB from the vector pETSumo and it was called 2013MMV. The recombinant DNA was sequenced, 225 amino acids were obtained by in silico translation, Met residue number 1 belongs to the cloning vector. Phylogenetic analyses were conducted in MEGA7 using the Neighbor-joining method [14, 15] using 87 sequences including the two amino acid sequences aforementioned and others from different strains isolated in the United States, Europe and Asia. Evolutionary distances were calculated using the Poisson correction method [16] and are shown as the number of amino acid substitutions per site.

Programs used to generate 3D protein models and evaluation

The 3D model of the PEDV M protein was generated using Robetta (comparative modeling). CAMEO project (https://www.cameo3d.org/) [17, 18]. The 3D models generated in Robetta were constructed from several templates by Dali server (http://ekhidna2.biocenter.helsinki.fi/dali/) [19]. The 3D M protein models were placed in the membrane using the OPM (https://opm.phar.umich.edu/) by PPM server (http://opm.phar.umich.edu/server.php) to infer its spatial position in the membrane [20]. Images were visualized with FirstGlance in Jmol by PPM server [21]. The quality of the generated models was evaluated using the online servers of QMEAN and QMEANDisCo (https://swissmodel.expasy.org/qmean) [22, 23], PROCHECK (https://saves.mbi.ucla.edu/) [24], MolProbity (http://molprobity.manchester.ac.uk/) [25] and ProSA-web (https://prosa.services.came.sbg.ac.at/prosa.php) [26].

SARS-CoV-2 M protein models

The 3D models of the SARS-CoV-2 M protein made by Feig lab (https://feig.bch.msu.edu/) and AlphaFold (https://deepmind.com/research/open-source/computational-predictions-of-protein-structures-associated-with-COVID-19) were used to compare the characteristics of these models with the PEDV M protein model generated by Robetta.

Superimposition of 3D protein models

The superimposition of 3D models and estimation of RMSD values was carried out using the UCSF Chimera program (https://www.cgl.ucsf.edu/chimera/) [27].

Continuous and discontinuous B-cell epitope prediction

Continuous epitopes in the PEDV M protein were identified using the tools of Immune Data Base (https://www.iedb.org/), Bepipred Linear Epitope Prediction 1.0 and 2.0 http://tools.iedb.org/bcell/). Bepipred 2.0 [28] was used with a threshold of 0.5 (sensitivity = 0.58564, specificity = 0.57158). The Bepipred 1.0 Linear Epitope Prediction tool [29] was used with a threshold of 0.35 (sensitivity = 0.49, specificity = 0.75). The predicted epitopes were evaluated using Chou & Fasman Beta-Turn Prediction, Emini Surface Accessibility Prediction, Kolaskar and Tongaonkar Antigenicity, and Parker Hydrophilicity Prediction with a threshold of 1.0. The conservation analysis was carried out using the IEDB Epitope Conservancy Analysis tool (http://tools.iedb.org/conservancy/) [30]. Discontinuous B-cell epitopes were predicted using the online servers DiscoTope (http://tools.iedb.org/discotope/) [31] and ElliPro (http://tools.iedb.org/ellipro/) [32]. ElliPro with a minimum score of 0.5 and a maximum distance of 6 Ǻ. Regarding DiscoTope (version 1.1), a threshold of -7.7 (sensitivity = 0.47, specificity = 0.75) was used.

Prediction of cytotoxic T-cell epitopes

Cytotoxic T-cell epitopes were identified using the NetMHCpan 4.1 server [33]. The swine leucocyte alleles (SLA-1:0101; SLA-1:0401; SLA-1:0801) were used, those alleles are widely distributed in swine populations [34, 35]. The search for peptides covered all nonamers, using 0.5% as a threshold for strong binding and 2% for weak binding.

Results

Conservation of the amino acid sequence of PEDV M proteins

To generate the 3D model of PEDV M protein, the amino acid sequence was selected based on a comparison of amino acid sequences from different strains using the M protein sequence of the CV777 prototype virulent strain as a reference. A high degree of conservation was observed among 87 PEDV strains from Mexico, United States of America, Europe, and Asia. The identity between the different sequences analyzed was recorded in the range of 97.35 to 99.56% corresponding to the minimum and maximum identity values related to the CV777 M protein. The amino acid sequences of M protein were clustered into three groups related to G2a, G2b, G1 strains of PEDV (Fig 1). Thus, the high identity value between the compared sequences allowed the selection of the M protein of the PEDV CV777 strain (called M protein CV777 in this work, UniProt ID P59771) to build the 3D model.

thumbnail
Fig 1. Phylogenetic tree built with 87 sequences of the M protein of the PEDV from Mexico, United States of America, Europe and Asia.

The amino acid sequences of M protein were clustered into three groups related to G2a, G2b, G1 strains of PEDV. The arrows signal the two sequences chosen to generate the 3D model of the M protein. The evolutionary history was inferred using the Neighbor-Joining method. Details and references are provided in Material and Methods section.

https://doi.org/10.1371/journal.pone.0263582.g001

3D modelling of the PEDV M protein

The predicted 3D model of the PEDV M protein was done using Robetta server on the CAMEO project (Continuous Automated Model EvaluatiOn) [18] using the RosettaCM comparative modeling [36]. The model 1 of the CV777 M protein was selected based on its QMEAN value of -0.86 and its predicted protein architecture, including the transmembrane and globular domains. Also, a 3D model of the M protein from PEDV/MEX/MICH/01/2013 (GenBank number: MH006957.1) called 2013MMV was predicted in this work. Like the CV777 strain, the model 1 was selected for the further analysis. 2013MMV is a Mexican strain isolated in 2013 in Michoacán, México; it has 6 amino acid differences compared to the CV777 M protein, is one of the strains isolated in cell culture that we keep under guard and that was genetically characterized in its entirety by our research group. Fig 2A and 2B show the predicted 3D models of the CV777 M protein and the 2013MMV M protein, respectively. Both models have a short N-terminal ectodomain, three α-helices corresponding to the three successive transmembrane domains, and a long globular C-terminal domain mainly formed by antiparallel β-strands on the inside of the virion. The 3D models generated in Robetta were constructed from various 3D structures deposited in the PDB and were selected by the Dali server, with the SARS-CoV-2 protein ORF3 (6xdc) showing the greatest structural similarity. Then, the 3D models of the CV777 and 2013MMV M proteins were superimposed using the UCSF Chimera program. The main differences were observed in the last 26 amino acids of the 2013MMV M protein on the globular C-terminal domain, which were folded in different spatial orientations. The RMSD value was 12.195 Å (Fig 2C). The theoretical positioning of the CV777 M protein (Fig 2D) and 2013MMV M protein (Fig 2E) in a lipid bilayer was analyzed using the PPM server in OPM. Both 3D M protein models were included in the membrane by three alpha helical transmembrane segments. The general positioning and orientation of both proteins in the lipid bilayer is maintained, but the spatial orientation of the 2013MMV M protein changes in the extracellular region, in the arrangement of the alpha helices in the transmembrane as well as in the globular region. To the 3D model of CV777 M protein, PPM server calculated a value of depth/hydrophobic thickness of 18.3 ± 2.0 Å with a ΔGtransfer to the membrane of -15.1 kcal/mol and a tilt angle of 13±7°. Similarly, a value of depth/hydrophobic thickness of 20.8 ±1.9 Å, with a ΔGtransfer of -18.8 kcal/mol and tilt angle de 25±0° was calculated for the 3D model of 2013MMV M protein.

thumbnail
Fig 2. Predicted 3D models of the CV777 M and 2013MMV M proteins, generated with Robetta.

A) CV777 M protein model. B) 2013MMV M protein model. The transmembrane region is shown in red. The regions outside the viral membrane are shown in yellow. The globular region is shown in green. The blue arrows point to the N-terminal end and the white arrows point to the C-terminal end. C) Overlapping of the 3D models of CV777 M and 2013MMV M proteins. The red areas correspond to the highest RMSD value, while the green areas correspond to the lowest RMSD value. The white arrows indicate the sites with the highest RMSD, corresponding to the loops and to the N- and C-terminal ends. D) 3D model of CV777 M protein on a theoretical lipid layer obtained from the OPM server. The program added red pseudoatoms to mark the extracellular hydrophobic boundary of the lipid bilayer. E) 3D model of 2013MMV M protein on a theoretical lipid bilayer obtained from the OPM server. In this case, the server added red and blue pseudoatoms to mark the extracellular and intracellular hydrophobic boundaries of the lipid bilayer, respectively.

https://doi.org/10.1371/journal.pone.0263582.g002

Validation of the predicted 3D model of the M protein

The predicted 3D models of the CV777 M protein and the 2013MMV M protein were evaluated with different programs. QMEANDisCo (a composite scoring function to derive the absolute quality of the model) values were 0.59 and 0.57, respectively. Global score must be in a range must be in a range between 0 and 1 with one being good. The PROCHECK program determined that the models of CV777 M protein and 2013MM M protein had values of 89.2% and 88.1% of phi and psi angles in the most favored regions of the Ramachandran plot, respectively. According to the PROCHECK server, a good model must contain more than 90% of the phi and psi angles in the most favored regions. But both values are close to 90%. The evaluation of the 3D models of CV777 M protein and 2013MMV M protein in MolProbity yielded scores of 1.34 and 1.37 respectively, indicating an acceptable stereochemistry. Finally, the 3D models were evaluated in ProSA web, CV777 M protein model (Fig 3A and 3B) and the 2013MMV M protein model (Fig 3C and 3D) had Z (overall quality score) values of -5.26 and -5.88, respectively (Fig 3A and 3C). Z-score of the 3D models of M protein is within the range of scores typically found for native proteins of 225 and 226 amino acid residues in length deposited in the PDB. In the plot of residue scores (Knowledge-based energy) of the 3D models of M protein, the 100 residues of the transmembrane region at the N-terminus have the highest energy level. But none of the amino acids had high positive values and were not penalized by the program (Fig 3B and 3D).

thumbnail
Fig 3. Evaluation of the overall model quality of 3D models of the CV777 M protein and 2013MMV M protein using ProSA web.

A) Z-score plot of the 3D model of CV777 M protein. B) Plot of residue scores (Knowledge-based energy) of the 3D model of CV777 M protein. C) Z-score plot of the 3D model of 2013MMV M protein. D) Plot of residue scores of the 2013MMV M protein. The black dot in A and C represents the 3D model of the M protein.

https://doi.org/10.1371/journal.pone.0263582.g003

Comparison of predicted M protein 3D models from PEDV and SARS-CoV-2 M protein models

As a result of the SARS-CoV-2 pandemic, DeepMind (AlphaFold) [37, 38] and Michael Feig’s laboratory (Feig lab) [39] generated 3D models of the M protein of the SARS-CoV-2 virus. AlphaFold uses a neural network to predict the distances between pairs of residues in a protein [37]. Feig lab generated initial machine learning-based models using trRosetta. The models were then subjected to molecular-dynamics simulation-based refinement protocol to maximize model accuracy [39]. Although, SARS-CoV-2 M protein has 33% amino acid sequence identity with CV777 M protein and 94% coverage percentage. 3D models of SARS-CoV-2 and PEDV M proteins displayed a similar global folding (S1 Fig). The SARS-CoV-2 M protein models were evaluated with the programs used with the PEDV M proteins. Then, the results were compared with those obtained from 3D models of the CV777 and 2013MMV M proteins generated by Robetta (S1 Table). The quality parameters of the PEDV M protein models was comparable with those of the validated models from SARS-CoV-2 M protein. To analyze the differences in the theoretical 3D structure of the M proteins, the models of the M proteins from PEDV and SARS-CoV-2 were superimposed. The RMSD values between the CV777 M protein and SARS-CoV-2 M protein from AlphaFold (S1A Fig) and the CV777 M protein and SARS-CoV-2 M protein from Feig lab models were 6.0 and 14.1 Å, respectively (S1B Fig). The RMSD values between 3D model of 2013MMV M protein with the SARS-CoV-2 M protein model from AlphaFold (S1C Fig) and with SARS-CoV-2 M protein model from Feig lab (S1D Fig) were 8.120 and 15.716 Å, respectively. While the superimposition of the SARS-CoV-2 M protein models from AlphaFold and Feig lab showed an RMSD of 12.6 Å (S1E Fig). The main difference between all the models was observed in the α-helix on the transmembrane region and the spatial arrangement of the globular region. The data obtained demonstrate that the 3D models of CV777 and 2013MMV M proteins generated by Robetta have good overall quality similar to the quality of the models of the SARS-CoV-2 M protein obtained by AlphaFold and Feig lab.

Identification of linear B cell epitopes in the PEDV M protein model.

The epitopes in the predicted 3D models of the PEDV M protein were analyzed. The B cell epitopes of the CV777 M and 2013MMV M proteins were predicted using servers BepiPred 1.0 (linear epitope prediction) and Bepipred 2.0 (sequential B cell epitope predictor). Both programs predicted four peptides in the CV777 M protein model; the B cell peptides RSVNASSGTG and KHGDYSAVSNPSAVLT were predicted by both servers. The two peptides were above the threshold level of 1.0 (Table 1). In the 2013MMV M protein model, 3 and 4 peptides were predicted by both servers, respectively. The KHGDYSAVSNPSSVLTD epitope was above the threshold level of 1.0 and was predicted by both servers. The peptides identified in the 3D models of the PEDV M proteins were used to perform a conservation analysis (%) with 31 M protein sequences from different PEDV strains isolated in Mexico and different strains isolated in the United States and other countries. A high level of conservation of the epitopes identified in the M proteins of 2013MMV and CV777 was observed among the analyzed sequences. However, the sequences of some epitopes from the 2013MMV M protein were more highly conserved than those from the CV777 M protein (Table 1).

Prediction of discontinuous B-cell epitopes from the PEDV M protein model.

The prediction of discontinuous epitopes from M protein 3D models from PEDV and SARS-CoV-2 was performed using the ElliPro and DiscoTope servers. ElliPro predicted 6 epitopes in the CV777 M protein model, 4 in the 2013MMV M protein model, 4 in the SARS-CoV2 M protein model from Alpha fold and 5 in the SARS-CoV2 M protein model from Feig lab (Fig 4) (S2 Table). The DiscoTope program predicted 15 residues in the CV777 model and 14 residues in the 2013MMV M protein model with potential to form an epitope, while 14 and 24 residues were predicted in the SARS-CoV-2 M protein model from AlphaFold and the SARS- CoV-2 M protein model from Feig lab, respectively (Fig 5) (S3 Table). In the Figs 4 and 5, the predicted epitopes were located in protein regions with similar folding. In both figures the colored regions correspond to discontinuous epitopes, ordered from the highest (red) to the lowest score (orange).

thumbnail
Fig 4. Discontinuous epitopes predicted in ElliPro from CV777, 2013MMV and SARS-CoV-2 M protein models.

A) CV777 M protein. B) 2013MMV M protein. C) SARS-CoV-2 M protein from AlphaFold. D) SARS-CoV-2 M protein from Feig lab. Predicted discontinuous epitopes are highlighted in the 3D model with different colors to represent the score. The epitope with the highest score is represented in red up to the lowest value in orange. The score decreases from the epitopes marked in green, blue, yellow, and pink. Arrows with the same color indicate discontinuous epitopes located in equivalent regions in the models. Colored residues in each epitope are indicated in S2 Table.

https://doi.org/10.1371/journal.pone.0263582.g004

thumbnail
Fig 5. Discontinuous epitopes predicted in DiscoTope from CV777 and 2013MMV M protein and SARS-CoV-2 protein M models.

A) CV777 M protein. B) 2013MMV M protein. C) SARS-CoV-2 M protein from AlphaFold. D) SARS-CoV-2 M protein from Feig lab. Predicted discontinuous epitopes are highlighted in the 3D model with different colors to represent the score. The epitope with the highest score is represented in red up to the lowest value in orange. The score decreases from the epitopes marked in green, blue, yellow and pink. Arrows with the same color indicate discontinuous epitopes located in equivalent regions in the models. An additional color (light blue) show residues that were not predicted in ElliPro. Colored residues in each epitope are indicated in S3 Table.

https://doi.org/10.1371/journal.pone.0263582.g005

Prediction of T-cell epitopes for the PEDV M protein

Prediction of T-cell epitopes was performed using the NetMHCpan 4.1 server (DTU Bioinformatics). Three alleles were used for the prediction, SLA-1: 0101, SLA-1: 0401 and SLA-1:0801. In the CV777 M protein, 40 weak-binding and 14 strong-binding peptides were predicted (Table 2). In the 2013MMV M protein, 36 weak-binding and 16 strong-binding peptides were predicted. The immunogenicity of the strong binding peptides of both M proteins was predicted using the IEDB class 1 immunogenicity server, which yielded 10 epitopes, 2 of them repeated in different alleles. The study provides theoretical data on the potential of the M protein to be considered as an immunogen in the development of a new vaccine for the control of PEDV.

thumbnail
Table 2. Continuous T cell epitopes of CV777 and 2013MMV M protein.

https://doi.org/10.1371/journal.pone.0263582.t002

Discussion

The porcine industry is affected each year by the loss of pigs due to viral infections, with porcine epidemic diarrhea being one of the most worrying diseases for producers. The disease, which is highly contagious, is caused by PEDV. PEDV is an etiological agent associated with infection outbreaks throughout the world, including Mexico. Although this disease is dealt with using biosecurity programs, adequate zootechnical management practices and vaccines, the occurrence of mutations in the viral genome motivates the continuous search for new control strategies. This requires molecular knowledge of the potential viral triggers of the host’s immune response. The M protein fulfills several important viral functions in different regions of the viral particle structure. Knowledge of these functions could help develop new strategies to control the disease. However, structure of the M protein of PEDV is unknown. Thus, the present work generated a 3D model of the M protein and compared it to validated models of SARS-CoV-2. The potential regions inducing an immune response were evaluated in the 3D model; 4 promising sites were found that should be subjected to further study. The 3D model was constructed based on the amino acid sequence of the M protein of CV777 and on that of a Mexican strain isolated in Michoacán, Mexico, showing changes of amino acids compared to the prototype. This could help to determine if the 3D structure of the protein is significantly affected when changes in the amino acid sequence occur, especially in the globular region of the protein where the residues associated with protein functions are found. To confirm that the amino acid sequences selected to obtain the 3D model represent the M protein of PEDV, a comparison was made with strains from different outbreaks around the world. The sequences were clustered into three groups related to G2a, G2b, G1 strains of PEDV. The analysis included the genogroups reported for the PEDV. There are different genogroups of PEDV, G1 and G2 (classic strain and variant strains), which can be divided into G1a and G1b, G2a and G2b. The G1a genogroup includes classic strains such as the virulent CV777 prototype strain and other ancient PEDV strains. The G1b genogroup contains strains isolated in Asia with deletions in different genes, mainly in the genes encoding to the nsp3 and ORF3 proteins [40]. Genogroup G2 differs from G1 mainly due to amino acid changes in the NTD domain of protein S. Within the genogroup G2a there are circulating Asian strains reported in reemerging cases of PEDV and in the genogroup G2b there are circulating strains related to outbreaks in the United States and other parts of the world [41]. The data obtained support the construction of the 3D model from the sequences of CV777 strain and the Mexican sequence. The present work proposes for the first time a predicted 3D model of the M protein of PEDV. The model was generated by Robetta using comparative modelling, using the ORF3 protein of SARS-CoV-2 as one of the main modeling templates. The structure of ORF3 (PDB: 6XDC) was solved in 2021 using Cryo-EM and has the same topology in the globular endodomain M as the PEDV M protein [42]. The 3D model of the M protein shown in Fig 2 shows a topology consistent with the architecture reported for the coronavirus M protein. It is organized into two transmembrane and globular domains, with 3 α-helices that would cross the membrane, with loop regions inside and outside the viral envelope and a globular domain formed mainly by antiparallel β-strands. Interestingly, the 3D model preserves the topology reported in Uniprot for the CV777 M protein: virion surface (positions 1–11), transmembrane (12–32, 21 residues of length), intravirion (33–41, 9 residues), transmembrane (42–62, 21 residues), virion surface (63–75, 13 residues), transmembrane (76–96, 21 residues), intravirion (97–226, 130 residues) (https://www.uniprot.org/uniprot/P59771).

Analysis of the positioning in the lipid bilayer of the theoretical 3D structure of M proteins by the PPM server showed the overall conservation of the theoretical topology, but the amino acid changes in the 2013MMV M protein change the spatial arrangement in the extracellular region, the tilt angle of the transmembrane alpha helices and the globular region. The PPM server added red and blue pseudoatoms to mark the extracellular and cytoplasmic hydrophobic boundaries of the lipid bilayer, respectively (Fig 2D and 2E). Although PPM was only able to calculate both membrane limits in the positioning of the 2013MMV M protein but not in the CV777 M protein, in the last one only the extracellular limit was calculated. This interesting finding will be analyzed later by in silico and experimental studies to evaluate the potential effect on the recognized functions of the M protein in the conformation of the virion and on some processes in cells infected by PEDV.

The M protein is the most abundant structural protein in the viral envelope of PEDV and other coronaviruses and it is known to play a central role in the assembly of the virus. Homodimerization of the M protein through the transmembrane region has been reported [43]. In Mouse Hepatitis Virus (MHV), the interaction of the M protein with the E protein is the minimum requirement for the assembly of the viral particle [43], Kuo and Masters reported an interaction between the M and N proteins of MHV through the C-terminal end of the M protein [44]. Interestingly, it was shown that, in the absence of the N protein, the MHV M protein can interact with synthetic RNAs containing a viral packaging signal that favors the specific packaging of RNAs in coronavirus-like particles [45]. A recent study showed that the PEDV M protein can interact with S and E viral proteins, with ORF3 and with 218 host proteins. Some of them are associated with 131 signaling pathways and 10 biological processes [46]. The M protein interacts with the translation initiation factor 3 (eIF3L) whose downregulation significantly increases viral production. The translation initiation factor 3 could act as a negative regulator of PEDV replication [46]. It has been shown that the M protein of three coronaviruses, SARS, MHV and the feline coronavirus (FCoV), adopts 2 conformations, called long and compact, that influence the curvature of the viral envelope. The long conformation (M-long) provides rigidity and uniformity to the viral envelope [47]. It has also been demonstrated that the M protein forms homodimers through contact with region corresponding to the endodomain, and that the M-long conformation interacts mainly with N and S proteins in a stoichiometry of the 8M2: 4N: 1S3 type [47]. The multimerization of the M protein is a conserved process among coronaviruses [47]. Recently, Lim Heo and Michael Feig generated models of homodimers of the M protein, including 4 homodimers of the M protein interacting with a trimer in the transmembrane region of the S protein of SARS-CoV-2. The models are deposited in GitHub (https://github.com/feiglab/sars-cov-2-proteins) [39]. Knowing the interactions of the M protein, as well as knowing its 3D structure, or even having a good 3D model thereof, would allow making a deep analysis of the intracellular processes this protein carries out, which would in turn allow the design of drugs and other alternatives that could stop the M protein from performing its functions as a method for controlling infections caused by PEDV and other coronaviruses. The lack of efficient control strategies to reduce the spread of the PEDV and, therefore, the death of piglets, keeps alive the interest in the development of new control methods. However, the constantly changing viral genome makes difficult to achieve this goal. Different studies have reported the existence of different recombination events between PEDV strains [4851]. A recombination event between a PEDV strain and one of the transmissible gastroenteritis virus (TGEV) was reported in Russia and Italy [52, 53]. A recombination event between a highly virulent PEDV strain and an attenuated PEDV strain has also been reported; this event generated a new virulent strain [54]. Developing an efficient control strategy has been hindered by the occurrence of mutations and recombinations among PEDV strains in the coding region of protein S. Therefore, it is important to study the immunological and functional potential of other structural proteins of the virus. The M protein is a good candidate due to its low genetic variability, its structure, and the functions it might perform. It is particularly interesting to analyze the potential of the M protein to induce the activation of the immune system. It has been shown that in other coronaviruses, such as MHV, the immune system produces monoclonal antibodies directed to the M protein; these antibodies protect mice from viral infection [55]. The immunization of 8 kittens with the recombinant M protein of the feline infectious peritonitis virus (FIPV) allowed 3 of them to survive the viral infection [56]. B-cell epitopes have also been found in the M protein of SARS-CoV as well as the avian infectious bronchitis virus [57, 58]. It has also been experimentally demonstrated that some peptides of the SARS-CoV M protein are highly reactive with sera from the convalescent phase of SARS patients. High titers of antibodies against the M protein were also elicited in immunized mice and rabbits, which suggests that the M protein is a highly immunogenic component in inactivated SARS vaccine preparations [59]. An experimental study identified a B cell epitope in the PEDV M protein that corresponds to the 195WAFYVR200 motif [8] covering the C-terminus of M protein. In the present study, the prediction of B cell epitopes of the M protein of PEDV was carried out and although there is agreement in the results for most of the programs used, it is noteworthy that the epitope 195WAFYVR200 found by Zhang [8] was not predicted. Some epitopes in the globular region at the C-terminus of M protein were located before or after the 195WAFYVR200 sequence. Further research of the immunogenic potential of the M protein could involve expressing the M protein of coronaviruses in eukaryotic and prokaryotic systems to analyze possible changes in the conformation of the protein. It would also be necessary to analyze the contact regions of the M protein with the antibodies produced by pigs immunized with the PEDV M protein [8].

The in silico results of the present study suggest that the M protein is a potential candidate for the activation of the cellular immune response against to the PEDV infection. Finally, a high degree of conservation of each of the B and T cell epitopes predicted in this study was also demonstrated by comparing different strains of PEDV circulating in Mexico and other parts of the world.

Conclusion

Predicted 3D models of the CV777 M and 2013MMV M proteins were obtained. In silico identification of continuous and discontinuous B cell epitopes and T cell epitopes on the M protein was also performed. The comparison between the 3D models of PEDV M proteins obtained in this study and the M protein models from SARS-CoV-2 obtained by the AlphaFold group and the Feig lab showed a high conservation of the 3D structure in both viruses, as well as the presence of discontinuous epitopes in the same regions in both proteins. The 3D model of the PEDV M protein generated in the present study could be a useful tool to carry out further research on the development of vaccines or drugs to control PED.

Supporting information

S1 Fig. Superimposition of the 3D M protein models from PEDV and SARS-CoV-2.

https://doi.org/10.1371/journal.pone.0263582.s001

(PDF)

S1 Table. Comparison of the quality of the 3D models of the M proteins from PEDV and SARS-CoV-2.

https://doi.org/10.1371/journal.pone.0263582.s002

(PDF)

S2 Table. Prediction of discontinuous epitopes with ElliPro in the 3D M protein models.

https://doi.org/10.1371/journal.pone.0263582.s003

(PDF)

S3 Table. Prediction of discontinuous epitopes with DiscoTope in the 3D M protein models.

https://doi.org/10.1371/journal.pone.0263582.s004

(PDF)

References

  1. 1. Reveles-Félix S, Carreón-Nápoles R, Mendoza-Elvira S, Quintero-Ramírez V, García-Sánchez J, Martínez-Bautista R, et al. Emerging strains of porcine epidemic diarrhoea virus (PEDv) in Mexico. Transbound Emerg Dis. 2020;67(2):1035–1041. pmid:31733175
  2. 2. Jung K, Saif LJ. Porcine epidemic diarrhea virus infection: etiology, epidemiology, pathogenesis and immunoprophylaxis. Vet J. 2015;204(2):134–43. pmid:25841898
  3. 3. Arndt AL, Larson BJ, Hogue BG. A conserved domain in the coronavirus membrane protein tail is important for virus assembly. J Virol. 2010;84(21):11418–28. pmid:20719948
  4. 4. Armstrong J, Niemann H, Smeekens S, Rottier P, Warren G. Sequence and topology of a model intracellular membrane protein, E1 glycoprotein, from a coronavirus. Nature. 1984;308(5961):751–2. pmid:6325918
  5. 5. Zhang Q, Shi K, Yoo D. Suppression of type I interferon production by porcine epidemic diarrhea virus and degradation of CREB-binding protein by nsp1. Virology. 2016; 489:252–68. pmid:26773386
  6. 6. Zhang Q, Ke H, Blikslager A, Fujita T, Yoo D. Type III interferon restriction by porcine epidemic diarrhea virus and the role of viral protein nsp1 in IRF1 Signaling. J Virol. 2018; 92(4):e01677–17. pmid:29187542
  7. 7. Yang S, Li L, Yin S, Shang Y, Zafar Khan MU, He X, et al. Single-domain antibodies as promising experimental tools in imaging and isolation of porcine epidemic diarrhea virus. Appl Microbiol Biotechnol. 2018;102(20):8931–8942. pmid:30143837
  8. 8. Zhang Z, Chen J, Shi H, Chen X, Shi D, Feng L, et al. Identification of a conserved linear B-cell epitope in the M protein of porcine epidemic diarrhea virus. Virol J. 2012;9:225. pmid:23025700
  9. 9. Li S, Yang J, Zhu Z, Zheng H. Porcine epidemic diarrhea virus and the host innate immune response. Pathogens. 2020;9(5):367. pmid:32403318
  10. 10. Sato T, Takeyama N, Katsumata A, Tuchiya K, Kodama T, Kusanagi Ki. Mutations in the spike gene of porcine epidemic diarrhea virus associated with growth adaptation in vitro and attenuation of virulence in vivo. Virus Genes. 2011;43(1):72–8. pmid:21559974
  11. 11. Wrapp D, McLellan JS. The 3.1-angstrom cryo-electron microscopy structure of the porcine epidemic diarrhea virus spike protein in the prefusion conformation. J Virol. 2019; 93(23):e00923–19. pmid:31534041
  12. 12. Ye G, Wang X, Tong X, Shi Y, Fu ZF, Peng G. Structural basis for inhibiting porcine epidemic diarrhea virus replication with the 3C-Like protease inhibitor GC376. Viruses. 2020;12(2):240. pmid:32098094
  13. 13. Zeng Z, Deng F, Shi K, Ye G, Wang G, Fang L, et al. Dimerization of Coronavirus nsp9 with diverse modes enhances its nucleic acid binding affinity. J Virol. 2018; 92(17):e00692–18. pmid:29925659
  14. 14. Kumar S, Stecher G, Tamura K. MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33(7):1870–4. pmid:27004904
  15. 15. Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987;4(4):406–25. pmid:3447015
  16. 16. Zuckerkandl E, Pauling L. Evolutionary divergence and convergence in proteins. In: Bryson V, Vogel HJ, editors. Evolving Genes and Proteins: Academic Press; 1965. pp. 97–166.
  17. 17. Haas J, Roth S, Arnold K, Kiefer F, Schmidt T, Bordoli L, et al. The Protein Model Portal—a comprehensive resource for protein structure and model information. Database (Oxford). 2013;2013:bat031. pmid:23624946
  18. 18. Haas J, Gumienny R, Barbato A, Ackermann F, Tauriello G, Bertoni M, et al. Introducing "best single template" models as reference baseline for the Continuous Automated Model Evaluation (CAMEO). Proteins. 2019;87(12):1378–1387. pmid:31571280
  19. 19. Holm L. Using Dali for Protein Structure Comparison. Methods Mol Biol. 2020;2112:29–42. pmid:32006276
  20. 20. Lomize MA, Pogozheva ID, Joo H, Mosberg HI, Lomize AL. OPM database and PPM web server: resources for positioning of proteins in membranes. Nucleic Acids Res. 2012;40(Database issue):D370–6. pmid:21890895
  21. 21. Herráez A. Biomolecules in the computer: Jmol to the rescue. Biochem Mol Biol Educ. 2006;(4):255–61. pmid:21638687
  22. 22. Benkert P, Biasini M, Schwede T. Toward the estimation of the absolute quality of individual protein structure models. Bioinformatics. 2011;27(3):343–50. pmid:21134891
  23. 23. Studer G, Rempfer C, Waterhouse AM, Gumienny R, Haas J, Schwede T. QMEANDisCo-distance constraints applied on model quality estimation. Bioinformatics. 2020; 36(6):1765–1771. pmid:31697312
  24. 24. Hodsdon ME, Ponder JW, Cistola DP. The NMR solution structure of intestinal fatty acid-binding protein complexed with palmitate: application of a novel distance geometry algorithm. J Mol Biol. 1996;264(3):585–602. pmid:8969307
  25. 25. Williams CJ, Headd JJ, Moriarty NW, Prisant MG, Videau LL, Deis LN, et al. MolProbity: more and better reference data for improved all-atom structure validation. Protein Sci. 2018;27(1):293–315. pmid:29067766
  26. 26. Wiederstein M, Sippl MJ. ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucleic Acids Res. 2007;35(Web Server issue):W407–10. pmid:17517781
  27. 27. Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, Meng EC, et al. UCSF Chimera—a visualization system for exploratory research and analysis. J Comput Chem. 2004;25(13):1605–12. pmid:15264254
  28. 28. Jespersen MC, Peters B, Nielsen M, Marcatili P. BepiPred-2.0: improving sequence-based B-cell epitope prediction using conformational epitopes. Nucleic Acids Res. 2017; 45(W1):W24–W29. pmid:28472356
  29. 29. Larsen JE, Lund O, Nielsen M. Improved method for predicting linear B-cell epitopes. Immunome Res. 2006;2:2. pmid:16635264
  30. 30. Bui HH, Sidney J, Li W, Fusseder N, Sette A. Development of an epitope conservancy analysis tool to facilitate the design of epitope-based diagnostics and vaccines. BMC Bioinformatics. 2007;8:361. pmid:17897458
  31. 31. Haste Andersen P, Nielsen M, Lund O. Prediction of residues in discontinuous B-cell epitopes using protein 3D structures. Protein Sci. 2006;15(11):2558–67. pmid:17001032
  32. 32. Ponomarenko J, Bui HH, Li W, Fusseder N, Bourne PE, Sette A, et al. ElliPro: a new structure-based tool for the prediction of antibody epitopes. BMC Bioinformatics. 2008;9:514. pmid:19055730
  33. 33. Reynisson B, Alvarez B, Paul S, Peters B, Nielsen M. NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data. Nucleic Acids Res. 2020;48(W1):W449–W454. pmid:32406916
  34. 34. Sørensen MR, Ilsøe M, Strube ML, Bishop R, Erbs G, Hartmann SB, et al. Sequence-based genotyping of expressed swine leukocyte antigen class I alleles by next-generation sequencing reveal novel swine leukocyte antigen class I haplotypes and alleles in belgian, danish, and kenyan fattening pigs and Göttingen minipigs. Front Immunol. 2017;8:701. pmid:28670315
  35. 35. Zhang N, Qi J, Feng S, Gao F, Liu J, Pan X, et al. Crystal structure of swine major histocompatibility complex class I SLA-1 0401 and identification of 2009 pandemic swine-origin influenza A H1N1 virus cytotoxic T lymphocyte epitope peptides. J Virol. 2011; 85(22):11709–24. pmid:21900158
  36. 36. Song Y, DiMaio F, Wang RYR, Kim D, Miles C, Brunette Tj, et al. High-resolution comparative modeling with RosettaCM. Comparative Study Structure. 2013; 21(10):1735–42. pmid:24035711
  37. 37. Jumper J, Tunyasuvunakool K, Kohli P, Hassabis D, the AlphaFold Team, “Computational predictions of protein structures associated with COVID-19”, Version 3, DeepMind website, 4 August 2020, https://deepmind.com/research/open-source/computational-predictions-of-protein-structures-associated-with-COVID-19.
  38. 38. Senior AW, Evans R, Jumper J, Kirkpatrick J, Sifre L, Green T, et al. Improved protein structure prediction using potentials from deep learning. Nature. 2020;577(7792):706–710. pmid:31942072
  39. 39. Heo L, Feig M. Modeling of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) proteins by machine learning and physics-based refinement. BioRxiv [Preprint]. 2020 bioRxiv 008904 [Posted 2020 March 28]. Available from: https://www.biorxiv.org/content/10.1101/2020.03.25.008904 pmid:32511334
  40. 40. Vlasova AN, Marthaler D, Wang Q, Culhane MR, Rossow KD, Rovira A, et al. Distinct characteristics and complex evolution of PEDV strains, North America, May 2013-February 2014. Emerg Infect Dis. 2014;20(10):1620–8. pmid:25279722
  41. 41. Huang YW, Dickerman AW, Piñeyro P, Li L, Fang L, Kiehne R, et al. Origin, evolution, and genotyping of emergent porcine epidemic diarrhea virus strains in the United States. mBio. 2013;4(5):e00737–13. pmid:24129257
  42. 42. Kern DM, Sorum B, Mali SS, Hoel CM, Sridharan S, Remis JP, et al. Cryo-EM structure of SARS-CoV-2 ORF3a in lipid nanodiscs. Nat Struct Mol Biol. 2021;28(7):573–582. pmid:34158638
  43. 43. de Haan CA, Vennema H, Rottier PJ. Assembly of the coronavirus envelope: homotypic interactions between the M proteins. J Virol. 2000;74(11):4967–78. pmid:10799570
  44. 44. Kuo L, Masters PS. Genetic evidence for a structural interaction between the carboxy termini of the membrane and nucleocapsid proteins of mouse hepatitis virus. J Virol. 2002; 76(10):4987–99. pmid:11967315
  45. 45. Narayanan K, Chen CJ, Maeda J, Makino S. Nucleocapsid-independent specific viral RNA packaging via viral envelope protein and viral RNA signal. J Virol. 2003;77(5):2922–7. pmid:12584316
  46. 46. Wang R, Yu R, Chen B, Si F, Wang J, Xie C, et al. Identification of host cell proteins that interact with the M protein of porcine epidemic diarrhea virus. Vet Microbiol. 2020; 246:108729. pmid:32605758
  47. 47. Neuman BW, Kiss G, Kunding AH, Bhella D, Baksh MF, Connelly S, et al. A structural analysis of M protein in coronavirus assembly and morphology. J Struct Biol. 2011; 174(1):11–22. pmid:21130884
  48. 48. Hsueh FC, Lin CN, Chiou HY, Chia MY, Chiou MT, Haga T, et al. Updated phylogenetic analysis of the spike gene and identification of a novel recombinant porcine epidemic diarrhoea virus strain in Taiwan. Transbound Emerg Dis. 2020;67(1):417–430. pmid:31538715
  49. 49. Chen N, Li S, Zhou R, Zhu M, He S, Ye M, et al. Two novel porcine epidemic diarrhea virus (PEDV) recombinants from a natural recombinant and distinct subtypes of PEDV variants. Virus Res. 2017;242:90–95. pmid:28947336
  50. 50. Tian PF, Jin YL, Xing G, Qv LL, Huang YW, Zhou JY. Evidence of recombinant strains of porcine epidemic diarrhea virus, United States, 2013. Emerg Infect Dis. 2014; 20(10):1735–8. pmid:25272273
  51. 51. Chung HC, Lee JH, Nguyen VG, Huynh TML, Lee GE, Moon HJ, et al. New emergence pattern with variant porcine epidemic diarrhea viruses, South Korea, 2012–2015. Virus Res. 2016;226:14–19. pmid:27345861
  52. 52. Nefedeva M, Titov I, Malogolovkin A. Molecular characteristics of a novel recombinant of porcine epidemic diarrhea virus. Arch Virol. 2019;164(4):1199–1204. pmid:30725181
  53. 53. Boniotti MB, Papetti A, Lavazza A, Alborali G, Sozzi E, Chiapponi C, et al. Porcine Epidemic Diarrhea Virus and Discovery of a Recombinant Swine Enteric Coronavirus, Italy. Emerg Infect Dis. 2016;22(1):83–7. pmid:26689738
  54. 54. Wang H, Zhang L, Shang Y, Tan R, Ji M, Yue X, et al. Emergence and evolution of highly pathogenic porcine epidemic diarrhea virus by natural recombination of a low pathogenic vaccine isolate and a highly pathogenic strain in the spike gene. Virus Evol. 2020; 6(2):veaa049. pmid:32913664
  55. 55. Fleming JO, Shubin RA, Sussman MA, Casteel N, Stohlman SA. Monoclonal antibodies to the matrix (E1) glycoprotein of mouse hepatitis virus protect mice from encephalitis. Virology. 1989;168(1):162–7. pmid:2535900
  56. 56. Vennema H, de Groot RJ, Harbour DA, Horzinek MC, Spaan WJ. Primary structure of the membrane and nucleocapsid protein genes of feline infectious peritonitis virus and immunogenicity of recombinant vaccinia viruses in kittens. Virology. 1991; 181(1):327–35. pmid:1847259
  57. 57. Qian C, Qin D, Tang Q, Zeng Y, Tang G, Lu C. Identification of a B-cell antigenic epitope at the N-terminus of SARS-CoV M protein and characterization of monoclonal antibody against the protein. Virus Genes. 2006;33(2):147–56. pmid:16972028
  58. 58. Xing J, Liu S, Han Z, Shao Y, Li H, Kong X. Identification of a novel linear B-cell epitope in the M protein of avian infectious bronchitis coronaviruses. J Microbiol. 2009; 47(5):589–99. pmid:19851732
  59. 59. He Y, Zhou Y, Siddiqui P, Niu J, Jiang S. Identification of immunodominant epitopes on the membrane protein of the severe acute respiratory syndrome-associated coronavirus. J Clin Microbiol. 2005;43(8):3718–26. pmid:16081901