Figures
Abstract
To facilitate gene expression study and obtain accurate qRT-PCR analysis, normalization relative to stable expressed housekeeping genes is required. In this study, expression profiles of 11 candidate reference genes, including actin (Actin), elongation factor 1 α (EF1A), TATA-box-binding protein (TATA), ribosomal protein L12 (RPL12), β-tubulin (Tubulin), NADH dehydrogenase (NADH), vacuolar-type H+-ATPase (v-ATPase), succinate dehydrogenase B (SDHB), 28S ribosomal RNA (28S), 16S ribosomal RNA (16S), and 18S ribosomal RNA (18S) from the pea aphid Acyrthosiphon pisum, under different developmental stages and temperature conditions, were investigated. A total of four analytical tools, geNorm, Normfinder, BestKeeper, and the ΔCt method, were used to evaluate the suitability of these genes as endogenous controls. According to RefFinder, a web-based software tool which integrates all four above-mentioned algorithms to compare and rank the reference genes, SDHB, 16S, and NADH were the three most stable house-keeping genes under different developmental stages and temperatures. This work is intended to establish a standardized qRT-PCR protocol in pea aphid and serves as a starting point for the genomics and functional genomics research in this emerging insect model.
Citation: Yang C, Pan H, Liu Y, Zhou X (2014) Selection of Reference Genes for Expression Analysis Using Quantitative Real-Time PCR in the Pea Aphid, Acyrthosiphon pisum (Harris) (Hemiptera, Aphidiae). PLoS ONE 9(11): e110454. https://doi.org/10.1371/journal.pone.0110454
Editor: Yi Li, Wuhan Bioengineering Institute, China
Received: June 18, 2014; Accepted: September 18, 2014; Published: November 25, 2014
Copyright: © 2014 Yang et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The authors confirm that all data underlying the findings are fully available without restriction. All relevant data are within the paper and its Supporting Information files.
Funding: This research was supported by a start-up fund from the University of Kentucky to XGZ, a grant from USDA BRAG grant (Award Agreement No.: 3048108827) to XGZ, and a Special Fund for Agroscience Research in the Public Interest (Award Agreement No.: 201303028) to YL. The granting agencies have no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no conflict of interest exists.
Introduction
Quantitative real-time PCR (qRT-PCR) is a rapid and reliable method for the detection and quantification of gene expression levels during different biological processes [1]. Although qRT-PCR is often described as the gold standard, there are still some limitations of this assay such as RNA quality and quantity, reverse transcription and normalization, and efficiency of PCR reaction can influence threshold cycle (Ct)values [2], [3]. A common technique in qRT-PCR is to normalize data by measuring in parallel the expression of a reference gene from the same samples. Using housekeeping genes as a reference is the most widely adopted approach [1]. Housekeeping genes are believed to possess inherent stable and constitutive expression irrespective of physiological conditions in different samples or treatments under investigation [4], [5]. Several reports have demonstrated that some commonly used reference genes differentially expressed under different treatments or conditions [4]–[8]. In fact, no reference genes are stably expressed and suitable for the entire cell and tissue, and various experimental conditions [4]–[8].
The pea aphid, Acyrthosiphon pisum (Harris) (Hemiptera, Aphidiae), is an important cosmopolitan pest. It feeds on a wide range of legume plants(family Fabaceae) worldwide, including pea, clover, alfalfa, and broad bean, and is considered as the aphid species of major agronomical importance [9]. More importantly, it can transmit over 30plant viruses [10]. In addition, A. pisumis an emerging model organism for the studies of insect-plant interactions, especially after the release of its genome in 2010 [11]. With the advent of omics tools, there is an unprecedented opportunity to investigate the genetic basis of its physiological and biological functions [12], [13]. There have been demonstrated needs for the systematic validation of references genes in qRT-PCR analysis, normalization procedures have yet received any attention for this emerging insect model.
The objective of this study was to address an important aspect of gene expression studies in the pea aphid, Acyrthosiphon pisum, as well as in other insects which is the selection of appropriate references genes with stable expression under different experimental conditions. Here, the expression profiles of 11 candidate reference genes, including actin (Actin), elongation factor 1 α (EF1A), TATA-box-binding protein (TATA), ribosomal protein L12 (RPL12), β-tubulin (Tubulin), NADH dehydrogenase (NADH), vacuolar-type H+-ATPase (v-ATPase), succinate dehydrogenase B (SDHB), 28S ribosomal RNA (28S), 16S ribosomal RNA (16S), and 18S ribosomal RNA (18S) from the pea aphid genome [11], were examined under different developmental stages and temperatures. As a result, different sets of reference genes were recommended accordingly.
Materials and Methods
Insects
Pea aphid, Acyrthosiphon pisum (Harris) (Hemiptera, Aphidiae)colony was kindly provided by Dr. John Obrycki (University of Kentucky). Aphids were maintained at 20–28°C on seedlings of fava bean, Viciafaba (Fabales, Fabaceae)in a greenhouse.
Samples preparation
Fifteen adult females were allowed to lay the offspring for 24 h on fava bean leaves resting on wet filter paper in a petri dish (9 cm diameter). Then10 adults as one replicate and 20nymphs (less than 24 h old) as one replicate, respectively were exposed to 10°C, 22°C, and 30°C, respectively for 2 d in a climate chamber with a photoperiod of 14: 10 (L: D) and 50% relative humidity. All collected samples were preserved in 1.5 ml centrifuge tubes and stored at −80°C after being frozen in liquid nitrogen. Each treatment was repeated three times independently, therefore, there are 18 biological samples in total.
Total RNA extraction and cDNA synthesis
Total RNA was extracted using TRIzol (Invitrogen, Carlsbad, CA) following the manufacturer's instruction. First-strand cDNA was synthesized from 1 µg of total RNA using the M-MLV reverse transcription kit (Invitrogen, Carlsbad, CA) according the manufacturer's recommendations.
Reference gene selection and primer design
Eleven commonly used reference genes were selected (Table 1). PCR amplifications were performed in 50 µl reactions containing 10 µl 5×PCR Buffer (Mg2+ Plus), 1 µldNTP mix (10 mM of each nucleotide), 5 µl of each primer (10µM each), and 0.25 µl of Go Taq(5 u/µl) (Promega). The PCR parameters were as follows: one cycle of 94°C for 3 min; 35 cycles of 94°C for 30 s, 59°C for 45 s, and 72°C for 1 min; a final cycle of 72°C for 10 min. Amplicons of the expected size were purified and cloned into the pCR4-TOPO vector (Invitrogen, Carlsbad, CA) for sequencing confirmation.
Quantitative real-time PCR
Gene-specific primers (Table 1) were used in PCR reactions (20 µl) containing 7µl of ddH2O, 10 µl of 2×SYBR Green MasterMix (Bio-Rad), 1 µl of each specific primer (10 µM), and 1 µl of first-strand cDNA template. The qPCR program included an initial denaturation for 3 min at 95°C followed by 40 cycles of denaturation at 95°C for 10 s, annealing for 30 s at 55°C, and extension for 30 s at 72°C. For melting curve analysis, a dissociation step cycle (55°C for 10 s, and then 0.5°C for 10 s until 95°C) was added. The reactions were set up in 96-well format Microseal PCR plates (Bio-Rad) in triplicates. All experiments were replicated in triplicate.
Reactions were performed in a MyiQ single Color Real-Time PCR Detection System (Bio-Rad). Existence of single peaks in melting curve analysis was used to confirm gene-specific amplification and rule out non-specific amplification and primer-dimer generation. The qRT-PCR was determined for each gene using slope analysis with a linear regression model. Relative standard curves for the transcripts were generated with serial dilutions of cDNA (1/5, 1/25, 1/125, 1/625, and 1/3125). The corresponding qRT-PCR efficiencies (E) were calculated according to the equation: E = (10[-1/slope] -1)×100.
Stability of gene expression
All biological replicates were used to calculate the average Ct value. The stability of the ten housekeeping genes were evaluated by algorithms geNorm [1], NormFinder [14], BestKeeper [15], and the comparative ΔCt method [16]. Finally, we compared and ranked the tested candidates based on a web-based analysis tool RefFinder (http://www.leonxie.com/referencegene.php).
Results
Transcriptional profiling of candidate reference genes
First, 11 candidate reference genes were investigated by reverse transcription polymerase chain reaction (RT-PCR). All genes tested were expressed in pea aphid, and visualized as a single amplicon with expected size on a 1.5% agarose gel. All amplicons were sequenced and displayed 100% identity with their corresponding sequences. Furthermore, gene-specific amplification of these genes was confirmed by a single peak in real-time melting-curve analysis. A standard curve was generated for each gene, using five-fold serial dilution of the pooled cDNAs. The correlation coefficient and PCR efficiency for each standard curve were shown in Table 1.
We calculated the mean and the standard derivation (SD) of the Ct values for all the samples together. v-ATPasehad the most variable expression levels reflected in its high SD values. On the contrary, 28S had the least variable expression levels reflected in its low SD values. In addition, TATA (Ctavg = 26.05) had the highest Ct values and was therefore the least expressed among the gene candidates. 18S (Ctavg = 10.72) had the lowest Ct values and was therefore the mostly expressed among the gene candidates. (Figure 1).
The expression level of candidate reference genes are documented in Ct-value. The median is represented by the line in the box. The interquartile rang is bordered by the upper and lower edges, which indicate the 75th and 25th percentiles, respectively.
Quantitative analysis of reference candidates based on geNorm
To determine the minimal number of genes required for normalization, we computed the V-value by geNorm. Starting with two genes, the software sequentially adds another gene and recalculates the normalization factor ratio. If the added gene does not increase the normalization factor ratio above the proposed 0.15 cut-off value, then the original pair of genes is enough for normalization. However, if the new ratio is above 0.15, then more genes should be included. The first V-value <0.15 was after V2/3 (Figure 2B). This means that two reference genes were enough for reliable normalization under the developmental stages and temperature conditions.
(A) Ranking of the 11 housekeeping genes based on the stability value (M). A lower stability value indicates more stable expression. (B)Pairwise variation (V) analysis of the candidate reference genes. The geNorm first calculates an expression stability value (M) for each gene and then compares the pair-wise variation (V) of this gene with the others. A threshold of V<0.15 was suggested for valid normalization. geNorm starts by a gene pair, and tests whether the inclusion of a 3rd gene adds significant variation. The pair-wise variation (Vn/Vn+1) was analyzed between the normalization factors NFn and NFn+1 by the geNorm software to determine the optimal number of references genes required for qRT-PCR data normalization.
Determining the best reference candidates based on geNorm
GeNorm bases its ranking on the geometric mean of the SD of each transformed gene set of pair combinations (M-value). The lower the M-value is, the higher the ranking. SDHB and 16S were co-ranked as the most stable genes (M = 0.382). The overall order based on geNorm from most stable to least stable reference genes was: SDHB = 16S, NADH,18S, RPL12,28S, Tublin, Actin, EF1A, v-ATPase, TATA (Figure 2A, Table 2).
Determining the best reference candidates based on ΔCt method
Gene ranking using the ΔCt method relies on relative pair-wise comparisons. Using raw Ct values, the average SD of each gene set is inversely proportional to gene stability. As shown in Tables S1 and 5, SDHB (0.57) was the top-ranked gene. The overall order from most stable to least stable reference genes based on the ΔCt method was: SDHB, NADH,16S, RPL12,18S, Actin, EF1A,28S, Tublin, v-ATPase, TATA(Table 2).
Determining the best reference candidates based on NormFinder
SDHB (0.194) was the gene with the least variation in expression levels; thus SDHB would be the most reliable reference gene. The overall order from most stable to least stable reference genes based on NormFinder was: SDHB, NADH,16S, RPL12,18S, Actin, EF1A,28S, Tublin, v-ATPase, TATA (Table 2).
Determining the best reference candidates based on BestKeeper
BestKeeper provided a two-way ranking: Pearson's correlation coefficient and BestKeeper computed SD values. The stability of a gene is directly proportional to the [r] value, while it is inversely proportional to the SD value. SDHB(r = 0.931) and 16S (r = 0.852) had the highest[r] value, whereas 28S(SD = 0.321) and EF1A (SD = 0.371) had the least variable expression levels across all the samples (Table S2, 2).
Comprehensive ranking of best reference genes using RefFinder
All software programs except the SD value based on BestKeeper indentified SDHB as the most stable gene (Table S2). According to RefFinder, the overall order from the most stable to the least stable reference genes was:SDHB,16S, NADH,28S, RPL12,18S, EF1A, Actin, Tublin, v-ATPase, TATA. Among them, v-ATPAase and TATA both had GM values higher than 10.0 (Table 2), these two candidates had the lowest ranking and less suitable to serve as reliable reference genes for normalizing gene expression.
Discussion
qRT-PCR quantification requires robust normalization by reference genes to offset confounding variations in experimental data. Most gene expression studies in the literature use a single endogenous control; this will profoundly influence the statistical outcome and may lead to inaccurate data interpretation [17]. Currently, the reference genes studies of many insects have been accomplished including whitefly, diamondback moth, brown planthopper, beet armyworm, oriental leafworm moth, Colorado potato beetle, and oriental fruit fly [4]–[8], [18], [19]. Here, the expression profiles of 11 candidate reference genes from the pea aphid were evaluated under different developmental stages and temperature conditions. Our finding is the first step toward establishing a standardized qRT-PCR analysis for this research model.
BestKeeper ranked the genes different from the other analysis methods used (Table 2). Unlike Genorm and NormFinder, BestKeeper is not specifically built to construct a hierarchy of reference genes. Instead, BestKeeper is intent to establish the best possible referencing point using an averaged expression of multiple housekeeping genes. Therefore, based on the needs, different analytical tools should be considered. There has been ongoing discussion about the optimal number of reference genes required for qRT-PCR analysis. The fact is that multiple reference genes are increasingly used to analyze gene expression under various experimental conditions in a given experiment, because one reference gene is usually insufficient to normalize the expression results of target genes [20]. This can decreased the probability of biased normalization. Our results demonstrated that the use of two reference genes can be sufficient to normalize the expression data and provides amore conservative estimation of target gene expression (Figure 2). As a result, we strongly suggest that two internal references are necessary for studying gene expression in pea aphid under different developmental stages and temperature conditions.
This is the first study to evaluate candidate reference genes for gene expression analyses in the pea aphid. Based on the comprehensive analysis, SDHB, 16S, and NADH were the three most stable house-keeping genes under different developmental stages and temperature conditions. This study not only sheds light on establishing a standardized qRT-PCR procedure in pea aphid, but also lays a solid foundation for the genomics and functional genomics research in this insect.
Supporting Information
Table S1.
Summary of mean and SD values of gene pairwise comparison using the ΔCt method for 11 gene candidates.
https://doi.org/10.1371/journal.pone.0110454.s001
(DOCX)
Table S2.
Ranking of 11 reference gene candidates based on BestKeeper. Two criteria are considered: Pearson's correlation coefficient and BestKeeper computed SD values. The stability of a gene is directly proportional to the [r] value, while it is inversely proportional to the SD value.
https://doi.org/10.1371/journal.pone.0110454.s002
(DOCX)
Acknowledgments
The authors are grateful to anonymous reviewers and the editor for their constructive criticisms. Special thanks go to Dr. Xun Zhu for his assistance with the data analysis. The information reported in this paper (No. 14-08-063) is part of a project of the Kentucky Agricultural Experiment Station and is published with the approval of the Director.
Author Contributions
Conceived and designed the experiments: HPP YL XGZ. Performed the experiments: CXY HPP. Analyzed the data: HPP. Contributed reagents/materials/analysis tools: XGZ. Wrote the paper: HPP CXY XGZ.
References
- 1. Vandesompele J, De Preter K, Pattyn F, Poppe B, Van Roy N, et al. (2002) Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes. Genome Biol 3 research0034.
- 2. Strube C, Buschbaum S, Wolken S, Schnieder T (2008) Evaluation of reference genes for quantitative real-time PCR to investigate protein disulfide isomerase transcription pattern in the bovine lungworm Dictyocaulus viviparus. Gene 425:36–43.
- 3. Bustin SA, Benes V, Nolan T, Pfaffl MW (2005) Quantitative real-time RT-PCR-a perspective. J Mol Endocrinol 34:597–601.
- 4. Li RM, Xie W, Wang SL, Wu QJ, Yang NN, et al. (2013) Reference gene selection for qRT-PCR analysis in the sweetpotato whitefly, Bemisia tabaci (Hemiptera: Aleyrodidae). PLoS One 8:e53006.
- 5. Zhu X, Yuan M, Shakeel M, Zhang YJ, Wang SL, et al. (2014) Selection and evaluation of reference genes for expression analysis using qRT-PCR in the beet armyworm Spodoptera exigua (Hübner)(Lepidoptera: Noctuidae). PLoS One 9:e84730.
- 6. Fu W, Xie W, Zhang Z, Wang SL, Wu QJ, et al. (2014) Exploring valid reference genes for quantitative real-time PCR analysis in Plutella xylostella (Lepidoptera: Plutellidae). Int J Biol Sci 9:792–802.
- 7. Shi XQ, Guo WC, Wan PJ, Zhou LT, Ren XL, et al. (2013) Validation of reference genes for expression analysis by quantitative real-time PCR in Leptinotarsa decemlineata (Say). BMC Res Notes 6:93.
- 8. Yuan M, Lu YH, Zhu X, Wan H, Shakeel M, et al. (2014) Selection and evaluation of potential reference genes for gene expression analysis in the Brown Planthopper, Nilaparvata lugens (Hemiptera: Delphacidae) using reverse-transcription quantitative PCR. PLoS One 9:e86503.
- 9.
Van Emden HF, Harrington R</emph> (Eds.) (2007) Aphids as crop pests. CABI.
- 10.
Blackman RL, Eastop VF (2000) Aphids of the World's Crops – An Identification and Information Guide. John Wiley and Sons, Inc., New York, NY, 466 pp.
- 11. International Aphid Genomics Consortium (2010) Genome sequence of the pea aphid Acyrthosiphon pisum. PLoS Biol 8:e1000313.
- 12. Walsh TK, Brisson JA, Robertson HM, Gordon K, Jaubert-Possamai S, et al. (2010) A functional DNA methylation system in the pea aphid, Acyrthosiphon pisum. Insect Mol Biol 19:215–228.
- 13. Hansen AK, Moran NA (2011) Aphid genome expression reveals host–symbiont cooperation in the production of amino acids. P Natl Acad Sci USA 108:2849–2854.
- 14. Andersen CL, Jensen JL, Ørntoft TF (2004) Normalization of real-time quantitative reverse transcription-PCR data: a model-based variance estimation approach to identify genes suited for normalization, applied to bladder and colon cancer data sets. Cancer Res 64:5245–5250.
- 15. Pfaffl MW, Tichopad A, Prgomet C, Neuvians TP (2004) Determination of stable housekeeping genes, differentially regulated target genes and sample integrity: BestKeeper–Excel-based tool using pair-wise correlations. Biotechnol Lett 26:509–515.
- 16. Silver N, Best S, Jiang J, Thein SL (2006) Selection of housekeeping genes for gene expression studies in human reticulocytes using real-time PCR. BMC Mol Biol 7:33.
- 17. Ferguson BS, Nam H, Hopkins RG, Morrison RF (2010) Impact of reference gene selection for target gene normalization on experimental outcome using real-time qRT-PCR in adipocytes. PLoS One 5:e15208.
- 18. Shen GM, Jiang HB, Wang XN, Wang JJ (2010) Evaluation of endogenous references for gene expression profiling in different tissues of the oriental fruit fly Bactrocera dorsalis (Diptera: Tephritidae). BMC Mol Biol 11:76.
- 19. Lu YH, Yuan M, Gao XW, Kang TH, Zhan S, et al. (2013) Identification and validation of reference genes for gene expression analysis using quantitative PCR in Spodoptera litura (Lepidoptera: Noctuidae). PLoS One 8:e68059.
- 20. Veazey KJ, Golding MC (2011) Golding selection of stable reference genes for quantitative RT-PCR comparisons of mouse embryonic and extra-embryonic stem cells. PLoS One 6:27592.