Technical Analysis of cDNA Microarrays

Cinda P. Scott; Jeff VanWye; M. Danielle McDonald; Douglas L. Crawford

doi:10.1371/journal.pone.0004486

Abstract

Background

There is extensive variation in gene expression among individuals within and between populations. Accurate measures of the variation in mRNA expression using microarrays can be confounded by technical variation, which includes variation in RNA isolation procedures, day of hybridization and methods used to amplify and dye label RNA for hybridization.

Methodology/Principal Findings

In this manuscript we analyze the relationship between the amount of mRNA and the fluorescent signal from the microarray hybridizations demonstrating that for a wide-range of mRNA concentrations the fluorescent signal is a linear function of the amount of mRNA. Additionally, the separate isolation, labeling or hybridization of RNA does not add significant amounts of variation in microarray measures of gene expression. However, single or double rounds of amplification for labeling do have small but significant affects on 10% of genes, but this source of technical variation is easy to avoid. To examine both technical and stochastic biological variation, mRNA expression was measured from the same five individuals over a six-week time course.

Conclusion

There were few, if any, meaningful differences in gene expression among time points. Thus, microarray measures using standard laboratory procedures can be precise and quantitative and are not subject to significant random biological noise.

Citation: Scott CP, VanWye J, McDonald MD, Crawford DL (2009) Technical Analysis of cDNA Microarrays. PLoS ONE 4(2): e4486. https://doi.org/10.1371/journal.pone.0004486

Editor: Grzegorz Kudla, University of Edinburgh, United Kingdom

Received: September 24, 2008; Accepted: November 20, 2008; Published: February 16, 2009

Copyright: © 2009 Scott et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Funding: This work was supported by NSF/OCE 0221879; NIH/NHLBI HL65470-01A2, and NSF-NIEHS P50ES12736 and NSF OCE-0432368. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Microarrays simultaneously quantify several hundred to thousands of genes on a single glass slide and their use has greatly expanded the breadth of quantified gene expression [1]–[10]. Yet the preparation of RNA affects the precision of microarray measures and therefore the ability to accurately quantify the content of an RNA sample [11]. Additionally, differences in microarray platforms, laboratory procedures and post-quantification analyses affect the precision among arrays [12]–[15]. Thus, technical variation can substantially affect the interpretation of microarrays.

For the teleost fish Fundulus heteroclitus variation among individuals in mRNA expression is extensive: >60% of genes have significant differences in expression among individuals within a population [1], [9], [16], [17]. Many of these differences in gene expression are associated with variation in cardiac metabolism [9]. However, the accuracy and biological relevance of these differences in expression depends on the technical variation inherent to microarray processing [1].

Accurate microarray quantification is best realized when there is a linear relationship between fluorescence and RNA concentration. This linear relationship fails when the dynamic range of microarrays are exceeded. For any microarray, there are two parameters that define its dynamic range: the range of fluorescence that can be measured and the range of RNA concentrations that can bind to a specific array feature. These two components of the dynamic range reflect the two types of saturation that can occur on a microarray: photomultiplier tube (PMT) saturation and biological saturation. A linear relationship between fluorescence and RNA concentration can only occur if the cDNA on the microarray captures proportional amounts of RNA and if the PMT is not saturated.

The PMT measures the number of photons from the fluorescently labeled RNA that are excited by the lasers. PMT saturation is a result of the photomultiplier tube becoming oversaturated due to an overabundance of converted electrons by the analog to digital (A/D) converter. The A/D converter can only convert the PMT signal into a value less than or equal to 2¹⁶-1 or 65,535 and thus any fluorescent photons captured at this value of 65,535 are not discernable [18]. This type of saturation can be avoided by reducing the PMT voltage and laser power. Alternatively, the specific activity of the mRNA (number of fluorescent molecules per message) can be reduced. However, the reduction of the PMT voltage, power of the lasers, or reduced labeling, does not address the question of whether or not a particular cDNA on a microarray is biologically saturated.

Biological saturation occurs when the amount of mRNA that can hybridize to the DNA on a microarray reaches a maximum binding capacity of the printed DNA. If biological saturation is reached, then the amount of a mRNA will be underestimated and differences among arrays or experiments can not be appropriately determined. To avoid biological saturation, the amount of target RNA must be present in quantities less than the amount that the cDNA on the microarray slide can bind. To determine the range and linear response of increasing amounts of mRNA, we hybridized a 500-fold concentration range of labeled RNA from cardiac tissue to the F. heteroclitus 384 cDNA metabolic microarray.

Sources of technical variation, other than PMT and biological saturation, come from methods used to fluorescently label the mRNA, the day on which the RNA is processed and varying amounts of available tissue [19], [20]. One of the most common approaches to fluorescently label mRNA for microarray studies is to amplify the RNA by synthesizing cDNA with a T7 RNA polymerase binding site. RNA is then synthesized in vivo by using the T7 RNA polymerase to incorporate amino allyls followed by covalent binding of fluorescent molecules to the incorporated amino allyls [21]. For small amounts of starting mRNA, the synthesis of RNA using T7 can be repeated to double the amplification. To understand the effect of a single round versus a double round of linear amplification we compared the quantification of RNA using both methods.

The day and process used to isolate mRNA are two additional sources of technical variation. Variation in the preparation of mRNA could alter its quality affecting how well the RNA amplifies, is fluorescently labeled, and the signal observed on the microarray. The day on which a tissue is sampled is not strictly technical but can introduce a second type of variation: biological variation. That is, isolating tissues on different days could introduce technical variation because of the precision of dissection and the quality of tissue or RNA preparation. However, because tissues are sampled on different days, the organisms may be biologically different (under more or less stress, healthier, or just one day older). To examine technical variation due only to RNA isolation, a single blood sample was divided into four, RNA was separately isolated from each sample and, gene expression was quantified. Biological variation was examined in a separate experiment where five fish were bled every two weeks for a total of six weeks in order to collect four separate samples from each individual. Gene expression was quantified for these four temporally separate samples.

These experiments indicate that for a wide range of experimental conditions, microarray experiments using the Fundulus array are both accurate and precise.

Materials and Methods

Organism

Fundulus heteroclitus were caught from wild populations in Wiscasset, Maine, USA (43°57′41″N, 69°42′45″W) by minnow trap. Fish were transported to the Rosenstiel School of Marine and Atmospheric Science at the University of Miami and acclimated to 20°C and 15ppt for approximately 6 months.

Blood Sampling.

Fundulus heteroclitus (N = 20) were anesthetized with MS222 (0.1 g·l⁻¹) and given tags with subdermal latex markers. Whole blood samples from each fish were taken every two weeks by caudal puncture using a 50 µl Hamilton syringe rinsed with heparinzed saline (50 i.u. ·ml⁻¹). Samples were immediately frozen in liquid N₂ and stored at −80°C. Only individuals that had all four serial samples taken (N = 5) were used in the present study.

RNA isolation and amino allyl labeling

Total RNA was isolated using 4.5 M guanidinium thiocyanate, 2% N-lauroylsarcosine, 50 mM EDTA, 25 mM Tris-HCl, 0.1 M β-Mercaptoethanol and 2% Antifoam A. The extracted RNA was further purified using a Qiagen RNeasy Mini kit in accordance with the manufacturer's protocols. The quantity and quality of the RNA was determined using a spectrophotometer (Nanodrop, ND-1000 V3.2.1) and a bioanalyzer (Agilent 2100). RNA was then converted into amino allyl labeled RNA (aRNA) using the Ambion Amino Allyl MessageAmp II aRNA Amplification kit. This method converts poly-A RNA into cDNA with a T7 RNA polymerase binding site; T7 is then used to synthesize new strands of RNA (in vitro transcription)[22]. During this in vitro transcription of aRNA, an amino allyl UTP (aaUTP) is incorporated into the elongating strand. aaUTP incorporation allows for the coupling of Cy3 or Cy5 dyes (GE biosciences) onto aRNA for microarray hybridization.

Dye labeled aRNA aliquots for each hybridization (typically 30 pmol each of Cy3 and Cy5) were vacuum dried together and resuspended in 15 µl hybridization buffer (final concentration of each labeled sample = 2 pmol/µl). Hybridization buffer consisted of 5× SSPE, 1% SDS, 50% formamide, 1 mg/ml polyA, 1 mg/ml sheared herring sperm carrier DNA, and 1 mg/ml BSA. Slides were washed in sodium borohydride solution in order to reduce autofluorescence. Following rinsing, slides were boiled for 2 minutes and spin-dried in a centrifuge at 800 rpm for 3 minutes. Samples (15 µl) were heated to 90°C for 2 minutes, quick cooled to 42°C, applied to the slide (hybridization zone area was 350 mm²), and covered with a cover slip. Slides were placed in an airtight chamber humidified with paper soaked in 5× SSPE and incubated 24–48 hours at 42°C.

Microarrays

mRNA expression was measured using microarrays where each array had four spatially separated replicates per gene. The 384 F. heteroclitus cDNA microarrays were printed using 55 control genes and 329 cDNAs which encode essential proteins for cellular metabolism (Table 1). The annotation of genes and related pathways used FunnyBase [23] and these were manually compared to KEGG pathway designations. Because many genes belong to more than one pathway, central metabolic pathways were preferentially used if the gene coded for a protein that was a catabolic or anabolic enzyme (versus acting in a signaling pathway that affected metabolism). Controls include DNA spots labeled with Cy5 (positive control for position and gridding) and Ctenophore cDNA as negative controls.

Download:

Table 1. 384 Array Metabolic Pathways.

https://doi.org/10.1371/journal.pone.0004486.t001

Microarrays were created by printing cDNAs amplified with amine-linked primers onto 3-D Link Activated slides (Surmodics Inc., Eden Prairie, MN) at the University of Miami's microarray facility. All printed cDNAs were re-sequenced from the same source used for printing. The microarray slides were scanned using ScanArray Express. The raw TIFF-image data was quantified using Imagene (v5).

All experiments used a loop design for hybridization of dye labeled aRNA [24], [25]. In a loop design [24], [25] each individual is labeled with Cy3 and Cy5. Each dye labeled sample is then hybridized on different arrays with another individual [26]. Thus, each individual is hybridized to two arrays with four replicates per array for a total of eight technical replicates per individual. This experimental design is a more efficient use of resources, providing more data per array and is thus statistically more powerful than a reference design.

To test for the relationship between fluorescence and the quantity of RNA, five concentrations of fluorescently labeled RNA were used: 1.2 to 700 pmol of Cy3 or Cy5 labeled mRNA where pmol are for the amount of incorporated dye (Table 2). A 15 µl hybridization using the 384 cDNA array corresponds to 0.09 to 47 µM of Cy dye. Cy5 dye labeled RNA was used at concentrations 18% less than Cy3 because the Cy5 dye is a more efficient fluorophore (greater fluorescence per photon) than the Cy3 dye. The average of eight fluorescence values for each gene was normalized to the original concentration of RNA added.

Download:

Table 2. Concentrations of Cy3 and Cy5 dye labeled RNA used for hybridization.

https://doi.org/10.1371/journal.pone.0004486.t002

Criteria for Inclusion

For a gene to be included in an analysis, the average signal among all arrays and dyes had to exceed background but not exceed 95% of PMT saturation (65,535). Background signal was determined as the amount of fluorescence in negative control array elements. Not all genes met these criteria and therefore were not included in the analysis.

Statistics

To adjust for systematic variation, gene expression values were first sum normalized, log2 transformed, and then loess normalized using Microarray Data Analysis System Software (MIDAS) [12], [27] and SAS JMP Genomics v.6.0.2. For every gene, eight fluorescence values were captured; four Cy3 values and four Cy5 values. Analysis of variance (ANOVA) was performed using SAS JMP Genomics v.6.0.2. To look for differences between single and double rounds of amplification the following ANOVA model was applied: y_ijkl = μ+A_i+D_j+_k+R_l+ε_ijkl where μ is the sample mean, A_i is the effect of the i^th array (i = 1–18), D_j is the effect of the j^th dye (Cy3 or Cy5), T_k is the effect of the number of rounds of amplification (single or double, k = 2), R_l is the effect of the day on which samples were prepared (l = 3), and epsilon is stochastic effects. The number of rounds of amplification (single or double) and channel variables were treated as fixed effects and array, and day on which samples were prepared were treated as random effects. Statistical analyses of replicate blood samples or repetitive measures of the same five individuals were applied to a separate ANOVA for each individual. The ANOVA model for this comparison was as follows: y_mnp = μ+A_m+D_n+T_p+ε_mnp where μ is the sample mean, A_m is the effect of the m^th array (m = 1–4 for both replicate and repetitive samples), D_n is the effect of the n^th dye (Cy3 or Cy5), T_p is the treatment effect and epsilon is stochastic effects. Sample, representative of either one of four temporal samples from an individual or one of four replicate blood samples, and channel were treated as fixed effects. Array was treated as a random effect. Significant differences were evaluated with a p-value cut-off of 0.01.

Results

Biosaturation

The concentration of fluorescently labeled RNA (0.09 to 47 µM of Cy dye) represents 0.1×, 1×, 5×, 10×, 50× the concentration of RNA typically used on F. heteroclitus cDNA microarrays [9], [26], [28]–[30] (Table 2, MIAME GSE12858). Among the 329 metabolic genes on the array, 212 of these genes met our criteria of being less than 95% of the PMT saturation and more than two standard deviations above the negative controls (Ctenophore cDNA with no similarity to vertebrate genes).

The linear relationship between the amount of RNA and relative fluorescence is shown in Figure 1. To remove the gene specific differences in expression, the fluorescence at each concentration was divided by the mean fluorescence for that specific gene (Fig. 1). The linear relationship between the amount of total fluorescent RNA added and the measures of gene specific fluorescence was determined for each gene. Most genes (176/212 or 83%) had an R²>95% and 78 genes had a nearly perfect R² (>0.995; Fig. 1B; Table 3). Examining the 18 genes with the lowest R² values (less than 0.8) revealed a non-linear relationship that can be explained by an apparent saturation at the 50× concentration of RNA (Fig. 1C). The relationship disappears if the fluorescence values for the 50× concentrations of RNA are removed and the 0.1 to 10× are plotted (Fig 1D–F). In the 100-fold range (0.1 to 10×) only three genes (1.4%) had R² values less than 0.8 (Table 3). Examination of the higher concentrations (1.0 to 50×) revealed 19 genes (9%) with R² less than 0.8 (Table 3). These data suggest that for most genes there is a linear relationship for a 500-fold range of RNA, however some cDNAs on the microarray will reach biological saturation at the highest RNA concentration.

Download:

Figure 1. Linear relationship of RNA concentration to relative fluorescence.

Graphs show linear relationship between concentrations of RNA (0.1–50×, A–C, and 0.1–10×, D–F) and relative fluorescence. Relative fluorescence is a normalized measure of fluorescence divided by the gene specific mean. 1× RNA is equal to 0.9 pmol/µl. Shown are the RNA concentrations versus fluorescence for 0.1 to 50× (A–C) and for 0.1× to 10× (D–F); for all genes (A and D), for the 78 genes with the highest R² values (B and E), and for the 18 with lowest R² values (C and F).

https://doi.org/10.1371/journal.pone.0004486.g001

Download:

Table 3. Number of genes and corresponding R² for various ranges of RNA concentrations.

https://doi.org/10.1371/journal.pone.0004486.t003

Variation in RNA preparation

To determine how RNA preparation affects variation, cardiac RNA from three individuals were combined, and then evenly divided and amino allyl and dye labeled on three separate days using single and double rounds of amplification (MIAME, GSE12858). Only 110 genes met our criteria for inclusion because many genes were below the low cut-off (Ctenophore negative control cDNAs). In this experiment fewer genes met our criteria of above background and below saturation due to sample RNA being divided for separate labeling using either single and double rounds of amplification. An ANOVA was performed to measure differences between single and double rounds of amino allyl labeled RNA amplification. Twelve of the 110 genes (11%) used in this analysis were significantly different between single and double rounds of amplification at p<0.01. The majority of genes (59%) had a higher fluorescence signal when only one round of amplification was performed.

Consistency of Quantitative Determination

In teleost fish, red blood cell (RBCs) nuclei are transcriptionally active [31], [32], and these cells can be sampled without sacrificing the fish. Thus to assess the consistency of microarray determinations, two experiments were performed on blood gene expression: 1) to examine technical variation a single sample of blood was divided into four samples; RNA isolations, amino allyl and dye labeling, hybridization and quantitative analyses were performed on each sample and 2) to examine biological variation, RNA isolated from blood from the five individuals were each sampled four times over a 6 week period (two weeks between samples; MIAME, TBA).

A one-way ANOVA was used to test for the technical variation in gene expression between the four RNA samples isolated from a single blood sample (Fig. 2). Among all 252 genes (eight replicates per gene per sample) only 6 genes were significantly different for the four isolates at a critical p-value of 0.01. Three false-positives are expected at a p-value of 0.01 and thus with only 6 significant differences (Fig. 2) there is little evidence that separate RNA isolation, labeling and hybridization has much affect on measures of gene expression. The lack of differences is not due to high technical variation: CV (standard deviation/mean) among the eight replicates was 4% and, only three genes had a CV of >10%. Nor was it due to the low p-value of 0.01 versus 0.05 (Fig. 2); the number of significant differences simply reflects the p-values.

Download:

Figure 2. Gene expression for Single Blood isolate.

Heat map for single blood isolate that was divided into four. RNA was purified, labeled and hybridized separately for each sample. Red is greater and green is less than the average gene specific fluorescence. First column (P) is the p-value from a one-way ANOVA. Only 6 genes (2.3%) out of 252 are significant at a critical p-value of 0.01. P-values (−log₁₀) shown in the heat map are from an ANOVA for significant differences among samples using the 8 replicates for each separate RNA isolation. Color bar gives fold difference for log₂ gene expression (e.g., 2 = 4×) and negative log₁₀ p-value (e.g., 2 = p-value of 0.01).

https://doi.org/10.1371/journal.pone.0004486.g002

Random biological variation can contribute to differences in expression. We tested for random biological variation by bleeding the same five individuals four times with two weeks between bleedings (Fig. 3, MIAME, TBA). For each of the 304 genes that met our criteria, an ANOVA tested for differences in expression among the four different time periods for each individual (four sample periods with eight replicates per gene per sample period). Among the four temporal samples, there were between one and seven genes that had a significant difference in expression at a p-value of 0.01 (Fig. 3). Only one individual had more than the expected number of false positives at the critical p-value: individual-00 had 7 (2%) significant genes at p-value 0.01 for 304 genes.

Download:

Figure 3. Individuals sampled over time.

Heat map for one individual (00) sampled 4 times over a total of 6 weeks. Numbers above the heat map are time points (0, 2, 4 & 6 weeks) and the “P” is for p-value (−log₁₀). P-values are from the ANOVA that tested for differences among separate blood isolations within an individual (4 isolations and 8 replicates per isolation). For gene expression, red is greater and green is lower expression than the mean expression for each gene. Table provides number of significant genes and percent (rounded up) out of the total of 304. Color bar gives fold difference for log₂ gene expression (e.g., 2 = 4×) and negative log₁₀ p-value (e.g., 2 = p-value of 0.01).

https://doi.org/10.1371/journal.pone.0004486.g003

Discussion

Understanding sources of variation in gene expression is important for determining the biological importance of measured differences in mRNA expression. The analyses of technical variation in the metabolic F. heteroclitus cDNA microarray suggest that measures of gene expression using the F. heteroclitus 384 cDNA microarray are quantitative and precise. This conclusion is based on the observation that there is a linear increase in fluorescence with increasing mRNA (Fig. 1), and that there is little additional variation due to RNA processing (Fig. 2) or the day on which RNA is isolated (Fig. 3).

There is a linear increase in fluorescence with increasing mRNA for 98.5% of genes between 0.1× to 10× concentrations (0.09 pmol/ul to 9.3 pmol/ul) and 95% of genes between 0.1× to 50× (0.09 pmol/ul to 47 pmol/ul). The linear relationship between RNA and fluorescence is quite strong for RNA concentrations of 0.1× to 10× having average R² values of 0.97, and most genes (88%) have R² values greater than 0.95 for these four concentrations. The genes most affected by biological saturation do not have a high fluorescence; if anything, they are less than the average (genes with R²<0.8 for 1× to 50× have a mean that is 60% of the mean for all other genes). The two possible explanations for biological saturation with low fluorescence are that the synthesis of amino allyl labeled RNA for these genes is strongly truncated or that there is less DNA printed on the array for these genes. Truncation of amino allyl labeling would produce many more short probes with few labels per probe. Thus, to produce a similar fluorescence many more molecules would be necessary and these would saturate the DNA on the array. These problems can be avoided by using moderate amounts of probe (<10 pmol/ul). We typically avoid this problem by using 0.7 to 2 pmol/ul. Using concentrations of RNA up to 50× (47 pmol/ul) is feasible, but our data suggest that at this high of a concentration some genes will biologically saturate the cDNA on the array and therefore should be avoided.

If RNA samples are amino allyl labeled using one round of T7-RNA synthesis [22] versus two rounds of T7-RNA synthesis, 11% of genes have significant differences in fluorescence at a p-value of 0.01. Although this difference in gene expression for single versus double labeling is not large, it may be unacceptably high. Thus, we would suggest that for any one experiment that a researcher uses only single or double labeling procedures but not both within an individual experiment. Approximately half (59%) of genes with a significant difference between single and double labeling were greater for single labeling. The greater fluorescence for single labeling than that for double labeling would occur if cDNA or RNA synthesis was truncated with each round of labeling. Truncation would occur if the synthesis of cDNA or RNA were incomplete forming shorter nucleotide sequences with less fluorescence per RNA.

We used blood to test the effect of different RNA isolations, amino allyl labeling and hybridizations. The first experiment used a single blood isolation that was divided into four equal samples. There are few differences in expression, 2.4% at a p-value of 0.01 (i.e., six versus the expected three false positives). If a Bonferroni correction was applied none of these genes would be significant. Therefore, technical errors do not necessarily contribute significant amounts of variation. Similar conclusions were made about microarray results among laboratories: many different laboratories yielded similar results using different varieties of platforms [13]–[15], [33]–[36]. However, a few laboratories yielded different results. Together these data suggest that good experimental practice can minimize the effect of technical variation.

In a separate experiment, five individuals were bled once every two weeks during a six-week period, resulting in four serial blood samples from each individual. Any differences in expression among sampling times could be due to technical variation, of which there is very little as shown by the previous experiment, or biological variation. That is, although fish appeared healthy, had normal blood glucose, and the stress hormone, cortisol, did not vary significantly (p>0.1), gene expression could vary significantly for unknown biological reasons. Yet, for the five individuals there are few, if any, meaningful differences in gene expression (only one individual had more than the expected number of false positives, Fig. 3). These data confirm the observation that technical errors do not necessarily affect microarray measures. Importantly, these data also suggest that for a tissue or blood sample there is little random stochastic variation in gene expression. These data are in contrast to other publications suggesting that mRNA expression is noisy and has large stochastic variation [37], [38]. The important distinction is that for a single cell, transcription is pulsatile, occurring in bursts [37], [38], and for an individual cell this creates large stochastic variation in mRNA expression. However, our results demonstrate that for millions of cells, this variation is not apparent across a 6-week time course. We suggest that if there is a large stochastic variation in each cell, sampling of millions of cells masks this variation such that the amount of expression from any one gene is stable over time.

The microarrays used here have array elements for essential metabolic genes (Table 1) and are similar to the array elements used in previous work demonstrating larger inter-individual variation in gene expression [1], [9], [16], [17]. While the data presented here addresses the sources of variation in many microarray experiments, the lack of temporal variation in gene expression in our study may only reflect the expression of the metabolic genes. However, these results are similar to studies of gene expression in humans where the same individuals were sampled over a time period of 24 hours to four weeks [39]–[41]. These studies also found relative stable expression of a more diverse set of genes when the same individuals were sampled over time. Thus, although there are good biochemical or molecular reasons to expect stochastic variation in gene expression, this variation is not necessarily observed using routine sampling methods.

Microarrays are a useful technology for observing differences in gene expression and data extracted from microarrays can be reliably reproduced. With reasonable care, any experiment involving microarrays is capable of obtaining biological data that is not masked by technical variation thereby providing a true representation of the transcriptome under a particular set of conditions. However, caution is required before making conclusions about the biological nature of the data until the sources of technical variation are understood.

Acknowledgments

We thank Dr. M. F. Oleksiak for production of the Fundulus heteroclitus EST collection, statistical analysis, and creative thought.

Author Contributions

Conceived and designed the experiments: CPS JDV MDM DLC. Performed the experiments: CPS JDV MDM DLC. Analyzed the data: CPS DLC. Contributed reagents/materials/analysis tools: DLC. Wrote the paper: CPS DLC.

References

1. Oleksiak MF, Churchill GA, Crawford DL (2002) Variation in gene expression within and among natural populations. Nature Genetics 32: 261–266.
- View Article
- Google Scholar
2. de Koning DJ, Haley CS (2005) Genetical genomics in humans and model organisms. Trends in Genetics 21: 377–381.
- View Article
- Google Scholar
3. Townsend JP, Cavalieri D, Hartl DL (2003) Population genetic variation in genome-wide gene expression. Molecular Biology and Evolution 20: 955–963.
- View Article
- Google Scholar
4. Schadt EE, Monks SA, Drake TA, Lusis AJ, Che N, et al. (2003) Genetics of gene expression surveyed in maize, mouse and man. Nature 422: 297–302.
- View Article
- Google Scholar
5. Gibson G, Riley-Berger R, Harshman L, Kopp A, Vacha S, et al. (2004) Extensive sex-specific nonadditivity of gene expression in Drosophila melanogaster. Genetics 167: 1791–1799.
- View Article
- Google Scholar
6. Brem RB, Yvert G, Clinton R, Kruglyak L (2002) Genetic dissection of transcriptional regulation in budding yeast. Science 296: 752–755.
- View Article
- Google Scholar
7. Enard W, Khaitovich P, Klose J, Zollner S, Heissig F, et al. (2002) Intra- and interspecific variation in primate gene expression patterns. Science 296: 340–343.
- View Article
- Google Scholar
8. Monks SA, Leonardson A, Zhu H, Cundiff P, Pietrusiak P, et al. (2004) Genetic inheritance of gene expression in human cell lines. American Journal of Human Genetics 75: 1094–1105.
- View Article
- Google Scholar
9. Oleksiak MF, Roach JL, Crawford DL (2005) Natural variation in cardiac metabolism and gene expression in Fundulus heteroclitus. Nature Genetics 37: 67–72.
- View Article
- Google Scholar
10. Yvert G, Brem RB, Whittle J, Akey JM, Foss E, et al. (2003) Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors. Nature Genetics 35: 57–64.
- View Article
- Google Scholar
11. Baugh LR, Hill AA, Brown EL, Hunter CP (2001) Quantitative analysis of mRNA amplification by in vitro transcription. Nucleic Acids Research 29:
- View Article
- Google Scholar
12. Quackenbush J (2002) Microarray data normalization and transformation. Nature Genetics 32: 496–501.
- View Article
- Google Scholar
13. Irizarry RA, Warren D, Spencer F, Kim IF, Biswal S, et al. (2005) Multiple-laboratory comparison of microarray platforms. Nature Methods 2: 345–349.
- View Article
- Google Scholar
14. Larkin JE, Frank BC, Gavras H, Sultana R, Quackenbush J (2005) Independence and reproducibility across microarray platforms. Nature Methods 2: 337–343.
- View Article
- Google Scholar
15. Bloom G, Yang IV, Boulware D, Kwong KY, Coppola D, et al. (2004) Multi-platform, multi-site, microarray-based human tumor classification. American Journal of Pathology 164: 9–16.
- View Article
- Google Scholar
16. Whitehead A, Crawford DL (2006) Variation within and among species in gene expression: raw material for evolution. Molecular Ecology 15: 1197–1211.
- View Article
- Google Scholar
17. Whitehead A, Crawford DL (2006) Neutral and adaptive variation in gene expression. Proceedings of the National Academy of Sciences of the United States of America 103: 5425–5430.
- View Article
- Google Scholar
18. Yang YH, Buckley MJ, Dudoit S, Speed TP (2002) Comparison of methods for image analysis on cDNA microarray data. Journal of Computational and Graphical Statistics 11: 108–136.
- View Article
- Google Scholar
19. Gold D, Coombes K, Medhane D, Ramaswamy A, Ju ZL, et al. (2004) A comparative analysis of data generated using two different target preparation methods for hybridization to high-density oligonucleotide microarrays. Bmc Genomics 5: -.
- View Article
- Google Scholar
20. van Haaften RIM, Schroen B, Janssen BJA, van Erk A, Debets JJM, et al. (2006) Biologically relevant effects of mRNA amplification on gene expression profiles. Bmc Bioinformatics 7: -.
- View Article
- Google Scholar
21. Vangelder RN, Vonzastrow ME, Yool A, Dement WC, Barchas JD, et al. (1990) Amplified Rna Synthesized from Limited Quantities of Heterogeneous Cdna. Proceedings of the National Academy of Sciences of the United States of America 87: 1663–1667.
- View Article
- Google Scholar
22. Eberwine J (1996) Amplification of mRNA populations using aRNA generated from immobilized oligo(dT)-T7 primed cDNA. Biotechniques 20: 584–&.
- View Article
- Google Scholar
23. Paschall JE, Oleksiak MF, VanWye JD, Roach JL, Whitehead JA, et al. (2004) FunnyBase: a Systems Level Functional Annotation of Fundulus ESTs for the Analysis of Gene Expression. BMC Genomics 5: 96.
- View Article
- Google Scholar
24. Kerr MK, Churchill GA (2001) Experimental design for gene expression microarrays. Biostatistics 2: 183–201.
- View Article
- Google Scholar
25. Kerr MK, Churchill GA (2001) Statistical design and the analysis of gene expression microarray data. Genetical Research 77: 123–128.
- View Article
- Google Scholar
26. Oleksiak MF, Churchill GA, Crawford DL (2002) Variation in gene expression within and among natural populations. Nature Genetics 32: 261–266.
- View Article
- Google Scholar
27. Dudoit S, Gendeman RC, Quackenbush J (2003) Open source software for the analysis of microarray data. Biotechniques 45–51.
- View Article
- Google Scholar
28. Whitehead A, Crawford D (2005) Variation in tissue-specific gene expression among natural populations. Genome Biology 6: R13.11–13.14.
- View Article
- Google Scholar
29. Whitehead A, Crawford DL (2006) Neutral and adaptive variation in gene expression. Proc Natl Acad Sci U S A 103: 5425–5430.
- View Article
- Google Scholar
30. Crawford DL, Oleksiak MF (2007) The biological importance of measuring individual variation. J Exp Biol 210: 1613–1621.
- View Article
- Google Scholar
31. Currie S, Tufts BL (1997) Synthesis of stress protein 70 (Hsp70) in rainbow trout (Oncorhynchus mykiss) red blood cells. Journal of Experimental Biology 200: 607–614.
- View Article
- Google Scholar
32. Koldkjaer P, Pottinger TG, Perry SF, Cossins AR (2004) Seasonality of the red blood cell stress response in rainbow trout (Oncorhynchus mykiss). Journal of Experimental Biology 207: 357–367.
- View Article
- Google Scholar
33. Bammler T, Beyer RP, Bhattacharya S, Boorman GA, Boyles A, et al. (2005) Standardizing global gene expression analysis between laboratories and across platforms. Nature Methods 2: 351–356.
- View Article
- Google Scholar
34. Tan PK, Downey TJ, Spitznagel EL, Xu P, Fu D, et al. (2003) Evaluation of gene expression measurements from commercial microarray platforms. Nucleic Acids Research 31: 5676–5684.
- View Article
- Google Scholar
35. Yauk CL, Berndt ML, Williams A, Douglas GR (2004) Comprehensive comparison of six microarray technologies. Nucleic Acids Research 32: -.
- View Article
- Google Scholar
36. Beissbarth T, Fellenberg K, Brors B, Arribas-Prat R, Boer JM, et al. (2000) Processing and quality control of DNA array hybridization data. Bioinformatics 16: 1014–1022.
- View Article
- Google Scholar
37. Raj A, Peskin CS, Tranchina D, Vargas DY, Tyagi S (2006) Stochastic mRNA synthesis in mammalian cells. Plos Biology 4: 1707–1719.
- View Article
- Google Scholar
38. Blake WJ, Kaern M, Cantor CR, Collins JJ (2003) Noise in eukaryotic gene expression. Nature 422: 633–637.
- View Article
- Google Scholar
39. Cobb JP, Mindrinos MN, Miller-Graziano C, Calvano SE, Baker HV, et al. (2005) Application of genome-wide expression analysis to human health and disease. Proc Natl Acad Sci U S A 102: 4801–4806.
- View Article
- Google Scholar
40. Eady JJ, Wortley GM, Wormstone YM, Hughes JC, Astley SB, et al. (2005) Variation in gene expression profiles of peripheral blood mononuclear cells from healthy volunteers. Physiol Genomics 22: 402–411.
- View Article
- Google Scholar
41. Whitney AR, Diehn M, Popper SJ, Alizadeh AA, Boldrick JC, et al. (2003) Individuality and variation in gene expression patterns in human blood. Proceedings of the National Academy of Sciences of the United States of America 100: 1896–1901.
- View Article
- Google Scholar

[ref1] 1. Oleksiak MF, Churchill GA, Crawford DL (2002) Variation in gene expression within and among natural populations. Nature Genetics 32: 261–266.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. de Koning DJ, Haley CS (2005) Genetical genomics in humans and model organisms. Trends in Genetics 21: 377–381.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Townsend JP, Cavalieri D, Hartl DL (2003) Population genetic variation in genome-wide gene expression. Molecular Biology and Evolution 20: 955–963.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Schadt EE, Monks SA, Drake TA, Lusis AJ, Che N, et al. (2003) Genetics of gene expression surveyed in maize, mouse and man. Nature 422: 297–302.
View Article
Google Scholar

[11] View Article

[12] Google Scholar

[ref5] 5. Gibson G, Riley-Berger R, Harshman L, Kopp A, Vacha S, et al. (2004) Extensive sex-specific nonadditivity of gene expression in Drosophila melanogaster. Genetics 167: 1791–1799.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref6] 6. Brem RB, Yvert G, Clinton R, Kruglyak L (2002) Genetic dissection of transcriptional regulation in budding yeast. Science 296: 752–755.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref7] 7. Enard W, Khaitovich P, Klose J, Zollner S, Heissig F, et al. (2002) Intra- and interspecific variation in primate gene expression patterns. Science 296: 340–343.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref8] 8. Monks SA, Leonardson A, Zhu H, Cundiff P, Pietrusiak P, et al. (2004) Genetic inheritance of gene expression in human cell lines. American Journal of Human Genetics 75: 1094–1105.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref9] 9. Oleksiak MF, Roach JL, Crawford DL (2005) Natural variation in cardiac metabolism and gene expression in Fundulus heteroclitus. Nature Genetics 37: 67–72.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref10] 10. Yvert G, Brem RB, Whittle J, Akey JM, Foss E, et al. (2003) Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors. Nature Genetics 35: 57–64.
View Article
Google Scholar

[29] View Article

[30] Google Scholar

[ref11] 11. Baugh LR, Hill AA, Brown EL, Hunter CP (2001) Quantitative analysis of mRNA amplification by in vitro transcription. Nucleic Acids Research 29:
View Article
Google Scholar

[32] View Article

[33] Google Scholar

[ref12] 12. Quackenbush J (2002) Microarray data normalization and transformation. Nature Genetics 32: 496–501.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref13] 13. Irizarry RA, Warren D, Spencer F, Kim IF, Biswal S, et al. (2005) Multiple-laboratory comparison of microarray platforms. Nature Methods 2: 345–349.
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref14] 14. Larkin JE, Frank BC, Gavras H, Sultana R, Quackenbush J (2005) Independence and reproducibility across microarray platforms. Nature Methods 2: 337–343.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref15] 15. Bloom G, Yang IV, Boulware D, Kwong KY, Coppola D, et al. (2004) Multi-platform, multi-site, microarray-based human tumor classification. American Journal of Pathology 164: 9–16.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref16] 16. Whitehead A, Crawford DL (2006) Variation within and among species in gene expression: raw material for evolution. Molecular Ecology 15: 1197–1211.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref17] 17. Whitehead A, Crawford DL (2006) Neutral and adaptive variation in gene expression. Proceedings of the National Academy of Sciences of the United States of America 103: 5425–5430.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref18] 18. Yang YH, Buckley MJ, Dudoit S, Speed TP (2002) Comparison of methods for image analysis on cDNA microarray data. Journal of Computational and Graphical Statistics 11: 108–136.
View Article
Google Scholar

[53] View Article

[54] Google Scholar

[ref19] 19. Gold D, Coombes K, Medhane D, Ramaswamy A, Ju ZL, et al. (2004) A comparative analysis of data generated using two different target preparation methods for hybridization to high-density oligonucleotide microarrays. Bmc Genomics 5: -.
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref20] 20. van Haaften RIM, Schroen B, Janssen BJA, van Erk A, Debets JJM, et al. (2006) Biologically relevant effects of mRNA amplification on gene expression profiles. Bmc Bioinformatics 7: -.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref21] 21. Vangelder RN, Vonzastrow ME, Yool A, Dement WC, Barchas JD, et al. (1990) Amplified Rna Synthesized from Limited Quantities of Heterogeneous Cdna. Proceedings of the National Academy of Sciences of the United States of America 87: 1663–1667.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref22] 22. Eberwine J (1996) Amplification of mRNA populations using aRNA generated from immobilized oligo(dT)-T7 primed cDNA. Biotechniques 20: 584–&.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref23] 23. Paschall JE, Oleksiak MF, VanWye JD, Roach JL, Whitehead JA, et al. (2004) FunnyBase: a Systems Level Functional Annotation of Fundulus ESTs for the Analysis of Gene Expression. BMC Genomics 5: 96.
View Article
Google Scholar

[68] View Article

[69] Google Scholar

[ref24] 24. Kerr MK, Churchill GA (2001) Experimental design for gene expression microarrays. Biostatistics 2: 183–201.
View Article
Google Scholar

[71] View Article

[72] Google Scholar

[ref25] 25. Kerr MK, Churchill GA (2001) Statistical design and the analysis of gene expression microarray data. Genetical Research 77: 123–128.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref26] 26. Oleksiak MF, Churchill GA, Crawford DL (2002) Variation in gene expression within and among natural populations. Nature Genetics 32: 261–266.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref27] 27. Dudoit S, Gendeman RC, Quackenbush J (2003) Open source software for the analysis of microarray data. Biotechniques 45–51.
View Article
Google Scholar

[80] View Article

[81] Google Scholar

[ref28] 28. Whitehead A, Crawford D (2005) Variation in tissue-specific gene expression among natural populations. Genome Biology 6: R13.11–13.14.
View Article
Google Scholar

[83] View Article

[84] Google Scholar

[ref29] 29. Whitehead A, Crawford DL (2006) Neutral and adaptive variation in gene expression. Proc Natl Acad Sci U S A 103: 5425–5430.
View Article
Google Scholar

[86] View Article

[87] Google Scholar

[ref30] 30. Crawford DL, Oleksiak MF (2007) The biological importance of measuring individual variation. J Exp Biol 210: 1613–1621.
View Article
Google Scholar

[89] View Article

[90] Google Scholar

[ref31] 31. Currie S, Tufts BL (1997) Synthesis of stress protein 70 (Hsp70) in rainbow trout (Oncorhynchus mykiss) red blood cells. Journal of Experimental Biology 200: 607–614.
View Article
Google Scholar

[92] View Article

[93] Google Scholar

[ref32] 32. Koldkjaer P, Pottinger TG, Perry SF, Cossins AR (2004) Seasonality of the red blood cell stress response in rainbow trout (Oncorhynchus mykiss). Journal of Experimental Biology 207: 357–367.
View Article
Google Scholar

[95] View Article

[96] Google Scholar

[ref33] 33. Bammler T, Beyer RP, Bhattacharya S, Boorman GA, Boyles A, et al. (2005) Standardizing global gene expression analysis between laboratories and across platforms. Nature Methods 2: 351–356.
View Article
Google Scholar

[98] View Article

[99] Google Scholar

[ref34] 34. Tan PK, Downey TJ, Spitznagel EL, Xu P, Fu D, et al. (2003) Evaluation of gene expression measurements from commercial microarray platforms. Nucleic Acids Research 31: 5676–5684.
View Article
Google Scholar

[101] View Article

[102] Google Scholar

[ref35] 35. Yauk CL, Berndt ML, Williams A, Douglas GR (2004) Comprehensive comparison of six microarray technologies. Nucleic Acids Research 32: -.
View Article
Google Scholar

[104] View Article

[105] Google Scholar

[ref36] 36. Beissbarth T, Fellenberg K, Brors B, Arribas-Prat R, Boer JM, et al. (2000) Processing and quality control of DNA array hybridization data. Bioinformatics 16: 1014–1022.
View Article
Google Scholar

[107] View Article

[108] Google Scholar

[ref37] 37. Raj A, Peskin CS, Tranchina D, Vargas DY, Tyagi S (2006) Stochastic mRNA synthesis in mammalian cells. Plos Biology 4: 1707–1719.
View Article
Google Scholar

[110] View Article

[111] Google Scholar

[ref38] 38. Blake WJ, Kaern M, Cantor CR, Collins JJ (2003) Noise in eukaryotic gene expression. Nature 422: 633–637.
View Article
Google Scholar

[113] View Article

[114] Google Scholar

[ref39] 39. Cobb JP, Mindrinos MN, Miller-Graziano C, Calvano SE, Baker HV, et al. (2005) Application of genome-wide expression analysis to human health and disease. Proc Natl Acad Sci U S A 102: 4801–4806.
View Article
Google Scholar

[116] View Article

[117] Google Scholar

[ref40] 40. Eady JJ, Wortley GM, Wormstone YM, Hughes JC, Astley SB, et al. (2005) Variation in gene expression profiles of peripheral blood mononuclear cells from healthy volunteers. Physiol Genomics 22: 402–411.
View Article
Google Scholar

[119] View Article

[120] Google Scholar

[ref41] 41. Whitney AR, Diehn M, Popper SJ, Alizadeh AA, Boldrick JC, et al. (2003) Individuality and variation in gene expression patterns in human blood. Proceedings of the National Academy of Sciences of the United States of America 100: 1896–1901.
View Article
Google Scholar

[122] View Article

[123] Google Scholar

Figures

Abstract

Background

Methodology/Principal Findings

Conclusion

Introduction

Materials and Methods

Organism

Blood Sampling.

RNA isolation and amino allyl labeling

Microarrays

Criteria for Inclusion

Statistics

Results

Biosaturation

Variation in RNA preparation

Consistency of Quantitative Determination

Discussion

Acknowledgments

Author Contributions

References