Discovery of a 29-Gene Panel in Peripheral Blood Mononuclear Cells for the Detection of Colorectal Cancer and Adenomas Using High Throughput Real-Time PCR

Laura Ciarloni; Sahar Hosseinian; Sylvain Monnier-Benoit; Natsuko Imaizumi; Gian Dorta; Curzio Ruegg; On behalf of the DGNP-COL-0310 Study Group

doi:10.1371/journal.pone.0123904

Abstract

Colorectal cancer (CRC) is the second leading cause of cancer-related death in developed countries. Early detection of CRC leads to decreased CRC mortality. A blood-based CRC screening test is highly desirable due to limited invasiveness and high acceptance rate among patients compared to currently used fecal occult blood testing and colonoscopy. Here we describe the discovery and validation of a 29-gene panel in peripheral blood mononuclear cells (PBMC) for the detection of CRC and adenomatous polyps (AP). Blood samples were prospectively collected from a multicenter, case-control clinical study. First, we profiled 93 samples with 667 candidate and 3 reference genes by high throughput real-time PCR (OpenArray system). After analysis, 160 genes were retained and tested again on 51 additional samples. Low expressed and unstable genes were discarded resulting in a final dataset of 144 samples profiled with 140 genes. To define which genes, alone or in combinations had the highest potential to discriminate AP and/or CRC from controls, data were analyzed by a combination of univariate and multivariate methods. A list of 29 potentially discriminant genes was compiled and evaluated for its predictive accuracy by penalized logistic regression and bootstrap. This method discriminated AP >1cm and CRC from controls with a sensitivity of 59% and 75%, respectively, with 91% specificity. The behavior of the 29-gene panel was validated with a LightCycler 480 real-time PCR platform, commonly adopted by clinical laboratories. In this work we identified a 29-gene panel expressed in PBMC that can be used for developing a novel minimally-invasive test for accurate detection of AP and CRC using a standard real-time PCR platform.

Citation: Ciarloni L, Hosseinian S, Monnier-Benoit S, Imaizumi N, Dorta G, Ruegg C, et al. (2015) Discovery of a 29-Gene Panel in Peripheral Blood Mononuclear Cells for the Detection of Colorectal Cancer and Adenomas Using High Throughput Real-Time PCR. PLoS ONE 10(4): e0123904. https://doi.org/10.1371/journal.pone.0123904

Academic Editor: Antonio Moschetta, IRCCS Istituto Oncologico Giovanni Paolo II, ITALY

Received: December 8, 2014; Accepted: February 27, 2015; Published: April 13, 2015

Copyright: © 2015 Ciarloni et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited

Data Availability: All relevant data are within the paper and its Supporting Information files.

Funding: This study was supported by Diagnoplex SA. The funder provided support in the form of salaries for authors LC, NI, SMB, SH, but did not have any additional role in data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the author contributions’ section.

Competing interests: LC SH SMB NI were employees of Diagnoplex SA at the time of the study. Also, they own stock options in Diagnoplex SA. CR is a co-founder, advisory board member and owns stocks and shares of Diagnoplex SA. GD has served as speaker, consultant and advisory board member for Diagnoplex SA, and has received research funding from Diagnoplex SA. This does not alter our adherence to PLOS ONE policies on sharing data and materials.

Introduction

Colorectal cancer (CRC) is the third most common cancer and second leading cause of cancer-related death among men and women in Europe [1]. Importantly, CRC is often curable, when diagnosed at early stages. Moreover, detection and removal of adenomatous polyps (AP) prevents CRC formation and decreases mortality due to CRC. Several countries have already adopted screening modalities for CRC and clinical practice guidelines recommend that average risk individuals begin regular screening at 50 years of age [2, 3]

Colonoscopy is the “gold standard” for AP and CRC diagnosis, however it is not the preferred method for mass screening because of its cost, invasiveness, low compliance and limited accessibility. Currently recommended non-invasive methods for mass screening include immunochemical and guaiac fecal occult blood testing (iFOBT, gFOBT). Yet, compliance with fecal tests is still suboptimal in countries with an FOBT screening program [4, 5, 6]. Therefore, there is still a large unmet need calling for a non- or minimally-invasive, compliant, cost-effective and accurate screening test to detect AP and CRC at early stages. A blood-based screening test is highly attractive due to its minimal invasiveness and high acceptance among patients. In particular we and others have reported signatures derived from peripheral blood mononuclear cells (PBMC) gene expression profiles associated with digestive [7, 8, 9, 10], breast [11], renal [12, 13], pulmonary [14] and bladder cancers [15]. These tests are conceptually different from classical tumor biomarker tests, as they are based on the detection of the host response to tumor-derived signals [16, 17] rather than on markers originating from the tumor itself.

When searching for differences in gene expression and identification of RNA transcripts to be subsequently used as potential biomarkers, commonly used methods include microarray-based DNA hybridization platforms or RNA-based sequencing techniques [18, 19]. Although powerful, these methods are complex, time consuming and expensive, and generate high volume of data that require specialized bioinformatics tools and competencies for their analysis. Moreover, identified genes of interest require further validation by more accurate and sensitive methods such real-time qPCR, before they can be translated into clinically useful tests [20]. In alternative to these techniques, high throughput real-time qPCR platforms demonstrated to perform well when candidate gene selection was driven by solid scientific evidence, in spite of the fact that they allow the analysis of only a fraction of the transcriptome [21]. Importantly, biomarker discoveries based on qPCR platforms have the major advantage that a further validation step in view of their clinical use is not required. Furthermore, they are substantially less expensive than whole genome approaches, allowing the analysis of a larger sample set and thus increasing the statistical power of the study.

Here we report the discovery and characterization of a 29-gene panel in PBMC for the detection of colorectal adenomas and carcinomas using a nanoliter high throughput qPCR platform (OpenArray) [22]. To this purpose we used samples prospectively collected from a multicenter, case-control study in which patients were referred for colonoscopy or scheduled for surgery for CRC removal. We also demonstrated that the gene panel could be easily transferred and implemented into a medical laboratory-friendly assay, which is a key step in the development of a new cost-effective, simple blood-based colorectal cancer screening test.

Material and Methods

Patients

A case-control study (DGNP-COL-0310), including three South Korean and six Swiss centers which enrolled 1665 subjects older than 50 years that were referred for colonoscopy by general practitioner or were scheduled for surgery, was conducted from June 2010 to April 2013. The study was specifically conceived and designed for the development and validation of a new test for CRC screening. The biomarker discovery phase took place during the first half of patient recruitment. For this purpose, a subset of 144 subjects, allocated to control, CRC and AP groups (Table 1), was randomly selected, and used for gene expression profiling by high throughput qPCR.

Download:

Table 1. Characteristics of the study population.

https://doi.org/10.1371/journal.pone.0123904.t001

Subjects had no first-degree family history of CRC or a known CRC predisposition, previous history of polyps or cancer including CRC, no hepatobiliary, genitourinary, autoimmune and inflammatory disorders, including inflammatory bowel diseases, infectious diseases and fever within 4 weeks before colonoscopy. Chronic diseases common in the old population, such as diabetes, hypertension, hypercholesterolemia, heart failure, were not considered as exclusion criteria. A control subject was defined as an individual without any past and present history of colorectal lesions or diseases (e.g. small adenomas, hyperplastic polyps, cancer). The AP group included subjects diagnosed with an adenoma larger than 1 cm, based on the endoscopic measurement. The CRC group included patients with carcinoma at all four TNM stages. Final diagnosis was based on colonoscopy and histopathological evaluation.

Ethical approval

The study protocol (DGNP-COL-0310) was approved by the competent review boards and ethics committees for research on human subjects of Canton Bern, Switzerland (Kantonale Etikkommission Berne, No KEK 139/10), Canton St. Gallen, Switzerland (Ethikkommission des Kantons St. Gallen, No. EKSG 10/091/1B), Canton Vaud, Switzerland (Commission Cantonale d’éthique de la recherche sur l’être human, No. VD 77/10), Canton Basel, Switzerland (Ethikkommission beider Basel, No. EKBB 242/10), of the Severance Hospital, Yonsei University College of Medicine, South Korea (No. 4-2010-0128) and by the Institutional Bioethics Review Board of Seoul National University Hospital, South Korea (IRB No. H-1004-020-315). Written informed consent was obtained from all study participants adhering to the local ethical guidelines.

Blood collection and processing

Peripheral blood from all subjects was drawn either up to 30 days before or up to 12 weeks after colonoscopy and prior to any polyp or cancer resection or pre-operative chemotherapy. Blood samples were collected into 4x4 ml BD Vacutainer CPT tubes (Becton Dickinson, Franklin Lakes, NJ). Filled CPT tubes were kept at room temperature and PBMC separation performed within 6 hours according to manufacturer’s instructions. PBMC pellets were resuspended in RNAlater Solution (Life Technologies, Carlsbad, CA) and stored at -80°C.

RNA preparation

Automated purification of total RNA was performed on QIAcube by RNeasy Mini kit (Qiagen, Venlo, Netherlands) and included a DNase treatment. RNA concentration was measured by Nanodrop spectrophotometer (Thermo Scientific, Waltham, MA) and RNA integrity was analyzed by Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA). Samples with a RIN < 7 were considered of poor quality and discarded. On average RNA showed a RIN of 9±0.5. Isolated total RNA was aliquoted and stored at -80°C. In order to meet the high RNA concentration required by the RT protocol, RNA samples were systematically precipitated following a standard 100% ethanol/3M sodium acetate method.

Screening design and dataset generation

In order to find relevant blood biomarkers for CRC, we performed gene expression screening on 144 samples derived from patients with AP or CRC and control subjects. The screening was conceived in 2 phases (Fig 1). First, we profiled 93 samples with a large gene panel. The panel included 667 candidate, of which 42 biomarkers previously identified by our laboratory [8] and 625 new candidate selected from the literature (S1 Table). Three reference genes for qPCR data normalization were also added. The literature search focused on genes, molecular pathways and biological processes considered to be relevant in the tumor-host response such as inflammation and immune response, tumor invasion and metastasis, hematopoiesis, signal transduction pathways (in particular NF-kB pathway), chemokines and cytokines (in particular IL-1, IL-2), extracellular matrix proteins, adhesion molecules and cell surface markers. For each candidate gene, a TaqMan assay was selected from a commercial repository (Life technologies, Carlsbad, CA) (S1 Table) and distributed in three 224-assay Open Array plates, each measuring 12 samples in parallel. Hence, to fully profile 12 samples, three 224-assay plates, thermocycled in parallel, were used. To check for inter-plate variability and reproducibility, the reference gene RPLP0 was assayed for each sample on all 3 plates. RPLP0 standard deviation (SD) analysis of the 3 sample replicates showed a median SD of 0.21 Ct, with an inter-quartile range of 0.14–0.34, indicating that sample measurement was accurate and highly reproducible across the 3-plate series. In this phase we focused on catching the broadest subject biological variability rather than minimizing the technical one, therefore systematic sample replicates were not performed, to allow the profiling of a maximum number of samples.

Download:

Fig 1. Study design.

In a first screening phase performed on the OpenArray system, 670 genes were profiled on 93 samples. Out of these, 163 genes were selected and further tested in phase 2 on additional 51 samples. The final dataset included 144 samples profiled with 163 genes. A 29-gene panel was compiled based on highest power to discriminate AP/CRC from controls by univariate and multivariate analysis. Finally, the 29-gene panel was validated with a LightCycler480 platform, commonly used in clinical laboratories.

https://doi.org/10.1371/journal.pone.0123904.g001

Out of 670 genes analyzed, 133 showed no expression or a poor PCR amplification. The remaining 534 candidate genes and 3 reference genes were overall well expressed with a median Ct of 22.94 (inter-quartile range: 21.28–25.24) and a median SD of 0.7 (inter-quartile range: 0.59–1.04) (S2 Table). Gene profiles passed through a light filtering step in which they had to satisfy at least one of the following criteria for at least one of the discrimination analysis (i.e. control versus CRC or control versus AP): a p-value less than 0.1 or a fold-change greater than 1.5 (linear scale). In addition, biomarkers previously identified in our laboratory [8] were retained in this phase regardless their p-value or fold change in order to be assessed in a larger sample set and using a multivariate statistical approach. In total, 160 genes were selected, together with the three reference genes, for the second phase of the screening. This time the 163 genes, measured by 168 assays (5 genes with very low expression had a second assay to validate the measure obtained), were allocated in a single plate (S3 Table). Additional 51 samples were profiled in duplicate with this reduced gene panel to increase the sample size and the statistical power in subsequent analyses. Also, 40 samples already profiled in the phase 1, were re-analyzed to ensure the reproducibility of the measurements across phase 1 and phase 2 and the different plate format (224- vs. 168-assay format). The expression levels obtained in both phases for these 40 samples were highly correlated (R² = 0.993) (S1 Fig), prompting us to combine the 93 and 51 samples, profiled for the 163 genes, into a single final dataset (Fig 1). To compile the dataset, the mean values of the 40 samples measured in duplicates were used.

Nanoliter high throughput qPCR

Nine hundred ng of total RNA was reverse transcribed in 20 μl volume using the high-Capacity cDNA Reverse Transcription kit (Life Technologies, Carlsbad, CA) with random primers, according to the manufacturer’s instructions.

Gene expression profiling was performed using the OpenArray system [22] (Life Technologies, Carlsbad, CA), a nanoliter high throughput real-time PCR platform, allowing 3,072 reactions in a single plate. PCR reactions were performed according to the TaqMan OpenArray real-time plates protocol. Briefly, PCR reaction mixtures containing 2.5 μl GeneAmp Fast PCR Master Mix (Applied Biosystems, Carlsbad, CA), 1 μl TaqMan OpenArray Remix (Applied Biosystems, Carlsbad, CA), 0.3 μl RNase-free water, and 1.2 μl cDNA, were loaded automatically in single or duplicate reaction into the OpenArray plates using an OpenArray AccuFill instrument according to the manufacturer’s protocols. The thermal cycling protocol consisted of 40 cycles at 95°C for 15 seconds and 60°C for 1 minute. At the end of each run, the images, collected before and during the PCR run, were visually inspected to check for sample misloading or plates reading problems. Ct values were computed by the OpenArray analysis software using automatic thresholding with the Ct confidence minimum signal set at 300. Values below the minimum signal setting indicated a failed reaction or no amplification and missing values appeared in the exported data. Most of the time a failed reaction was due to expression levels beyond the limit of detection. Since the maximum Ct detected was inferior to 36, these values were replaced by the arbitrary Ct value of 36. Missing values judged to be caused by technical reasons were replaced by the median Ct values of the concerned gene across all samples. A reference RNA (Xpress Ref Human Universal total RNA) (Qiagen, Venlo, Netherlands) was present as positive control at least once in every PCR run in order to ensure process and reagent stability as well as reproducibility over different PCR runs. Negative controls containing water instead of cDNA were used to rule out possible DNA contamination.

Real-time qPCR on 384-well plates

Two hundred ng of total RNA was reverse transcribed into cDNA using SuperScript VILO cDNA Synthesis Kit (Invitrogen, Carlsbad, CA) according to manufacturer’s instructions.

RealTime ready Custom RT-qPCR assays (Roche, Basel, Switzerland) based on Universal ProbeLibrary (UPL) technology (S4 Table), were pre-loaded on 384-well plates by the manufacturer. Real-time PCR analysis on 384-well plates was performed on the Lightcycler 480 instrument (Roche, Basel, Switzerland) on 144 samples. PCR reactions were carried out in duplicates in 384-well plates in a total volume of 10 μl. Each well was loaded by an automated pipettor (MICROLAB STARLet, Hamilton Robotics, Reno, NV) with 5 μl of RealTime Ready DNA Probes Master Mix (Roche, Basel, Switzerland) and the cDNA equivalent of 2.5 ng of total RNA. The qPCR program consisted of 2 seconds at 95°C and 30 seconds at 60°C for 40 cycles. Positive and negative controls were generated with each retrotranscription batch and were included in every qPCR run for each target assay. The negative control contained neither RNA nor cDNA to confirm no contamination occurred. The positive control was made with a standardized quantity of Human Universal Reference RNA (Clontech, Mountain View, CA) aliquoted and stored at -80°C. For qPCR run validation, the negative control yielded no amplification or a Crossing point (Cp) value up or equal to 35, and the positive control a Cp value, for each target gene, that fell within a pre-determined range. Cp values were automatically calculated with the LightCycler 480 analysis software according to the 2^nd derivative maximum method [23].

Statistical analysis

PCR data derived from the OpenArray and the LC480 platforms were normalized by the ΔCT method using the mean of three housekeeping genes RPLP0, NACA and TPT1. These genes were selected because they were the most stable in 3 PBMC-related microarray dataset available from the GEO database [24] and also in qPCR analysis performed by us (data not shown).

Gene expression fold change, was defined as with , where is the normalized data.

Wilcoxon rank test [25] was applied to the normalized gene expression data in order to define genes significantly differentially expressed between groups. In addition, in phase 2 screening, Wilcoxon rank test was applied to 500 randomly selected datasets (bootstrap) and significance was set to genes appearing significant (p-value<0.05) in at least 250 bootstraps out of 500.

The multivariate analysis used for feature selection in phase 2 screening included the following methods: K-top scoring pair, a parameter-free, feature selection algorithm [26], and penalized logistic regression method with different algorithms [27, 28].

To evaluate the predictive accuracy of the 29-gene panel, penalized logistic regression models were fitted on the dataset and validated by non-overlapped bootstrap method [29]. Five hundred random datasets were drawn with replacement from dataset; each bootstrap had the same size as the training set. The model was re-fitted at each bootstrap and validated on the out-of-bag samples. The specificity and sensitivity average values over 500 bootstraps were calculated and Receiver Operating Characteristics (ROC) curves were generated by plotting the sensitivity against the false positive rate (1—specificity). Area under the curve (AUC) was calculated.

The Pearson correlation coefficient was used to evaluate linear correlation between genes’ measurements by two instruments.

The R statistics environment was used for statistical analyses.

Results

Definition of a 29-gene panel for colorectal cancer and adenoma detection

The dataset generated from phase 2 screening (163 genes and 144 samples) was analyzed and filtered for low expression and unstable genes across the two phases, and 20 genes were further discarded, reducing the number of candidate genes to 140. The data were explored in order to define which genes, alone or in combinations, had the highest power to discriminate CRC, AP and AP together with early stage CRC (AP+CRC I-II) from the control group (S3 Table). In addition, the CRC group was compared to AP group, to identify specific genes able to differentiate between CRC and AP. In general, most of the genes appeared to be up-regulated in the CRC and AP groups when compared to the control group. The observed gene expression fold changes were relatively modest, not exceeding a factor of 2.3 (log2 = 1.22) (Fig 2). When a filter based on a FC>1.3 and p-value <0.05 was applied to all group comparisons (CRC/Con, AP/Con, AP+CRCI-II/Con, CRC/AP), we found 28 genes that satisfied both criteria (S3 Table). Among those, 14 discriminate CRC and 8 AP from control group (Fig 2), two of which were common to both conditions (CES1 and IL1B). Seven were specific only for separating AP from CRC and 1 for discriminating AP +CRC I-II. Genes were confirmed to be significant when statistical testing was applied to 500 randomly generated datasets (bootstrap, data not shown).

Download:

Fig 2. Differential gene expression analysis of 140 genes.

The volcano plots summarize the gene expression fold-changes (FC) (x-axis) and the p-values (y-axis) for the comparisons: A, CRC versus control or, B, AP versus control. P-value cutoff was fixed at 0.05 (horizontal line) and fold-change threshold at ±0.38 (equivalent to ± 1.3 in linear scale, vertical line). A FC equal to 1 means that the gene is expressed in the group of interest, on average, twice as much than in the control group.

https://doi.org/10.1371/journal.pone.0123904.g002

Multivariate analysis was applied to the dataset to discriminate CRC, AP, AP+CRC from control group. It included KTS-pair [26] and five different algorithms based on penalized logistic regression method [27, 28, 30] for variable selections and model fitting. Genes were ranked according to the frequency of selection by the method used which is summarized by the multivariate score. Thirty-eight genes with a score of at least two were retained to compile the final gene list (S3 Table).

The univariate and multivariate gene lists were then merged, resulting in a final list of 29 genes (Table 2). Interestingly, most of the univariate top-scoring genes appeared to be also the top-scoring genes in the multivariate analysis. Four genes (MAP2K3, MAPK6, CD63, ITGB5), excluded by the filter of the univariate analysis because FC <1.3 but statistically significant (p value<0.05), were “rescued” by the multivariate analysis. Of the other 10 genes not statistically significant and integrated in the final lists thanks to the multivariate approach (GATA2, LTF, MMP9, CXCL10, MSL1, RHOC, FXYD5), the first three showed a FC >1.3.

Download:

Table 2. 29-gene panel for colorectal cancer and adenoma detection.

https://doi.org/10.1371/journal.pone.0123904.t002

Functional analysis conducted with the Ingenuity Pathway Analysis package (IPA; www.qiagen.com/ingenuity), revealed that the panel was enriched in genes involved in leukocytes migration and chemotaxis (CCR1, CXCL10, CXCL11, CXCR3, IL1B, IL8, ITGA2, MMP9, S100A8) (Table 3). These cellular functions are tightly related to immune cell trafficking and inflammation. Highly represented were also genes involved in cell proliferation and differentiation (BCL3, CD63, EGR1, GATA2, JUN, LTF, MAPK6, MMP11, NME1, PPARG, TNFSF13B), reflecting a possible role in hematopoiesis.

Download:

Table 3. Functional analysis of the 29-gene panel.

https://doi.org/10.1371/journal.pone.0123904.t003

Validation of the 29-gene panel

To evaluate the clinical relevance of our 29-gene panel, in particular its predictive accuracy, penalized logistic regression was applied to the dataset and fitted models were validated by non-overlapped bootstrap method. Models could discriminate CRC or AP >1cm from controls with an average sensitivity of 75% and 59%, respectively. The average specificity, defined as the number of controls correctly classified over the total number of controls, was 91%. ROC analysis determined an AUC of 0.88 (0.83–0.92, 95% CI) and of 0.85 (0.78–0.91, 95%CI) for CRC or AP detection, respectively (Fig 3). When the same approach was applied to the 15 top-ranked genes for CRC or AP discrimination by univariate analysis only (Table 2), predictive accuracy drastically decreased. When specificity was set at 91%, CRC were detected with a sensitivity of 65% and AP with a sensitivity of 37%, with an AUC of 0.86 (0.77–0.84, 95% CI) and 0.77 (0.70–0.82, 95% CI), respectively. This result supported our choice of integrating univariate and multivariate approach for gene selection: genes that otherwise would have been discarded because not meeting the fixed p-value and FC criteria, were indeed valuable for CRC and AP detection as deemed by multivariate methods.

Download:

Fig 3. Receiving Operating Characteristics (ROC) analysis.

A. Summary of the false and true positive rates of the 29-gene panel in classifying CRC cases. B. Summary of the false and true positive rates of the 29-gene panel in classifying AP cases. Analyses were performed using 500 bootstrap validations. The boxplots represent the distribution of the 500 bootstraps. The black line represents the average values over 500 bootstraps for clinical specificity and sensitivity.

https://doi.org/10.1371/journal.pone.0123904.g003

Assay migration to a 384-well plate qPCR platform

The OpenArray platform demonstrated to be valuable for high throughput gene expression profiling and biomarker discovery. However, this platform is not suitable in a routine clinical laboratory setting, in which easiness of use, flexibility and low costs are preferred. With the aim of developing the 29-gene panel into a widely used CRC screening test, we evaluated the panel behavior on a qPCR platform that is commonly adopted by clinical laboratories. The 144 samples were profiled with the 29-gene panel using a LightCycler 480 (LC480) instrument and a set of commercially available probe-based assays (Universal ProbeLibrary, Roche), preloaded on 384-well plates.

Correlation and linear regression analysis showed that gene expression levels measured on the two platforms were highly comparable (correlation coefficient: 0.933) (Fig 4A). In general, gene expression showed similar variance across the samples (Fig 4B). However, a group of lowly expressed genes displayed measurements with smaller standard deviations on the LC480 platform than on the OpenArray one, suggesting that the assays on the LC480 instrument are more accurate. When we repeated the differential gene expression analysis between controls and CRC groups with the LC480 dataset, we found that relative abundance and statistical significance for the 29 genes were similar or with the similar trend to what observed on the OpenArray dataset (Fig 4C and 4D). However, for five genes (PTGES, MMP11, IL8, CCR1, S100A8) statistical significance was lost when measured on the LC480.

Download:

Fig 4. Validation of the 29-gene panel on the LightCycler480 qPCR platform.

Scatter plots comparing analyses performed for each gene on the datasets generated on the LC480 and OpenArray platforms. The following variables have been used: A. Mean normalized expression values (ΔCp and ΔCt) (R²: 0.933), B. Mean standard deviations (SD) relative to each target gene measured, C. Gene expression fold changes between the CRC and the control group (linear absolute values), D. p-values from statistical testing between the CRC and the control group (log transformed). Lines represent a p-value<0.05. Gene names have been overlapped to the graphs.

https://doi.org/10.1371/journal.pone.0123904.g004

Discussion and Conclusions

In this work we have identified a 29-gene panel expressed in PBMC, capable of discriminating individuals with AP >1cm or CRC from healthy individuals. Penalized logistic regression analysis correctly classified 75%, 59% (sensitivity) and 91% (specificity) of CRC, AP and controls, respectively.

The approach we used is different compared to existing screening tests for CRC, as it is based on PBMC genes expression profiles. This leverages the well-established concept of tumor-host interaction and contribution of bone marrow-derived cells to tumor progression [16, 17]. It is of interest the fact that AP, considered premalignant lesions, are also detected, albeit with a lower sensitivity, by the 29-gene panel. Indeed, inflammation is associated with neoplastic colonic polyp formation [31] and inflammatory bowel diseases, in particular colitis ulcerosa, are a risk factor for CRC [32]. Importantly, non-steroidal anti-inflammatory drugs protect against CRC development [33], prevent adenoma formation in experimental models of familial adenomatous polyposis coli [34] and reverse gene expression changes in the normal colon to adenoma sequence [35]. This reinforces the notion that the proposed approach might be developed for AP and early CRC detection as more effective alternative to fecal occult blood-based tests. The pool of 670 candidate genes used for the screening included many host genes and pathways involved in inflammation, immune response and tumor progression. Importantly, these genes were not chosen based on a preceding genome-wide screen, but based on existing knowledge (i.e. literature and own data) and hypotheses (i.e. role of inflammation in cancer progression). The majority of these 29 genes of the panel are mediators/regulators of inflammation, cell motility, cell survival, cell signaling and proliferation (Table 2 and 3). This is consistent with the notion that tumor-mobilized bone marrow-derived circulating myelomoncytic cells are in a state of activation in response to tumor-released factors. We have previously shown that tumor-released PlGF and KitL are capable of modulating the differentiation program of CD11b⁺ cells mobilized from the bone marrow in response to the growing primary tumor, thereby generating pro-angiogenic or pro-metastatic CD11b⁺ cells, respectively [36, 37].

We recognize that genes associated to inflammatory processes might be similarly modulated in inflammatory conditions/diseases other than CRC or AP, possibly leading to reduced signature specificity. However, the weight of possibly “non-specific” genes is moderated by the information carried by other genes in the panel. We are planning to evaluate the gene panel in an independent test set of samples collected during this clinical study, including several inflammatory conditions, in particular inflammatory bowel diseases and to use this sample set to optimize a predictive algorithm highly specific for CRC and AP detection.

Technically, our discovery strategy leveraged the recent development of accurate and sensitive high-throughput qPCR platforms based on microfluidic technologies, such as the OpenArray [22]. The apparent disadvantage of this approach, the screening of only a limited part of the transcriptome, is overcome by several advantages. In particular, the high analytical sensitivity typical of qPCR, reached with relatively low amounts of native, non-amplified RNA, together with a direct transcript measure, will make further analytical validation steps unnecessary. In addition, the rapidity of sample processing due to the high throughput technology, with the analysis of more than hundred samples per day, and the easiness of data mining and bioinformatic analysis, greatly balanced the advantages of a genome-wide approach, like microarray and RNA-seq. With the aim of translating our gene panel into an assay easily implementable as a routine lab test, we performed a technology transfer from the high throughput platform to a standard 384-well plate qPCR platform using plates preloaded with specific target gene assays. As shown, the gene expression profiles and differential analysis results obtained from the two platforms were comparable. We therefore concluded that the LC480 platform is suitable to implement a CRC screening assay based on the 29-gene panel. Nevertheless, it would be interesting to reanalyze the samples with a genome wide approach (e.g. RNAseq) and to compare the discriminatory power of the newly identified genes to the 29-gene panel described here. Additional genes issued from such an approach might be eventually added to the current signature to improve sensitivity and specificity of the test.

The findings reported here are in line with the recently reported results obtained from a pilot monocentric study in which the feasibility of the use of a PBMC-derived signature to detect CRC and AP was demonstrated [8]. The significance of the study was limited, however, by the small sample size and by the low number of CRC samples compared to AP. The present study was designed to independently validate this pilot study in a multicenter, case-control study. The two studies also differed from a technical point of view as numerous changes were introduced in the assay procedures, in particular blood collection, PBMC isolation and the qPCR chemistry, which made necessary a fully new development and assessment.

In spite of the major clinical and technical differences in these studies, 8 genes were found common to the 29-gene panel reported here and the 42-gene panel previously reported [8]. Four of those, would have been excluded from the 29-gene panel without the decision of pushing to the phase 2 of the screening, genes identified by Nichita et al. but not significant in phase 1. Moreover, the predictive accuracy reported in the two studies is very similar, thereby demonstrating the consistency and robustness of our approach. The fact that the two panels overlap only for a fraction of genes could be explained by the fact that the biomarker discovery reported by Nichita et al. focused predominantly on adenoma samples rather than CRC. However, it is already known that different gene signatures may carry the same biological information as it was reported in gene-expression signatures in breast cancer [38].

In conclusion, we have discovered and characterized a 29-gene panel in PBMC for the detection of colorectal adenomas and CRC. The signature can discriminate AP >1cm and CRC from controls with an average sensitivity of 59% and 75%, respectively, and a specificity of 91%. We also demonstrated that the gene panel could be easily transferred and implemented into a medical laboratory-friendly assay, which is a key step in the development of a new cost-effective, simple blood-based colorectal cancer screening test. The identified signature will be the basis for developing a decisional algorithm that will be validated for its prospective discriminatory value on the remaining samples collected in this case-control multicenter study. The availability of a larger number of AP>1 cm and early stage CRC samples, as well as the presence of patients with inflammatory diseases or other type of tumors, will allow to finely tune the algorithm to be highly sensitive and specific to precancerous and early stage cancerous lesion.

Supporting Information

S1 Fig. Linear regression between mean Ct values obtained from the two screening phases.

Forty samples were analyzed in both phase 1 and phase 2 and Ct values compared. Ct1 refers to Ct values obtained in the phase 1 and Ct2 in the phase 2. Mean Ct values obtained in the two phases were highly correlated (R² = 0.993).

https://doi.org/10.1371/journal.pone.0123904.s001

(TIFF)

S1 Table. Annotated list with the 670 genes tested in the screening.

For each of the gene is reported the official gene symbol, the full gene name, the RNA reference sequence database ID (RefSeq), the TaqMan assay ID (Life Technologies, Carlsbad, CA) used during the screening as well as, if any, the alternative assay used in phase 2 (TaqMan assay 2), and at last the genes selected for phase 2 screening.

https://doi.org/10.1371/journal.pone.0123904.s002

(XLSX)

S2 Table. Summary of the univariate analysis performed during phase 1 screening.

In this phase 93 PBMC samples, equally distributed among controls (Con), adenomatous polyps >1cm (AP) and CRCs stage I-IV (stage I-II, n = 17), were profiled with 670 genes. The table displays only those 534 genes showing expression levels above the limit of detection for more than 50% of the samples or a good PCR amplification. Mean, standard deviation (SD) and median values are reported for four main analysis groups (Con, AP, CRC and AP plus CRC (AP-CRC)) and for two subgroups: CRC stages I-II only (CRCI-II) and AP plus CRCI-II (AP-CRCI-II). During the univariate analysis, the disease groups/subgroups were compared to the control one and the AP group was evaluated against the CRC one. For each gene and for each group analysis, p-values derived from Student’s t-test and Wilcoxon rank sum test and fold-change (FC) linear values, derived from the mean or median, are reported. In addition, the table indicates if the gene was selected for the phase 2 screening (163 genes) and in the final 29-gene panel.

https://doi.org/10.1371/journal.pone.0123904.s003

(XLSX)

S3 Table. Summary of the univariate and multivariate analysis performed during phase 2 screening.

In this phase 144 PBMCs samples, including 50 controls (Con), 46 adenomatous polyps >1cm (AP) and 48 CRCs stage I-IV (stage I-II, n = 24), were profiled with 163 genes. Univariate and multivariate analysis were conducted only on the 140 genes showing stable and reproducible measurements between the two phases. For the univariate analysis, the same settings explained in S2 Table were applied, and the results were reported similarly. Multivariate analysis was applied only for the discrimination of the main groups (CRC, AP, AP-CRC) from control one. It included KTS-pair and five different penalized logistic regression algorithms for variable selections and model fitting. Selection of a given gene by one multivariate method, for one particular group analysis, is indicated by 1, whereas non-selection is indicated by 0. The frequency by which each gene appeared during the analysis is summarized by the Multivariate Score, calculated by summing the gene presence in all combinations/fitted models for all group analyses. The score could range from 0 to 18 and genes with a score of at least two were retained to compile the final gene list. (nd: not determined, NA: not applicable).

https://doi.org/10.1371/journal.pone.0123904.s004

(XLSX)

S4 Table. Final 29-gene panel.

RealTime ready Custom RT-qPCR assays (Roche, Basel, Switzerland) used to validate the gene panel on the LightCycler 480 instrument are reported, including forward and reverse primer sequences and the associated UPL probe ID. The assays were pre-loaded on 384-well plates. Reference genes used for PCR values normalization are marked with *.

https://doi.org/10.1371/journal.pone.0123904.s005

(PDF)

Acknowledgments

Members of the DGNP-COL-0310 Study Group: Seoul, South Korea: Si Young Song—Severance Hospital, Division of Gastroenterology; Joo Sung Kim—Seoul National University Hospital, Department of Internal Medicine; Seung-Jae Myung—Asan Medical Center, Division of Gastroenterology. Switzerland: Jan Borovicka—St.Gallen Cantonal Hospital, Gastroenterology and Hepatology; Rémy Meier – Liestal Cantonal Hospital, Gastroenterology, Hepatology and Nutrition; Boudewijn van der Weg, Thun Hospital, Gastroenterology; Christoph Beglinger, Basel University Hospital, Department of Biomedicine; Cédric Vallet, La Côte Hospital (Morges), Division of Surgery.

The authors wish to thank Josiane Wyneger and Nathalie Rochat for excellent technical assistance in performing the experiments. We would also like to thank Dr. Stavros Therianos for inspiration and support in this study. The work was performed by Diagnoplex SA. As of December 2014 all IP and technologies that emerged from this study have been transferred to Novigenix SA. This change is reflected in the double affiliation of some of the authors.

Author Contributions

Conceived and designed the experiments: LC SH SMB NI GD CR. Performed the experiments: LC SMB SH. Analyzed the data: LC SMB SH. Contributed reagents/materials/analysis tools: SH GD. Wrote the paper: LC SH SMB CR. Writing clinical protocol: NI GD.

References

1. Jemal A, Siegel R, Xu J, Ward E. Cancer statistics, 2010. CA Cancer J Clin. 2010;60: 277–300. pmid:20610543
- View Article
- PubMed/NCBI
- Google Scholar
2. Levin B, Lieberman DA, McFarland B, Andrews KS, Brooks D, Bond J, et al. Screening and surveillance for the early detection of colorectal cancer and adenomatous polyps, 2008: a joint guideline from the American Cancer Society, the US Multi-Society Task Force on Colorectal Cancer, and the American College of Radiology. Gastroenterology. 2008;134: 1570–1595. pmid:18384785
- View Article
- PubMed/NCBI
- Google Scholar
3. von Karsa L, Patnick J, Segnan N, Atkin W, Halloran S, Lansdorp-Vogelaar I, et al. European guidelines for quality assurance in colorectal cancer screening and diagnosis: overview and introduction to the full supplement publication. Endoscopy. 2013;45: 51–59. pmid:23212726
- View Article
- PubMed/NCBI
- Google Scholar
4. Parente F, Boemo C, Ardizzoia A, Costa M, Carzaniga P, Ilardo A, et al. Outcomes and cost evaluation of the first two rounds of a colorectal cancer screening program based on immunochemical fecal occult blood test in northern Italy. Endoscopy. 2013;45: 27–34. pmid:23254404
- View Article
- PubMed/NCBI
- Google Scholar
5. Jean-Jacques M, Kaleba EO, Gatta JL, Gracia G, Ryan ER, Choucair BN. Program to improve colorectal cancer screening in a low-income, racially diverse population: a randomized controlled trial. Ann Fam Med. 2012;10: 412–417. pmid:22966104
- View Article
- PubMed/NCBI
- Google Scholar
6. von Wagner C, Baio G, Raine R, Snowball J, Morris S, Atkin W, et al. Inequalities in participation in an organized national colorectal cancer screening programme: results from the first 2.6 million invitations in England. Int J Epidemiol. 2011;40: 712–718. pmid:21330344
- View Article
- PubMed/NCBI
- Google Scholar
7. Han M, Liew CT, Zhang HW, Chao S, Zheng R, Yip KT, et al. Novel blood-based, five-gene biomarker set for the detection of colorectal cancer. Clin Cancer Res. 2008;14: 455–460. pmid:18203981
- View Article
- PubMed/NCBI
- Google Scholar
8. Nichita C, Ciarloni L, Monnier-Benoit S, Hosseinian S, Dorta G, Ruegg C. A novel gene expression signature in peripheral blood mononuclear cells for early detection of colorectal cancer. Aliment Pharmacol Ther. 2014;39: 507–517. pmid:24428642
- View Article
- PubMed/NCBI
- Google Scholar
9. Marshall KW, Mohr S, Khettabi FE, Nossova N, Chao S, Bao W, et al. A blood-based biomarker panel for stratifying current risk for colorectal cancer. Int J Cancer. 2010;126: 1177–1186. pmid:19795455
- View Article
- PubMed/NCBI
- Google Scholar
10. Honda M, Sakai Y, Yamashita T, Sakai A, Mizukoshi E, Nakamoto Y, et al. Differential gene expression profiling in blood from patients with digestive system cancers. Biochem Biophys Res Commun. 2010;400: 7–15. pmid:20682290
- View Article
- PubMed/NCBI
- Google Scholar
11. Sharma P, Sahni NS, Tibshirani R, Skaane P, Urdal P, Berghagen H, et al. Early detection of breast cancer based on gene-expression patterns in peripheral blood cells. Breast Cancer Res. 2005;7: R634–644. pmid:16168108
- View Article
- PubMed/NCBI
- Google Scholar
12. Twine NC, Stover JA, Marshall B, Dukart G, Hidalgo M, Stadler W, et al. Disease-associated expression profiles in peripheral blood mononuclear cells from patients with advanced renal cell carcinoma. Cancer Res. 2003;63: 6069–6075. pmid:14522937
- View Article
- PubMed/NCBI
- Google Scholar
13. Burczynski ME, Twine NC, Dukart G, Marshall B, Hidalgo M, Stadler WM, et al. Transcriptional profiles in peripheral blood mononuclear cells prognostic of clinical outcomes in patients with advanced renal cell carcinoma. Clin Cancer Res. 2005;11: 1181–1189. pmid:15709187
- View Article
- PubMed/NCBI
- Google Scholar
14. Showe MK, Vachani A, Kossenkov AV, Yousef M, Nichols C, Nikonova EV, et al. Gene expression profiles in peripheral blood mononuclear cells can distinguish patients with non-small cell lung cancer from patients with nonmalignant lung disease. Cancer Res. 2009;69: 9202–9210. pmid:19951989
- View Article
- PubMed/NCBI
- Google Scholar
15. Osman I, Bajorin DF, Sun TT, Zhong H, Douglas D, Scattergood J, et al. Novel blood biomarkers of human urinary bladder cancer. Clin Cancer Res. 2006;12: 3374–3380. pmid:16740760
- View Article
- PubMed/NCBI
- Google Scholar
16. Lorusso G, Ruegg C. The tumor microenvironment and its contribution to tumor evolution toward metastasis. Histochem Cell Biol. 2008;130: 1091–1103. pmid:18987874
- View Article
- PubMed/NCBI
- Google Scholar
17. Coussens LM, Werb Z. Inflammation and cancer. Nature. 2002;420: 860–867. pmid:12490959
- View Article
- PubMed/NCBI
- Google Scholar
18. Quackenbush J. Microarray analysis and tumor classification. N Engl J Med. 2006;354: 2463–2472. pmid:16760446
- View Article
- PubMed/NCBI
- Google Scholar
19. Xuan J, Yu Y, Qing T, Guo L, Shi L. Next-generation sequencing in the clinic: promises and challenges. Cancer Lett. 2013;340: 284–295. pmid:23174106
- View Article
- PubMed/NCBI
- Google Scholar
20. Bernard PS, Wittwer CT. Real-time PCR technology for cancer diagnostics. Clin Chem. 2002;48: 1178–1185. pmid:12142370
- View Article
- PubMed/NCBI
- Google Scholar
21. Espinosa E, Redondo A, Vara JA, Zamora P, Casado E, Cejas P, et al. High-throughput techniques in breast cancer: a clinical perspective. Eur J Cancer. 2006;42: 598–607. pmid:16431104
- View Article
- PubMed/NCBI
- Google Scholar
22. Henriquez-Hernandez LA, Valenciano A, Herrera-Ramos E, Lloret M, Riveros-Perez A, Lara PC. High-throughput genotyping system as a robust and useful tool in oncology: experience from a single institution. Biologicals. 2013;41: 424–429. pmid:24103542
- View Article
- PubMed/NCBI
- Google Scholar
23. Rasmussen R (2001) Quantification on the LightCycler. In: Meyer S, Wittwer C, Nakagawara K, editors. Rapid Cycle Real-time PCR, Methods and Applications. Heidelberg: Springer Press. pp. 21–34.
24. Popovici V, Goldstein DR, Antonov J, Jaggi R, Delorenzi M, Wirapati P. Selecting control genes for RT-QPCR using public microarray data. BMC Bioinformatics. 2009;10: 42. pmid:19187545
- View Article
- PubMed/NCBI
- Google Scholar
25. Wilcoxon F. Individual comparisons of grouped data by ranking methods. J Econ Entomol. 1946;39: 269. pmid:20983181
- View Article
- PubMed/NCBI
- Google Scholar
26. Tan AC, Naiman DQ, Xu L, Winslow RL, Geman D. Simple decision rules for classifying human cancers from gene expression profiles. Bioinformatics. 2005;21: 3896–3904. pmid:16105897
- View Article
- PubMed/NCBI
- Google Scholar
27. Park MY, Hastie T. L1 regularization path algorithm for generalized linear models. 2007;69: 659–677.
- View Article
- Google Scholar
28. Friedman J, Hastie T, Tibshirani R. Regularization Paths for Generalized Linear Models via Coordinate Descent. J Stat Softw. 2010;33: 1–22. pmid:20808728
- View Article
- PubMed/NCBI
- Google Scholar
29. Steyerberg EW, Harrell FE Jr., Borsboom GJ, Eijkemans MJ, Vergouwe Y, Habbema JD. Internal validation of predictive models: efficiency of some procedures for logistic regression analysis. J Clin Epidemiol. 2001;54: 774–781. pmid:11470385
- View Article
- PubMed/NCBI
- Google Scholar
30. Goeman JJ. L1 penalized estimation in the Cox proportional hazards model. Biom J. 2010;52: 70–84. pmid:19937997
- View Article
- PubMed/NCBI
- Google Scholar
31. Bilinski C, Burleson J, Forouhar F. Inflammation associated with neoplastic colonic polyps. Ann Clin Lab Sci. 2012;42: 266–270. pmid:22964614
- View Article
- PubMed/NCBI
- Google Scholar
32. Grivennikov SI. Inflammation and colorectal cancer: colitis-associated neoplasia. Semin Immunopathol. 2013;35: 229–244. pmid:23161445
- View Article
- PubMed/NCBI
- Google Scholar
33. Brown JR, DuBois RN. COX-2: a molecular target for colorectal cancer prevention. J Clin Oncol. 2005;23: 2840–2855. pmid:15837998
- View Article
- PubMed/NCBI
- Google Scholar
34. Oshima M, Murai N, Kargman S, Arguello M, Luk P, Kwong E, et al. Chemoprevention of intestinal polyposis in the Apcdelta716 mouse by rofecoxib, a specific cyclooxygenase-2 inhibitor. Cancer Res. 2001;61: 1733–1740. pmid:11245490
- View Article
- PubMed/NCBI
- Google Scholar
35. Galamb O, Spisak S, Sipos F, Toth K, Solymosi N, Wichmann B, et al. Reversal of gene expression changes in the colorectal normal-adenoma pathway by NS398 selective COX2 inhibitor. Br J Cancer. 2010;102: 765–773. pmid:20087348
- View Article
- PubMed/NCBI
- Google Scholar
36. Laurent J, Hull EF, Touvrey C, Kuonen F, Lan Q, Lorusso G, et al. Proangiogenic factor PlGF programs CD11b(+) myelomonocytes in breast cancer during differentiation of their hematopoietic progenitors. Cancer Res. 2011;71: 3781–3791. pmid:21507936
- View Article
- PubMed/NCBI
- Google Scholar
37. Kuonen F, Laurent J, Secondini C, Lorusso G, Stehle JC, Rausch T, et al. Inhibition of the Kit ligand/c-Kit axis attenuates metastasis in a mouse model mimicking local breast cancer relapse after radiotherapy. Clin Cancer Res. 2012;18: 4365–4374. pmid:22711708
- View Article
- PubMed/NCBI
- Google Scholar
38. Sotiriou C, Pusztai L. Gene-expression signatures in breast cancer. N Engl J Med. 2009;360: 790–800. pmid:19228622
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Jemal A, Siegel R, Xu J, Ward E. Cancer statistics, 2010. CA Cancer J Clin. 2010;60: 277–300. pmid:20610543
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Levin B, Lieberman DA, McFarland B, Andrews KS, Brooks D, Bond J, et al. Screening and surveillance for the early detection of colorectal cancer and adenomatous polyps, 2008: a joint guideline from the American Cancer Society, the US Multi-Society Task Force on Colorectal Cancer, and the American College of Radiology. Gastroenterology. 2008;134: 1570–1595. pmid:18384785
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. von Karsa L, Patnick J, Segnan N, Atkin W, Halloran S, Lansdorp-Vogelaar I, et al. European guidelines for quality assurance in colorectal cancer screening and diagnosis: overview and introduction to the full supplement publication. Endoscopy. 2013;45: 51–59. pmid:23212726
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Parente F, Boemo C, Ardizzoia A, Costa M, Carzaniga P, Ilardo A, et al. Outcomes and cost evaluation of the first two rounds of a colorectal cancer screening program based on immunochemical fecal occult blood test in northern Italy. Endoscopy. 2013;45: 27–34. pmid:23254404
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Jean-Jacques M, Kaleba EO, Gatta JL, Gracia G, Ryan ER, Choucair BN. Program to improve colorectal cancer screening in a low-income, racially diverse population: a randomized controlled trial. Ann Fam Med. 2012;10: 412–417. pmid:22966104
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. von Wagner C, Baio G, Raine R, Snowball J, Morris S, Atkin W, et al. Inequalities in participation in an organized national colorectal cancer screening programme: results from the first 2.6 million invitations in England. Int J Epidemiol. 2011;40: 712–718. pmid:21330344
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref7] 7. Han M, Liew CT, Zhang HW, Chao S, Zheng R, Yip KT, et al. Novel blood-based, five-gene biomarker set for the detection of colorectal cancer. Clin Cancer Res. 2008;14: 455–460. pmid:18203981
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref8] 8. Nichita C, Ciarloni L, Monnier-Benoit S, Hosseinian S, Dorta G, Ruegg C. A novel gene expression signature in peripheral blood mononuclear cells for early detection of colorectal cancer. Aliment Pharmacol Ther. 2014;39: 507–517. pmid:24428642
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref9] 9. Marshall KW, Mohr S, Khettabi FE, Nossova N, Chao S, Bao W, et al. A blood-based biomarker panel for stratifying current risk for colorectal cancer. Int J Cancer. 2010;126: 1177–1186. pmid:19795455
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref10] 10. Honda M, Sakai Y, Yamashita T, Sakai A, Mizukoshi E, Nakamoto Y, et al. Differential gene expression profiling in blood from patients with digestive system cancers. Biochem Biophys Res Commun. 2010;400: 7–15. pmid:20682290
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref11] 11. Sharma P, Sahni NS, Tibshirani R, Skaane P, Urdal P, Berghagen H, et al. Early detection of breast cancer based on gene-expression patterns in peripheral blood cells. Breast Cancer Res. 2005;7: R634–644. pmid:16168108
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref12] 12. Twine NC, Stover JA, Marshall B, Dukart G, Hidalgo M, Stadler W, et al. Disease-associated expression profiles in peripheral blood mononuclear cells from patients with advanced renal cell carcinoma. Cancer Res. 2003;63: 6069–6075. pmid:14522937
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref13] 13. Burczynski ME, Twine NC, Dukart G, Marshall B, Hidalgo M, Stadler WM, et al. Transcriptional profiles in peripheral blood mononuclear cells prognostic of clinical outcomes in patients with advanced renal cell carcinoma. Clin Cancer Res. 2005;11: 1181–1189. pmid:15709187
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref14] 14. Showe MK, Vachani A, Kossenkov AV, Yousef M, Nichols C, Nikonova EV, et al. Gene expression profiles in peripheral blood mononuclear cells can distinguish patients with non-small cell lung cancer from patients with nonmalignant lung disease. Cancer Res. 2009;69: 9202–9210. pmid:19951989
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref15] 15. Osman I, Bajorin DF, Sun TT, Zhong H, Douglas D, Scattergood J, et al. Novel blood biomarkers of human urinary bladder cancer. Clin Cancer Res. 2006;12: 3374–3380. pmid:16740760
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref16] 16. Lorusso G, Ruegg C. The tumor microenvironment and its contribution to tumor evolution toward metastasis. Histochem Cell Biol. 2008;130: 1091–1103. pmid:18987874
View Article
PubMed/NCBI
Google Scholar

[62] View Article

[63] PubMed/NCBI

[64] Google Scholar

[ref17] 17. Coussens LM, Werb Z. Inflammation and cancer. Nature. 2002;420: 860–867. pmid:12490959
View Article
PubMed/NCBI
Google Scholar

[66] View Article

[67] PubMed/NCBI

[68] Google Scholar

[ref18] 18. Quackenbush J. Microarray analysis and tumor classification. N Engl J Med. 2006;354: 2463–2472. pmid:16760446
View Article
PubMed/NCBI
Google Scholar

[70] View Article

[71] PubMed/NCBI

[72] Google Scholar

[ref19] 19. Xuan J, Yu Y, Qing T, Guo L, Shi L. Next-generation sequencing in the clinic: promises and challenges. Cancer Lett. 2013;340: 284–295. pmid:23174106
View Article
PubMed/NCBI
Google Scholar

[74] View Article

[75] PubMed/NCBI

[76] Google Scholar

[ref20] 20. Bernard PS, Wittwer CT. Real-time PCR technology for cancer diagnostics. Clin Chem. 2002;48: 1178–1185. pmid:12142370
View Article
PubMed/NCBI
Google Scholar

[78] View Article

[79] PubMed/NCBI

[80] Google Scholar

[ref21] 21. Espinosa E, Redondo A, Vara JA, Zamora P, Casado E, Cejas P, et al. High-throughput techniques in breast cancer: a clinical perspective. Eur J Cancer. 2006;42: 598–607. pmid:16431104
View Article
PubMed/NCBI
Google Scholar

[82] View Article

[83] PubMed/NCBI

[84] Google Scholar

[ref22] 22. Henriquez-Hernandez LA, Valenciano A, Herrera-Ramos E, Lloret M, Riveros-Perez A, Lara PC. High-throughput genotyping system as a robust and useful tool in oncology: experience from a single institution. Biologicals. 2013;41: 424–429. pmid:24103542
View Article
PubMed/NCBI
Google Scholar

[86] View Article

[87] PubMed/NCBI

[88] Google Scholar

[ref23] 23. Rasmussen R (2001) Quantification on the LightCycler. In: Meyer S, Wittwer C, Nakagawara K, editors. Rapid Cycle Real-time PCR, Methods and Applications. Heidelberg: Springer Press. pp. 21–34.

[ref24] 24. Popovici V, Goldstein DR, Antonov J, Jaggi R, Delorenzi M, Wirapati P. Selecting control genes for RT-QPCR using public microarray data. BMC Bioinformatics. 2009;10: 42. pmid:19187545
View Article
PubMed/NCBI
Google Scholar

[91] View Article

[92] PubMed/NCBI

[93] Google Scholar

[ref25] 25. Wilcoxon F. Individual comparisons of grouped data by ranking methods. J Econ Entomol. 1946;39: 269. pmid:20983181
View Article
PubMed/NCBI
Google Scholar

[95] View Article

[96] PubMed/NCBI

[97] Google Scholar

[ref26] 26. Tan AC, Naiman DQ, Xu L, Winslow RL, Geman D. Simple decision rules for classifying human cancers from gene expression profiles. Bioinformatics. 2005;21: 3896–3904. pmid:16105897
View Article
PubMed/NCBI
Google Scholar

[99] View Article

[100] PubMed/NCBI

[101] Google Scholar

[ref27] 27. Park MY, Hastie T. L1 regularization path algorithm for generalized linear models. 2007;69: 659–677.
View Article
Google Scholar

[103] View Article

[104] Google Scholar

[ref28] 28. Friedman J, Hastie T, Tibshirani R. Regularization Paths for Generalized Linear Models via Coordinate Descent. J Stat Softw. 2010;33: 1–22. pmid:20808728
View Article
PubMed/NCBI
Google Scholar

[106] View Article

[107] PubMed/NCBI

[108] Google Scholar

[ref29] 29. Steyerberg EW, Harrell FE Jr., Borsboom GJ, Eijkemans MJ, Vergouwe Y, Habbema JD. Internal validation of predictive models: efficiency of some procedures for logistic regression analysis. J Clin Epidemiol. 2001;54: 774–781. pmid:11470385
View Article
PubMed/NCBI
Google Scholar

[110] View Article

[111] PubMed/NCBI

[112] Google Scholar

[ref30] 30. Goeman JJ. L1 penalized estimation in the Cox proportional hazards model. Biom J. 2010;52: 70–84. pmid:19937997
View Article
PubMed/NCBI
Google Scholar

[114] View Article

[115] PubMed/NCBI

[116] Google Scholar

[ref31] 31. Bilinski C, Burleson J, Forouhar F. Inflammation associated with neoplastic colonic polyps. Ann Clin Lab Sci. 2012;42: 266–270. pmid:22964614
View Article
PubMed/NCBI
Google Scholar

[118] View Article

[119] PubMed/NCBI

[120] Google Scholar

[ref32] 32. Grivennikov SI. Inflammation and colorectal cancer: colitis-associated neoplasia. Semin Immunopathol. 2013;35: 229–244. pmid:23161445
View Article
PubMed/NCBI
Google Scholar

[122] View Article

[123] PubMed/NCBI

[124] Google Scholar

[ref33] 33. Brown JR, DuBois RN. COX-2: a molecular target for colorectal cancer prevention. J Clin Oncol. 2005;23: 2840–2855. pmid:15837998
View Article
PubMed/NCBI
Google Scholar

[126] View Article

[127] PubMed/NCBI

[128] Google Scholar

[ref34] 34. Oshima M, Murai N, Kargman S, Arguello M, Luk P, Kwong E, et al. Chemoprevention of intestinal polyposis in the Apcdelta716 mouse by rofecoxib, a specific cyclooxygenase-2 inhibitor. Cancer Res. 2001;61: 1733–1740. pmid:11245490
View Article
PubMed/NCBI
Google Scholar

[130] View Article

[131] PubMed/NCBI

[132] Google Scholar

[ref35] 35. Galamb O, Spisak S, Sipos F, Toth K, Solymosi N, Wichmann B, et al. Reversal of gene expression changes in the colorectal normal-adenoma pathway by NS398 selective COX2 inhibitor. Br J Cancer. 2010;102: 765–773. pmid:20087348
View Article
PubMed/NCBI
Google Scholar

[134] View Article

[135] PubMed/NCBI

[136] Google Scholar

[ref36] 36. Laurent J, Hull EF, Touvrey C, Kuonen F, Lan Q, Lorusso G, et al. Proangiogenic factor PlGF programs CD11b(+) myelomonocytes in breast cancer during differentiation of their hematopoietic progenitors. Cancer Res. 2011;71: 3781–3791. pmid:21507936
View Article
PubMed/NCBI
Google Scholar

[138] View Article

[139] PubMed/NCBI

[140] Google Scholar

[ref37] 37. Kuonen F, Laurent J, Secondini C, Lorusso G, Stehle JC, Rausch T, et al. Inhibition of the Kit ligand/c-Kit axis attenuates metastasis in a mouse model mimicking local breast cancer relapse after radiotherapy. Clin Cancer Res. 2012;18: 4365–4374. pmid:22711708
View Article
PubMed/NCBI
Google Scholar

[142] View Article

[143] PubMed/NCBI

[144] Google Scholar

[ref38] 38. Sotiriou C, Pusztai L. Gene-expression signatures in breast cancer. N Engl J Med. 2009;360: 790–800. pmid:19228622
View Article
PubMed/NCBI
Google Scholar

[146] View Article

[147] PubMed/NCBI

[148] Google Scholar

Figures

Abstract

Introduction

Material and Methods

Patients

Ethical approval

Blood collection and processing

RNA preparation

Screening design and dataset generation

Nanoliter high throughput qPCR

Real-time qPCR on 384-well plates

Statistical analysis

Results

Definition of a 29-gene panel for colorectal cancer and adenoma detection

Validation of the 29-gene panel

Assay migration to a 384-well plate qPCR platform

Discussion and Conclusions

Supporting Information

S1 Fig. Linear regression between mean Ct values obtained from the two screening phases.

S1 Table. Annotated list with the 670 genes tested in the screening.

S2 Table. Summary of the univariate analysis performed during phase 1 screening.

S3 Table. Summary of the univariate and multivariate analysis performed during phase 2 screening.

S4 Table. Final 29-gene panel.

Acknowledgments

Author Contributions

References