De novo leaf and root transcriptome analysis to identify putative genes involved in triterpenoid saponins biosynthesis in Hedera helix L.

Huapeng Sun; Fang Li; Zijian Xu; Mengli Sun; Hanqing Cong; Fei Qiao; Xiaohong Zhong

doi:10.1371/journal.pone.0182243

Abstract

Hedera helix L. is an important traditional medicinal plant in Europe. The main active components are triterpenoid saponins, but none of the potential enzymes involved in triterpenoid saponins biosynthesis have been discovered and annotated. Here is reported the first study of global transcriptome analyses using the Illumina HiSeq^™ 2500 platform for H. helix. In total, over 24 million clean reads were produced and 96,333 unigenes were assembled, with an average length of 1385 nt; more than 79,085 unigenes had at least one significant match to an existing gene model. Differentially Expressed Gene analysis identified 6,222 and 7,012 unigenes which were expressed either higher or lower in leaf samples when compared with roots. After functional annotation and classification, two pathways and 410 unigenes related to triterpenoid saponins biosynthesis were discovered. The accuracy of these de novo sequences was validated by RT-qPCR analysis and a RACE clone. These data will enrich our knowledge of triterpenoid saponin biosynthesis and provide a theoretical foundation for molecular research on H. helix.

Citation: Sun H, Li F, Xu Z, Sun M, Cong H, Qiao F, et al. (2017) De novo leaf and root transcriptome analysis to identify putative genes involved in triterpenoid saponins biosynthesis in Hedera helix L. PLoS ONE 12(8): e0182243. https://doi.org/10.1371/journal.pone.0182243

Editor: Turgay Unver, Dokuz Eylul Universitesi, TURKEY

Received: April 4, 2017; Accepted: July 14, 2017; Published: August 3, 2017

Copyright: © 2017 Sun et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the paper and its Supporting Information files. All raw Illumina reads data are available from the NCBI SRA database (accession number SRR5369456).

Funding: The authors received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Hedera helix L. is a perennially evergreen climbing vine that belongs to the Araliaceae family and is collected in the European Pharmacopoeia and British Pharmacopoeia as a traditional medicinal plant [1,2]. It occurs in Europe and is widely cultivated around the world. Functional components in H. helix with high medicinal value are extracted from fresh leaves and stems (European Pharmacopoeia 7.0, 2010). Clinical studies have revealed its pharmacological effect for treating cough [3], asthma [4], bronchitis [3,5,6] and other respiratory diseases. A recent pharmacology study also showed that the extracts have anti-inflammatory, antimicrobial, anti-oxidative, antitumour, anti-mutagenicity, antispasmodic, anti-leishmania and hepatoprotective activities [1]. Due to its efficacy, it had been approved by many European countries for clinical applications [7]. The extracts of H. helix can be made into several medicinal dosage forms, such as syrups and tablets. In Poland, preparations with its extracts as the main components are commercially available, including Hedelix syrups, Prospan, PiniHelix, Helical, Hederasal and Hederoin tablets [1,8].

Using modern biotechnology to dissect biosynthesis pathways and manipulate the functional genes involved in biosynthesis of the active components in medicinal plants are two main development directions for the modernization of herbal medicines. The main active components in H. helix are hederacoside C [9,10] and α-hederin [11,12], which belong to the oleanane-type triterpenoid saponins [13]. Many studies describe the biosynthesis pathway of triterpenoid saponins in three parts [14,15]. Part I is upstream isopentenyl diphosphate (IPP) and dimethylallyl pyrophosphate (DMAPP) biosynthesis [16,17], part II is the midstream carbocyclic of triterpenoid biosynthesis [18], and part III is the downstream modification pathway of complex functional groups in triterpenoid carbocyclic [19]. Genes upstream and midstream of the pathway have been intensively studied [20,21], but the specific downstream genes involved in catalysing triterpenoid saponin biosynthesis have yet to be revealed. Since genomic information for H. helix is lacking and no genes in the triterpenoid saponin pathway in this plant have been reported, transcriptome analysis using next-generation sequencing technology provides a convenient and economical tool. It is also powerful for identifying various transcripts and functional genes and for providing quantitative estimates of gene expression. Next-generation sequencing technology can also generate forty to eighty million bp of high-quality sequences per sample. It has been widely used in medicinal plants such as Panax ginseng [20], Panax notoginseng [21], Asparagus racemosus [22], Cephalotaxus hainanensis [23], Pseudostellaria heterophylla [24], Sinopodophyllum hexandrum [25], Solanum elaeagnifolium [26].

In this study, we aimed to screen the functional genes involved in triterpenoid saponin biosynthesis in H. helix using de novo transcriptome sequencing. Since hederacoside C, hederacoside B, hederacoside D and α-hederin are rich in leaves and hardly detectable in roots (S1 Fig), the transcriptomes between leaves and roots were compared to dissect the molecular basis of this tissue-specific accumulation. Interestingly, 2 pathways and 410 unigenes related to triterpenoid saponins biosynthesis were discovered in the present study, which enriched the database of molecular information for H. helix.

Materials and methods

Plant materials

One-year-old H. helix were cultivated from scions, and scion woods were collected from the Hunan Research Institute of Vine Plant and the green house in National Center for Citrus Improvement, Hunan Agricultural University, Hunan Province, China on April 11, 2104. Leaf and root tissues were collected randomly from these 1-year-old plants for transcriptome analysis on May 15, 2015. Tissues were rinsed in water, cut into small pieces, frozen in liquid nitrogen immediately and stored at −80°C for further analyses.

RNA extraction, cDNA library construction and transcriptome sequencing

Leaf and root samples were harvested from five plants for RNA extraction and three biological replicates (L1 and R1, L2 and R2, L3 and R3) were performed. First, total RNA was extracted using RNA plant Plus Reagent (Tiangen, Beijing, China) according to the manufacturer’s instructions for polysaccharides & polyphenolics-rich samples. The total RNA purity was checked using a NanoDrop^®2000 (Thermo, CA, USA) and the total RNA concentration and integrity were assessed using the RNA Nano 6000 Assay Kit of the Agilent Bioanalyzer 2100 system (Agilent Technologies, CA, USA).

The cDNA libraries were generated using the NEB Next^® Ultra^™ RNA Library Prep Kit for Illumina^® (NE, USA) following the manufacturer’s recommendations, and index codes were added to attribute sequences to each sample. Poly-(A) mRNA was isolated from total RNA using Oligo-(dT) magnetic beads and fragmented in fragmentation buffer. First-strand cDNA was synthesized using a random hexamer primer and M-MuLV Reverse Transcriptase (RNase H-). Second-strand cDNA was synthesized using buffer, dNTPs, RNaseH, and DNA polymerase I. The remaining overhangs were converted into blunt ends via exonuclease/polymerase activities. After adenylation of the 3’ ends of DNA fragments, NEBNext Adaptor with a hairpin loop structure was ligated to prepare for hybridization. To select cDNA fragments preferentially 250–300 bp in length, the library fragments were purified with the AMPure XP system (Beckman Coulter, Beverly, USA). Then, 3 μL of USER Enzyme (NEB, USA) was used with size-selected, adaptor-ligated cDNA at 37°C for 15 min. PCR was performed with Q5 Hot Start HiFi DNA polymerase, Universal PCR primers and Index (X) Primer. PCR products were purified (AMPure XP system) and library quality was assessed on an Agilent Bioanalyzer 2100 system.

Clustering of the index-coded samples was performed using a cBot Cluster Generation System using the TruSeq PE Cluster Kit v4-cBot-HS (Illumina) according to the manufacturer’s instructions. After cluster generation, the library preparations were sequenced on an Illumina Hiseq 2500 platform, and paired-end reads were generated.

De novo assembly, unigene annotation and functional classification

Raw reads obtained from HiSeq-2500 sequencing were filtered to exclude reads containing adaptors, reads with more than 5% unknown nucleotides, and low-quality reads. At the same time, the Q20, Q30 and GC-content of the clean data were calculated. All downstream analyses were based on clean data of high quality. De novo assembly of the transcriptome was performed with Trinity [27]. After assembly of clean reads with Trinity and removal of redundant sequences using the TGI Clustering Tool (TGICL) [28], clusters (prefixed with CL) and singletons (prefixed with Unigene) were finally obtained.

All assembled unique gene sequences were aligned against the non-redundant (Nr) protein database (http://www.ncbi.nlm.nih.gov/), SwissProt (http://www.expasy.ch/sprot/), Kyoto Encyclopedia of Genes and Genomes (KEGG) [29] (http://www.genome.jp/kegg/) and Clusters of Orthologous Groups (COG) (http://www.ncbi.nlm.nih.gov/cog/) databases using BLASTx algorithms with a threshold of E<1.0E-5, and the protein functional annotation information was searched against the Nt database using BLASTn algorithms with a threshold of E-value<0.00001 [30]. Gene ontology (GO) (http://www.geneontology.org) terms functional annotation was performed based on the best hits from Nr annotation using Blast2go software (http://www.blast2go.de/), and WEGO (http://wego.genomics.org.cn/cgi-bin/wego/index.pl) software was used for further resultant GO ID and classification [31,32]. All assembled unigenes were searched using the Gene ID listed in S1 Table.

Differentially expressed unigene analysis

The expression levels of unigenes were established by in-house Perl scripts for each leaf and root sample; clean data were mapped back onto the assembled transcriptome and read-counts for each gene were obtained from the mapping results. To calculate the gene expression levels, the Fragments Per kb per Million fragments (FPKM) method was used [33]. The expression difference was identified by Perl scripts, in which Fisher’s exact test and the likelihood ratio were proposed to identify differentially expressed genes, and the P-value and false discovery rate (FDR) for each gene were calculated. Differentially expressed genes were required to have thresholds of “log2 ratio≥1” and “FDR<0.001” [34]. Next, GO and KEGG analysis were again performed on the DEGs.

Quantitative real-time PCR analysis and RACE clone

The total RNA for RT-qPCR analysis was extracted using a RNAprep Pure Kit (Tiangen, Beijing, China), and 1.0 μg RNA was used for reverse transcription with a Fast Quant RT Kit (Tiangen, Beijing, China) in a 20-mL reaction volume according to the manufacturer's instructions. Gene-specific primer pairs (S2 and S3 Tables) were designed with Beacon Designer 8 software based on transcriptome-assembled data and synthesized by a commercial supplier (Sangon, Shanghai, China). The F-box gene was used as an internal control [35]. RT-qPCR was performed in 96-well plates in a Bio-Rad CFX96 real-time PCR system (Bio-Rad, CA, USA) with a SYBR Green-based PCR assay. The final volume for each reaction was 20 mL with the following components: 2 mL diluted cDNA template (1 mg/mL), 10 mL SYBR Green Mix (Bio-Rad, CA, USA), 2.5 mL forward primer (2.5 mM), 2.5 mL reverse primer (2.5 mM) and 3 mL ddH₂O. The reaction was conducted under the following conditions: 95°C for 3 min, followed by 40 cycles of denaturation at 95°C for 10 s and annealing/extension at 56°C for 30 s. The melting curve was obtained by heating the amplicon from 65°C to 95°C at increments of 0.5°C per 5 s. Each RT-qPCR analysis was performed with three biological replicates. The relative quantification of gene expression was computed using the 2^−ΔΔCt method. The full-length cDNAs of the two genes were cloned by RACE with a SMARTer RACE 5’/3’ Amplification kit (Clontech, CA, USA) according to the user manual. The RACE fragments were amplified by specific primers, ligated into the pRACE vector (Clontech, CA, USA) and sequenced. Then, the RACE fragments were used for the subsequent verification and analysis of full-length cDNAs.

Results

Reads generation and De novo assembly

Six cDNA libraries prepared from three leaves and three roots tissue of H. helix were sequenced using the Illumina HiSeq^™2500 platform. A total of 138,709,902 and 139,062,838 raw reads were generated, and after removing adapters and filtering the low-quality sequences, 123,162,610 and 122,589,364 clean reads were generated for the leaf (L1, L2, L3) and root (R1, R2, R3) libraries (Table 1). Since there is no reference genome sequence in H. helix, all clean reads from these six libraries were de novo assembled into contigs using the Trinity software, and reads were mapped back to contigs, redundancy was removed and the reads were assembled further using TGICL [28]. Finally, 96,333 unigenes were obtained with a mean length of 1385 nucleotides (nt) (N50 1927 nt). A detailed summary of the sequencing and assembly results is shown in Table 2 and the length distribution of all unigenes is shown in Fig 1.

Download:

Table 1. Summary of data output quality of various libraries.

https://doi.org/10.1371/journal.pone.0182243.t001

Download:

Table 2. Summary of assembly results of H. helix.

https://doi.org/10.1371/journal.pone.0182243.t002

Download:

Fig 1. Length distribution frequency of unigenes in H. helix.

https://doi.org/10.1371/journal.pone.0182243.g001

Functional annotation and classification

All assembled unigenes were first searched against the Nr protein, Swissprot protein, KEGG, GO and COG databases using the Blastx program with E-value threshold of 1E-5. Of the 96,333 unique sequences, 79,085 unigenes (82.1%) had at least one significant match to an existing gene model. The detailed results are shown in Table 3 and S1 Table. Based on the Nr database, the E-value distribution showed that approximately 54.5% of the mapped unigenes ranged from 1E-5 to 1E-100 (Fig 2A). As shown in the similarity distribution, approximately 24.9% unigene sequences have a similarity above 80%, while 67.2% of the hits have a similarity from 40% to 80% (Fig 2B). Furthermore, 31.9% of the H. helix unigenes sequences are homologous to genes of Vitis vinifera; the others are 8.3% to Theobroma cacao, 7.8% to Solanum tuberosum, 5.1% to Lycopersicon esculentum, 5.0% to Amygdalus persica, 4.7% to Populus balsamifera subsp. trichocarpa and 4.3% to Mimulus guttatus, respectively (Fig 2C).

Download:

Table 3. Summary of functional annotations of H. helix unigenes.

https://doi.org/10.1371/journal.pone.0182243.t003

Download:

Fig 2. Gene similarity of unigenes against the Nr database.

(A) E-value distribution of top BLAST hits for each unigene (E-value of 1.0E-5). (B) Similarity distribution of best BLAST hits for each unigene. (C) Distribution of BLAST results by species shown as percentage of total homologous sequences (E-value ≤1.0E-5). All plant proteins in the NCBI Nr database were used for homology search and the best hit of each sequence was used for analysis.

https://doi.org/10.1371/journal.pone.0182243.g002

GO analysis is an international standard system of gene function classification, that has three main ontologies to describe molecular functions, cellular components and biological processes. Based on the Nr annotation, we used GO analysis to classify functions and understand the general distribution of the unigenes of H. helix. Among the 75,773 annotated unigenes, 50,479 sequences were categorized into 56 GO functional groups, as shown in Fig 3 and S4 Table. The biological process category has the highest number of unigenes among the three categories, followed by the cellular components category and molecular function category. From the overall analysis, there are 15 items clustering more than 5,047 unigenes. Within the biological processes category, “cellular process” and “metabolic process” have the highest number of unigenes. Meanwhile, “cell”, “cell part” and “organelle” were enriched in the cellular components category. Genes encoding “binding” proteins and proteins related to “catalytic activity” were the largest proportion in the molecular function category.

Download:

Fig 3. Comparison of Gene ontology (GO) classifications of H. helix.

Results are summarized into three main GO categories (biological process, cellular component, molecular function) and 44 sub-categories. The x-axis indicates subcategories; right y-axis indicates number of genes in a category; and left y-axis indicates percentage of a specific category of genes in the main category.

https://doi.org/10.1371/journal.pone.0182243.g003

Using COG classification to further evaluate the completeness and effectiveness of the H. helix annotation, a total of 32,443 annotated sequences were assigned to 25 COG categories. As shown in Fig 4 and S5 Table, the cluster of “General function prediction” represents the largest group (11,547, 35.6%), followed by “Transcription” (6,459, 19.9%), “Replication, recombination and repair” (5,560, 17.1%), “Signal transduction mechanisms” (4,791 14.8%) and “Posttranslational modification, protein turnover, chaperones” (4,643 14.3%). The smallest group was “Nuclear structure”, with only 8 sequences.

Download:

Fig 4. COG function classification of H. helix transcriptome.

A total of 33,205 unigenes showed significant homology (E-value ≤1.0E-5) to genes in one of the 25 categories (A-W, Y and Z) in the NCBI COGs database.

https://doi.org/10.1371/journal.pone.0182243.g004

For KEGG analysis, a total of 96,333 annotated unigenes were mapped to identify active pathways in H. helix with a cut-off of E-value<0.00001. Of these, 47,100 with significant matches from the database were assigned to 128 KEGG pathways (S6 Table). The largest category was metabolic pathways (10,484, 22.3%), followed by biosynthesis of secondary metabolites (4,795, 10.2%), plant-pathogen interactions (2,805, 6.0%), plant hormone signal transduction (2,623, 5.6%) and spliceosome (2,074, 4.4%). These annotations provide a further understanding of the transcriptome data and their functions and pathways in H. helix.

Differentially Expressed Gene analysis

The Differentially Expressed Genes (DEG) of the six transcriptome libraries were used to discover the unigenes with significant differences in expression. In this study, the expression of unigenes was calculated by FPKM. The different analysis methods were as follows: R1 and L1, R2 and L2, R3 and L3, were compared, and the DEGs found were used to comment on all three replicates for GO classification and KEGG pathway analysis. In total, 6,222 unigenes were more expressed higher and 7,012 unigenes were expressed lower in leaf samples than root samples (S7 and S8 Tables). According to the expression differences between the roots (R) and Leaves (L) shown in Fig 5, GO and KEGG analysis were performed again based on the Nr annotation. Of the 13,234 DEGs, 8,771 were assigned to 52 GO categories, and of these unigenes, 4,243 of the 6,222 expressed higher unigenes and 4,528 of the 7,012 down-expressed lower unigenes were assigned to at least one of the GO terms (biological process, cellular component, or molecular function). After KEGG analysis was performed again, 3,657 of the 6,222 expressed higher unigenes were annotated into 123 pathways and 4,324 of the 7,012 expressed lower unigenes were annotated into 122 pathways. Meanwhile, the top 20 statistically concentrated pathways were analysed based on the false discovery rate (FDR)≤0.001 and these results can identify the main biochemical metabolic pathways and signal transduction pathways that the DEGs attended, as seen in S2 Fig.

Download:

Fig 5. Differentially expressed gene analysis of six libraries in H. helix.

(A) Expressed higher unigenes in leaf samples. (B) Expressed lower unigenes in leaf samples.

https://doi.org/10.1371/journal.pone.0182243.g005

Identification of triterpene saponin biosynthesis genes

In addition to the KEGG analysis and unigenes functional annotation results, several triterpene saponin biosynthesis genes were discovered and identified. In the KEGG analysis results of 96,333 unigenes, two pathways related to triterpene saponin biosynthesis in Hedera helix L., i.e., the terpenoid backbone biosynthesis and sesquiterpenoid and triterpenoid biosynthesis pathways, were assigned; these two pathways contained the MVA and MEP pathways upstream and carbocyclic biosynthesis midstream. From the annotation results, 339 unigenes were mapped to the terpenoid backbone biosynthesis pathway and 71 unigenes were mapped to the sesquiterpenoid and triterpenoid biosynthesis pathway. Among these unigenes, the genes encoding key enzymes involved in triterpene saponin biosynthesis are listed in full in Table 4: 6 putative genes (AACT, HMGS, HMGR, MVK, PMVK, MVD) and 8 putative genes (DXS, DXR, ispD, ispE, ispF, ispG, ispH, IDI) were discovered in the MVA and MEP pathway, respectively, and 6 putative genes (GPS, GGPS, FPS, SS, SE, β-AS) were discovered for carbocyclic biosynthesis. In most cases, more than one unigene can be annotated to the same enzyme encoded gene or gene family [36,37]; this phenomenon was observed in this study. As shown in Table 4, most genes had multiple alleles or paralogues in the transcriptome, such as 1-deoxy-D-xylulose-5-phosphate synthase (DXS, EC: 2.2.1.7), geranylgeranyl diphosphate synthase (GGPS, EC: 2.5.1.29) and squalene synthase (SS, EC: 2.5.1.21), these genes were annotated with nine paralogues. Meanwhile, in the above pathways related to triterpene saponin biosynthesis, we also identified genes with different expression levels in leaf compared to roots using combined DEG analysis and FPKM calculation results. There are 13 unigenes belonging to 8 putative genes expressed higher and 8 unigenes belonging to 4 putative genes expressed lower in leaves (Table 4). In addition, the modification of the triterpenoid carbocyclic is a putative downstream pathway of triterpene saponin biosynthesis in H. helix; specifically, CYP450s and GTs played a major role in this part. In this study, a total of 269 and 197 unigenes were identified for the CYP450 family and GT family, respectively (S9 and S10 Tables).

Download:

Table 4. Discovery and expression of unigenes involved in triterpenesaponin biosynthesis in Hedera helix L.

https://doi.org/10.1371/journal.pone.0182243.t004

Validation by RT-qPCR and RACE Clone

To confirm the accuracy of the Illumina paired-end sequencing and FPKM calculated results, we selected 12 unigenes and used RT-qPCR to determine their relative expression level in the leaf and root tissues of H. helix. All 12 unigenes were triterpene saponin biosynthesis genes and contained 5 higher-expressed unigenes unigenes (DXS_CL1741, iSPF_Unigene5248, iSPH_CL3923, SS_CL11265, SE_CL6504), 3 lower-expressed unigenes unigenes (HMGR_CL84, MVK_CL10135, DXR_CL4453) and 4 unchanged unigenes (AACT_CL10734, HMGS_CL4843, iSPE_Unigene29532) as calculated by FPKM. The RT-qPCR and FPKM results were compared and are presented in Fig 6, the expression levels are similar.

Download:

Fig 6. RT-qPCR validation of selected unigenes involved in triterpene saponin biosynthesis.

Columns indicate relative expression obtained by RT-qPCR (left y-axis); lines indicating the expression level were calculated by FPKM method (right y-axis). All data are presented as mean value of three repeats.

https://doi.org/10.1371/journal.pone.0182243.g006

To further validate the de novo assembly results, we also cloned the full-length cDNA of 2 genes (HMGR accession No. KX056076, SE accession No. KU942524) from the H. helix leaf using the RACE method according to fragments of the transcriptome (HMGR_CL84, SE_CL6504). The full-length cDNA sequence of HMGR is 2215 bp and includes a complete open reading frame of 1731 bp, encoding a polypeptide of 577 amino acids (S3 Fig). A phylogenetic tree shows that the HMGR gene and SE gene in H. helix are highly homologous with other species. The predicted molecular weight of the HMGR putative protein is 61.98kD, with a theoretical pI of 7.52. Similarly, the full-length cDNA sequence of SE is 2040 bp and includes a complete open reading frame of 1615 bp, encoding a polypeptide of 537 amino acids (S4 Fig). A phylogenetic tree was built to investigate the evolutionary relationship with other plants and found higher homology between them. The putative protein of SE had a 58.69-kD predicted molecular weight with a theoretical pI of 8.95. Both the full-length cDNA sequences of HMGR and SE had a Poly (A) signal sequence.

Discussion

As it is a traditional medicinal plant in Europe, previous studies of H. helix have mainly focused on its pharmacology, efficacy and active components [1,3,7]; molecular biology research is less common, and genomic and transcriptomic data are still not available in NCBI database for this plant. As the first study using next-generation sequencing technology to obtain transcript coverage of H. helix, our deep sequencing provides important, fundamental molecular data. In the current study, the clean reads and de novo assembled unigenes were obtained using the Illumina HiseqTM 2500 platform (3 replicates of leaf and root tissues, 6 libraries in total). Their N50 length and average length indicated that all sequencing results were assembled effectively and with high quality [37,38]. In transcriptome analysis without a reference genome, unigenes were mapped in a public database, and unigenes with a high matching rate by the homology principle were annotated and classified. The results indicate that the transcriptomes of H. helix have great accuracy and integrity and can be used for functional genes studies or other molecular biology studies on H. helix.

As triterpenoid saponins are the main functional components in H. helix [4,9,10], here we aimed to expand knowledge of their biosynthesis pathway and screen related functional genes using transcriptome sequencing. The HPLC results clearly show that triterpenoid saponin contents are significantly different in leaves and roots (S1 Fig); therefore, choosing the leaf and root for comparative transcriptome analysis will greatly facilitate dissection of the genes involved in organ-specific secondary metabolite biosynthesis of triterpenoid saponins. This approach is widely used for mining and identifying novel genes in secondary metabolite biosynthesis [22, 39]. In this study, with de novo assembling and deciphering, the terpenoid biosynthesis backbone and sesquiterpenoid and triterpenoid biosynthesis pathway were addressed, and 20 functional genes were found. Although the two pathways are common pathways in the biosynthesis of terpenoids and sterols reported in Panax notoginseng [36] and Solanum elaeagnifolium [26], our results provide the first accurate and comprehensive gene information for H. helix.

CYP450s [19, 40, 41] and GTs [42,43], which modify the triterpene carbon ring, play key roles in the biosynthesis of various natural products. Moreover, CYP450s catalyse an irreversible oxygenating reaction and are multifunctional enzymes with a very complicated functional classification. Therefore, they have attracted special attention in metabolic engineering. However, in the synthesis pathway of terpenoid-based natural products, only 19 CYP450s gene functions have been validated [40].

https://doi.org/10.1371/journal.pone.0182243.s014

(XLS)

References

1. Lutsenko Y, Bylka W, MatlawskaI , Darmohray R. (2010) Hedera helix as a medicinal plant. Herba. Polonica 56: 83–96.
- View Article
- Google Scholar
2. Landgrebe H, Matusch R, Marburg FR, Hecker M. (1999) Effectiveness and use of an old medicinal plant. Pharm Zeitung Nr 35: 11–15.
- View Article
- Google Scholar
3. Fazio S, Pouso J, Dolinsky D, Fernandez A, Hernandez M, Clavier G, et al. (2009) Tolerance, safety and efficacy of Hedera helix extract in inflammatory bronchial diseases under clinical practice conditions: a prospective, open, multicentre post marketing study in 9657 patients. Phytomedicine 16: 17–24. pmid:16860549
- View Article
- PubMed/NCBI
- Google Scholar
4. Hofmann D, Hecker M, Völp A. (2003) Efficacy of dry extract of ivy leaves in children with bronchial asthma-a review of randomized controlled trials. Phytomedicine 10: 213–220. pmid:12725580
- View Article
- PubMed/NCBI
- Google Scholar
5. Marzian O. (2007) Treatment of acute bronchitis in children and adolescents. Non-interventional post marketing surveillance study confirms the benefit and safety of a syrup made of extracts from thyme and ivy leaves. MMW Fortschr Med 149: 69–74. pmid:17619603
- View Article
- PubMed/NCBI
- Google Scholar
6. Cwientzek U, Ottillinger B, Arenberger P. (2011) Acute bronchitis therapy with ivy leaves extracts in a two-arm study. A double-blind, randomised study vs. another ivy leaves extract. Phytomedicine 18: 1105–1109. pmid:21802921
- View Article
- PubMed/NCBI
- Google Scholar
7. Holzinger F, Chenot JF. (2011) Systematic review of clinical trials assessing the effectiveness of Ivy leaf (Hedera Helix) for acute upper respiratory tract infections. Evid-based Compl Alt 1: 1–9.
- View Article
- Google Scholar
8. Stauss-Grabo M, Atiye S, Warnke A, Wedemeyer R, Donath F, Blume H. (2011) Observational study on the tolerability and safety of film-coated tablets containing ivy extract (Prospan^® Cough Tablets) in the treatment of colds accompanied by coughing. Phytomedicine 18: 433–436. pmid:21211950
- View Article
- PubMed/NCBI
- Google Scholar
9. Khdair A, Mohammad MK, Tawaha K, Al-Hamarsheh E, AlKhatib HS, Al-Khalidi B, et al. (2010) A validated RP HPLC-PAD method for the determination of hederacoside C in Ivy-Thyme cough syrup. Int J Anal Chem pmid:20862201
- View Article
- PubMed/NCBI
- Google Scholar
10. Hussien SA and Awad ZJ. (2014) Isolation and characterization of triterpenoid saponin Hederacoside C present in the leaves of Hedera helix L. Cultivated in Iraq. Iraq J Pharm Sci 23: 33–41.
- View Article
- Google Scholar
11. Prescott AK, Rigby LP, Veitch NC, Simmonds SJ. (2014) The haploinsufficiency profile of a-hederin suggests a caspofungin-like antifungal mode of action. Phytochemistry 101: 116–120. pmid:24569176
- View Article
- PubMed/NCBI
- Google Scholar
12. Mendel M, Chłopecka M, Dziekan N, Wiechetek M. (2013) Participation of extracellular calcium in α-hederin-induced contractions of rat isolated stomach strips. J Ethnopharmacol 146: 423–426. pmid:23274745
- View Article
- PubMed/NCBI
- Google Scholar
13. Vincken JP, Heng L, de Groot A, Gruppena H. (2007) Saponins, classification and occurrence in the plant kingdom. Phytochemistry 68: 275–297. pmid:17141815
- View Article
- PubMed/NCBI
- Google Scholar
14. Haralampidis K, Trojanowska M, Osbourn AE. (2002) Biosynthesis of triterpenoid saponins in plants. AdvBiochemEngBiot 75: 31–49.
- View Article
- Google Scholar
15. Choi DW, Jung JD, Ha YI, Park HW, In DS, Chung HJ, et al. (2005) Analysis of transcripts in methyl jasmonate-treated ginseng hairy roots to identify genes involved in the biosynthesis of ginsenosides and other secondary metabolites. Plant cell rep 23: 557–566. pmid:15538577
- View Article
- PubMed/NCBI
- Google Scholar
16. Rohdich F, Lauw S, Kaiser J, Feicht R, Kohler P, Bacher A, et al. (2006) Isoprenoid biosynthesis in plants-2C-methyl-D-erythritol-4-phosphate synthase (IspC protein) of Arabidopsis thaliana. Febs J 273: 4446–4458. pmid:16972937
- View Article
- PubMed/NCBI
- Google Scholar
17. Leopoldini M, Malaj N, Toscano M, Sindona G, Russo N. (2010) On the inhibitor effects of bergamot juice flavonoids binding to the 3-hydroxy-3-methylglutaryl- CoA reductase (HMGR) enzyme. J Agr Food Chem 58: 10768–10773.
- View Article
- Google Scholar
18. Aharoni A, Jongsma MA, Kim TY, Ri MB, Giri AP, Verstappen FWA, et al. (2006) Metabolic engineering of terpenoid biosynthesis in plants. Phytochem Rev 5: 49–58.
- View Article
- Google Scholar
19. Carelli M, Biazzi E, Panara F, Tava A, Scaramelli L, Porceddu A, et al. (2011) Medicago truncatula CYP716A12 is a multifunctional oxidase involved in the biosynthesis of hemolytic saponins. Plant Cell 23: 3070–3081. pmid:21821776
- View Article
- PubMed/NCBI
- Google Scholar
20. Jayakodi M, Lee SC, Park HS, Jang WJ, Lee YS, Choi BS, et al. (2014) Transcriptome profiling and comparative analysis of Panax ginseng adventitious roots. J Ginseng Res 38: 278–288. pmid:25379008
- View Article
- PubMed/NCBI
- Google Scholar
21. Liu MH, Yang BR, Cheung WF, Yang KY, Zhou HF, Kwok JSL, et al. (2015) Transcriptome analysis of leaves, roots and flowers of Panax notoginseng identifies genes involved in ginsenoside and alkaloid biosynthesis. BMC Genomics 16: 265. pmid:25886736
- View Article
- PubMed/NCBI
- Google Scholar
22. Upadhyay S, Phukan UJ, Mishra S, Shukla RK. (2014) De novo leaf and root transcriptome analysis identified novel genes involved in Steroidal sapogenin biosynthesis in Asparagus racemosus. BMC Genomics 15: 746. pmid:25174837
- View Article
- PubMed/NCBI
- Google Scholar
23. Qiao F, Cong HQ, Jiang XF, Wang RX, Yin JM, Qian D, et al. (2014) De Novo Characterization of a Cephalotaxus hainanensis Transcriptome and Genes Related to Paclitaxel Biosynthesis. PLoS ONE pmid:25203398
- View Article
- PubMed/NCBI
- Google Scholar
24. Li J, Zhen W, Long DK, Ding L, Gong AH, Xiao CH, et al. (2016) De Novo Sequencing and Assembly Analysis of the Pseudostellaria heterophylla Transcriptome. PLoS ONE pmid:27764127
- View Article
- PubMed/NCBI
- Google Scholar
25. Kumari A, Singh HR, Jha A, Swarnkar MK, Shankar R, Kumar S. (2014) Transcriptome sequencing of rhizome tissue of Sinopodophyllum hexandrumat two temperatures. BMC Genomics 15: 871. pmid:25287271
- View Article
- PubMed/NCBI
- Google Scholar
26. Tsaballa A, Nikolaidis A, Trikka F, Ignea C, Kampranis SC, Makris AM, et al. (2015) Use of the de novo transcriptome analysis of silver-leaf nightshade (Solanum elaeagnifolium) to identify gene expression changes associated with wounding and terpene biosynthesis. BMC Genomics 16: 504. pmid:26149407
- View Article
- PubMed/NCBI
- Google Scholar
27. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29: 644–652. pmid:21572440
- View Article
- PubMed/NCBI
- Google Scholar
28. Perteal G, Huang XQ, Liang F, Antonescu V, Sultanal R, Karamycheval S, et al. (2003) TIGR gene indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics 19: 651–652. pmid:12651724
- View Article
- PubMed/NCBI
- Google Scholar
29. Kanehisal M, Araki M, Goto S, Hattori M, Hirakawal M, Itoh M, et al. (2008) KEGG for linking genomes to life and the environment. Nucleic Acids Res 36: 480–484.
- View Article
- Google Scholar
30. Altschul SF, Madden TL, Schäfferl AA, Zhang JH, Zhang Z, Miller W, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402. pmid:9254694
- View Article
- PubMed/NCBI
- Google Scholar
31. Gotz S, Garcia-Gomez JM, Terol J, Williams TD, Nagaraj SH, Nueda MJ, et al. (2008) High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res 36: 3420–3435. pmid:18445632
- View Article
- PubMed/NCBI
- Google Scholar
32. Young MD, Wakefield MJ, Smyth GK, Oshlack A. (2010) Gene ontology analysis for RNA-seq: accounting for selection bias. Genome Biology 11: R14. pmid:20132535
- View Article
- PubMed/NCBI
- Google Scholar
33. Mortazavil A, Williams BA, McCuel K, Schaeffer L, Wold B. (2008) Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods 5: 621–628. pmid:18516045
- View Article
- PubMed/NCBI
- Google Scholar
34. Audic S, Claverie JM. (1997) The significance of digital gene expression profiles. Genome Res 7: 986–995. pmid:9331369
- View Article
- PubMed/NCBI
- Google Scholar
35. Sun HP, Li F, Ruan QM, Zhong XH. (2016) Identification and validation of reference genes for quantitative real-time PCR studies in Hedera helix L. Plant PhysiolBioch 108: 286–294.
- View Article
- Google Scholar
36. Liu MH, Yang BR, Cheung WF, Yang KY, Zhou HF, Kwok JSL, et al. (2015) Transcriptome analysis of leaves, roots and flowers of Panax notoginseng identifies genes involved in ginsenoside and alkaloid biosynthesis. BMC Genomics 16: 265. pmid:25886736
- View Article
- PubMed/NCBI
- Google Scholar
37. Zhang XD, Allan AC, Li CX, Wang YZ, Yao QY. (2015) De Novo assembly and characterization of the transcriptome of the Chinese medicinal herb, Gentiana rigescens. Int J MolSci 16: 11550–11573.
- View Article
- Google Scholar
38. Tian XJ, Long Y, Wang J, Zhang JW, Wang YY, Li WM, et al. (2015) De novo transcriptome assembly of common wild rice (Oryza rufipogon Griff.) and discovery of drought-response genes in root tissue based on transcriptomic data. PLoS ONE pmid:26134138
- View Article
- PubMed/NCBI
- Google Scholar
39. Gurkok T, Turktas M, Parmaksiz I,Unver T. (2015) Transcriptome Profiling of Alkaloid Biosynthesis in Elicitor Induced Opium Poppy. Plant Mol Biol Rep 33(3): 673–688.
- View Article
- Google Scholar
40. Rasool S, Mohamed R. (2016) Plant cytochrome P450s: nomenclature and involvement in natural product biosynthesis. Protoplasma 253(5):1197–1209. pmid:26364028
- View Article
- PubMed/NCBI
- Google Scholar
41. Weitzel C, Simonsen HT. (2015) Cytochrome P450-enzymes involved in the biosynthesis of mono-and sesquiterpenes. Phytochem Rev 14: 7–24.
- View Article
- Google Scholar
42. Achnine L, Huhman DV, Farag MA, Sumner LW, Blount JW, Dixon RA. (2005) Genomics based selection and functional characterization of triterpene glycosyltransferases from the model legume Medicago truncatula. Plant J 41: 875–887. pmid:15743451
- View Article
- PubMed/NCBI
- Google Scholar
43. Zheng X, Xu H, Ma X, Zhan R, Chen W. (2014) Triterpenoid saponin biosynthetic pathway profiling and candidate gene mining of the Ilex asprella root using RNA-Seq. Int J MolSci 15: 5970–5987.
- View Article
- Google Scholar
44. Han JY, Kim MJ, Ban YW, Hwang HS, Choi YE. (2013) The involvement of β-amyrin 28-oxidase (CYP716A52v2) in oleanan-type ginsenoside biosynthesis in Panax ginseng. Plant Cell Physiol 54: 2034–2046. pmid:24092881
- View Article
- PubMed/NCBI
- Google Scholar
45. Han JY, Kim HJ, Kwon YS, Choi YE. (2011) The Cyt P450 enzyme CYP716A47 catalyzes the formation of protopanaxadiol from dammarenediol-II during ginsenoside biosynthesis in Panax ginseng. Plant Cell Physiol 52: 2062–2073. pmid:22039120
- View Article
- PubMed/NCBI
- Google Scholar
46. Irmler S, Schroder G, St-Pierre B, Crouch NP, Hotze M, Schmidt J, et al. (2000) Indole alkaloid biosynthesis in Catharanthus roseus: new enzyme activities and identification of cytochrome P450 CYP72A1 as secologanin synthase. Plant J 24: 797–804. pmid:11135113
- View Article
- PubMed/NCBI
- Google Scholar
47. Takahashi S, Yeo YS, Zhao Y, O’Maille PE, Greenhagen BT, Noel JP, et al. (2007) Functional characterization of premnaspirodiene oxygenase a cytochrome P450 catalysing region and stereo-specific hydroxylation of diverse sesquiterpene substrates. J Biol Chem 282: 31744–31754. pmid:17715131
- View Article
- PubMed/NCBI
- Google Scholar
48. Achnine L, Huhman DV, Farag MA, Sumner LW, Blount JW, Dixon RA. (2005) Genomics based selection and functional characterization of triterpene glycosyltransferases from the model legume Medicago truncatula. Plant J 41: 875–887. pmid:15743451
- View Article
- PubMed/NCBI
- Google Scholar
49. Wang GY, Keasling JD. (2002) Amplification of HMG-CoA reductase production enhances carotenoid accumulation in Neurospora crassa. MetabEng 4: 193–201.
- View Article
- Google Scholar
50. Dai Zb, Cui Gh, Zhou SF, Zhang XN, Huang LQ. (2011) Cloning and characterization of a novel 3-hydroxy-3-methylglutaryl coenzyme A reductase gene from Salvia miltiorrhiza involved in diterpenoid tanshinone accumulation. J Plant Physiol 168: 148–157. pmid:20637524
- View Article
- PubMed/NCBI
- Google Scholar
51. Suzuki H, Achnine L, Xu R, Matsuda SPT, Dixon RA. (2002) A genomics approach to the early stages of triterpene saponin biosynthesis in Medicago truncatula. Plant J 32: 1033–1048. pmid:12492844
- View Article
- PubMed/NCBI
- Google Scholar
52. Han JY, In JG, Kwon YS, Choi YE. (2010) Regulation of ginsenoside and phytosterol biosynthesis by RNA interferences of squalene epoxidase gene in Panax ginseng. Phytochemistry 71: 36–46. pmid:19857882
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Lutsenko Y, Bylka W, MatlawskaI , Darmohray R. (2010) Hedera helix as a medicinal plant. Herba. Polonica 56: 83–96.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Landgrebe H, Matusch R, Marburg FR, Hecker M. (1999) Effectiveness and use of an old medicinal plant. Pharm Zeitung Nr 35: 11–15.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Fazio S, Pouso J, Dolinsky D, Fernandez A, Hernandez M, Clavier G, et al. (2009) Tolerance, safety and efficacy of Hedera helix extract in inflammatory bronchial diseases under clinical practice conditions: a prospective, open, multicentre post marketing study in 9657 patients. Phytomedicine 16: 17–24. pmid:16860549
View Article
PubMed/NCBI
Google Scholar

[8] View Article

[9] PubMed/NCBI

[10] Google Scholar

[ref4] 4. Hofmann D, Hecker M, Völp A. (2003) Efficacy of dry extract of ivy leaves in children with bronchial asthma-a review of randomized controlled trials. Phytomedicine 10: 213–220. pmid:12725580
View Article
PubMed/NCBI
Google Scholar

[12] View Article

[13] PubMed/NCBI

[14] Google Scholar

[ref5] 5. Marzian O. (2007) Treatment of acute bronchitis in children and adolescents. Non-interventional post marketing surveillance study confirms the benefit and safety of a syrup made of extracts from thyme and ivy leaves. MMW Fortschr Med 149: 69–74. pmid:17619603
View Article
PubMed/NCBI
Google Scholar

[16] View Article

[17] PubMed/NCBI

[18] Google Scholar

[ref6] 6. Cwientzek U, Ottillinger B, Arenberger P. (2011) Acute bronchitis therapy with ivy leaves extracts in a two-arm study. A double-blind, randomised study vs. another ivy leaves extract. Phytomedicine 18: 1105–1109. pmid:21802921
View Article
PubMed/NCBI
Google Scholar

[20] View Article

[21] PubMed/NCBI

[22] Google Scholar

[ref7] 7. Holzinger F, Chenot JF. (2011) Systematic review of clinical trials assessing the effectiveness of Ivy leaf (Hedera Helix) for acute upper respiratory tract infections. Evid-based Compl Alt 1: 1–9.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref8] 8. Stauss-Grabo M, Atiye S, Warnke A, Wedemeyer R, Donath F, Blume H. (2011) Observational study on the tolerability and safety of film-coated tablets containing ivy extract (Prospan^® Cough Tablets) in the treatment of colds accompanied by coughing. Phytomedicine 18: 433–436. pmid:21211950
View Article
PubMed/NCBI
Google Scholar

[27] View Article

[28] PubMed/NCBI

[29] Google Scholar

[ref9] 9. Khdair A, Mohammad MK, Tawaha K, Al-Hamarsheh E, AlKhatib HS, Al-Khalidi B, et al. (2010) A validated RP HPLC-PAD method for the determination of hederacoside C in Ivy-Thyme cough syrup. Int J Anal Chem pmid:20862201
View Article
PubMed/NCBI
Google Scholar

[31] View Article

[32] PubMed/NCBI

[33] Google Scholar

[ref10] 10. Hussien SA and Awad ZJ. (2014) Isolation and characterization of triterpenoid saponin Hederacoside C present in the leaves of Hedera helix L. Cultivated in Iraq. Iraq J Pharm Sci 23: 33–41.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref11] 11. Prescott AK, Rigby LP, Veitch NC, Simmonds SJ. (2014) The haploinsufficiency profile of a-hederin suggests a caspofungin-like antifungal mode of action. Phytochemistry 101: 116–120. pmid:24569176
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref12] 12. Mendel M, Chłopecka M, Dziekan N, Wiechetek M. (2013) Participation of extracellular calcium in α-hederin-induced contractions of rat isolated stomach strips. J Ethnopharmacol 146: 423–426. pmid:23274745
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref13] 13. Vincken JP, Heng L, de Groot A, Gruppena H. (2007) Saponins, classification and occurrence in the plant kingdom. Phytochemistry 68: 275–297. pmid:17141815
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref14] 14. Haralampidis K, Trojanowska M, Osbourn AE. (2002) Biosynthesis of triterpenoid saponins in plants. AdvBiochemEngBiot 75: 31–49.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref15] 15. Choi DW, Jung JD, Ha YI, Park HW, In DS, Chung HJ, et al. (2005) Analysis of transcripts in methyl jasmonate-treated ginseng hairy roots to identify genes involved in the biosynthesis of ginsenosides and other secondary metabolites. Plant cell rep 23: 557–566. pmid:15538577
View Article
PubMed/NCBI
Google Scholar

[53] View Article

[54] PubMed/NCBI

[55] Google Scholar

[ref16] 16. Rohdich F, Lauw S, Kaiser J, Feicht R, Kohler P, Bacher A, et al. (2006) Isoprenoid biosynthesis in plants-2C-methyl-D-erythritol-4-phosphate synthase (IspC protein) of Arabidopsis thaliana. Febs J 273: 4446–4458. pmid:16972937
View Article
PubMed/NCBI
Google Scholar

[57] View Article

[58] PubMed/NCBI

[59] Google Scholar

[ref17] 17. Leopoldini M, Malaj N, Toscano M, Sindona G, Russo N. (2010) On the inhibitor effects of bergamot juice flavonoids binding to the 3-hydroxy-3-methylglutaryl- CoA reductase (HMGR) enzyme. J Agr Food Chem 58: 10768–10773.
View Article
Google Scholar

[61] View Article

[62] Google Scholar

[ref18] 18. Aharoni A, Jongsma MA, Kim TY, Ri MB, Giri AP, Verstappen FWA, et al. (2006) Metabolic engineering of terpenoid biosynthesis in plants. Phytochem Rev 5: 49–58.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref19] 19. Carelli M, Biazzi E, Panara F, Tava A, Scaramelli L, Porceddu A, et al. (2011) Medicago truncatula CYP716A12 is a multifunctional oxidase involved in the biosynthesis of hemolytic saponins. Plant Cell 23: 3070–3081. pmid:21821776
View Article
PubMed/NCBI
Google Scholar

[67] View Article

[68] PubMed/NCBI

[69] Google Scholar

[ref20] 20. Jayakodi M, Lee SC, Park HS, Jang WJ, Lee YS, Choi BS, et al. (2014) Transcriptome profiling and comparative analysis of Panax ginseng adventitious roots. J Ginseng Res 38: 278–288. pmid:25379008
View Article
PubMed/NCBI
Google Scholar

[71] View Article

[72] PubMed/NCBI

[73] Google Scholar

[ref21] 21. Liu MH, Yang BR, Cheung WF, Yang KY, Zhou HF, Kwok JSL, et al. (2015) Transcriptome analysis of leaves, roots and flowers of Panax notoginseng identifies genes involved in ginsenoside and alkaloid biosynthesis. BMC Genomics 16: 265. pmid:25886736
View Article
PubMed/NCBI
Google Scholar

[75] View Article

[76] PubMed/NCBI

[77] Google Scholar

[ref22] 22. Upadhyay S, Phukan UJ, Mishra S, Shukla RK. (2014) De novo leaf and root transcriptome analysis identified novel genes involved in Steroidal sapogenin biosynthesis in Asparagus racemosus. BMC Genomics 15: 746. pmid:25174837
View Article
PubMed/NCBI
Google Scholar

[79] View Article

[80] PubMed/NCBI

[81] Google Scholar

[ref23] 23. Qiao F, Cong HQ, Jiang XF, Wang RX, Yin JM, Qian D, et al. (2014) De Novo Characterization of a Cephalotaxus hainanensis Transcriptome and Genes Related to Paclitaxel Biosynthesis. PLoS ONE pmid:25203398
View Article
PubMed/NCBI
Google Scholar

[83] View Article

[84] PubMed/NCBI

[85] Google Scholar

[ref24] 24. Li J, Zhen W, Long DK, Ding L, Gong AH, Xiao CH, et al. (2016) De Novo Sequencing and Assembly Analysis of the Pseudostellaria heterophylla Transcriptome. PLoS ONE pmid:27764127
View Article
PubMed/NCBI
Google Scholar

[87] View Article

[88] PubMed/NCBI

[89] Google Scholar

[ref25] 25. Kumari A, Singh HR, Jha A, Swarnkar MK, Shankar R, Kumar S. (2014) Transcriptome sequencing of rhizome tissue of Sinopodophyllum hexandrumat two temperatures. BMC Genomics 15: 871. pmid:25287271
View Article
PubMed/NCBI
Google Scholar

[91] View Article

[92] PubMed/NCBI

[93] Google Scholar

[ref26] 26. Tsaballa A, Nikolaidis A, Trikka F, Ignea C, Kampranis SC, Makris AM, et al. (2015) Use of the de novo transcriptome analysis of silver-leaf nightshade (Solanum elaeagnifolium) to identify gene expression changes associated with wounding and terpene biosynthesis. BMC Genomics 16: 504. pmid:26149407
View Article
PubMed/NCBI
Google Scholar

[95] View Article

[96] PubMed/NCBI

[97] Google Scholar

[ref27] 27. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29: 644–652. pmid:21572440
View Article
PubMed/NCBI
Google Scholar

[99] View Article

[100] PubMed/NCBI

[101] Google Scholar

[ref28] 28. Perteal G, Huang XQ, Liang F, Antonescu V, Sultanal R, Karamycheval S, et al. (2003) TIGR gene indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics 19: 651–652. pmid:12651724
View Article
PubMed/NCBI
Google Scholar

[103] View Article

[104] PubMed/NCBI

[105] Google Scholar

[ref29] 29. Kanehisal M, Araki M, Goto S, Hattori M, Hirakawal M, Itoh M, et al. (2008) KEGG for linking genomes to life and the environment. Nucleic Acids Res 36: 480–484.
View Article
Google Scholar

[107] View Article

[108] Google Scholar

[ref30] 30. Altschul SF, Madden TL, Schäfferl AA, Zhang JH, Zhang Z, Miller W, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402. pmid:9254694
View Article
PubMed/NCBI
Google Scholar

[110] View Article

[111] PubMed/NCBI

[112] Google Scholar

[ref31] 31. Gotz S, Garcia-Gomez JM, Terol J, Williams TD, Nagaraj SH, Nueda MJ, et al. (2008) High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res 36: 3420–3435. pmid:18445632
View Article
PubMed/NCBI
Google Scholar

[114] View Article

[115] PubMed/NCBI

[116] Google Scholar

[ref32] 32. Young MD, Wakefield MJ, Smyth GK, Oshlack A. (2010) Gene ontology analysis for RNA-seq: accounting for selection bias. Genome Biology 11: R14. pmid:20132535
View Article
PubMed/NCBI
Google Scholar

[118] View Article

[119] PubMed/NCBI

[120] Google Scholar

[ref33] 33. Mortazavil A, Williams BA, McCuel K, Schaeffer L, Wold B. (2008) Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods 5: 621–628. pmid:18516045
View Article
PubMed/NCBI
Google Scholar

[122] View Article

[123] PubMed/NCBI

[124] Google Scholar

[ref34] 34. Audic S, Claverie JM. (1997) The significance of digital gene expression profiles. Genome Res 7: 986–995. pmid:9331369
View Article
PubMed/NCBI
Google Scholar

[126] View Article

[127] PubMed/NCBI

[128] Google Scholar

[ref35] 35. Sun HP, Li F, Ruan QM, Zhong XH. (2016) Identification and validation of reference genes for quantitative real-time PCR studies in Hedera helix L. Plant PhysiolBioch 108: 286–294.
View Article
Google Scholar

[130] View Article

[131] Google Scholar

[ref36] 36. Liu MH, Yang BR, Cheung WF, Yang KY, Zhou HF, Kwok JSL, et al. (2015) Transcriptome analysis of leaves, roots and flowers of Panax notoginseng identifies genes involved in ginsenoside and alkaloid biosynthesis. BMC Genomics 16: 265. pmid:25886736
View Article
PubMed/NCBI
Google Scholar

[133] View Article

[134] PubMed/NCBI

[135] Google Scholar

[ref37] 37. Zhang XD, Allan AC, Li CX, Wang YZ, Yao QY. (2015) De Novo assembly and characterization of the transcriptome of the Chinese medicinal herb, Gentiana rigescens. Int J MolSci 16: 11550–11573.
View Article
Google Scholar

[137] View Article

[138] Google Scholar

[ref38] 38. Tian XJ, Long Y, Wang J, Zhang JW, Wang YY, Li WM, et al. (2015) De novo transcriptome assembly of common wild rice (Oryza rufipogon Griff.) and discovery of drought-response genes in root tissue based on transcriptomic data. PLoS ONE pmid:26134138
View Article
PubMed/NCBI
Google Scholar

[140] View Article

[141] PubMed/NCBI

[142] Google Scholar

[ref39] 39. Gurkok T, Turktas M, Parmaksiz I,Unver T. (2015) Transcriptome Profiling of Alkaloid Biosynthesis in Elicitor Induced Opium Poppy. Plant Mol Biol Rep 33(3): 673–688.
View Article
Google Scholar

[144] View Article

[145] Google Scholar

[ref40] 40. Rasool S, Mohamed R. (2016) Plant cytochrome P450s: nomenclature and involvement in natural product biosynthesis. Protoplasma 253(5):1197–1209. pmid:26364028
View Article
PubMed/NCBI
Google Scholar

[147] View Article

[148] PubMed/NCBI

[149] Google Scholar

[ref41] 41. Weitzel C, Simonsen HT. (2015) Cytochrome P450-enzymes involved in the biosynthesis of mono-and sesquiterpenes. Phytochem Rev 14: 7–24.
View Article
Google Scholar

[151] View Article

[152] Google Scholar

[ref42] 42. Achnine L, Huhman DV, Farag MA, Sumner LW, Blount JW, Dixon RA. (2005) Genomics based selection and functional characterization of triterpene glycosyltransferases from the model legume Medicago truncatula. Plant J 41: 875–887. pmid:15743451
View Article
PubMed/NCBI
Google Scholar

[154] View Article

[155] PubMed/NCBI

[156] Google Scholar

[ref43] 43. Zheng X, Xu H, Ma X, Zhan R, Chen W. (2014) Triterpenoid saponin biosynthetic pathway profiling and candidate gene mining of the Ilex asprella root using RNA-Seq. Int J MolSci 15: 5970–5987.
View Article
Google Scholar

[158] View Article

[159] Google Scholar

[ref44] 44. Han JY, Kim MJ, Ban YW, Hwang HS, Choi YE. (2013) The involvement of β-amyrin 28-oxidase (CYP716A52v2) in oleanan-type ginsenoside biosynthesis in Panax ginseng. Plant Cell Physiol 54: 2034–2046. pmid:24092881
View Article
PubMed/NCBI
Google Scholar

[161] View Article

[162] PubMed/NCBI

[163] Google Scholar

[ref45] 45. Han JY, Kim HJ, Kwon YS, Choi YE. (2011) The Cyt P450 enzyme CYP716A47 catalyzes the formation of protopanaxadiol from dammarenediol-II during ginsenoside biosynthesis in Panax ginseng. Plant Cell Physiol 52: 2062–2073. pmid:22039120
View Article
PubMed/NCBI
Google Scholar

[165] View Article

[166] PubMed/NCBI

[167] Google Scholar

[ref46] 46. Irmler S, Schroder G, St-Pierre B, Crouch NP, Hotze M, Schmidt J, et al. (2000) Indole alkaloid biosynthesis in Catharanthus roseus: new enzyme activities and identification of cytochrome P450 CYP72A1 as secologanin synthase. Plant J 24: 797–804. pmid:11135113
View Article
PubMed/NCBI
Google Scholar

[169] View Article

[170] PubMed/NCBI

[171] Google Scholar

[ref47] 47. Takahashi S, Yeo YS, Zhao Y, O’Maille PE, Greenhagen BT, Noel JP, et al. (2007) Functional characterization of premnaspirodiene oxygenase a cytochrome P450 catalysing region and stereo-specific hydroxylation of diverse sesquiterpene substrates. J Biol Chem 282: 31744–31754. pmid:17715131
View Article
PubMed/NCBI
Google Scholar

[173] View Article

[174] PubMed/NCBI

[175] Google Scholar

[ref48] 48. Achnine L, Huhman DV, Farag MA, Sumner LW, Blount JW, Dixon RA. (2005) Genomics based selection and functional characterization of triterpene glycosyltransferases from the model legume Medicago truncatula. Plant J 41: 875–887. pmid:15743451
View Article
PubMed/NCBI
Google Scholar

[177] View Article

[178] PubMed/NCBI

[179] Google Scholar

[ref49] 49. Wang GY, Keasling JD. (2002) Amplification of HMG-CoA reductase production enhances carotenoid accumulation in Neurospora crassa. MetabEng 4: 193–201.
View Article
Google Scholar

[181] View Article

[182] Google Scholar

[ref50] 50. Dai Zb, Cui Gh, Zhou SF, Zhang XN, Huang LQ. (2011) Cloning and characterization of a novel 3-hydroxy-3-methylglutaryl coenzyme A reductase gene from Salvia miltiorrhiza involved in diterpenoid tanshinone accumulation. J Plant Physiol 168: 148–157. pmid:20637524
View Article
PubMed/NCBI
Google Scholar

[184] View Article

[185] PubMed/NCBI

[186] Google Scholar

[ref51] 51. Suzuki H, Achnine L, Xu R, Matsuda SPT, Dixon RA. (2002) A genomics approach to the early stages of triterpene saponin biosynthesis in Medicago truncatula. Plant J 32: 1033–1048. pmid:12492844
View Article
PubMed/NCBI
Google Scholar

[188] View Article

[189] PubMed/NCBI

[190] Google Scholar

[ref52] 52. Han JY, In JG, Kwon YS, Choi YE. (2010) Regulation of ginsenoside and phytosterol biosynthesis by RNA interferences of squalene epoxidase gene in Panax ginseng. Phytochemistry 71: 36–46. pmid:19857882
View Article
PubMed/NCBI
Google Scholar

[192] View Article

[193] PubMed/NCBI

[194] Google Scholar

Figures

Abstract

Introduction

Materials and methods

Plant materials

RNA extraction, cDNA library construction and transcriptome sequencing

De novo assembly, unigene annotation and functional classification

Differentially expressed unigene analysis

Quantitative real-time PCR analysis and RACE clone

Results

Reads generation and De novo assembly

Functional annotation and classification

Differentially Expressed Gene analysis

Identification of triterpene saponin biosynthesis genes

Validation by RT-qPCR and RACE Clone

Discussion

Conclusions

Supporting information

S1 Fig. HPLC chromatogram of different tissue samples of H. helix.

S2 Fig. Scatter plot of top 20 KEGG pathway enrichment for DEGs.

S3 Fig. The amino acid sequence of HMGR gene in H. helix.

S4 Fig. The amino acid sequence of SE gene in H. helix.

S1 Table. Annotation summary.

S2 Table. Primers for RT-qPCR analysis.

S3 Table. Primer sequences for RACE clone.

S4 Table. All-Unigene GO analysis.

S5 Table. All-Unigene COG classification.

S6 Table. All-Unigene KEGG analysis.

S7 Table. Expressed higher DEGs in leaf.

S8 Table. Expressed lower DEGs in leaf.

S9 Table. Annotation_CYP450s.

S10 Table. Annotation_GTs.

References