Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

De Novo Assembly of the Peanut (Arachis hypogaea L.) Seed Transcriptome Revealed Candidate Unigenes for Oil Accumulation Pathways

Abstract

Peanuts are one of the most important edible oil crops in the world. In order to survey key genes controlling peanut oil accumulation, we analyzed the seed transcriptome in different developmental stages of high- and low-oil peanut varieties. About 54 million high quality clean reads were generated, which corresponded to 4.85 Gb total nucleotides. These reads were assembled into 59,236 unique sequences. Differential mRNA processing events were detected for most of the peanut Unigenes and found that 15.8% and 18.0% of the Unigenes were differentially expressed between high- and low-oil varieties at 30 DAF and 50 DAF, respectively. Over 1,500 Unigenes involved in lipid metabolism were identified, classified, and found to participate in FA synthesis and TAG assembly. There were seven possible metabolic pathways involved in the accumulation of oil during seed development. This dataset provides more sequence resource for peanut plant and will serve as the foundation to understand the mechanisms of oil accumulation in oil crops.

Introduction

Peanuts are one of the most important edible oil crops in the world. Peanut oil has a yellowish luster, is fragrant in flavor and rich in oleic acid and linoleic acid. Peanuts have a high oil content (on average, 48-56%) compared to other oil crops. The yield of peanut oil per unit area of land is of paramount importance in supplying oil for food and non-food applications as well. Currently, there is an immediate need for the genetic enhancement and metabolic engineering of peanuts to create improved varieties for agricultural purposes including oil yield improvements. It is essential to identify candidate genes that participate in the storage lipid biosynthesis during peanut seed development, and this will serve as a foundation to create improved varieties through genetic engineering.

The quality and quantity of the storage lipids synthesized in developing seeds depends on a number of enzymes that participate in FA synthesis and TAG assembly through the Kennedy pathway [1]. Although some of the individual enzymes and genes involved in these processes are known [25], and changes in TAG quality or yield have been made by genetic manipulation [69], little is known about the overall expression and regulation profiles of genes involved in peanut lipid biosynthesis [1012]. To investigate the molecular basis of storage lipid accumulation during seed development it is necessary to understand the physiological mechanisms that affect the oil content in seeds and identify the rate-limiting enzymes that regulate storage lipid biosynthesis [13,14]. During the past several years, a large number of ESTs has been generated by several cDNA sequencing studies [1520]. Currently, about 250,000 ESTs were deposited in the NCBI Genbank. In addition to ESTs, microarray analyse has been employed for gene discovery and global gene expression profiling analyses [21]. Recently, the next generation sequence (NGS) technologies, with high throughput at a substantially reduced unit data cost, have been applied in peanuts [2224]. There are approximately 600 genes known to be involved in lipid metabolism of Arabidopsis [25], but far less lipid metabolism genes are currently known from peanuts. More genes involved in lipid synthesis need to be discovered and their various functions need to be documented in peanuts to further understand the regulatory mechanisms controlling the accumulation of storage lipids.

In this study, various histological and molecular surveys were conducted from high- and low-oil peanut sublines to understand the transcriptional network that is associated with the development of lipid metabolism. Illumina HiSeq™ 2000 sequencing technology was used to profile the gene expression patterns from developing seeds and the Unigenes involved in lipid metabolism were explored to understand the major metabolic activities in peanut seeds. This dataset would provide important resource to understand the molecular basis of lipid biosynthesis and identify the rate-limiting enzymes that regulate storage lipid biosynthesis during seed development in peanuts.

Results

Morphology and oil accumulation of developing peanut seeds

The growth of peanut seeds occurred over the 70 days post fertilization in peanut plants grown in field condition. The major storage reserves accumulated from 20 DAF -70 DAF, with maximum rates of accumulation between 40 and 60 DAF. The major accumulation of storage lipids started at about 30 DAF (Figure 1A) and was indicated by an increase in total fatty acid content. Seeds at approximately 20 DAF contained 20% oil and the oil content increased to over 40% by 60 DAF. The accumulation of total lipid of U606 showed similar trend with that of U12, but the oil content of U606 seeds was increased significantly by 13% compared to that of U12 seeds at the mature stage.

thumbnail
Figure 1. The development period of peanut seeds.

(A) Time courses of oil accumulation in developing peanut seeds. Values are the average of two independent measurements, and the same seed material used. (B) The development period of U12 seeds. (C) The development period of U606 seeds. (D) and (E) A confocal section of a U606 seed at 30-40 DAF and 50-60 DAF and stained with Nile red. (F) and (G) A confocal section of a U12 seed at 30-40 DAF and 50-60 DAF and stained with Nile red. Bars = 25 µm.

https://doi.org/10.1371/journal.pone.0073767.g001

It was rationalized that the mRNA changes and their regulating processes would precede the appearance of the enzymes and their products. This study focused on the period between 30 DAF and 50 DAF because this period preceded the rapid increase in storage product synthesis, and the seed biosynthetic pathways were also at their maximum activity (Figure 1B-1C). The development of oil bodies of two peanut cultivars was observed with the Nile Red confocal method (Figure 1D–1G). A few oil bodies were distributed in the center of the storage cells in the cotyledons at 30-40 DAF, and the volume and number of oil bodies per cell dramatically increased during 50-60 DAF. At 60 DAF, all oil bodies showed a globular shape, and localized in the middle of the cells. As compared with the high-oil cultivar U606, the low-oil U12 had a smaller number of uniformly sized oil bodies in storage cells. In mature seeds, the oil bodies were distributed in the central region of the storage cells and were mostly with an elliptical or irregular shape.

De novo assembly of peanut seed transcriptomes

To profile the seed transcriptome, mRNA from seeds of cultivar U606 and U12 at 30 and 50 DAF were used to construct four cDNA libraries (named 606-3, 606-5, 12-3 and 12-5) and sequences were generated on a Illumina Genome Analyzer II platform. After removing the adaptor and low quality reads, about 54 million high quality reads were obtained from these four libraries totaling 4.8 Gb nucleotides. The average read size, Q20 percentage (sequencing error rate, 1%), and GC percentage for each library was 90 bp, >98%, and >46%, respectively. Clean reads from each library were used for assembly separately. And 58,693, 49,700, 59,819 and 52,142 Unigenes were generated from the 606-3, 606-5, 12-3 and 12-5 libraries, respectively (Figure S1). Currently, there is no complete peanut genome available, therefore these unigene data sets were further clustered to generate the final 59,236 Unigenes with a mean length of 751 nt (Table S1). Further analysis was based on this final unigene data set.

Functional annotations of unigenes

Functional annotation provided protein functional annotation, clusters of orthologous groups (COG) annotation and gene ontology (GO) annotation of Unigenes. The unigene sequences were first aligned by blastx to protein databases including nr, Swiss-Prot, KEGG and COG (e-value < 0.00001), which retrieved proteins with the highest sequence similarity with the given Unigenes along with their protein functional annotations. Using this approach, 42,774 Unigenes (72.2% of all Unigenes) returned an above cut-off BLAST result. For Swiss-Prot analysis, up to 28,605 Unigenes (48.3% of all Unigenes) had no annotation hits.

The gene ontology (GO) assignments were used to categorize the functions of the predicted peanut Unigenes. Based on the sequence homologies found, 15,468 sequences were categorized into 45 functional groups (Figure 2). In each of the three main categories of the GO classification (biological processes, cellular components and molecular functions), the Unigenes attributed to “metabolic process”, “cell” and “catalytic activity” sub-categories were found in the highest percentages. A high percentage of Unigenes from the sub-categories of “cellular process”, “organelle” and “binding” were also found, but only a few Unigenes from the subsections of “biological adhesion”, “extracellular region part” and “translation regulator activity” were discovered.

thumbnail
Figure 2. A histogram of unigene ontology classification.

The results are summarized in three main categories: biological processes, cellular components and molecular functions. The right y-axis indicates the number of Unigenes in a particular category. The left y-axis indicates the percentage of a specific category of Unigenes in that main category.

https://doi.org/10.1371/journal.pone.0073767.g002

To further evaluate the completeness of our transcriptome libraries and the effectiveness of the annotation processes, we searched the previously annotated sequences for Unigenes involved in COG classifications. In total, out of the 42,562 nr hits, 14,497 sequences had a COG classification (Figure 3, Table S2). Among the 25 COG categories, the cluster for “general function prediction only” represented the largest group (4,730; 32.6%) followed by ‘‘transcription’’ (3,037; 20.9%) and “replication, recombination and repair” (2,519; 17.4%). The categories of “extracellular structures” (7; 0.05%) and “nuclear structure” (3; 0.02%) represented the smallest groups. The lipid transported metabolism group had 654 Unigenes, which accounted for 4.5% of the total Unigenes.

thumbnail
Figure 3. A histogram of clusters of orthologous groups (COG) classification.

Out of 42,562 nr hits, 14,497 Unigenes have a COG classification within the 25 categories.

(A) RNA processing and modification; (B) Chromatin structure and dynamics; (C) Energy production and conversion; (D) Cell cycle control, cell division, chromosome partitioning; (E) Amino acid transport and metabolism; (F) Nucleotide transport and metabolism; (G) Carbohydrate transport and metabolism; (H) Coenzyme transport and metabolism; (I) Lipid transport and metabolism; (J) Translation, ribosomal structure and biogenesis; (K) Transcription; (L) Replication, recombination and repair; (M) Cell wall/membrane/envelope biogenesis; (N) Cell motility; (O) Posttranslational modification, protein turnover, chaperones; (P) Inorganic ion transport and metabolism; (Q) Secondary metabolites biosynthesis, transport and catabolism; (R) General function prediction only; (S) Function unknown; (T) Signal transduction mechanisms; (U) Intracellular trafficking, secretion, and vesicular transport; (V) Defense mechanisms; (W) Extracellular structures; (Y) Nuclear structure; (Z) Cytoskeleton.

https://doi.org/10.1371/journal.pone.0073767.g003

To identify the biological pathways that were active in the peanut plant, the 42,774 annotated sequences were mapped to the reference canonical pathways in Kyoto Encyclopedia of Genes and Genomes (KEGG). In total, 23,282 sequences were assigned to 126 KEGG pathways (Table S3). The five best represented pathways by the unique sequences were “metabolic pathways” with 5,360 members, “biosynthesis of secondary metabolites” with 2,312 members, “plant hormone signal transduction” with 1,493 members, “plant–pathogen interaction” with 1432 members and “endocytosis” with 1077 members. More than 900 members were also mapped to each of the “glycerophospholipid metabolism” (934), “spliceosome” (905 members), “RNA transport” (852) and “ether lipid metabolism” (805) subclass. These annotations provide a valuable resource for investigating specific processes, functions and pathways of the identified Unigenes.

Gene expression profiles of developing seeds from the high- and low-oil peanut

To profile unigene expression for each library, clean reads from each library were mapped to 59,236 unigene. Reads mapped to multi- Unigenes were excluded. For this analysis, we artificially included Unigenes which have more than 10 mapped reads or of which more than 50% in length were covered by mapped reads. By this criterion, 55,021 out of 59,236 Unigenes were included in this analysis, among which 31,985 Unigenes were expressed in all of the four samples (Figure 4A). There were 2851 Unigenes only expressed in high-oil peanut samples (U606 libraries), and 5859 Unigenes were detected only in low-oil peanut samples (U12 libraries).

thumbnail
Figure 4. A comparison of differential unigene expression profiles among the different peanut tissues.

(A) Distribution of Unigenes from mRNA-seq from the different seed tissues; (B) Expression level in 606-3vs 12-3; (C) Expression level in 606-5vs 12-5; (D) The number of upregulated or downregulated Unigenes.

https://doi.org/10.1371/journal.pone.0073767.g004

Among the 41,201 Unigenes detected between the U606-3 and U12-3 samples, we found 6,521 were differentially expressed, with more up-regulated Unigenes (4,411) than down-regulated ones (2,110) (Figure 4B, D, Table S4). And 6,265 out of 34,888 Unigenes were differentially expressed between 606–5 and 12-5 samples with more up-regulated Unigenes (4,150) than down-regulated ones (2,115) as well (Figure 4C, D, Table S5).

Lipid pathway-specific unigenes and their functional categories

Mature peanut seeds consist of approximately 52% oil, mostly in the form of triacylglycerols (TAGs) within oil bodies. KEGG was used to automatically annotate peanut Unigenes that coded for orthologues involved in lipid metabolism. About 1,500 Unigenes were found to be involved in lipid metabolism (Table 1, Table S6). The lipid Unigenes covered most of the known cellular activities of acyl lipid metabolism, including many that were poorly characterized or recently described. There were nine cellular activities of lipid metabolism (Table S6-11). To our knowledge, this is one of the most complete and extensive efforts for lipid genes annotation of peanut plants.

PasswaySymbolEnzymeUnigenes
Fatty acid biosynthesisDESA1acyl-[acyl-carrier-protein] desaturase8
fabIenoyl-[acyl-carrier protein] reductase I3
fabZhydroxymyristoyl ACP dehydrase3
FATAfatty acyl-ACP thioesterase A4
FATBfatty acyl-ACP thioesterase B5
accCacetyl-CoA carboxylase, biotin carboxylase subunit12
FAD2omega-6 fatty acid desaturase (delta-12 desaturase)13
PPTpalmitoyl-protein thioesterase6
Fatty acid metabolismPAAGenoyl-CoA hydratase3
ACADMacyl-CoA dehydrogenase2
DCImethylglutaconyl-CoA hydratase1
ATOBacetyl-CoA C-acetyltransferase6
ACAA1acetyl-CoA acyltransferase7
ACSLlong-chain acyl-CoA synthetase33
ACOXacyl-CoA oxidase28
Glycerolipid metabolismPDATphospholipid:diacylglycerol acyltransferase10
DGAT1diacylglycerol O-acyltransferase5
GPAT3_4glycerol-3-phosphate O-acyltransferase8
GPATglycerol-3-phosphate acyltransferase7
OLEoleosin13

Table 1. The main Enzymes related to oil accumulation and metabolism in peanut seeds.

CSV
Download CSV

EXPANDER was used to annotate and assign Unigenes to different functional categories. The Unigenes were then grouped by developmental dynamics using the K-Means clustering algorithm and nine clusters were identified (K1-K9; Figure 5). The Unigenes that were located within the same cluster had the same or similar expression patterns during seed development. Cluster 9, covering 443 Unigenes, and cluster 1, which included 179 Unigenes, showed a major flat-rise pattern, and together with cluster 6, which included 262 Unigenes, represented up-regulated clusters. Clusters 2, 3 and 8 represented down-regulated clusters, and included 93, 128 and 110 Unigenes, respectively. Cluster 7 contained 47 Unigenes and had no recognizable pattern or they have different patterns. Together, these data reveal that the major expression profiles are produced in part by highly dynamic, coordinated and localized transitions in mRNA abundance.

thumbnail
Figure 5. K-means clustering (A) the expression profile (B) of the Unigenes involved in lipid metabolism.

The nine clusters are presented in B (K1 - K9). The error bars represent the standard deviation.

https://doi.org/10.1371/journal.pone.0073767.g005

Construction of metabolic pathways related to oil accumulation in peanut seeds

KAAS [26] and KOBAS [27] were used to automatically annotate the peanut Unigenes that coded for known orthologues of plant enzymes involved in fatty acid biosynthesis, fatty acid metabolism, glycerolipid metabolism and glyceropholipid metabolism pathways. These data have been integrated and compiled to propose schematic metabolic pathways that lead to oil accumulation in peanuts seeds (Figure 6).

thumbnail
Figure 6. The contrapuntal patterns of unigene expression in developing peanut seeds.

Peanut enzymes found in these pathways. The icons beside each enzyme name show the total number of peanut Unigenes corresponding to the enzyme in the seeds. FAB, fatty acid biosynthesis; FAM, fatty acid metabolism; GLM, glycerolipid metabolism.

https://doi.org/10.1371/journal.pone.0073767.g006

These results suggested that fatty acid biosynthesis, fatty acid metabolism, glycerolipid metabolism, glyceropholipid metabolism and tricarboxylic acid cycle (TCA) were all activated in the seed development process. The FAs played a fundamental role in oil mobilization and their oxidation produced 2-carbon compounds, which ultimately provided substrates for the TCA cycle. Many of the enzymes involved in fatty acid biosynthesis, fatty acid metabolism, glycerolipid metabolism and glyceropholipid metabolism pathways were up-regulated or down-regulated. ACC, FatA, FatB, SAD and FAD2 which are specific and critical enzymes in fatty acid biosynthesis were up- or down-regulated more than 10 fold (P<0.01) between the high- and low-oil peanut seeds.

Discussion

We have assembled 59,236 expressed unigene sequences from actively growing cultivated peanut seeds using Illumina RNA-seq technology and de novo short read assembly. Blastx predicted 42,562 CDSs and ESTScan predicted 2,449 CDSs. There were 16,462 Unigenes that do not show homology to any genes of any plant species in UniProt using the current search methodology. Predictably, a small portion of these Unigenes were likely to represent peanut specific genes. Further studies of these Unigenes will provide new insights into peanut specific programs.

There were approximately 1,500 Unigenes involved in the lipid metabolic pathway of developing peanut seeds, and many were found to participate in fatty acid biosynthesis, fatty acid metabolism, glycerolipid metabolism, glycerophospholipid metabolism, etc. Based on our data and other extensive data, a sketch map of the metabolic processes of oil mobilization in peanuts was produced (Figure 6). Several other pathways, such as oxidation, glyoxylate cycle, glycolysis, citric acid cycle, gluconeogenesis, pentose phosphate pathway, were involved in the accumulation process. These results will aid in further understanding of the accumulation of oil in seeds.

Many enzymes, such as ACC, FAD2, DGAT, were involved in the activation to accumulate the oleosin and the lipid membrane of the oil body. Thirteen Unigenes were found to code for FAD2 catalyzing unsaturated oleoyl-ACP to linoleoyl-ACP. One unigene (unigene29528) was expressed as 1715 RPKM in 12-3, 8.5 folds to 606-3, and dramatically decreased at 50 DAF. Another unigene (unigene18953) had the same expression profile. There were five Unigenes that were expressed in higher levels in U606 than in U12. Fourteen Unigenes were found to code for 3-oxoacyl- (acyl-carrier-protein) synthase and 8 Unigenes coded for stearoyl-ACP desaturase. All of these Unigenes may have lead to a higher O/L in U12 (Table S12).

We also found five Unigenes that coded for diacylglycerol acyltransferase, which catalyzed the final step in the TAG biosynthesis pathway [6]. One of them (unigene25061) was expressed as 55 RPKM in 12-5, 2.3 folds to 606-5. Another unigene (unigene36685) was expressed as 12 RPKM in 12-3, 3 folds to 606-3. There were two Unigenes in the high-oil peanut that were expressed in higher levels than the low-oil peanut.

We found 13 Unigenes that coded for oleosin, which has an important role in the formation of oil bodies and the regulation of the size of oil bodies [2830]. Interestingly, 10 Unigenes were more active than all of the others and all of these Unigenes were expressed in lower levels in 12-3, and in a much high level in 606-5. These Unigenes were expressed in both 606-3 and 606-5, which indicated that the high-oil peanuts had a sustained and stable expression of the team contributed to a goal. The earlier expressed Unigenes might be more important than the later expressed ones. All these Unigenes have the coordination effect, ultimately lead to the difference between high-oil and low-oil peanut.

In conclusion, we have developed a comprehensive lipid unigene resource for peanut plants that represents the developmental stages of specific seed tissues. The datasets developed in this study enhance the genomic resource database for peanuts. These resources can contribute to gene discovery and help to develop molecular markers for oil breeding studies. Additionally, the unigene set developed here might later contribute to the annotation and assembly of the whole peanut genome sequence.

Materials and Methods

Plant materials and tissue collection

Two peanut subline cultivars originated from the same male (Yuhua15) and female (Huayu17) parents, U606 and U12, were used. These two subline cultivars differed greatly in oil content (U606 > 58% and U12 < 45%) and fatty acid composition in mature seeds (Table S12). Days to first flower for these two cultivars are approximately 50 days. The entire growth period was approximately 120 days. Flowers at the lower position were marked and the elongating aerial pegs were later tied with colored tags. Seed tissues at specific developmental stages (20 days after flowering [DAF], 30 DAF, 40 DAF, 50 DAF, 60 DAF and 70 DAF), were harvested, quickly frozen in liquid nitrogen and stored at -80°C for RNA extraction during the 2010 growing season from May to September in Henan province.

Lipid analysis

Seeds from each line at the six different stages were obtained from randomized growth trials. The lipid levels for each of the seeds were determined by using at least three replicates for each seed and time period. The oil content of seeds at different stages was determined by the Soxhlet extraction method. The fatty acid composition of peanut oil was analyzed by gas chromatography-mass spectrometry (Foss NIRSystems 5000).

Microscope analyses

For laser scanning confocal microscopy, the developing seeds were cut into slices about at 50 µm thick, washed three times with a 1X PBS buffer and then infiltrated with an aqueous solution of Nile red for 15 min in the dark to visualize neutral lipids and washed again several times with 1X PBS buffer. The images were acquired with a Zeiss LSM 510 confocal laser scanning microscope.

RNA, poly (A) RNA isolation, purification and amplification

Five seeds from the 30 DAF and 50 DAF stages were ground in liquid N2, and 2-5 g of the mixed sample was used for RNA isolation as described previously [31]. The total RNA was purified using the RNAeasy Mini RNA kit (Qiagen, Valencia, CA, USA). On-column DNase digestion was performed according to the manufacturer’s protocol. The Poly(A) RNA was isolated from purified total RNA using two rounds of purification with oligo d (T) beads (Dynabeads). RNA was amplified through one round using the Ribo Amp HS plus RNA amplification kit (Molecular Devices, Sunnyvale, CA, USA) following the manufacturer’s protocol. At least 100 ng of the starting total RNA was used for the amplification of each replicate. The integrity and quality of the total RNA, mRNA and amplified RNA from the tissue on the glass slides was checked by a Bioanalyzer 2100 (Agilent, Santa Clara, CA, USA). The quality cut-offs for the Bioanalyzer ananlysis is RIN (RNA Integrity Number) >7.0.

Library preparation and Illumina sequence analyses

To construct the RNA-seq libraries, 100 ng of the fragmented ploy (A) RNA from four segments and 100 ng of the amplified RNA were used according to the manufacturer’s recommendations (Illumina). The first-strand cDNA was generated using random hexamer-primed reverse transcription. A second strand of cDNA was synthesized with the ligation of the adaptor, and the cDNA fragments of approximately 200 bp were isolated by gel electrophoresis. The cDNA fragments were amplified by 15 cycles of PCR and sequenced on the Illumina Genome Analyzer II.

De novo unigene assembly and annotation

Reads were assembled using Trinity software with a Kmer = 25 [32]. Unigenes were defined as sequences that were without Ns and could not be extended on either end. Unigene sequences were aligned using BlastX (e-value <0.00001). Sequence orientations were determined according to the best hit in the database. Orientation and CDS of sequences which had no hits in Blast were predicted using ESTScan [33]. Original transcript sequences (5'–3') were provided if their orientations could be determined.

Unigene expression levels were calculated using the RPKM method (Reads Per kb per Million reads) [34]. The RPKM method can eliminate the influence of different gene lengths and sequencing discrepancies on the gene expression calculation. Therefore, the calculated gene expression can be directly used to compare differences in gene expression levels between samples. Functional annotations of Unigenes were performed using protein sequence similarities, the KEGG Pathway and COG and Gene Ontology (GO) databases using BlastX (e-value <0.00001). Furthermore, we obtained GO functional annotation with Nr annotation using the Blast2GO program [35]. We then used WEGO software [36] to gain the final GO functional classification for all Unigenes in order to understand the distribution of unigene functions of the peanut species. This Transcriptome Shotgun Assembly project has been deposited at DDBJ/EMBL/GenBank under the accession GAER00000000. The version described in this paper is the first version, GAER01000000.

The quantitative expression data of each unigene at various developmental stages were subjected to unigene expression clustering analysis using the k-mean clustering method provided within the software Expression Analyzer and DisplayER (EXPANDER) [37]. Input data were first standardized with mean 0 and variance 1, fixed norm and then pursued clustering with k-mean method. To view the expression patterns of clusters by a graph, the option of mean patterns with error bars operated by the EXPANDER was chosen that allows each cluster to be displayed in a separate panel with error bars representing standard deviations.

Supporting Information

Figure S1.

The distribution of Unigenes based on length.

https://doi.org/10.1371/journal.pone.0073767.s001

(TIF)

Table S1.

The distribution of contigs and Unigenes in each library.

https://doi.org/10.1371/journal.pone.0073767.s002

(DOC)

Table S2.

A histogram of clusters of orthologous groups (COG) classification

https://doi.org/10.1371/journal.pone.0073767.s003

(DOC)

Table S3.

Unigene Metabolic Pathway Analysis.

https://doi.org/10.1371/journal.pone.0073767.s004

(DOC)

Table S4.

The list of 606-3 vs. 12-3 differentially expressed Unigenes (FDR<=0.001 and |log2Ratio| > =1).

https://doi.org/10.1371/journal.pone.0073767.s005

(XLS)

Table S5.

The list of 606-5 vs. 12-5 differentially expressed Unigenes (FDR<=0.001 and |log2Ratio| > =1).

https://doi.org/10.1371/journal.pone.0073767.s006

(XLS)

Table S6.

Enzymes related to lipid metabolism in peanut seeds.

https://doi.org/10.1371/journal.pone.0073767.s007

(XLS)

Table S7.

Peanut Unigenes in fatty acid biosynthesis and elongation pathway.

https://doi.org/10.1371/journal.pone.0073767.s008

(XLS)

Table S8.

Peanut Unigenes in fatty acid metabolism pathway.

https://doi.org/10.1371/journal.pone.0073767.s009

(XLS)

Table S9.

Peanut Unigenes in glycerolipid metabolism pathway.

https://doi.org/10.1371/journal.pone.0073767.s010

(XLS)

Table S10.

Peanut Unigenes in glycerophospholipid metabolism pathway.

https://doi.org/10.1371/journal.pone.0073767.s011

(XLS)

Table S11.

Peanut Unigenes in linoleic and alpha-linolenic acid pathway.

https://doi.org/10.1371/journal.pone.0073767.s012

(XLS)

Table S12.

Fatty acid composition of the total lipid content in mature seeds.

https://doi.org/10.1371/journal.pone.0073767.s013

(DOC)

Author Contributions

Conceived and designed the experiments: DY WZ SC JZ. Performed the experiments: YW XL. Analyzed the data: XZ. Contributed reagents/materials/analysis tools: HL. Wrote the manuscript: DY.

References

  1. 1. Murphy DJ (2005) Plant lipids: biology, utilization and manipulation. Boca Raton: CRC Press. pp. 162-225.
  2. 2. Yin D, Cui D, Jia B (2007) Construction of a high-efficient expression vector of Delta12 fatty acid desaturase in peanut and its prokaryotical expression. J Genet Genomics 34: 81-88. doi:https://doi.org/10.1016/S1673-8527(07)60009-5. PubMed: 17469780.
  3. 3. Jung S, Swift D, Sengoku E, Patel M, Teulé F et al. (2000) The high oleate trait in the cultivated peanut (Arachis hypogaea L.). I. Isolation and characterization of two genes encoding microsomal oleoyl-PC desaturases. Mol Gen Genet 263: 796-805. doi:https://doi.org/10.1007/s004380000244. PubMed: 10905347.
  4. 4. Yu S, Pan L, Yang Q, Min P, Ren Z et al. (2008) Comparison of the Delta(12) fatty acid desaturase gene between high-oleic and normal-oleic peanut genotypes. J Genet Genomics 35: 679-685. doi:https://doi.org/10.1016/S1673-8527(08)60090-9. PubMed: 19022202.
  5. 5. Yu S, Chen M, Yang Q, Zhang H (2009) Cloning, characterization and expression analysis of a stearoyl-ACP desaturase gene from Arachis hypogaea. In Proc BMEI. pp. 1-6.
  6. 6. Saha S, Enugutti B, Rajakumari B, Rajasekharan R (2006) Cytosolic Triacylglycerol Biosynthetic Pathway in Oilseeds. Molecular Cloning and Expression of Peanut Cytosolic Diacylglycerol Acyltransferase. Plant Physiol 141: 1533-1543.
  7. 7. Yin D, Deng S, Zhan K, Cui D (2007) High-oleic peanut oils produced by HpRNA-mediated gene silencing of oleate desaturase. Plant Mol Biol Rep 25: 154-163. doi:https://doi.org/10.1007/s11105-007-0017-0.
  8. 8. Zheng P, Allen WB, Roesler K, Williams ME, Zhang S et al. (2008) A phenylalanine in DGAT is a key determinant of oil content and composition in maize. Nat Genet 40: 367-372. doi:https://doi.org/10.1038/ng.85. PubMed: 18278045.
  9. 9. Li C, Wu K, Fu G, Li Y, Lin X, et al. (2009) Regulation of oleosin expression in developing peanut (Arachis hypogaea L.) embryos through nucleosome loss and histone modifications. J Exp Bot 60: 4371-4382. doi:https://doi.org/10.1093/jxb/erp275. PubMed: 19737778.
  10. 10. Wang H, Guo J, Lambert KN, Lin Y (2007) Developmental control of Arabidopsis seed oil biosynthesis. Planta 226: 773-783. doi:https://doi.org/10.1007/s00425-007-0524-0. PubMed: 17522888.
  11. 11. Chen M, Mooney BP, Hajduch M, Joshi T, Zhou M et al. (2009) System Analysis of an Arabidopsis Mutant Altered in de Novo Fatty Acid Synthesis Reveals Diverse Changes in Seed Composition and Metabolism. Plant Physiol 150: 27-41. doi:https://doi.org/10.1104/pp.108.134882. PubMed: 19279196.
  12. 12. Weselake RJ, Taylor DC, Rahman MH, Shah S, Laroche A et al. (2009) Increasing the flow of carbon into seed oil. Biotechnol Adv 27: 866-878. doi:https://doi.org/10.1016/j.biotechadv.2009.07.001. PubMed: 19625012.
  13. 13. Cahoon EB, Shockey JM, Dietrich CR, Gidda SK, Mullen RT et al. (2007) Engineering oilseeds for sustainable production of industrial and nutritional feedstocks: solving bottlenecks in fatty acid flux. Curr Opin Plant Biol 10: 236-244. doi:https://doi.org/10.1016/j.pbi.2007.04.005. PubMed: 17434788.
  14. 14. Bates PD, Durrett TP, Ohlrogge JB, Pollard M (2009) Analysis of Acyl Fluxes through Multiple Pathways of Triacylglycerol Synthesisin Developing Soybean Embryos. Plant Physiol 150: 55-72. doi:https://doi.org/10.1104/pp.109.137737. PubMed: 19329563.
  15. 15. Jayashree B, Ferguson M, Ilut D, Doyle J, Crouch JH (2005) Analysis of genomic sequences from peanut. Electron J Biotechnol 8: 226-237. doi:https://doi.org/10.2225/vol8-issue3-fulltext-3.
  16. 16. Luo M, Dang P, Guo B, He G, Holbrook C et al. (2005) Generation of expressed sequence tags (ESTs) for gene discovery and marker development in cultivated peanut. Crop Sci 45: 346-353. doi:https://doi.org/10.2135/cropsci2005.0346.
  17. 17. Guo B, Chen X, Dang P, Scully BT, Liang X et al. (2008) Peanut gene expression profiling in developing seeds at different reproduction stages during Aspergillus parasiticus infection. BMC Dev Biol 8: 12-18. doi:https://doi.org/10.1186/1471-213X-8-12. PubMed: 18248674.
  18. 18. Guo B, Chen X, Hong Y, Liang X, Dang P et al. (2009) Analysis of gene expression profiles in leaf tissues of cultivated peanuts and development of EST-SSR markers and gene discovery. Int J Plants Genomics, 2009: 715605. doi:https://doi.org/10.1155/2009/715605. PubMed: 19584933. PubMed: 19584933.
  19. 19. Bi YP, Liu W, Xia H, Su L, Zhao CZ et al. (2010) EST sequencing and gene expression profiling of cultivated peanut (Arachis hypogaea L.). Genome 53: 832-839. doi:https://doi.org/10.1139/G10-074. PubMed: 20962890.
  20. 20. Tirumalaraju SV, Jain M, Gallo M (2011) Differential gene expression in roots of nematode-resistant and -susceptible peanut (Arachis hypogaea) cultivars in response to early stages of peanut root-knot nematode (Meloidogyne Arenaria) parasitization. J Plant Physiol 168: 481-492. doi:https://doi.org/10.1016/j.jplph.2010.08.006. PubMed: 20863592.
  21. 21. Payton P, Kottapalli KR, Rowland D, Faircloth W, Guo B et al. (2009) Gene expression profiling in peanut using high density oligonucleotide microarrays. BMC Genomics 10: 265-275. doi:https://doi.org/10.1186/1471-2164-10-265. PubMed: 19523230.
  22. 22. Pandey MK, Monyo E, Ozias-Akins P, Liang X, Guimarães P et al. (2012) Advances in Arachis genomics for peanut improvement. Biotechnol Adv 30: 639-651. doi:https://doi.org/10.1038/nbt.2283. PubMed: 22094114.
  23. 23. Zhang J, Liang S, Duan J, Wang J, Chen S et al. (2012) De novo assembly and Characterisation of theTranscriptome during seed development, and generation of genic-SSR markers in Peanut (Arachis hypogaea L.). BMC Genomics 13: 90-95. doi:https://doi.org/10.1186/1471-2164-13-90. PubMed: 22409576.
  24. 24. Chen X, Zhu W, Azam S, Li HY, Zhu F et al. (2013) Deep sequencing analysis of the transcriptomes of peanut aerial and subterranean young pods identifies candidate genes related to early embryo abortion. Plant Biotechnol J 11: 115-127. doi:https://doi.org/10.1111/pbi.12018. PubMed: 23130888.
  25. 25. Beisson F, Koo AJ, Ruuska S, Schwender J, Pollard M et al. (2003) Arabidopsis genes involved in acyl lipid metabolism. A 2003 census of the candidates, a study of the distribution of expressed sequence tags in organs, and a web-based database. Plant Physiol 132: 681-697. doi:https://doi.org/10.1104/pp.103.022988. PubMed: 12805597.
  26. 26. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA et al. (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29: 644-652. doi:https://doi.org/10.1038/nbt.1883. PubMed: 21572440.
  27. 27. Wu J, Mao X, Cai T, Luo J, Wei L (2006) KOBA Sserver: a web-based platform for automated annotation and pathway identification. Nucleic Acids Res 34: 720-724. doi:https://doi.org/10.1093/nar/gkl167.
  28. 28. Siloto RM, Findlay K, Lopez-Villalobos A, Yeung EC, Nykiforuk CL et al. (2006) The accumulation of oleosins determines the size of seed oil bodies in Arabidopsis. Plant Cell 18: 1961-1974. doi:https://doi.org/10.1105/tpc.106.041269. PubMed: 16877495.
  29. 29. Shimada TL, Shimada T, Takahashi H, Fukao Y, Hara-Nishimura I (2008) A novel role for oleosins in freezing tolerance of oilseeds in Arabidopsis thaliana. Plant J 55: 798-809. doi:https://doi.org/10.1111/j.1365-313X.2008.03553.x. PubMed: 18485063.
  30. 30. Shimada TL, Hara-Nishimura I (2010) Oil-Body-Membrane proteins and their physiological functions in plants. Biol Pharm Bull 33: 360-363. doi:https://doi.org/10.1248/bpb.33.360. PubMed: 20190393.
  31. 31. Yin D, Liu H, Zhang X, Cui D (2011) A protocol for extraction of high-quality RNA and DNA from peanut plant tissues. Mol Biotechnol 49: 187-191. doi:https://doi.org/10.1007/s12033-011-9391-9. PubMed: 21416202.
  32. 32. Guo B, Chen ZY, Lee RD, Scully BT (2008) Drought stress and preharvest aflatoxin contamination in agricultural commodity: genetics, genomics and proteomics. J Integr Plant Biol 50: 1281-1291. doi:https://doi.org/10.1111/j.1744-7909.2008.00739.x. PubMed: 19017115.
  33. 33. Iseli C, Jongeneel CV, Bucher P (1999) ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences. Proc Int Conf Intell Syst Mol Biol: 138-148. PubMed: 10786296.
  34. 34. Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M (2007) KAAS: anautomatic genome annotation and pathway reconstruction server. Nucleic Acids Res 35: 182-185. doi:https://doi.org/10.1093/nar/gkm321.
  35. 35. Conesa A, Gotz S, Garcia-Gomez JM, Terol J, Talon M et al. (2005) Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21: 3674-3676. doi:https://doi.org/10.1093/bioinformatics/bti610. PubMed: 16081474.
  36. 36. Ye J, Fang L, Zheng H, Zhang Y, Chen J et al. (2006) WEGO: a web tool for plotting GO annotations. Nucleic Acids Res 34: 293-297. doi:https://doi.org/10.1093/nar/gkl031. PubMed: 16845012.
  37. 37. Shamir R, Maron-Katz A, Tanay A, Linhart C, Steinfeld I et al. (2005) EXPANDER-an integrative program suite for microarray data analysis. BMC Bioinformatics 6: 232-244. doi:https://doi.org/10.1186/1471-2105-6-232. PubMed: 16176576.