Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Long non-coding RNAs and mRNAs profiling during spleen development in pig

  • Tiandong Che ,

    Contributed equally to this work with: Tiandong Che, Diyan Li

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Resources, Software, Visualization, Writing – original draft

    Affiliation Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan, China

  • Diyan Li ,

    Contributed equally to this work with: Tiandong Che, Diyan Li

    Roles Conceptualization, Methodology, Project administration, Supervision, Visualization, Writing – review & editing

    mingzhou.li@sicau.edu.cn (ML); diyanli@sicau.edu.cn (DL)

    Affiliation Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan, China

  • Long Jin,

    Roles Investigation, Methodology, Resources

    Affiliation Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan, China

  • Yuhua Fu,

    Roles Formal analysis, Investigation, Methodology, Software

    Affiliation School of Computer Science and Technology, Wuhan University of Technology, Wuhan, Hubei, China

  • Yingkai Liu,

    Roles Investigation, Resources

    Affiliation Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan, China

  • Pengliang Liu,

    Roles Investigation, Resources

    Affiliation Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan, China

  • Yixin Wang,

    Roles Investigation, Resources

    Affiliation Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan, China

  • Qianzi Tang,

    Roles Formal analysis, Methodology, Resources, Visualization

    Affiliation Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan, China

  • Jideng Ma,

    Roles Investigation, Resources, Supervision

    Affiliation Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan, China

  • Xun Wang,

    Roles Investigation, Resources

    Affiliation Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan, China

  • Anan Jiang,

    Roles Project administration, Supervision

    Affiliation Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan, China

  • Xuewei Li,

    Roles Project administration, Supervision

    Affiliation Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan, China

  • Mingzhou Li

    Roles Conceptualization, Methodology, Project administration, Resources, Supervision, Writing – review & editing

    mingzhou.li@sicau.edu.cn (ML); diyanli@sicau.edu.cn (DL)

    Affiliation Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan, China

Abstract

Genome-wide transcriptomic studies in humans and mice have become extensive and mature. However, a comprehensive and systematic understanding of protein-coding genes and long non-coding RNAs (lncRNAs) expressed during pig spleen development has not been achieved. LncRNAs are known to participate in regulatory networks for an array of biological processes. Here, we constructed 18 RNA libraries from developing fetal pig spleen (55 days before birth), postnatal pig spleens (0, 30, 180 days and 2 years after birth), and the samples from the 2-year-old Wild Boar. A total of 15,040 lncRNA transcripts were identified among these samples. We found that the temporal expression pattern of lncRNAs was more restricted than observed for protein-coding genes. Time-series analysis showed two large modules for protein-coding genes and lncRNAs. The up-regulated module was enriched for genes related to immune and inflammatory function, while the down-regulated module was enriched for cell proliferation processes such as cell division and DNA replication. Co-expression networks indicated the functional relatedness between protein-coding genes and lncRNAs, which were enriched for similar functions over the series of time points examined. We identified numerous differentially expressed protein-coding genes and lncRNAs in all five developmental stages. Notably, ceruloplasmin precursor (CP), a protein-coding gene participating in antioxidant and iron transport processes, was differentially expressed in all stages. This study provides the first catalog of the developing pig spleen, and contributes to a fuller understanding of the molecular mechanisms underpinning mammalian spleen development.

Introduction

The developmental complexity of organisms arises from elaborate gene regulation rather than an increase in the number of protein-coding genes [1]. Systematic transcriptome analyses by deep sequencing of human cell lines have revealed that ~75% of the human genome is transcribed into primary transcripts, but less than 3% of the genome accounts for protein-coding transcripts [2]. This means that the vast majority of the mammalian genome generates non-coding RNAs (ncRNAs). Long non-coding RNAs (lncRNAs) are defined as ncRNAs with a length longer than 200 nt. It has been proposed that lncRNAs may serve as versatile regulators of diverse aspects of biology in physiological and pathological contexts [3,4]. At present, a large number of lncRNAs has been discovered in mammals [59], and many play an important role in biological processes such as X-chromosome inactivation [10], genomic imprinting [11], disease [3], cell differentiation and development [3,12].

Pigs (Sus scrofa) are domesticated in multiple centers across Eurasia [13,14], where they are important for agricultural economics. To date, over ten genomes of pig breeds with distinct phenotypes have been published [1519], constituting a huge resource to understand whole genome characteristics of this species. Moreover, pigs have been employed as a biomedical model for research on human diseases [2022]. Spleen, the secondary lymphoid organ, has functions in iron metabolism, immunization and hematopoiesis [23]. However, very little transcriptomic research on pig spleen has been published. To improve the robustness and resilience of pigs against pathogens through selection, a better understanding of transcriptomic factors involved in the immune response is required [24].

Here, we report the systematic identification and characterization of lncRNAs in pig spleen during five developmental stages using an Illumina HiSeq platform. A total of 15,040 lncRNAs were identified. To the best of our knowledge, no other report describing spleen lncRNAs and their biological function in pigs is currently available. Our results not only provide a useful resource for better understanding the regulatory functions of lncRNAs in pig and annotation of the pig genome, but also they contribute to better comprehension of mammalian spleen development.

Materials and methods

Animals and sample collection

Three healthy female pigs (Rongchang pig, a Chinese indigenous breed in Rongchang, Chongqing) were examined at each of the five developmental stages in this study, including E55d (55 days before birth), B0d (just after birth), W30d (weaned for 30 days), A180d (180 days of age) and A2Y (2 years of age). In addition, three 2-year old female Wild Boars (WB) were included, which were collected from Ya’an, Sichuan. The spleen was rapidly dissected from each carcass and immediately frozen in liquid nitrogen. All samples were stored at −80°C until total RNA extraction. Animals were humanely killed to ameliorate suffering by intravenous injection with 2% pentobarbital sodium (25 mg/Kg). All experimental procedures and sample collection in this study were approved by the Institutional Animal Care and Use Committee (IACUC) of Sichuan Agricultural University, under permit No. DKY-B20141401.

RNA-seq

Total RNA was isolated using a standard Trizol (Invitrogen) protocol. Genomic DNA was removed using DNaseI. Eighteen strand-specific cDNA libraries for paired-end 150 bp sequencing were prepared using dUTP protocols. Libraries were sequenced on Illumina’s HiSeq platform. More than 10 Gb of high quality data were obtained per library.

Identification of lncRNAs

To obtain high quality lncRNAs, we replaced the heterosome of the reference genome with the newest version [25]. The filtered clean reads were mapped to the new reference genome using Tophat version 2.0.11 [26]. Transcripts were assembled by Stringtie version 1.3.3 [27]. Transcripts with lengths < 200 nt were filtered. Next, assembled transcripts from each sample were merged into a consensus transcriptome using previously published custom Python scripts [8]. Cuffcompare version 2.2.1 [28] was used to remove the transcripts that were annotated in the reference as “c” and “=“ (“c” for partial match, and “=“ for full match). Remaining transcripts that contained a known protein-coding domain were removed by Hmmscan [29] and BLASTX. The Coding Potential Calculator (CPC) [30] was used to assess the coding potential of the remaining transcripts, and transcripts with CPC score > 0 were removed. Finally, the remaining transcripts with FPKM > 0 at least in one biological replicate were annotated as lncRNAs.

Classification of lncRNAs

LncRNAs were classified based on their genomic characterization by FEELnc [31]. The resulting set of lncRNAs was subdivided into five categories: (1) no overlap with other loci, classified as intergenic lncRNAs (lincRNAs); (2) overlap with sense intron; (3) overlap with antisense intron; (4) overlap with sense exon; and (5) overlap with antisense exon. The last four classes include two conditions: (a) the lncRNA contains the intron or exon; (b) the lncRNA is contained within the intron or exon.

Expression analysis

Stringtie version 1.3.3 were applied to quantify protein-coding genes and lncRNAs expression, and obtained FPKM expression values (denoted as fragments per kb of transcript per Mb of mapped reads). FPKM > 0.1 was used to filter the expressed protein-coding genes [32]. Log2 transformed values of (FPKM+1) were used in subsequent analyses. Pearson correlations were calculated across developmental stages. Principal Component Analysis (PCA) was carried out using R.

The Shannon entropy (H) [33] is calculated as: where Pt/g is the relative expression of a gene g in a stage t relative to its expression given in N stages. This value has units of bits ranging from zero, indicating genes expressed in a single stage, to log2(N), indicating genes expressed uniformly in all developmental stages considered. DESeq2 [34] was applied to detect differentially expressed protein-coding genes and lncRNAs based on the read count produced from FPKM by Stringtie version 1.3.3. Benjamini-adjusted P values ≤ 0.05 were identified as DEGs.

Time-series analysis

Time-series analysis was performed by STEM (Short Time-series Expression Miner) [35]. Significantly enriched model profiles are indicated by different colors (Bonferroni-adjusted P values ≤ 0.05). Model profiles of the same color belong to the same cluster of profiles.

Co-expression networks

A co-expression network was constructed across the five developmental stages by R package WGCNA [36] to analyze sets of protein-coding genes and lncRNAs that were significant for each time point, as evaluated by the R package DESeq2 with a time-series model.

PSI estimation

Values for PSI (percent spliced in) were calculated as previously described [37]. The PSI value of each annotated exon was obtained at every developmental stage. PCA and Pearson correlations were performed to analyze expression.

Functional enrichment analyses for genes

The DAVID (Database for Annotation, Visualization and Integrated Discovery) web server was used to perform functional enrichment analysis of Gene Ontology (GO) and KEGG pathway categories. Genes were mapped to their respective human orthologous and the resulting list was submitted to DAVID for enrichment analysis of significant overrepresentation of GO biological processes (GO-BP), molecular function (GO-MF) terminologies, and KEGG pathway categories. During the analysis, the whole gene set was treated as the background. Only terms with Benjamini-adjusted P values ≤ 0.05 were considered significant.

Results

Identification of lncRNAs in pig spleen

To systematically identify lncRNAs and their spatiotemporal expression profiles during spleen development in pig, we constructed 18 cDNA libraries for strand-specific, paired-end 150 bp sequencing on Illumina’s HiSeq platform, representing five important developmental stages: E55d (55 days before birth), B0d (just after birth), W30d (weaned for 30 days), A180d (180 days old) and A2Y (2 years old, including Wild Boar). On average, we obtained about ~37.03 Gb of high quality data per stage (S1 Table). We developed a highly stringent step-wise protocol to discard transcripts not possessing all the characteristics of lncRNAs (S1 Fig). We identified putative lncRNAs by considering their homology with known proteins, containment of a known protein-coding domain, and coding potential. In total, we obtained a “high-confidence” set of 15,040 lncRNAs in pig spleen, each of which was expressed in at least one biological replicate (FPKM > 0) (S2 Table). We found that 13,047 lncRNAs existed in WB. Among them, 195 lncRNAs were specific to WB, which were not identified in other domestic pig samples in our study.

Genomic characterization and classification of lncRNAs

Compared with protein-coding genes, lncRNAs are shorter in length, have fewer exons, and are expressed at lower levels (S2A–S2C and S3 Figs). These phenomenons all exist in domestic pigs and WBs, which are consistent with previous studies in mammals [3840]. According to their genomic location, results of FEELnc analysis for the best lncRNA-mRNA partner interaction of each of 12,188 identified lncRNAs could be partitioned into five groups: 9,063 intergenic lncRNAs (lincRNAs) without any gene overlap, 1,195 sense exon-overlapping lncRNAs, 568 antisense exon-overlapping lncRNAs, 268 sense intron-overlapping lncRNAs, and 1,094 antisense intron-overlapping lncRNAs (S2D Fig). Functional enrichment analysis was performed on genes whose exon or intron overlapped with lncRNAs. The results showed that T cell activation involved in immune response (GO:0002286), humoral immune response (GO:0006959), adaptive immune response (GO:0002250), and blood coagulation (GO:0007596) were significantly enriched for genes with intron-overlapping lncRNAs (S3 Table).

Expression profiles of protein-coding genes and lncRNAs

Based on expression profiles (S2 and S4 Tables), we observed that samples not only could be separated by different developmental stages, but also could be distinguished between A2Y domestic pig and WB, indicating that both protein-coding genes and lncRNAs are expressed in a stage-specific manner (Fig 1 and S4 Fig). To follow the expression dynamics of lncRNAs and protein-coding genes as development proceeds, we calculated Pearson correlations between each pair of samples based on the expression profiles of all five developmental stages. This resulted in the discovery of two interesting phenomenon. First, comparison of independently clustered expression profiles of samples revealed that both protein-coding genes and lncRNAs could be grouped into three broad classes (Fig 2A): (1) samples from the embryonic development stage (E55d) were congregated, (2) samples from the suckling period (B0d and W30d) were clustered, and (3) samples from the late stage of development (A180d and A2Y) formed the last group.

thumbnail
Fig 1. Expression profile and PCA of protein-coding genes.

(A) Heat map showing the expression profile of protein-coding genes. The top panel is the tree constructed by Pearson correlation. (B) Two-way PCA plot of protein-coding genes based on expression profile.

https://doi.org/10.1371/journal.pone.0193552.g001

thumbnail
Fig 2. Temporal expression profiles of protein-coding genes and lncRNAs.

(A) Dynamic changes in expression profiles of protein-coding genes and lncRNAs. The top panel shows protein-coding genes and the bottom panel shows lncRNAs. Values represent the pairwise Pearson correlation. Correlation between every two samples was calculated by log2-transformed (FPKM+1) gene expression values. Three main expression patterns can be distinguished. (B) Distributions of Shannon entropy-based temporal specificity scores were calculated for distinct classes of lncRNAs and protein-coding genes.

https://doi.org/10.1371/journal.pone.0193552.g002

Second, the power of correlation between two consecutive stages was weaker for lncRNAs than for protein-coding genes. This observation implies that lncRNAs have a more restricted temporal expression than protein-coding genes. To further validate this hypothesis, we calculated the Shannon entropy (H) value as a measure of the specificity of gene expression across developmental stages. All three classes of lncRNAs (lincRNAs, and intron-overlapping and exon-overlapping lncRNAs) showed increased temporal specificity compared with protein-coding genes (Fig 2B).

Time-series analysis and co-expression network of protein-coding genes and lncRNAs

To explore the dynamic expression pattern of protein-coding genes and lncRNAs, we adopted a time-series analysis by STEM (Short Time-series Expression Miner). According to their dynamic expression patterns across the five development stages, we found that 12,913 protein-coding genes and 15,040 lncRNAs were classified into six and eight cluster profiles, respectively, which included 12 and 11 significantly enriched model profiles (colored in figure, Bonferroni adjusted P values ≤ 0.05), respectively (Fig 3A, S5 and S6 Tables). To our surprise, eight model profiles existed in both protein-coding genes and lncRNAs, which indicated that their expression patterns during developmental stages were highly correlated. This implies functional relatedness or a regulatory relationship between protein-coding genes and lncRNAs. Combining gene number and significance level, we used the genes in red (increased expression level with time) and green (decreased expression level with time) modules to perform functional enrichment analysis. Interestingly, genes in green modules were mainly enriched for cell division (GO:0051301) and DNA replication (GO:0006260) (Fig 3B and S7 Table), while genes in red modules were mainly enriched for immune response (GO:0006955) and inflammatory response (GO:0006954) (Fig 3C and S8 Table). This finding is consistent with the process of spleen development.

thumbnail
Fig 3. Time-series modules and co-expression network of lncRNAs and protein-coding genes.

(A) Time-series modules of protein-coding genes and lncRNAs. The top panel shows protein-coding genes and the second panel shows lncRNAs. Numbers in the top left corner indicate module number. Numbers in lower left corners indicate numbers of protein-coding genes or lncRNAs in each module. The same color was used to represent each cluster. Functional categories of genes in green (B) and red modules (C). Benjamini adjusted P values were transformed by ‒log10. (D) Heat map showing the largest two co-expression networks of protein-coding genes. Values represent log2(FPKM+1) of each gene in each sample minus the mean value of each gene across all samples.

https://doi.org/10.1371/journal.pone.0193552.g003

To explore the functional relatedness between protein-coding genes and lncRNAs, we used co-expression analysis. First, we screened out the set of protein-coding genes and lncRNAs that were related with the factor of time by DESeq2. The differentially expressed protein-coding genes and lncRNAs between A2Y domestic pig and WB were also included. Next, we built the co-expression network based on this set of protein-coding genes and lncRNAs by WGCNA. FPKM > 0.1 was used as the cutoff to filter the protein-coding genes and lncRNAs. A total of 5,474 protein-coding genes and 4,223 lncRNAs were used to construct the co-expression network. Finally, we observed 11 modules and only considered the two largest modules and the overlapping genes (Fig 3D and S9 Table), which accounted for 73.23% of total genes in the 11 modules. Functional enrichment analysis showed that the co-expressed genes in the two largest modules were enriched in a variety of biological processes. Some of them were related to immune response (GO:0006955), inflammatory response (GO:0006954), protein binding (GO:0005515), cell proliferation (GO:0008283) and DNA replication (GO:0006260) (S10 Table).

Identification of differentially expressed protein-coding genes and lncRNAs

To explore the biological function of each stage, we performed pairwise comparisons between the five developmental stages. First, we identified differentially expressed protein-coding genes and lncRNAs between one stage and each of the other four stages (S11 and S12 Tables). Next, we merged the former protein-coding genes and lncRNAs into a non-redundant set for each stage. A Venn diagram was constructed using the non-redundant set (Fig 4A and 4B). Only one protein-coding gene and lncRNA was differentially expressed in all five developmental stages. This protein-coding gene, CP, encodes ceruloplasmin precursor, which has ferroxidase activity to oxidize Fe2+ to Fe3+ without releasing radical oxygen species, suggesting it may be related to antioxidant processes [41]. Interesting, it is also involved in iron transport across the cell membrane, which is consistent with the main biological function of spleen [23]. In addition, we also found that the expression level of CP was almost zero at E55d and reached a maximum at B0d. Afterwards, it gradually declined and remained stable throughout the process of growth and development (Fig 4C). Compared with CP, expression of the lncRNA (TU78568) gradually increased across all five stages (Fig 4C), suggesting it may play an important role in spleen development. However, it is disappointing that this lncRNA is located on the scaffold (GL895479.1), where there is no protein-coding gene in the adjacent position.

thumbnail
Fig 4. Differentially expressed protein-coding genes and lncRNAs, and PCA of PSI values.

Venn diagram of common differentially expressed protein-coding genes (A) and lncRNAs (B) in five developmental stages. (C) Dynamic expression profiles of CP and TU78568. (D) Two-way PCA plot of protein-coding genes based on PSI values.

https://doi.org/10.1371/journal.pone.0193552.g004

Functional enrichment analysis was performed on stage-specific differentially expressed protein-coding genes. Genes at E55d were significantly enriched for immune response (GO:0006955) and inflammatory response (GO:0006954) (S13 Table). No significant enrichment for biological functions was observed during the other four stages, which may result from the lack of gene number. Next, non-redundant differentially expressed protein-coding genes for each stage were analyzed for functional enrichment. E55d and B0d were enriched for terms related to immunity and inflammation (S14 Table). Differentially expressed protein-coding genes between WB and A2Y domestic pig were enriched for terms related to protein binding and hematopoietic cell lineages (S15 Table).

Splicing landscape of protein-coding genes

To explore alternative splicing levels of protein-coding genes across developmental stages, we performed analyses to calculate “percent spliced in” (PSI) values for exons, similar to analysis performed for gene expression levels. Consistent with the results of expression profiling, alternative splicing could also separate samples by different developmental stages, including domestic pig and WB (Fig 4D), indicating that splicing levels have spatiotemporal specificity as well. Although the power of alternative splicing levels to distinguish populations was not stronger than gene expression profiles, individual differences were more distinguishable at the level of splicing.

Discussion

In this study, we developed a highly stringent filtering pipeline to identify lncRNAs (S1 Fig). In total, we identified 15,040 lncRNAs in the developing pig spleen. To the best of our knowledge, this is the first report to systematically identify lncRNAs during pig spleen development using RNA-seq data.

According to our results, protein-coding genes were highly temporally restricted in terms of both expression profiles and levels of alternative splicing, indicating the function of protein-coding genes is stage-specific [42]. Compared with protein-coding genes, lncRNAs were expressed during narrower time windows (Fig 2B), indicating lncRNAs were more temporally restricted than protein-coding genes [43]. Time-series analysis indicated that both protein-coding genes and lncRNAs are dynamically expressed during spleen development and have similar expression patterns. These results suggest consistent biological function between protein-coding genes and lncRNAs. Genes in red modules with up-regulated expression levels were enriched for genes related to immune and inflammatory responses, such as TNF, IL18, IL15, IL10 and IL7R [44, 45]. Among of these genes, the TNF superfamily plays an important role in immunization and inflammation [46, 47]. Genes in green modules with down-regulated expression levels were enriched for biological functions associated with cell division and DNA replication, such as the CDC family and CDK6 [48]. In addition, we also observed two GO terms, vascular endothelial growth factor receptor signaling pathway (GO: 0048010) and platelet degranulation (GO:0002576) (S7 Table), which implied the spleen might participate in the process of hematopoiesis [49] during the process of development. Functional analysis of co-expressed genes showed similar results, further confirming this biological function of the spleen. CP was differentially expressed in all five developmental stages, with expression levels ranging from almost none to maximum during the first two stages. B0d represented the moment when the pig birthed and the environment changed from womb to natural atmosphere. This process involves the piglet encountering a high amount of oxygen, which may activate antioxidant mechanisms. CP can oxidize Fe2+ to Fe3+ without releasing radical oxygen species, and is related to antioxidant processes. Its expression level gradually declined and tended to remain stable at subsequent stages, suggesting CP might participate in iron transport during these stages. This is also consistent with the splenic function of filtering blood and recycling iron from aging red blood cells [23].

In our study, we added the sample of WB, which enriched our data and made us obtain more lncRNAs in pig spleen. Based on our results, the character of lnRNAs in WB was consistent with domestic pig. Besides, according to the expression profiles of protein-coding genes and lncRNAs, WBs were clustered together with the A180d and A2Y domestic pigs but not separated into a single one, which implied that both protein-coding genes and lncRNAs had stronger stage-specific than breed-specific.

In summary, our study provides the first catalog of lncRNAs in the developing pig spleen. Our results suggest numerous roles of lncRNAs in spleen development and provide a high quality resource for future transcriptomic, genetic and genomic studies.

Supporting information

S1 Fig. Identification pipeline of lncRNAs.

https://doi.org/10.1371/journal.pone.0193552.s001

(TIF)

S2 Fig. Genomic characterization of lncRNAs.

(A) Distribution of transcript length for lncRNAs and protein-coding genes. (B) Exon number distribution of lncRNAs and protein-coding genes. (C) Comparison of the expression levels of lncRNAs and protein-coding genes. (D) Classification of lncRNAs.

https://doi.org/10.1371/journal.pone.0193552.s002

(TIF)

S3 Fig. Comparison of the expression levels of lncRNAs and protein-coding genes.

https://doi.org/10.1371/journal.pone.0193552.s003

(TIF)

S4 Fig. Expression profile and PCA of lncRNAs.

(A) Heat map shows the expression profile of lncRNAs. The top panel shows the tree constructed by Pearson correlation. (B) Two-way PCA plot of lncRNAs based on expression profile.

https://doi.org/10.1371/journal.pone.0193552.s004

(TIF)

S1 Table. Summary of samples for total RNA sequencing.

https://doi.org/10.1371/journal.pone.0193552.s005

(DOCX)

S2 Table. The identified lncRNAs in pig spleen.

https://doi.org/10.1371/journal.pone.0193552.s006

(XLSX)

S3 Table. Functional categories of genes of intronic overlapping lncRNAs.

https://doi.org/10.1371/journal.pone.0193552.s007

(XLSX)

S4 Table. The log2-transformed (FPKM+1) of protein-coding genes.

https://doi.org/10.1371/journal.pone.0193552.s008

(XLSX)

S5 Table. Protein-coding gene list in different model profiles of STEM.

https://doi.org/10.1371/journal.pone.0193552.s009

(XLSX)

S6 Table. LncRNA list in different model profiles of STEM.

https://doi.org/10.1371/journal.pone.0193552.s010

(XLSX)

S7 Table. Functional categories of genes in green modules.

https://doi.org/10.1371/journal.pone.0193552.s011

(XLSX)

S8 Table. Functional categories of genes in red modules.

https://doi.org/10.1371/journal.pone.0193552.s012

(XLSX)

S9 Table. Protein-coding gene and lncRNA list in co-expression network.

https://doi.org/10.1371/journal.pone.0193552.s013

(XLSX)

S10 Table. Functional categories of genes in co-expression network.

https://doi.org/10.1371/journal.pone.0193552.s014

(XLSX)

S11 Table. Differentially expressed protein-coding genes in pairwise stages.

https://doi.org/10.1371/journal.pone.0193552.s015

(XLSX)

S12 Table. Differentially expressed lncRNAs in pairwise stages.

https://doi.org/10.1371/journal.pone.0193552.s016

(XLSX)

S13 Table. Functional categories of genes specific differentially expressed in E55d.

https://doi.org/10.1371/journal.pone.0193552.s017

(XLSX)

S14 Table. Functional categories of genes differentially expressed in E55d and B0d.

https://doi.org/10.1371/journal.pone.0193552.s018

(XLSX)

S15 Table. Functional categories of genes differentially expressed between WB and domestic pig.

https://doi.org/10.1371/journal.pone.0193552.s019

(XLSX)

References

  1. 1. Morris KV, Mattick JS. The rise of regulatory RNA. Nat Rev Genet. 2014;15(6):423. pmid:24776770
  2. 2. Djebali S, Davis CA, Merkel A, Dobin A, Lassmann T, Mortazavi AM, et al. Landscape of transcription in human cells. Nature. 2012;489(7414):101. pmid:22955620
  3. 3. Batista PJ, Chang HY. Long noncoding RNAs: cellular address codes in development and disease. Cell. 2013;152(6):1298–307. pmid:23498938
  4. 4. Sauvageau M, Goff LA, Lodato S, Bonev B, Groff AF, Gerhardinger C, et al. Multiple knockout mouse models reveal lincRNAs are required for life and brain development. Elife. 2013;2:e01749. pmid:24381249
  5. 5. Bakhtiarizadeh MR, Hosseinpour B, Arefnezhad B, Shamabadi N, Salami SA. In silico prediction of long intergenic non-coding RNAs in sheep. Genome. 2016;59(4):263–75. pmid:27002388
  6. 6. Billerey C, Boussaha M, Esquerré D, Rebours E, Djari A, Meersseman C, et al. Identification of large intergenic non-coding RNAs in bovine muscle using next-generation transcriptomic sequencing. BMC genomics. 2014;15(1):499. pmid:24948191
  7. 7. Goff LA, Groff AF, Sauvageau M, Trayes-Gibson Z, Sanchez-Gomez DB, Morse M, et al. Spatiotemporal expression and transcriptional perturbations by long noncoding RNAs in the mouse brain. Proc. Natl. Acad. Sci. USA. 2015;112(22):6855–62. pmid:26034286
  8. 8. Iyer MK, Niknafs YS, Malik R, Singhal U, Sahu A, Hosono Y, et al. The landscape of long noncoding RNAs in the human transcriptome. Nat Genet. 2015;47(3):199–208. pmid:25599403
  9. 9. Ran M, Chen B, Li Z, Wu M, Liu X, He C, et al. Systematic identification of long noncoding RNAs in immature and mature porcine testes. Biol Reprod. 2016;94(4):77, 1–9. pmid:26935596
  10. 10. Lee JT, Bartolomei MS. X-inactivation, imprinting, and long noncoding RNAs in health and disease. Cell. 2013;152(6):1308–23. pmid:23498939
  11. 11. Lee JT. Epigenetic regulation by long noncoding RNAs. Science. 2012;338(6113):1435–9. pmid:23239728
  12. 12. Fatica A, Bozzoni I. Long non-coding RNAs: new players in cell differentiation and development. Nat Rev Genet. 2014;15(1):7. pmid:24296535
  13. 13. Frantz LA, Schraiber JG, Madsen O, Megens HJ, Bosse M, Paudel Y, et al. Genome sequencing reveals fine scale diversification and reticulation history during speciation in Sus. Genome Biol. 2013;14(9):R107. pmid:24070215
  14. 14. Larson G, Dobney K, Albarella U, Fang M, Matisoo-Smith E, Robins J, et al. Worldwide phylogeography of wild boar reveals multiple centers of pig domestication. Science. 2005;307(5715):1618–21. pmid:15761152
  15. 15. Ai H, Fang X, Yang B, Huang Z, Chen H, Mao L, et al. Adaptation and possible ancient interspecies introgression in pigs identified by whole-genome sequencing. Nat Genet. 2015;47(3):217–25. pmid:25621459
  16. 16. Fang X, Mou Y, Huang Z, Li Y, Han L, Zhang Y, et al. The sequence and analysis of a Chinese pig genome. Gigascience. 2012;1(1):16. pmid:23587058
  17. 17. Groenen MA, Archibald AL, Uenishi H, Tuggle CK, Takeuchi Y, Rothschild MF, et al. Analyses of pig genomes provide insight into porcine demography and evolution. Nature. 2012;491(7424):393–8. pmid:23151582
  18. 18. Li M, Chen L, Tian S, Lin Y, Tang Q, Zhou X, et al. Comprehensive variation discovery and recovery of missing sequence in the pig genome using multiple de novo assemblies. Genome Res. 2017;27(5):865–74. pmid:27646534
  19. 19. Vamathevan JJ, Hall MD, Hasan S, Woollard PM, Xu M, Yang Y, et al. Minipig and beagle animal model genomes aid species selection in pharmaceutical discovery and development. Toxicol Appl Pharmacol. 2013;270(2):149–57. pmid:23602889
  20. 20. Meurens F, Summerfield A, Nauwynck H, Saif L, Gerdts V. The pig: a model for human infectious diseases. Trends Microbiol. 2012;20(1):50–7. pmid:22153753
  21. 21. Prather RS, Lorson M, Ross JW, Whyte JJ, Walters E. Genetically engineered pig models for human diseases. Annu Rev Anim Biosci. 2013;1(1):203–19. pmid:25387017
  22. 22. Ozawa M, Himaki T, Ookutsu S, Mizobe Y, Ogawa J, Miyoshi K, et al. Production of cloned miniature pigs expressing high levels of human apolipoprotein (a) in plasma. PLoS One. 2015;10(7):e0132155. pmid:26147378
  23. 23. Bronte V, Pittet MJ. The spleen in local and systemic regulation of immunity. Immunity. 2013;39(5):806–18. pmid:24238338
  24. 24. Schroyen M, Tuggle CK. Current transcriptomics in pig immunity research. Mamm Genome. 2015;26(1–2):1–20. pmid:25398484
  25. 25. Skinner BM, Sargent CA, Churcher C, Hunt T, Herrero J, Loveland JE, et al. The pig X and Y Chromosomes: structure, sequence, and evolution. Genome Res. 2016;26(1):130–9. pmid:26560630
  26. 26. Trapnell C, Pachter L, Salzberg SL. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009;25(9):1105–11. pmid:19289445
  27. 27. Pertea M, Pertea GM, Antonescu CM, Chang T-C, Mendell JT, Salzberg SL. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat Biotechnol. 2015;33(3):290–5. pmid:25690850
  28. 28. Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, Van Baren MJ, et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010;28(5):511–5. pmid:20436464
  29. 29. Eddy SR. A new generation of homology search tools based on probabilistic inference. Genome Inform; 2009.
  30. 30. Kong L, Zhang Y, Ye Z-Q, Liu X-Q, Zhao S-Q, Wei L, et al. CPC: assess the protein-coding potential of transcripts using sequence features and support vector machine. Nucleic Acids Res. 2007;35(suppl_2):W345–W9. pmid:17631615
  31. 31. Wucher V, Legeai F, Hédan B, Rizk G, Lagoutte L, Leeb T, et al. FEELnc: a tool for long non-coding RNA annotation and its application to the dog transcriptome. Nucleic Acids Res. 2017;45(8):e57–e. pmid:28053114
  32. 32. Consortium GT. The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans. Science. 2015;348(6235):648–60. pmid:25954001
  33. 33. Schug J, Schuller W-P, Kappen C, Salbaum JM, Bucan M, Stoeckert CJ. Promoter features related to tissue specificity as measured by Shannon entropy. Genome Biol. 2005;6(4):R33. pmid:15833120
  34. 34. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550. pmid:25516281
  35. 35. Ernst J, Bar-Joseph Z. STEM: a tool for the analysis of short time series gene expression data. BMC Bioinformatics. 2006;7(1):191. pmid:16597342
  36. 36. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9(1):559. pmid:19114008
  37. 37. Barbosa-Morais NL, Irimia M, Pan Q, Xiong HY, Gueroussov S, Lee LJ, et al. The evolutionary landscape of alternative splicing in vertebrate species. Science. 2012;338(6114):1587–93. pmid:23258890
  38. 38. Guttman M, Garber M, Levin JZ, Donaghey J, Robinson J, Adiconis X, et al. Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs. Nat Biotechnol. 2010;28(5):503–10. pmid:20436462
  39. 39. Cabili MN, Trapnell C, Goff L, Koziol M, Tazon-Vega B, Regev A, et al. Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. Genes Dev. 2011;25(18):1915–27. pmid:21890647
  40. 40. Wang Y, Xue S, Liu X, Liu H, Hu T, Qiu X, et al. Analyses of long non-coding RNA and mRNA profiling using RNA sequencing during the pre-implantation phases in pig endometrium. Sci Rep. 2016;6. pmid:26822553
  41. 41. Goldstein IM, Charo IF. Ceruloplasmin: an acute phase reactant and antioxidant. Lymphokines. 2014;8:373–411.
  42. 42. Tang Z, Wu Y, Yang Y, Yang Y-CT, Wang Z, Yuan J, et al. Comprehensive analysis of long non-coding RNAs highlights their spatio-temporal expression patterns and evolutional conservation in Sus scrofa. Sci Rep. 2017;7. pmid:28233874
  43. 43. Pauli A, Valen E, Lin MF, Garber M, Vastenhouw NL, Levin JZ, et al. Systematic identification of long noncoding RNAs expressed during zebrafish embryogenesis. Genome Res. 2012;22(3):577–91. pmid:22110045
  44. 44. Lauw FN, Pajkrt D, Hack CE, Kurimoto M, van Deventer SJH, van der Poll T. Proinflammatory effects of IL-10 during human endotoxemia. J Immunol. 2000;165(5):2783–9. pmid:10946310
  45. 45. Locksley RM, Killeen N, Lenardo MJ. The TNF and TNF receptor superfamilies: integrating mammalian biology. Cell. 2001;104(4):487–501. pmid:11239407
  46. 46. Hotamisligil GS. Inflammation, metaflammation and immunometabolic disorders. Nature. 2017;542(7640):177–85. pmid:28179656
  47. 47. Kallaur AP, Reiche EMV, Oliveira SR, Pereira WLdCJ, Alfieri DF, Flauzino T, et al. Genetic, immune-inflammatory, and oxidative stress biomarkers as predictors for disability and disease progression in multiple sclerosis. Mol Neurobiol. 2017;54(1):31–44. pmid:26732588
  48. 48. Salazar-Roa M, Malumbres M. Fueling the cell division cycle. Trends Cell Biol. 2017;27(1):69–81. pmid:27746095
  49. 49. Wolber FM, Leonard E, Michael S, Orschell-Traycoff CM, Yoder MC, Srour EF. Roles of spleen and liver in development of the murine hematopoietic system. Exp Hematol. 2002;30(9):1010–9. pmid:12225792