RNA-sequencing analysis results
After successfully constructing six transcriptome libraries for mRNA expression profiling. We obtained 36.61 and 33.86 million raw sequence reads for each LTH and STH sample, respectively (Table 1), with 96.88% (24.33/25.08 million reads) and 94.53% (19.22/20.24 million reads) of these reads, respectively, being uniquely aligned to the Oar_v3.1 sheep reference genome. Over 99% of these reads exhibited a QC-score >20 and mapping rates were very high, consistent with the reliability of our results. Even distributions of mapped reads to all chromosomes were observed, with over 98% of the bases in these mapped reads corresponding to mRNA (LTH: 98.2%; STH 99.63%).
In total, RNA sequencing yielded 44,582,440 (L1), 21,670,754 (L2), 43,891,310 (L3), 15,088,076 (S1), 42,549,998 (S2), and 44,334,664 (S3) paired-end reads for the six individuals sequenced LTH and STH sheep. After quality control and filtering of these reads, 444,37,680, 21,622,512, 43,776,088, 15,040,666, 42,442,906, and 44,092,952 valid reads were obtained from the indicated sheep (Table 1). These reads were then mapped to the annotated ovine genome (Oar_v3.1), with 73.9%, 57.6%, 68.3%, 55.4%, 45.1%, and 75.4% of the reads in these six individual LTH and STH sheep having successfully mapped to the genome, including 72.9%, 61.4%, 64.8% in the LTH sheep group and 59.9%, 49.9%, and 69.3% in the STH group that had mapped to genes (Table 1).
An average of 16 million raw reads per library was obtained following miRNA library sequencing, yielding 12,248,962, 15,829,550, 16,172,015, 15,480,083, 12,823,610, and 12,314,699 clean reads that were successfully mapped to the sheep genome for the L1, L2, L3, S1, S2, and S3 libraries, respectively (Table 2). Of these cleaned reads, 27,123 and 27,072 from LTH and STH sheep, respectively, aligned to unique miRNA sequences (Table 2). Total and unique RNA frequency distributions are shown in Figure 1, revealing highly diverse size distribution mapping to the sheep genome (Fig. 2). Most reads were between 18 to 36 nucleotides-long reads, with 22 nucleotides-long reads being the most common, followed by 21, 23, and 32 nucleotides-long (Fig. 3), consistent with the typical length of miRNAs. Proportions of small non-coding RNAs expression levels in these samples are shown in Table 3. Of these RNAs, miRNAs were the most abundantly expressed in libraries from both LTH and STH sheep (mean = 72.8%; Fig. 1), whereas other RNAs including ribosomal RNAs (rRNAs), small nucleolar RNAs (snoRNAs), small nuclear RNAs (snRNAs), and transfer RNAs (tRNAs) comprised 27.2% of the total expressed reads. Repeat-associated RNAs made up just 3% of these unique tags, while 7% of these tags did not map to any small non-coding RNA databases or to the sheep genome.
While miRNA do not directly encode proteins, their expression levels can offer functional insights into their role as post-transcriptional regulators. As miRNA functionality is also dependent upon spatiotemporal expression of target mRNAs, we therefore conducted integrated analyses of mRNA and miRNA profiles for LTH and STH sheep. In total, we identified 132 miRNAs in the tail fat tissue of these sheep (Table 4), with an average of 130 known miRNAs per sample (range: 125-132). Maxium miRNAs numbers were detected in sheep S2, while the lowest number of miRNAs was detected in the sample from sheep L2.
Analysis of differentially expressed genes between STH and LTH sheep
We next utilized the Cuffdiff software to identify DEGs when comparing STH and LTH sheep samples [19]. In total, we detected 20,137 mRNAs that were expressed in these samples (RPKM>1), including the majority of annotated sheep reference genes (Additional file 1: Table S1). We then assessed correlation between gene expression profiles in different samples, revealing generally strong Pearson correlation coefficients (ranging from 0.724 - 0.916; Fig. 4). When selecting DEGs, STH sheep were treated as the control group and LTH sheep as the experimental group. Clustering analyses (Fig. 5A), volcano plots (Fig. 6) were used to identify DEGs that were up- and down-regulated in LTH sheep relative to STH sheep. In total, we detected 521 significant DEGs in tail adipose tissue samples from these sheep (Additional file 2: Table S2), of which 237 were up-regulated and 284 were down-regulated. Several to DEGs associated with fatty acid synthesis, including ADIRF, HSD17B12, LPL, APOBR, INSIG1, THRSP, ACSL5, FAAH, ACSS2, APOA1, ACLY, and ACSM3, were upregulated in LTH sheep, potentially explaining this phenotype.
Analysis of differentially expressed miRNAs and their target genes
Next, we quantified differential miRNA expression patterns in the tails of LTH and STH sheep in terms of reads per million (RPM) values. All miRNAs with an RPM>10 were annotated based upon mature miRNA sequences recorded in miRBase (release 18.0), leading to the identification of 143 mature miRNAs in these six tissue samples (Additional file 3: Table S3). All miRNAs with < 2 nucleotide substitutions outside of the seed region were considered to represent a single miRNA family for the purposes of calculating differential expression. When we assessed tail fat miRNA expression profiles for LTH and STH sheep (Fig. 5B), we detected 132 known miRNAs. Those miRNAs with an average PRM>10 and a fold change between groups of >1 or < -1 were then identified as DEMs. Relative to LTH samples, STH samples exhibited 14 DEMs (6 upregulated and 8 downregulated) (Additional file 4: Table S4). The putative targets of these DEMs were then identified using miRanda program [28], leading to the identification of 2,409 possible target genes (Additional file 5: Table S5). As such, these identified DEMs were associated multiple targets and a range of regulatory modules, with an average of 172 target genes per DEM (range: 14-357). Identified DEM target genes include those associated with carbohydrate metabolism, the formation of anatomical structures, morphogenesis, lipid metabolism, kinase activity, and immune and inflammatory responses. Identified DEM target genes related to lipid metabolism are ACSL4, FTO, FGF8, IGF2, GNPDA2, LIPG, PRKAA2, ELOVL7, SOAT2, and SIRT1.
GOanalysis and functional annotation of DEGs and DEM target genes
A GO analysis of DEGs (RPKM<0.01or |log2Ratio| ≥ 1) was next conducted, with the resultant enriched biological process, cellular component, and molecular function GO terms being summarized in Table S6 (Additional file 6). With respect to biological processes, these genes were highly enriched for genes associated with translation (P=3.29E-05) and lipid metabolic processes (P=9.97e-05). These DEGs were enriched for cellular component GO terms linked to the cytoplasm, extracellular region, extracellular space, and organelles, while enriched molecular function included various structural molecules or functional activities. Overall, the most significantly enriched GO terms included extracellular region (P=8.60E-11), external encapsulating structure (P=4.75E-08), and extracellular space (P= 4.1E-06).
To better understand the functional implications of DEM-mediated the target gene regulation, we focused on DEM target genes that were negatively correlated with corresponding DEMs and subjected these genes to a GO analysis (Additional file 7; Table S7). These target genes were significantly enriched in cell death processes (P=4.3E-09), and were also associated with carbohydrate metabolism, ligase activity, lipid metabolic processes, protein maturation, and response to stress.
KEGG pathway analysis of DEGs and DEM target genes
Next, we conducted a KEGG pathway analysis of DEGs and DEM target genes identified through the above pairwise comparisons, revealing them to cluster primarily into the metabolism, cellular processes, environmental information processing, human diseases, organismal system, and genetic information processing pathways (Additional file 8: Table S8, Additional file 9: Table S9). Many of the identified DEGs were particularly enriched in the metabolism and human diseases pathways, when comparing LTH sheep samples with those from STH sheep, the top significantly enriched pathways included translation (P=3.4E-07), lipid metabolism (P=4.91E-04), infectious diseases (P=7.1E-09), cancers (P=6.0E-06), carbohydrate metabolism (P=8.4E-06), cardiovascular diseases (P=4.2E-07), and xenobiotic biodegradation and metabolism (P=1.6E-07) (Additional file 8: Table S8). Subsequent KEGG enrichment analysis of the 2,409 identified DEM target genes revealed them to be enriched in three metabolism-related pathways (P < 0.05), including biosynthesis of other secondary metabolites, folding, sorting and degradation, cardiovascular diseases, carbohydrate metabolism, and immune diseases (Additional file 9: Table S9).
Integrated analysis of DEGs and DEMs
As detailed above, we selected DEGs and DEMs using either FDR <10% or |log2Ratio| ≥ 1 as cut-off criteria, with miRanda being used to predict DEM target gene [28]. In total, 14 DEMs had sheep orthologues, including 6 that were upregulated (oar-miR-133, oar-miR-150, oar-miR-376c-3p, oar-miR-154a-3p, oar-miR-655-3p, and oar-miR-376a-5p) and 8 that were downregulated (oar-let-7i, oar-miR-221, oar-miR-30a-3p, oar-miR-381-3p, oar-miR-411a-5p, oar-miR-495-3p, oar-miR-543-3p, oar-miR-543-3p, and oar-miR-412-3p). As such, these miRNAs and their putative target were subjected to downstream analysis. In total, these 14 DEMs were associated with 2,409 target genes. We then conducted an integrated analysis of these DEMs, the expression of their target genes, DEGs, and correlations between these expression profiles in sheep tail adipose tissues. We identified 65 total miRNA-mRNA interaction pairs through these analyses DEMs and DEGs (Additional file 10: Table S10).