I summarized sRNA expression for all three E5 species. This includes summary of both raw and normalized counts of all sRNAs, miRNAs, and siRNAs. sRNA, miRNA, and siRNA identification performed using ShortStack 4.1.0.
Rendered code files:
This also included generating count matrices for raw, normalized, and variance stabilized counts of all sRNA, just miRNA, and just siRNA.
count matrices:
As a reminder, normalization and variance stabilization are two different transformations that serve different purposes.
Normalizing the transcript counts adjusts for differences in library size or sequencing depth, but retains count-like properties. Normalized counts are most useful for things like visualizing expression levels and differential expression analysis.
Variance stabilization aims to make the variance of the transformed data approximately independent of the mean, reducing heteroscedasticity (the relationship between variance and mean) and “smoothing” out the variance at low counts. Notably, the transformed data is no longer on the original count scale. The transformation makes the variance roughly constant across the range of counts, which makes it easier to interpret patterns in the data visually. Variance stabilized data is most useful for exploratory data analysis, like PCA, clustering, and heatmaps, and is also the transformation we’ll want to use before WGCNA.
Here’s some of the figures generated in these summary docs
Total transcripts (raw)
All sRNA:
miRNA:
siRNA:
Total transcripts (normalized)
All sRNA:
miRNA:
siRNA:
Unique transcripts
All sRNA:
Little variation within a species, but quite a large difference among species. Apul specimens expressed ~19,000 unique sRNA transcripts, Peve expressed ~15,700, and Ptuh specimens expressed dramatically fewer at ~6700 unique sRNA transcripts.
miRNA:
Despite the differences in total unique sRNA transcripts, all three species have similar numbers of unique miRNAs (though, note our deep-dive exploration showed there is little miRNA overlap among species).
siRNA:
There is again a more noticable difference in numbers of unique siRNAs. Apul have ~120, Peve ~155, and Ptuh ~90.