Last updated: 2021-12-27

Code version: 07560b56369297666d28cbf860e87b0058253e51

Project Introduction

This this the project website for Blake, Roux, et al., “A comparison of gene expression and DNA methylation patterns across tissues and species”. Many of the main conclusions from each section are quoted directly from the publication. A link to the Github repo can be found here, although the code is much more readable using the links below.

Sample Information and Record of Technical Variables

RNA-seq Data

Pre-processing

Normalization

Testing the relationship between a large number of technical factors recorded throughout our experiments and the biological variables of interest in the study

Final filtering, normalizations, and analysis with technical factors (12,184 genes)

  • Processing using filtering with lowly expressed genes.
  • Conclusion from this section: I estimated gene expression levels using an approach designed to prevent biases driven by sequence divergence across species and excluded data from genes that were lowly expressed in more than half of the samples (as well as data from one human heart sample that was an obvious outlier, probably owing to a sample swap). I normalized the distribution of gene expression levels to remove systematic expression differences between species (maximizing the number of genes with invariant expression levels across species corresponds to our null hypothesis).

Univariate analysis on gene expression levels

Methylation Data

Differentially methylated regions (DMRs)

Identifying DMRs was computationally intensive using the BSmooth package. Therefore, tissue and species DMRs were identified using the orthologous CpGs on a HPC. The output of each file was either (1) a list of sDMRs where the same tissue was analyzed across species or (2) a list of tDMRs where different tissues were analyzed within a species.

Integration of gene expression levels and methylation levels- interspecies DE genes

What percentage of interspecies gene expression level differences can be explained by variation in DNA methylation levels?

Integration of gene expression levels and methylation levels- intertissue DE genes

Within a species, what percentage of intertissue gene expression level differences can be explained by variation in DNA methylation levels?

Integration of gene expression levels and methylation levels- conserved versus non-conserved intertissue DE genes

What percentage of conserved intertissue gene expression level differences can be explained by variation in DNA methylation levels? How does this percentage compare to the percentage of non-conserved intertissue gene expression level differences that can be explained by variation in DNA methylation levels?

Additional figure/table generation

Revisions

Many thanks to the editor and 3 reviewers who provided helpful comments on this manuscript.