From 58d890d9146f89e69a62cb84ebfe7fa06ac04c05 Mon Sep 17 00:00:00 2001 From: Anthony Gitter Date: Sat, 3 Aug 2019 08:05:06 -0500 Subject: [PATCH] Proofread methylation intro section --- content/04.study.md | 27 +++++++++++++++------------ 1 file changed, 15 insertions(+), 12 deletions(-) diff --git a/content/04.study.md b/content/04.study.md index 35b5c319..769f7e33 100644 --- a/content/04.study.md +++ b/content/04.study.md @@ -49,24 +49,27 @@ For example, the effects of cellular heterogeneity on basic biology and disease ### DNA methylation -DNA methylation, is the process of adding a methyl group to a cytosine in the context of a CpG dinucleotide. -This DNA-level epigenetic modification regulates gene transcription, is critical in development, and alterations to DNA methylation are well-established as contributing to pathophysiology of many diseases including cancers [@tag:Robertson2005],[@tag:Feinberg2018]. -Studies of DNA methylation have demonstrated the fundamental role of DNA methylation in cell lineage specification starting with stem cell differentiation [@tag:Meissner2008], [@tag:Nazor2012], as well as a strong relationship of DNA methylation with aging phenotypes [@tag:Kwabi-Addo2007], [@tag:Fraga2005], and pathogenesis in response to environmental exposures [@tag:Christensen2009], [@tag:Relton2010]. - -Traditional analytic approaches to DNA methylation data often focus on estimating differential DNA methylation between groups or related with an outcome using linear mixed effects models, so-called epigenome-wide association studies (EWAS) [@tag:Laird2010], [@tag:Wilhelm-Benartzi2013], [@tag:Liu2013], [@tag:Teschendorff2017]. -In addition, a growing application of DNA methylation measures is to infer cellular or subject phenotypes from samples and either examine the relation of these phenotypes with outcomes or disease states directly, and/or include them in models as covariates [@tag:Titus2017], [@tag:Salas2018_GR], [@tag:Zhang2019], [@tag:Horvath2014], [@tag:Quach2017]. -For example, inference of subject age using DNA methylation clock approaches are established [@tag:Horvath2013], and are starting to be applied to test the relation of biological age with disease risk and outcomes [@tag:Kresovich2019]. +DNA methylation is the process of adding a methyl group to a cytosine in the context of a CpG dinucleotide. +This DNA-level epigenetic modification regulates gene transcription and is critical in development. +Alterations to DNA methylation are well-established as contributing to pathophysiology of many diseases including cancers [@tag:Robertson2005; @tag:Feinberg2018]. +Studies of DNA methylation have demonstrated its fundamental role in cell lineage specification starting with stem cell differentiation [@tag:Meissner2008; @tag:Nazor2012] as well as a strong relationship with aging phenotypes [@tag:Kwabi-Addo2007; @tag:Fraga2005] and pathogenesis in response to environmental exposures [@tag:Christensen2009; @tag:Relton2010]. + +Traditional analytic approaches to DNA methylation data often focus on estimating differential DNA methylation between groups or related with an outcome using linear mixed effects models, so-called epigenome-wide association studies [@tag:Laird2010; @tag:Wilhelm-Benartzi2013; @tag:Liu2013; @tag:Teschendorff2017]. +In addition, a growing application of DNA methylation measures is to infer cellular or subject phenotypes from samples and either examine the relation of these phenotypes with outcomes or disease states directly or include them in models as covariates [@tag:Titus2017; @tag:Salas2018_GR; @tag:Zhang2019; @tag:Horvath2014; @tag:Quach2017]. +For example, inference of subject age using DNA methylation clock approaches are established [@tag:Horvath2013] and are starting to be applied to test the relation of biological age with disease risk and outcomes [@tag:Kresovich2019]. Different cell types have different DNA methylation profiles. -A novel approach to immunophenotyping combines measurements with Reference DNA methylation profiles of leukocytes, which are available, to infer immune cell type proportions [@tag:Houseman2012],[@tag:Salas2018]. +A novel approach to immunophenotyping combines measurements with reference DNA methylation profiles of leukocytes to infer immune cell type proportions [@tag:Houseman2012; @tag:Salas2018]. This strategy is particularly helpful when only DNA is available from a sample. Cell type inference is important for adjusting for cell-type composition in epigenome-wide association studies [@tag:Teschendorff2017]. -While reference-based libraries have strong predictive value for immune cell type estimation and has broad utility, cell composition mixtures per se, and methods to incorporate estimates of mixtures pose important considerations on the interpretation of underlying biology associated with disease manifestations and phenotypes. -When a reference library is not available for use, methods that do not rely on these reference libraries, so called reference-free deconvolution [@tag:Houseman2016], are available to decompose signal purported to be contributed by cell types. -However, using reference-free cell type proportion estimates as potential confounders in adjusted models can be overly conservative as as outcome-associated variation in DNA methylation may be decomposed into putative cell type estimates. +While reference-based libraries have strong predictive value for immune cell type estimation and have broad utility, methods to incorporate estimates of mixtures pose important considerations on the interpretation of underlying biology associated with disease manifestations and phenotypes. +When a reference library is not available, reference-free deconvolution methods [@tag:Houseman2016] that do not rely on these reference libraries are available to decompose signal purported to be contributed by cell types. +However, using reference-free cell type proportion estimates as potential confounders in adjusted models can be overly conservative. +Outcome-associated variation in DNA methylation may be decomposed into putative cell type estimates. Additional validated reference-based libraries for other tissue types, advancements in reference-free deconvolution methods, and application of deep learning methods are expected to provide new opportunities to understand and interpret DNA methylation in human health and disease. Deep learning approaches have numerous potential applications for DNA methylation data. -Imputation methods that capture complex interactions between different regions of DNA can expand the number of CpG sites whose DNA methylation state can be studied, and ideally these methods can derive their own informative, biologically-relevant features. +Imputation methods that capture complex interactions between different regions of DNA can expand the number of CpG sites whose DNA methylation state can be studied. +Ideally these methods can derive their own informative, biologically-relevant features. The primary deep learning methods developed to date focus on: 1) estimating regions of methylation status and imputing missing methylation values, 2) performing classification and regression tasks, and 3) using the latent embeddings of methylation states to derive biologically meaningful features, infer interpolated disease states, and uncover CpG sites that aid the above prediction tasks. #### Inference, imputation, and prediction