-
Notifications
You must be signed in to change notification settings - Fork 271
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #135 from greenelab/more-intro-work
Fill in more of the introduction
- Loading branch information
Showing
2 changed files
with
153 additions
and
73 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,97 +1,158 @@ | ||
## Introduction | ||
|
||
### Potential writing prompt | ||
Biology and medicine are rapidly becoming particularly data-intensive field with | ||
respect to both research and practice. A recent comparison of genomics with | ||
social media, online videos and other data-intensive scientific disciplines | ||
suggested that the field of genomics alone would equal or surpass other fields | ||
in data generation and analysis within the next decade | ||
[@doi:10.1371/journal.pbio.1002195]. These data present new opportunities, but | ||
also new challenges. We expect that algorithms to automatically extract | ||
meaningful patterns and provide actionable knowledge allowing us to better | ||
treat, categorize, or study disease will be required. | ||
|
||
One potential future that we could imagine is a world in which data, once | ||
gathered, is rapidly integrated into our understanding of living systems. What | ||
we learn is rapidly put to use. Our health-related activities are constantly | ||
monitored (e.g. by wearables) and all of our interactions with health care | ||
systems are extensively tracked. These sources of information are combined to | ||
help to guide our health care and maintenance. We'd be able to compare our state | ||
and trajectory to (anonymized) others, and identify means to improve our health. | ||
These means might contain drug combinations selected based on personalized | ||
predictions. | ||
Concurrent with this explosive growth in biomedical data, a new class of machine | ||
learning algorithm has become widespread in the domain of image analysis. | ||
Computer scientists are now building many-layered neural networks from | ||
collections of millions of images. In a famous example, scientists from Google | ||
demonstrated that a neural network could learn to identify cats simply by | ||
watching online videos | ||
[@url:http://research.google.com/archive/unsupervised_icml2012.html]. Such | ||
approaches, termed deep learning, seem like a solution to the challenge | ||
presented by the growth of data in biomedicine. Perhaps these algorithms could | ||
identify the "cats" hidden in our data - the patterns that exist but that we | ||
don't know to look for - and could act on them. | ||
|
||
### If this happens, is deep learning required for any of it? Are we any closer | ||
### because of the advent of deep learning? | ||
Deep learning has transformed image analysis, but researchers' initial forays | ||
into the use of these techniques in biomedicine have been less conclusive. | ||
There are certainly numerous promising examples, but we have not yet seen the | ||
massive convergence on these approaches that occurred in the field of image | ||
analysis. In this review, we discuss whether this is simply a matter of time or | ||
if there are unique challenges posed by biomedical data that render deep | ||
learning methods more challenging or less fruitful to apply. | ||
|
||
*"Categorize" and "treat" sound a bit like PMI goals. Another way to think about | ||
this would be: do we think that deep learning will make much of a difference | ||
for the precision medicine initiative (PMI)?* | ||
### What is deep learning? | ||
|
||
### General deep learning introduction | ||
Deep learning is built on a biologically-inspired approach from machine learning | ||
termed neural networks. Each neuron in a computational neural network, termed a | ||
node, has inputs, an activation function, and outputs. Each value from the | ||
inputs is usually multiplied by some weight and combined and summarized by the | ||
activation function. The value of the activation function is then multiplied by | ||
another set of weights to produce the output `TODO: we probably need a figure | ||
here - I see no way that we don't include this type of description in our paper, | ||
despite the fact that it's been done tons of times before. Corresponded with the | ||
author of the neural network zoo on twitter. He says that we should feel free to | ||
grab what we need. He does not intend to draw more and he does not have vector | ||
forms. He would like us to link to the original blog post.` These neural | ||
networks are trained by identifying weights that produce a desired output from | ||
some specific input. | ||
|
||
*Definitions, specific architectures, etc. We may want to clarify what we mean | ||
by "deep" learning when most methods use few hidden layers.* | ||
Neural networks can also be stacked. The outputs from one can be used as inputs | ||
to another. This process produces a stacked or multi-layer neural network. The | ||
multi-layer neural network techniques that underlie deep learning have a long | ||
history. Multi-layer methods have been discussed in the literature for more than | ||
five decades [@doi:10.1103/RevModPhys.34.135]. Given this context, it's | ||
challenging to consider "deep learning" as a new advance, though the term has | ||
only become widespread to describe analysis methods in the last decade. Much of | ||
the early history of neural networks has been extensively covered in a recent | ||
review [@doi:10.1016/j.neunet.2014.09.003]. For the purposes of this review, we | ||
identify deep learning approaches as those that use multi-layer neural networks | ||
to construct complex features from large-scale datasets. | ||
|
||
### Types of biological problems | ||
We also identify a class of algorithms that we term "shallow learning" | ||
approaches. We do not use this as a pejorative term, but instead to denote | ||
algorithms which have all of the hallmarks of deep approaches except that they | ||
employ networks of limited depth. We found it valuable to include these as we | ||
sought to identify the current contributions of deep learning and to predict its | ||
future impact. Researchers may employ these shallow learning methods for a | ||
number of reasons including: 1) shallow networks provide a degree of | ||
interpretability that better matches their use case; 2) the available data are | ||
insufficient to support deeper architectures, however new datasets that will | ||
support deep methods are expected; or 3) as building blocks to be combined with | ||
other non-neural-network-based approaches at subsequent stages. | ||
|
||
In this review, we are interested in the application of deep learning to | ||
topics of biomedical importance. This covers a large range of topics, which | ||
we divide into three broad classes based on their applied areas. We then | ||
briefly introduce the types of questions, approaches and data which are | ||
typical for each class in the application of deep learning. | ||
### Will deep learning transform the study of human disease? | ||
|
||
With this review, we specifically set out to address the question: "what would | ||
need to be true for deep learning to transform how we categorize, study, and | ||
treat individuals to maintain or restore health?." We set a high bar for the | ||
term "transform." Specifically we sought to identify whether deep learning was a | ||
disruptive innovation that would induce a strategic inflection point on the | ||
practice of biology or medicine. There are numerous examples where deep learning | ||
has been applied to biological problems and produced somewhat improved results, | ||
and there are numerous reviews that have focused on general applications of deep | ||
learning in biology [@doi:10.1038/nbt.3313 @doi:10.1002/minf.201501008 | ||
@doi:10.3109/10409238.2015.1135868 @doi:10.1021/acs.molpharmaceut.5b00982 | ||
@doi:10.15252/msb.20156651 @doi:10.1093/bib/bbw068]. We sought cases where deep | ||
learning was enabling researchers to solve challenges that were previously | ||
considered infeasible, or if it made difficult, tedious, and non-routine | ||
analyses routine. | ||
|
||
Based on our guiding question, we focused on the application of deep learning to | ||
topics of biomedical importance. This covers a large range of topics. We divided | ||
these into three broad classes based on their applied areas. We briefly | ||
introduce the types of questions, approaches and data which are typical for each | ||
class in the application of deep learning. | ||
|
||
#### Disease and Patient Categorization | ||
|
||
One important topic in the biomedical field is the accurate classification of | ||
diseases and disease subtypes. In the oncology field, current "gold standard" | ||
approaches are limited to either histological approaches, requiring manual | ||
human expertise, or shallow molecular markers, such as the cell surface | ||
receptors or small panels of genes. One example is the current PAM50 approach | ||
in classifying breast cancer, which utilizes the expression of 50 marker | ||
genes in order to divide breast cancer patients into four subtypes. | ||
Significant heterogeneity still remains within these four subtypes | ||
[@doi:10.1200/JCO.2008.18.1370 @doi:10.1158/1078-0432.CCR-13-0583]. Given the | ||
increasing wealth of molecular data available, it seems that a more | ||
One important topic in the biomedical field is the accurate classification of | ||
diseases and disease subtypes. In the oncology field, current "gold standard" | ||
approaches are limited to either histological approaches, requiring manual | ||
human expertise, or shallow molecular markers, such as the cell surface | ||
receptors or small panels of genes. One example is the current PAM50 approach | ||
in classifying breast cancer, which utilizes the expression of 50 marker | ||
genes in order to divide breast cancer patients into four subtypes. | ||
Significant heterogeneity still remains within these four subtypes | ||
[@doi:10.1200/JCO.2008.18.1370 @doi:10.1158/1078-0432.CCR-13-0583]. Given the | ||
increasing wealth of molecular data available, it seems that a more | ||
comprehensive subtyping is possible. | ||
|
||
Several studies have used deep learning methods in order to better categorize | ||
breast cancer patients. For example, Tan et al. applied denoising | ||
autoencoders (DA), an unsupervised approach, in order to cluster breast | ||
cancer patients [@doi:10.1142/9789814644730_0014]. Ciresan et al. utilized | ||
convolutional neural networks (CNN) to count mitotic divisions in | ||
histological images; a feature which is highly correlated with disease | ||
outcome [@doi:10.1007/978-3-642-40763-5_51]. Despite these recent advances, a | ||
number of challenges exist in this area of research, such as the integration | ||
of disparate types of data, including electronic health records (EHR), | ||
Several studies have used deep learning methods in order to better categorize | ||
breast cancer patients. For example, Tan et al. applied denoising | ||
autoencoders (DA), an unsupervised approach, in order to cluster breast | ||
cancer patients [@doi:10.1142/9789814644730_0014]. Ciresan et al. utilized | ||
convolutional neural networks (CNN) to count mitotic divisions in | ||
histological images; a feature which is highly correlated with disease | ||
outcome [@doi:10.1007/978-3-642-40763-5_51]. Despite these recent advances, a | ||
number of challenges exist in this area of research, such as the integration | ||
of disparate types of data, including electronic health records (EHR), | ||
imaging and histology data and molecular omics data. | ||
|
||
#### Fundamental Biological Study | ||
|
||
Broadly speaking, topics in this class aim to answer more fundamental | ||
biological questions. Deep learning is especially suited in leveraging the | ||
large amounts of data from high throughput omics studies. The development of | ||
deep learning techniques and complex network architectures allow researchers | ||
Broadly speaking, topics in this class aim to answer more fundamental | ||
biological questions. Deep learning is especially suited in leveraging the | ||
large amounts of data from high throughput omics studies. The development of | ||
deep learning techniques and complex network architectures allow researchers | ||
to answer fundamental biological questions with unprecedented accuracy. | ||
`TODO: revisit "unprecedented accuracy" once we set the overall tone`. One | ||
classic biological problem where machine learning has been extensively | ||
applied is the prediction of molecular targets. Recent advances using deep | ||
learning have shown higher accuracy in determining molecular targets. For | ||
example, Lee et al. used deep recurrent neural networks (RNN) to predict gene | ||
targets of micro-RNAs [@doi:10.1109/icnn.1994.374637]. Wang et al. used a | ||
residual CNN to predict protein-protein contact on a genome-wide scale | ||
[@doi:10.1101/073239]. Other biological questions that have been investigated | ||
include the prediction of protein secondary structure based on sequence data | ||
[@doi:10.1109/tcbb.2014.2343960 @doi:10.1038/srep18962 @doi:10.1038/srep18 | ||
962], recognition of functional genomic elements such as enhancers and | ||
promoters [@doi:10.1101/036129 @doi:10.1007/978-3-319-16706-0_20 | ||
@doi:10.1093/nar/gk u1058], predicting the deleterious effects of nucleotide | ||
`TODO: revisit "unprecedented accuracy" once we set the overall tone`. One | ||
classic biological problem where machine learning has been extensively | ||
applied is the prediction of molecular targets. Recent advances using deep | ||
learning have shown higher accuracy in determining molecular targets. For | ||
example, Lee et al. used deep recurrent neural networks (RNN) to predict gene | ||
targets of micro-RNAs [@doi:10.1109/icnn.1994.374637]. Wang et al. used a | ||
residual CNN to predict protein-protein contact on a genome-wide scale | ||
[@doi:10.1101/073239]. Other biological questions that have been investigated | ||
include the prediction of protein secondary structure based on sequence data | ||
[@doi:10.1109/tcbb.2014.2343960 @doi:10.1038/srep18962 @doi:10.1038/srep18 | ||
962], recognition of functional genomic elements such as enhancers and | ||
promoters [@doi:10.1101/036129 @doi:10.1007/978-3-319-16706-0_20 | ||
@doi:10.1093/nar/gk u1058], predicting the deleterious effects of nucleotide | ||
polymorphisms [@doi:10.1093/bioinformatics/btu703], etc. | ||
|
||
#### Patient Treatment | ||
|
||
Studies in this category aim to recommend patient treatment, predict | ||
treatment outcome, or guide future development of new therapies.. | ||
Specifically, effort in this area aims to identify drug targets, identify | ||
drug interactions or predict drug response. One recent approach for | ||
predicting drug response is the use of protein structure to predict drug | ||
interactions and drug bioactivity through CNN [@arxiv:1510.02855]. Since CNNs | ||
leverage spatial relationships within the data, this particular deep learning | ||
framework is well suited to the problem. Drug discovery and drug | ||
"repurposing" are two other hot topics. Aliper et al. used transcriptomic | ||
data to predict which drugs might be repurposed for other diseases through | ||
deep fully connected neural networks | ||
[@doi:10.1021/acs.molpharmaceut.6b00248]. In a similar vein, Wang et al. used | ||
restricted boltzman machines (RBM) to predict drug molecular targets | ||
Studies in this category aim to recommend patient treatment, predict | ||
treatment outcome, or guide future development of new therapies.. | ||
Specifically, effort in this area aims to identify drug targets, identify | ||
drug interactions or predict drug response. One recent approach for | ||
predicting drug response is the use of protein structure to predict drug | ||
interactions and drug bioactivity through CNN [@arxiv:1510.02855]. Since CNNs | ||
leverage spatial relationships within the data, this particular deep learning | ||
framework is well suited to the problem. Drug discovery and drug | ||
"repurposing" are two other hot topics. Aliper et al. used transcriptomic | ||
data to predict which drugs might be repurposed for other diseases through | ||
deep fully connected neural networks | ||
[@doi:10.1021/acs.molpharmaceut.6b00248]. In a similar vein, Wang et al. used | ||
restricted boltzman machines (RBM) to predict drug molecular targets | ||
[@doi:10.1093/bioinformatics/btt234]. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,22 @@ | ||
## Conclusions | ||
|
||
*Final thoughts and future outlook here* | ||
|
||
### Author contributions | ||
|
||
`TODO: not sure if it should go here, but somewhere we should talk about how we | ||
wrote this thing, since it is still somewhat unconventional to have a review | ||
written in this manner.` We recognized that writing a review on a rapidly | ||
developing area in a manner that allowed us to provide a forward-looking | ||
perspective on diverse approaches and biological problems would require | ||
expertise from across computational biology and medicine. We created an open | ||
repository on the GitHub version control system and engaged with numerous | ||
authors from papers within and outside of the area. Paper review was conducted | ||
in the open by `#` individuals, and the manuscript was drafted in a series of | ||
commits from `#` authors. Individuals who met the ICJME standards of authorship | ||
are included as authors. These were individuals who contributed to the review of | ||
the literature; drafted the manuscript or provided substantial critical | ||
revisions; approved the final manuscript draft; and agreed to be accountable in | ||
all aspects of the work. Individuals who did not contribute in one or more of | ||
these ways, but who did participate, are acknowledged at the end of the | ||
manuscript. |