Learn Before
Concept
Example 1: Genomics
Pharmaceutical companies are interested in determining the influence of genes on each other. A dataset may consist of jointly recorded gene activities for pairs of genes and for patients .
Determining which gene influences which other gene is a costly experimental process, so only a subset of pairs of genes are generally labeled with ground truth of causal relationship .
The labeled dataset , is an empirical sample of the “mother distribution” over pairs of genes and their causal labels, which may be used as training data to obtain a classifier, such that predictions of unknown causal labels can be made on new pairs of genes.
0
1
Updated 2020-07-14
Tags
Data Science