Diff of /README.md [000000] .. [6ac965]

Switch to unified view

a b/README.md
1
2
# Learning Personalized Treatment Decisions in Precision Medicine
3
4
This repository accompanies the paper [*Learning Personalized Treatment Decisions in Precision Medicine: Disentangling Treatment Assignment Bias in Counterfactual Outcome Prediction and Biomarker Identification*](https://arxiv.org/abs/2410.00509), accepted at the ML4H 2024 conference. It provides code for replicating experiments, analyzing treatment assignment biases, and exploring counterfactual outcome prediction and biomarker identification. 
5
6
## Overview
7
8
Precision medicine offers the potential to tailor treatment strategies to individual patients based on their unique characteristics. However, leveraging clinical observational data for personalized treatment decisions poses significant challenges due to inherent biases and the high-dimensional nature of biological data.
9
10
This study aims to model various types of treatment assignment biases, using mutual information to quantify their impacts on machine learning models for counterfactual prediction and biomarker identification. Unlike standard counterfactual benchmarks that rely on fixed treatment policies, this work explores the characteristics of observational treatment policies across different clinical settings. This approach helps in understanding how different biases affect model performance in predicting outcomes and identifying biomarkers.
11
12
### Contributions
13
- **Formalization and Quantification**: We formalize and quantify different types of treatment assignment biases induced by observational treatment policies and explain their relationships with clinical settings and biomarker types.
14
- **Simulation and Analysis**: We systematically simulate different types of treatment selection policies and analyze their impact on the performance of various state-of-the-art counterfactual ML models using toy, semi-synthetic, and real-world outcomes.
15
- **Novel Evaluation Approach**: We propose using in-vitro experiments for counterfactual evaluation, providing the community with a realistic evaluation approach characterized by empirical outcomes and multi-modal biological covariates.
16
- **Insights on Bias and Model Performance**: Our findings show that the type of bias significantly influences model performance. Importantly, the violation of the overlap assumption does not always harm prediction accuracy, highlighting the need for nuanced approaches in high-dimensional medical data.
17
- **Model Differentiation**: The results indicate that models respond differently to various biases, providing critical insights for developing new methodologies and algorithms tailored to specific clinical settings.
18
19
## Setup and Installation
20
If you have any questions about the implementation or content of the paper, feel free to reach out to me (michavol@ethz.ch).
21
Ensure you are using Python 3.10.12. Follow these steps to set up the environment:
22
23
```bash
24
# Create and activate a virtual environment
25
python3 -m venv .venv
26
source .venv/bin/activate
27
28
# Install dependencies
29
pip install -r requirements.txt
30
```
31
32
## Configuration
33
34
Update the `config_TY_tcga.yaml` and `config_TY_toy.yaml` files with the appropriate path to the repository code:
35
36
```yaml
37
repo_path = PATH_TO_REPO
38
```
39
40
## Running the Experiments
41
42
Navigate to the submission code folder and run:
43
44
```bash
45
python 01_run_simulation_experiment.py -cn=config_AY_tcga
46
python 01_run_simulation_experiment.py -cn=config_AY_toy
47
```
48
49
Modify parameters like `model_names` and `seeds` in the configuration files if needed. The results can be found in the `results` directory specified in `config_TY_tcga.yaml` or `config_TY_toy.yaml`.
50
51
### Optional Settings
52
Disable feature explainability evaluation by setting:
53
  ```yaml
54
  evaluate_explanations: false
55
  evaluate_prog_explanations: false
56
  ```
57
58
## Generating Plots
59
60
After running the experiments, generate the results plots with:
61
62
```bash
63
python 02_plot_results.py
64
```
65
66
## Citing This Work
67
68
Please reference this repository and the associated paper when using or extending this code:
69
```
70
@misc{vollenweider2024learningpersonalizedtreatmentdecisions,
71
      title={Learning Personalized Treatment Decisions in Precision Medicine: Disentangling Treatment Assignment Bias in Counterfactual Outcome Prediction and Biomarker Identification}, 
72
      author={Michael Vollenweider and Manuel Schürch and Chiara Rohrer and Gabriele Gut and Michael Krauthammer and Andreas Wicki},
73
      year={2024},
74
      eprint={2410.00509},
75
      archivePrefix={arXiv},
76
      primaryClass={cs.LG},
77
      url={https://arxiv.org/abs/2410.00509}, 
78
}
79
```
80
## Acknowledgments
81
82
Parts of the code in this repository are adapted from Alicia Curth's implementations of neural treatment effect models and the `EconML` library for the linear lasso models. Refer to their repositories for additional details:
83
- [Alicia Curth's code](https://github.com/AliciaCurth/CATENets)
84
- [EconML library](https://github.com/py-why/EconML)