Notes - Trial Outcome Prediction

Documenting research progress through the semester.

Week 1-2

Reading papers
- HINT (SOTA model): HINT: Hierarchical interaction network for clinical trial-outcome predictions
- PyTrial (Benchmark tool): PyTrial: Machine Learning Software and Benchmark for Clinical Trial Applications
- AnyPredict / MediTab: MediTab: Scaling Medical Tabular Data Predictors via Data Consolidation, Enrichment, and Refinement - The paper for PyTrial claimed that this model outperformed HINT. Their model is not available, thus the results cannot be verified or reproduced. The described approach does not seem very promising either.
- Factors of clinical trial outcomes: Factors Affecting Success of New Drug Clinical Trials
Implementing GPT Baseline
Evaluated GPT-4 and GPT-3.5 on the Trial Outcome Prediction datasets with various prompting strategies, including Chain of Thought, and breaking down the challenge into subtasks. When tested for binary classification with 30-50 uniformly distributed samples, the models showed very poor performance.

Testing HINT
Reproducing HINT's results on a virtual machine and analyzing which features could be improved.