This project is designed to match patient data with clinical trials based on eligibility criteria. It leverages web scraping techniques to gather trial information and processes patient electronic health records (EHR) to determine eligibility.
- Features
- Installation
- Usage
1. Clone the Repository
2. Set up a virtual Environment: source env/bin/activate
3. Install required packages: pip install -r requirements.txt
4. Set up WebDriver
data/processed/patients
directory. (For the scope of the project, I have created another directory patients_small
with smaller dataset for demonstrating the results)data/raw/scraped
directory. (For the scope of the project, I have created another directory scraped_small
for demonstrating the results for demonstrating the results)master.py
and pass the arguments --scrape
, --preprocess
, --tests
,---model
to run scraping, preprocessing, unit tests and AI model scripts.notebooks
directory.spreadsheet_id, token_spreadsheet, openaiapi
in the .env
file.