Search
About
Datasets
Models
Log In
Sign Up
Log In
Sign Up
Models
:
philipB
/
EHRKit-2022
Like
0
Downloads: 1
Card
Git
Discussions
[2d4573]
:
/
ehrkit
/ README.md
History
Download this file
76 lines (72 with data), 2.6 kB
EHRKit Documentation
Directory Structure
EHRKit
ehrkit
ehrkit.py
solr_lib.py
classes.py
external
icd9
status: incorporated but doesn't seem useful
patient2vec
status: not yet incorporated
could be useful:
phrase-at-scale
status: incorporated, but only a crude version
problem: does not return the phrases but outputs it to files
scripts
helper scripts for handling data
see README_scripts for more descriptions
tests
see README_tests for more descriptions
ehrkit.py functions
helper functions:
def start_session(db_user, db_pass): -> dict
:
def createPatient(data):
def flatten(lst):
def numbered_print(lst):
def init_embedding_model():
def get_abbs_sent_ids(text):
def post_single_dict_to_solr(d: dict, core: str)-> None
def abbs_disambiguate(ABB):
def get_documents_solr(query):
class ehr_db
attributes:
cnx
-- MySQL connection object
cur
-- MySQL cursor
patients = {}
-- Patient (from classes.py) dictionary
methods:
def close_session(self):
def get_patients(self, n):
def count_patients(self):
def count_gender(self, gender):
def count_docs(self, query, getAll = False, inverted = False):
def get_note_events(self):
def longest_NE(self):
def get_document(self, id):
def get_all_patient_document_ids(self, patientID):
def list_all_patient_ids(self):
def list_all_document_ids(self):
def get_document_sents(self, docID):
def get_abbreviations(self, doc_id):
def get_abbreviation_sent_ids(self, doc_id):
def get_documents_d(self, date):
def get_documents_q(self, query, n = -1):
def get_documents_icd9_alt(self,query):
def get_documents_icd9(self,code):
def get_prescription(self):
def extract_key_words(self, text):
def count_all_prescriptions(self):
def get_diagnoses(self):
def get_procedures(self):
def extract_patient_words(self, patientID):
def output_note_events_file_by_patients(self, directory):
def output_note_events_discharge_summary(self, directory):
yet to refactor:
def extract_key_words(self, text):
def get_abbreviations(self, doc_id):
def get_abbreviation_sent_ids(self, doc_id):
unfinished:
def docs_with_phrase(self, phrase):
def outputAbbreviation(self, directory):
def extract_phrases(self, docID):