[c3444c]: / coderpp / test / README.md

Download this file

5 lines (5 with data), 594 Bytes

Steps for testing

bash run.sh or
1. Generate idx2phrase and phrase2idx files for all the phrases we are using for testing. Typically, we are using all cleaned UMLS terms as our train set. python generate_use_data.py --umls_dir ../umls --use_data_dir data
2. Generate faiss index for testing. python generate_faiss_index.py --tokenizer_name GanjinZero/coder_eng_pp --model_name_or_path GanjinZero/coder_eng_pp --save_dir data --phrase2idx_path data/phrase2idx.pkl
3. Testing. python confusion_matrix.py --umls_dir ../umls --output_dir output/ --use_data_dir data/ --title coderpp_test