|
a |
|
b/coderpp/clustering/README.md |
|
|
1 |
# Getting started |
|
|
2 |
|
|
|
3 |
## System requirements |
|
|
4 |
### Operating system |
|
|
5 |
Ubuntu 18.04.5 LTS |
|
|
6 |
### Software dependencies |
|
|
7 |
Check `environment.yml`. |
|
|
8 |
### Hardware |
|
|
9 |
We are running it on NVIDIA A100 or NVIDIA GeForce RTX 3090 |
|
|
10 |
|
|
|
11 |
## Installation guide |
|
|
12 |
1. (recommended) Use conda-pack to relocate the conda environment |
|
|
13 |
Tutorial: https://litingchen16.medium.com/how-to-use-conda-pack-to-relocate-your-condo-environment-622b68e077df |
|
|
14 |
Download environment: https://cloud.tsinghua.edu.cn/f/c4248149f5ae4f388766/ |
|
|
15 |
Installation time: within 5 mins |
|
|
16 |
2. Use yaml file to create environment |
|
|
17 |
`conda env create -f environment.yml` |
|
|
18 |
|
|
|
19 |
## Instruction for use |
|
|
20 |
1. Put your NER vocabulary under `./ner_data/`. Each line of the file is one term. |
|
|
21 |
2. `cd ./utils`, `bash run.sh`. |
|
|
22 |
3. The result is in `./result/final_cluster_res.txt`. |
|
|
23 |
|
|
|
24 |
## Demo |
|
|
25 |
1. Run on the data: we've put a demo under `./ner_data/data.txt`. Run the demo with `cd ./utils` and `bash run.sh`. |
|
|
26 |
2. Expected output: you are expected to get the same result as in `./result/final_cluster_res.txt`. |
|
|
27 |
3. Expected run time for demo: 37.438s |