|
a |
|
b/tests/readme.md |
|
|
1 |
# Testing the algorithm |
|
|
2 |
|
|
|
3 |
Various tests for the components of the spaCy pipeline. |
|
|
4 |
|
|
|
5 |
We decided to design tests entity-wise, meaning that we only check the validity |
|
|
6 |
of the computed modality on a set of entities. This design choice is motivated by |
|
|
7 |
the fact that : |
|
|
8 |
|
|
|
9 |
1. That's what we actually care about. We want our pipeline to detect negation, |
|
|
10 |
family context, patient history and hypothesis relative to a given entity. |
|
|
11 |
|
|
|
12 |
2. Deciding on the span of an annotation (negated, hypothesis, etc) is tricky. |
|
|
13 |
Consider the example : `"Le patient n'est pas malade."`. Should the negated span |
|
|
14 |
correspond to `["est", "malade"]`, `["malade"]`, `["n'", "est", "pas", "malade", "."]` ? |
|
|
15 |
|
|
|
16 |
3. Depending on the design of the algorithm, the span might be off, even though it |
|
|
17 |
can correctly assign polarity to a given entity (but considered that the punctuation |
|
|
18 |
was negated as well). |
|
|
19 |
By relaxing the need to infer the correct span, we avoid giving an unfair disadvantage |
|
|
20 |
to an otherwise great algorithm. |