|
a |
|
b/CHANGELOG.md |
|
|
1 |
# Change Log |
|
|
2 |
All notable changes to this project will be documented in this file. |
|
|
3 |
|
|
|
4 |
The format is based on [Keep a Changelog](http://keepachangelog.com/) |
|
|
5 |
and this project adheres to [Semantic Versioning](http://semver.org/). |
|
|
6 |
|
|
|
7 |
|
|
|
8 |
## [Unreleased] |
|
|
9 |
|
|
|
10 |
|
|
|
11 |
## [1.8.0] - 2025-01-11 |
|
|
12 |
### Removed |
|
|
13 |
- Support for Python 3.10. |
|
|
14 |
|
|
|
15 |
### Added |
|
|
16 |
- Clinical [PubMedBert SBERT embeddings] configuration. |
|
|
17 |
- [PyRuSH] Clincal clinical sententence chunker |
|
|
18 |
|
|
|
19 |
### Changed |
|
|
20 |
- Upgraded to [zensols.util] version 1.15. |
|
|
21 |
|
|
|
22 |
|
|
|
23 |
## [1.7.0] - 2024-04-14 |
|
|
24 |
### Removed |
|
|
25 |
- Deprecated feature document parser `mednlp_combine_medfirst_doc_parser` from |
|
|
26 |
resource library. |
|
|
27 |
|
|
|
28 |
### Changed |
|
|
29 |
- Renamed `MeddicalFeatureDocumentParser` to `MedCatFeatureDocumentParser` |
|
|
30 |
since the parser only adds MedCAT CUIs. |
|
|
31 |
- Fixed the configured parser that adds both ScispaCy biomed entities and |
|
|
32 |
MedCAT CUIs in `mednlp_combine_biomed_doc_parser`. |
|
|
33 |
- Separated combiner with non-combiner models in configuration space. The |
|
|
34 |
non-combiners (biomed ScispaCy and MedCAT) are parses that are used in their |
|
|
35 |
respective combiners. These are then used by a new composite parser that |
|
|
36 |
uses both (`mednlp_combine_biomed_doc_parser`). Unit test case added for |
|
|
37 |
all configured parsers. |
|
|
38 |
|
|
|
39 |
|
|
|
40 |
## [1.6.0] - 2024-02-27 |
|
|
41 |
### Added |
|
|
42 |
- Added a ScispaCy biomedical document parser, which is enabled by setting |
|
|
43 |
`mednlp_default:doc_parser = mednlp_combine_biomed_doc_parser`. This uses |
|
|
44 |
better linguistic features and detects more entity span(s). |
|
|
45 |
|
|
|
46 |
### Changed |
|
|
47 |
- Fixed normalize token and sentence/token indexes from being clobbered by |
|
|
48 |
combined medical parsers. |
|
|
49 |
- Fixed numeric entity feature from the combined medical parser is non-zero |
|
|
50 |
for medical entities. |
|
|
51 |
- Combined medical parser defines linguistic and medical features in the |
|
|
52 |
document parser object instance. |
|
|
53 |
|
|
|
54 |
|
|
|
55 |
## [1.5.0] - 2023-12-05 |
|
|
56 |
### Changed |
|
|
57 |
- Upgrade libraries: `numpy`, `lxml`, `scikitlearn`, `scipy`, `medcat`. |
|
|
58 |
- Added `scispacy` dependency. |
|
|
59 |
|
|
|
60 |
### Added |
|
|
61 |
- Support for Python 3.11. |
|
|
62 |
|
|
|
63 |
### Removed |
|
|
64 |
- Support for Python 3.9. |
|
|
65 |
|
|
|
66 |
|
|
|
67 |
## [1.4.1] - 2023-09-08 |
|
|
68 |
### Changed |
|
|
69 |
- Fix unit tests using the model provided for the MedCAT tutorials. |
|
|
70 |
- Re-enable GitHub workflow unit tests CI. |
|
|
71 |
|
|
|
72 |
|
|
|
73 |
## [1.4.0] - 2023-08-16 |
|
|
74 |
Downstream moderate risk update release. |
|
|
75 |
|
|
|
76 |
### Changed |
|
|
77 |
- Switch order of medical specific and general spaCy language parsing and |
|
|
78 |
chunking. Now the medical parser is the source parser and the default spaCy |
|
|
79 |
parser is the target in `MappingCombinerFeatureDocumentParser`. This was |
|
|
80 |
done to get better sentence chunking as MedCAT (used in the medical parser) |
|
|
81 |
does not sentence chunk well as it was not designed for it. |
|
|
82 |
- Upgrade to [zensols.util] 1.13.0 |
|
|
83 |
- Fix `cui2vec` weight archive re-download on each access. |
|
|
84 |
- Mapping combiner default is to use token's character absolute index. |
|
|
85 |
|
|
|
86 |
|
|
|
87 |
## [1.3.2] - 2023-06-29 |
|
|
88 |
### Added |
|
|
89 |
- Feature document parser shortcut from application factory. |
|
|
90 |
|
|
|
91 |
### Changed |
|
|
92 |
- Resource library configuration to clean up model resources after download. |
|
|
93 |
|
|
|
94 |
|
|
|
95 |
## [1.3.1] - 2023-06-27 |
|
|
96 |
### Changed |
|
|
97 |
- Updated to [zensols.nlp] to 1.7.2 and use its new feature to auto load any |
|
|
98 |
missing spaCy base model(s). |
|
|
99 |
- Remove configuration file requirement from the CLI. |
|
|
100 |
- Fix spaCy dependency requirement to align with `scipy` and [zensols.nlp]. |
|
|
101 |
|
|
|
102 |
|
|
|
103 |
## [1.3.0] - 2023-06-20 |
|
|
104 |
### Changed |
|
|
105 |
- Fix contraction tokenization. This was done by swapping the target and |
|
|
106 |
source parser in the `lang.conf:mednlp_combine_doc_parser` resource library |
|
|
107 |
configuration. |
|
|
108 |
|
|
|
109 |
|
|
|
110 |
## [1.2.0] - 2023-06-09 |
|
|
111 |
### Changed |
|
|
112 |
- Upgrade to [medcat] 1.7.0. |
|
|
113 |
- Better UTS error handling with raised exceptions and messages. |
|
|
114 |
|
|
|
115 |
|
|
|
116 |
## [1.1.0] - 2023-04-05 |
|
|
117 |
### Changed |
|
|
118 |
- Install missing models and packages on start up. |
|
|
119 |
- Updated [zensols.install] to 0.2.1 |
|
|
120 |
- Updated [zensols.nlp] to 1.6.0. |
|
|
121 |
|
|
|
122 |
|
|
|
123 |
## [1.0.0] - 2023-02-02 |
|
|
124 |
### Changed |
|
|
125 |
- Updated [zensols.util] to 1.12.1. |
|
|
126 |
- Updated [zensols.nlp] to 1.5.0. |
|
|
127 |
|
|
|
128 |
|
|
|
129 |
## [0.1.1] - 2022-10-02 |
|
|
130 |
### Removed |
|
|
131 |
- Make scispacy optional and remove the dependency. See the test cases in |
|
|
132 |
[test](test/entlink). |
|
|
133 |
|
|
|
134 |
|
|
|
135 |
## [0.1.0] - 2022-10-01 |
|
|
136 |
### Added |
|
|
137 |
- Unit test in GitHub CI. |
|
|
138 |
|
|
|
139 |
### Changed |
|
|
140 |
- Upgrade to spaCy 2.2, MedCAT 3.0, `zensols.nlp` 1.4.0. |
|
|
141 |
- Make `cui2vec` a standard word embedding with vectorizer and layer. |
|
|
142 |
- Replace entity splitter by not embedding named entities as default |
|
|
143 |
configuration. |
|
|
144 |
|
|
|
145 |
### Removed |
|
|
146 |
- Support for Python 3.7, 3.8 from dropped support in `zensols.util`. |
|
|
147 |
|
|
|
148 |
|
|
|
149 |
## [0.0.2] - 2022-05-04 |
|
|
150 |
### Added |
|
|
151 |
- A CSV features dump example. |
|
|
152 |
|
|
|
153 |
### Changed |
|
|
154 |
- Use token instead of sentence level mapping in cases where MedCAT creates |
|
|
155 |
unaligned sentence boundaries. |
|
|
156 |
- Make medical parser stand-alone and use delegate mapping combiner instead of |
|
|
157 |
using a class hierarchy. |
|
|
158 |
- Entity linker is now a token decorator. |
|
|
159 |
- Protect against `unk` (unknown) keys in TUIs. |
|
|
160 |
|
|
|
161 |
|
|
|
162 |
## [0.0.1] - 2022-01-30 |
|
|
163 |
### Added |
|
|
164 |
- Initial version. |
|
|
165 |
|
|
|
166 |
|
|
|
167 |
<!-- links --> |
|
|
168 |
[Unreleased]: https://github.com/Paul Landes/mednlp/compare/v1.8.0...HEAD |
|
|
169 |
[1.8.0]: https://github.com/Paul Landes/mednlp/compare/v1.7.0...v1.8.0 |
|
|
170 |
[1.7.0]: https://github.com/Paul Landes/mednlp/compare/v1.6.0...v1.7.0 |
|
|
171 |
[1.6.0]: https://github.com/Paul Landes/mednlp/compare/v1.5.0...v1.6.0 |
|
|
172 |
[1.5.0]: https://github.com/Paul Landes/mednlp/compare/v1.4.1...v1.5.0 |
|
|
173 |
[1.4.1]: https://github.com/Paul Landes/mednlp/compare/v1.4.0...v1.4.1 |
|
|
174 |
[1.4.0]: https://github.com/Paul Landes/mednlp/compare/v1.3.2...v1.4.0 |
|
|
175 |
[1.3.2]: https://github.com/Paul Landes/mednlp/compare/v1.3.1...v1.3.2 |
|
|
176 |
[1.3.1]: https://github.com/Paul Landes/mednlp/compare/v1.3.0...v1.3.1 |
|
|
177 |
[1.3.0]: https://github.com/Paul Landes/mednlp/compare/v1.2.0...v1.3.0 |
|
|
178 |
[1.2.0]: https://github.com/Paul Landes/mednlp/compare/v1.1.0...v1.2.0 |
|
|
179 |
[1.1.0]: https://github.com/Paul Landes/mednlp/compare/v1.0.0...v1.1.0 |
|
|
180 |
[1.0.0]: https://github.com/Paul Landes/mednlp/compare/v0.1.1...v1.0.0 |
|
|
181 |
[0.1.1]: https://github.com/Paul Landes/mednlp/compare/v0.1.0...v0.1.1 |
|
|
182 |
[0.1.0]: https://github.com/Paul Landes/mednlp/compare/v0.0.2...v0.1.0 |
|
|
183 |
[0.0.2]: https://github.com/Paul Landes/mednlp/compare/v0.0.1...v0.0.2 |
|
|
184 |
[0.0.1]: https://github.com/Paul Landes/mednlp/compare/v0.0.0...v0.0.1 |
|
|
185 |
|
|
|
186 |
[zensols.util]: https://github.com/plandes/util |
|
|
187 |
[zensols.nlp]: https://github.com/plandes/nlparse |
|
|
188 |
[zensols.install]: https://github.com/plandes/install |
|
|
189 |
[medcat]: https://github.com/CogStack/MedCAT |
|
|
190 |
[PyRuSH]: https://github.com/jianlins/PyRuSH |
|
|
191 |
[PubMedBert SBERT embeddings]: https://huggingface.co/pritamdeka/S-PubMedBert-MS-MARCO |