Diff of /CHANGELOG.md [000000] .. [ca4dac]

Switch to unified view

a b/CHANGELOG.md
1
# Change Log
2
All notable changes to this project will be documented in this file.
3
4
The format is based on [Keep a Changelog](http://keepachangelog.com/)
5
and this project adheres to [Semantic Versioning](http://semver.org/).
6
7
8
## [Unreleased]
9
10
11
## [1.8.0] - 2025-01-11
12
### Removed
13
- Support for Python 3.10.
14
15
### Added
16
- Clinical [PubMedBert SBERT embeddings] configuration.
17
- [PyRuSH] Clincal clinical sententence chunker
18
19
### Changed
20
- Upgraded to [zensols.util] version 1.15.
21
22
23
## [1.7.0] - 2024-04-14
24
### Removed
25
- Deprecated feature document parser `mednlp_combine_medfirst_doc_parser` from
26
  resource library.
27
28
### Changed
29
- Renamed `MeddicalFeatureDocumentParser` to `MedCatFeatureDocumentParser`
30
  since the parser only adds MedCAT CUIs.
31
- Fixed the configured parser that adds both ScispaCy biomed entities and
32
  MedCAT CUIs in `mednlp_combine_biomed_doc_parser`.
33
- Separated combiner with non-combiner models in configuration space.  The
34
  non-combiners (biomed ScispaCy and MedCAT) are parses that are used in their
35
  respective combiners.  These are then used by a new composite parser that
36
  uses both (`mednlp_combine_biomed_doc_parser`).  Unit test case added for
37
  all configured parsers.
38
39
40
## [1.6.0] - 2024-02-27
41
### Added
42
- Added a ScispaCy biomedical document parser, which is enabled by setting
43
  `mednlp_default:doc_parser = mednlp_combine_biomed_doc_parser`.  This uses
44
  better linguistic features and detects more entity span(s).
45
46
### Changed
47
- Fixed normalize token and sentence/token indexes from being clobbered by
48
  combined medical parsers.
49
- Fixed numeric entity feature from the combined medical parser is non-zero
50
  for medical entities.
51
- Combined medical parser defines linguistic and medical features in the
52
  document parser object instance.
53
54
55
## [1.5.0] - 2023-12-05
56
### Changed
57
- Upgrade libraries: `numpy`, `lxml`, `scikitlearn`, `scipy`, `medcat`.
58
- Added `scispacy` dependency.
59
60
### Added
61
- Support for Python 3.11.
62
63
### Removed
64
- Support for Python 3.9.
65
66
67
## [1.4.1] - 2023-09-08
68
### Changed
69
- Fix unit tests using the model provided for the MedCAT tutorials.
70
- Re-enable GitHub workflow unit tests CI.
71
72
73
## [1.4.0] - 2023-08-16
74
Downstream moderate risk update release.
75
76
### Changed
77
- Switch order of medical specific and general spaCy language parsing and
78
  chunking.  Now the medical parser is the source parser and the default spaCy
79
  parser is the target in `MappingCombinerFeatureDocumentParser`.  This was
80
  done to get better sentence chunking as MedCAT (used in the medical parser)
81
  does not sentence chunk well as it was not designed for it.
82
- Upgrade to [zensols.util] 1.13.0
83
- Fix `cui2vec` weight archive re-download on each access.
84
- Mapping combiner default is to use token's character absolute index.
85
86
87
## [1.3.2] - 2023-06-29
88
### Added
89
- Feature document parser shortcut from application factory.
90
91
### Changed
92
- Resource library configuration to clean up model resources after download.
93
94
95
## [1.3.1] - 2023-06-27
96
### Changed
97
- Updated to [zensols.nlp] to 1.7.2 and use its new feature to auto load any
98
  missing spaCy base model(s).
99
- Remove configuration file requirement from the CLI.
100
- Fix spaCy dependency requirement to align with `scipy` and  [zensols.nlp].
101
102
103
## [1.3.0] - 2023-06-20
104
### Changed
105
- Fix contraction tokenization.  This was done by swapping the target and
106
  source parser in the `lang.conf:mednlp_combine_doc_parser` resource library
107
  configuration.
108
109
110
## [1.2.0] - 2023-06-09
111
### Changed
112
- Upgrade to [medcat] 1.7.0.
113
- Better UTS error handling with raised exceptions and messages.
114
115
116
## [1.1.0] - 2023-04-05
117
### Changed
118
- Install missing models and packages on start up.
119
- Updated [zensols.install] to 0.2.1
120
- Updated [zensols.nlp] to 1.6.0.
121
122
123
## [1.0.0] - 2023-02-02
124
### Changed
125
- Updated [zensols.util] to 1.12.1.
126
- Updated [zensols.nlp] to 1.5.0.
127
128
129
## [0.1.1] - 2022-10-02
130
### Removed
131
- Make scispacy optional and remove the dependency.  See the test cases in
132
  [test](test/entlink).
133
134
135
## [0.1.0] - 2022-10-01
136
### Added
137
- Unit test in GitHub CI.
138
139
### Changed
140
- Upgrade to spaCy 2.2, MedCAT 3.0, `zensols.nlp` 1.4.0.
141
- Make `cui2vec` a standard word embedding with vectorizer and layer.
142
- Replace entity splitter by not embedding named entities as default
143
  configuration.
144
145
### Removed
146
- Support for Python 3.7, 3.8 from dropped support in `zensols.util`.
147
148
149
## [0.0.2] - 2022-05-04
150
### Added
151
- A CSV features dump example.
152
153
### Changed
154
- Use token instead of sentence level mapping in cases where MedCAT creates
155
  unaligned sentence boundaries.
156
- Make medical parser stand-alone and use delegate mapping combiner instead of
157
  using a class hierarchy.
158
- Entity linker is now a token decorator.
159
- Protect against `unk` (unknown) keys in TUIs.
160
161
162
## [0.0.1] - 2022-01-30
163
### Added
164
- Initial version.
165
166
167
<!-- links -->
168
[Unreleased]: https://github.com/Paul Landes/mednlp/compare/v1.8.0...HEAD
169
[1.8.0]: https://github.com/Paul Landes/mednlp/compare/v1.7.0...v1.8.0
170
[1.7.0]: https://github.com/Paul Landes/mednlp/compare/v1.6.0...v1.7.0
171
[1.6.0]: https://github.com/Paul Landes/mednlp/compare/v1.5.0...v1.6.0
172
[1.5.0]: https://github.com/Paul Landes/mednlp/compare/v1.4.1...v1.5.0
173
[1.4.1]: https://github.com/Paul Landes/mednlp/compare/v1.4.0...v1.4.1
174
[1.4.0]: https://github.com/Paul Landes/mednlp/compare/v1.3.2...v1.4.0
175
[1.3.2]: https://github.com/Paul Landes/mednlp/compare/v1.3.1...v1.3.2
176
[1.3.1]: https://github.com/Paul Landes/mednlp/compare/v1.3.0...v1.3.1
177
[1.3.0]: https://github.com/Paul Landes/mednlp/compare/v1.2.0...v1.3.0
178
[1.2.0]: https://github.com/Paul Landes/mednlp/compare/v1.1.0...v1.2.0
179
[1.1.0]: https://github.com/Paul Landes/mednlp/compare/v1.0.0...v1.1.0
180
[1.0.0]: https://github.com/Paul Landes/mednlp/compare/v0.1.1...v1.0.0
181
[0.1.1]: https://github.com/Paul Landes/mednlp/compare/v0.1.0...v0.1.1
182
[0.1.0]: https://github.com/Paul Landes/mednlp/compare/v0.0.2...v0.1.0
183
[0.0.2]: https://github.com/Paul Landes/mednlp/compare/v0.0.1...v0.0.2
184
[0.0.1]: https://github.com/Paul Landes/mednlp/compare/v0.0.0...v0.0.1
185
186
[zensols.util]: https://github.com/plandes/util
187
[zensols.nlp]: https://github.com/plandes/nlparse
188
[zensols.install]: https://github.com/plandes/install
189
[medcat]: https://github.com/CogStack/MedCAT
190
[PyRuSH]: https://github.com/jianlins/PyRuSH
191
[PubMedBert SBERT embeddings]: https://huggingface.co/pritamdeka/S-PubMedBert-MS-MARCO