[ca4dac]: / CHANGELOG.md

Download this file

192 lines (142 with data), 6.3 kB

Change Log

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog
and this project adheres to Semantic Versioning.

[Unreleased]

[1.8.0] - 2025-01-11

Removed

  • Support for Python 3.10.

Added

  • Clinical [PubMedBert SBERT embeddings] configuration.
  • [PyRuSH] Clincal clinical sententence chunker

Changed

  • Upgraded to [zensols.util] version 1.15.

[1.7.0] - 2024-04-14

Removed

  • Deprecated feature document parser mednlp_combine_medfirst_doc_parser from
    resource library.

Changed

  • Renamed MeddicalFeatureDocumentParser to MedCatFeatureDocumentParser
    since the parser only adds MedCAT CUIs.
  • Fixed the configured parser that adds both ScispaCy biomed entities and
    MedCAT CUIs in mednlp_combine_biomed_doc_parser.
  • Separated combiner with non-combiner models in configuration space. The
    non-combiners (biomed ScispaCy and MedCAT) are parses that are used in their
    respective combiners. These are then used by a new composite parser that
    uses both (mednlp_combine_biomed_doc_parser). Unit test case added for
    all configured parsers.

[1.6.0] - 2024-02-27

Added

  • Added a ScispaCy biomedical document parser, which is enabled by setting
    mednlp_default:doc_parser = mednlp_combine_biomed_doc_parser. This uses
    better linguistic features and detects more entity span(s).

Changed

  • Fixed normalize token and sentence/token indexes from being clobbered by
    combined medical parsers.
  • Fixed numeric entity feature from the combined medical parser is non-zero
    for medical entities.
  • Combined medical parser defines linguistic and medical features in the
    document parser object instance.

[1.5.0] - 2023-12-05

Changed

  • Upgrade libraries: numpy, lxml, scikitlearn, scipy, medcat.
  • Added scispacy dependency.

Added

  • Support for Python 3.11.

Removed

  • Support for Python 3.9.

[1.4.1] - 2023-09-08

Changed

  • Fix unit tests using the model provided for the MedCAT tutorials.
  • Re-enable GitHub workflow unit tests CI.

[1.4.0] - 2023-08-16

Downstream moderate risk update release.

Changed

  • Switch order of medical specific and general spaCy language parsing and
    chunking. Now the medical parser is the source parser and the default spaCy
    parser is the target in MappingCombinerFeatureDocumentParser. This was
    done to get better sentence chunking as MedCAT (used in the medical parser)
    does not sentence chunk well as it was not designed for it.
  • Upgrade to [zensols.util] 1.13.0
  • Fix cui2vec weight archive re-download on each access.
  • Mapping combiner default is to use token's character absolute index.

[1.3.2] - 2023-06-29

Added

  • Feature document parser shortcut from application factory.

Changed

  • Resource library configuration to clean up model resources after download.

[1.3.1] - 2023-06-27

Changed

  • Updated to [zensols.nlp] to 1.7.2 and use its new feature to auto load any
    missing spaCy base model(s).
  • Remove configuration file requirement from the CLI.
  • Fix spaCy dependency requirement to align with scipy and [zensols.nlp].

[1.3.0] - 2023-06-20

Changed

  • Fix contraction tokenization. This was done by swapping the target and
    source parser in the lang.conf:mednlp_combine_doc_parser resource library
    configuration.

[1.2.0] - 2023-06-09

Changed

  • Upgrade to [medcat] 1.7.0.
  • Better UTS error handling with raised exceptions and messages.

[1.1.0] - 2023-04-05

Changed

  • Install missing models and packages on start up.
  • Updated [zensols.install] to 0.2.1
  • Updated [zensols.nlp] to 1.6.0.

[1.0.0] - 2023-02-02

Changed

  • Updated [zensols.util] to 1.12.1.
  • Updated [zensols.nlp] to 1.5.0.

[0.1.1] - 2022-10-02

Removed

  • Make scispacy optional and remove the dependency. See the test cases in
    test.

[0.1.0] - 2022-10-01

Added

  • Unit test in GitHub CI.

Changed

  • Upgrade to spaCy 2.2, MedCAT 3.0, zensols.nlp 1.4.0.
  • Make cui2vec a standard word embedding with vectorizer and layer.
  • Replace entity splitter by not embedding named entities as default
    configuration.

Removed

  • Support for Python 3.7, 3.8 from dropped support in zensols.util.

[0.0.2] - 2022-05-04

Added

  • A CSV features dump example.

Changed

  • Use token instead of sentence level mapping in cases where MedCAT creates
    unaligned sentence boundaries.
  • Make medical parser stand-alone and use delegate mapping combiner instead of
    using a class hierarchy.
  • Entity linker is now a token decorator.
  • Protect against unk (unknown) keys in TUIs.

[0.0.1] - 2022-01-30

Added

  • Initial version.

[Unreleased]: https://github.com/Paul Landes/mednlp/compare/v1.8.0...HEAD
[1.8.0]: https://github.com/Paul Landes/mednlp/compare/v1.7.0...v1.8.0
[1.7.0]: https://github.com/Paul Landes/mednlp/compare/v1.6.0...v1.7.0
[1.6.0]: https://github.com/Paul Landes/mednlp/compare/v1.5.0...v1.6.0
[1.5.0]: https://github.com/Paul Landes/mednlp/compare/v1.4.1...v1.5.0
[1.4.1]: https://github.com/Paul Landes/mednlp/compare/v1.4.0...v1.4.1
[1.4.0]: https://github.com/Paul Landes/mednlp/compare/v1.3.2...v1.4.0
[1.3.2]: https://github.com/Paul Landes/mednlp/compare/v1.3.1...v1.3.2
[1.3.1]: https://github.com/Paul Landes/mednlp/compare/v1.3.0...v1.3.1
[1.3.0]: https://github.com/Paul Landes/mednlp/compare/v1.2.0...v1.3.0
[1.2.0]: https://github.com/Paul Landes/mednlp/compare/v1.1.0...v1.2.0
[1.1.0]: https://github.com/Paul Landes/mednlp/compare/v1.0.0...v1.1.0
[1.0.0]: https://github.com/Paul Landes/mednlp/compare/v0.1.1...v1.0.0
[0.1.1]: https://github.com/Paul Landes/mednlp/compare/v0.1.0...v0.1.1
[0.1.0]: https://github.com/Paul Landes/mednlp/compare/v0.0.2...v0.1.0
[0.0.2]: https://github.com/Paul Landes/mednlp/compare/v0.0.1...v0.0.2
[0.0.1]: https://github.com/Paul Landes/mednlp/compare/v0.0.0...v0.0.1