Oger

Latest version: v1.5

Safety actively analyzes 622435 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 3

1.5

- new output format: *pubanno_json.tgz*, gzipped archive of PubAnnotation JSON
- new input format: *txt_tsv*, two-column TSV <doc-ID, text>

1.4.1

- bug fixes in PubAnnotation output

1.4

- REST API:
* various bugfixes
* new default value `true` for the postfilter parameter
* don't allow pickle objects in user-defined configuration
* allow built-in postfilters in user-defined annotators
* background loading uses threads (concurrent.futures) instead of subprocesses (multiprocessing) -- more stable now
- new formats:
* europepmc and europepmc.zip (output)
* *bionlp* (output)
* *brat.ann*, *bionlp.ann*, *txt* (output) (individual files of bionlp and brat)
* *conll* (input and output)
- fixes to input/output formats:
* *pmc* (more robust)
* *tsv* (no newlines/tabs inside mentions)
- new normalization method: ise/ize spelling
- fixes to document/collection iteration
- evaluation: misc improvements

1.3

- new input format: *bioc_json*
- built-in postfilters for removing subsumed/overlapping matches and frequent false positives
- parameter `mesh-as-entities` for recognising MeSH descriptor names as entity annotations (*pubmed* and *pxml* format)
- UMLS CUIs (given in the termlist) included in _bioc*_, *(text_)tsv* and *xml* output formats

1.2.1

- bug fixes in input/output formats

1.2

- new input and output formats *pubtator* and *pubtator_fbk*
- new normalization method *mask*
- new parameters `byte-offsets-in` and `byte-offsets-out` for interpreting and producing BioC documents with byte-based offsets
- improved PipelineServer class for library usage
- fixed a security hole involving eval()
- allow specifying a pretrained sentence splitter

Page 1 of 3

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.