Simplemma

Latest version: v1.1.1

Safety actively analyzes 685670 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 3

0.8.1

-----

* better rules for English and German
* inconsistencies fixed for cy, de, en, ga, sv (16)
* docs: added language detection and citation info

0.8.0

-----

* code fully type checked, optional pre-compilation with ``mypyc``
* fixes: logging error (11), input type (12)
* code style: `black <https://github.com/psf/black>`_

0.7.0

-----

* **breaking change**: language data pre-loading now occurs internally, language codes are now directly provided in ``lemmatize()`` call, e.g. ``simplemma.lemmatize("test", lang="en")``
* faster lemmatization, result cache
* sentence-aware ``text_lemmatizer()``
* optional iterators for tokenization and lemmatization

0.6.0

-----

* improved language models
* improved tokenizer
* maintenance and code efficiency
* added basic language detection (undocumented)

0.5.0

-----

* faster, more efficient code
* dropped support for Python 3.5

0.4.0

-----

* new languages: Armenian, Greek, Macedonian, Norwegian (Bokmål), and Polish
* language data reviewed for: Dutch, Finnish, German, Hungarian, Latin, Russian, and Swedish
* Urdu removed of language list due to issues with the data
* add support for Python 3.10 and drop support for Python 3.4
* improved decomposition and tokenization algorithms

Page 2 of 3

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.