Simplemma

Latest version: v0.9.1

Safety actively analyzes 622001 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 3

0.9.0

-----

* smaller data files (especially for fi, la, pl, pt, sk & tr, 19)
* added support for Asturian (``ast``, 20)
* bug fixes (18, 26)

0.8.2

-----

* languages added: Albanian, Hindi, Icelandic, Malay, Middle English, Northern Sámi, Nynorsk, Serbo-Croatian, Swahili, Tagalog
* fix for slow language detection introduced in 0.7.0

0.8.1

-----

* better rules for English and German
* inconsistencies fixed for cy, de, en, ga, sv (16)
* docs: added language detection and citation info

0.8.0

-----

* code fully type checked, optional pre-compilation with ``mypyc``
* fixes: logging error (11), input type (12)
* code style: `black <https://github.com/psf/black>`_

0.7.0

-----

* **breaking change**: language data pre-loading now occurs internally, language codes are now directly provided in ``lemmatize()`` call, e.g. ``simplemma.lemmatize("test", lang="en")``
* faster lemmatization, result cache
* sentence-aware ``text_lemmatizer()``
* optional iterators for tokenization and lemmatization

0.6.0

-----

* improved language models
* improved tokenizer
* maintenance and code efficiency
* added basic language detection (undocumented)

Page 1 of 3

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.