* smaller data files (especially for fi, la, pl, pt, sk & tr, 19) * added support for Asturian (``ast``, 20) * bug fixes (18, 26)
0.8.2
-----
* languages added: Albanian, Hindi, Icelandic, Malay, Middle English, Northern Sámi, Nynorsk, Serbo-Croatian, Swahili, Tagalog * fix for slow language detection introduced in 0.7.0
0.8.1
-----
* better rules for English and German * inconsistencies fixed for cy, de, en, ga, sv (16) * docs: added language detection and citation info
0.8.0
-----
* code fully type checked, optional pre-compilation with ``mypyc`` * fixes: logging error (11), input type (12) * code style: `black <https://github.com/psf/black>`_
0.7.0
-----
* **breaking change**: language data pre-loading now occurs internally, language codes are now directly provided in ``lemmatize()`` call, e.g. ``simplemma.lemmatize("test", lang="en")`` * faster lemmatization, result cache * sentence-aware ``text_lemmatizer()`` * optional iterators for tokenization and lemmatization
0.6.0
-----
* improved language models * improved tokenizer * maintenance and code efficiency * added basic language detection (undocumented)