Classla

Latest version: v2.2

Safety actively analyzes 723144 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 2

1.1.0

- Added tokenizer pretag option for both obeliks and reldi-tokeniser (via `pos_lemma_pretag`)
- Updated Slovene inflectional lexicon and moved from lemmatizer model to morphosyntactic annotation model
- Added upos and ufeats control to Slovene inflectional lexicon
- Other smaller fixes

1.0.2

- fixed issue where the parser produced non-CONLLU-compliant extension labels with underscores (e.g. `cc_preconj`) instead of colon-separated labels (e.g. `cc:preconj`)
- during lemmatization, if a token consists of a character that is not present in the seq2seq vocabulary, lemma will now be identical to the token
- added PUNCT control
- fixed MISC collumn bug for NER
- `punct` in Bulgarian UPOS was renamed to `Z`

Page 2 of 2

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.