Cltk

Latest version: v1.4.0

Safety actively analyzes 724288 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 11 of 13

0.1.26

Add wrapper function, `nltk_tokenize_words()`, for `PunktLanguageVars.word_tokenize()`.

Move NER to `cltk.tag` (from `cltk.ner`).

Switch versioning to semantic style ([read about here](http://semver.org/)).

0.0.1.24

Addition of basic named entity recognition (NER) for Greek and Latin.

This works by taking word lists ([Latin](https://github.com/cltk/latin_proper_names_cltk) and [Greek](https://github.com/cltk/greek_proper_names_cltk)) and matching against tokens of incoming text.

0.0.1.23

This release features the first robust interface for using the TLG indices, which were parsed by Stephen Margheim (smargh).

Also included are JSON versions of the PHI5 by Martín Pozzi (marpozzi), though no helper functions have been written yet.

Docs available at: http://docs.cltk.org/en/latest/greek.html#tlg-indices.

0.0.1.22

This release adds TylerKirby's [Greek Scansion module](https://github.com/kylepjohnson/cltk/blob/69e5a3d2d0430ab963167e1320a5a510c5ecc0f3/cltk/prosody/greek/scanner.py).

0.0.1.21

Improved enclitic handling for Latin's word tokenizer by Patrick J. Burns ( diyclassics ).

Original pull request: https://github.com/kylepjohnson/cltk/pull/84

0.0.1.20

This tagged release features a few point changes, most notably Tyler Kirby's Latin prosody scanner.

Also improvements to the PHI5 and TLG text cleaners, as well as an overhaul of what used to be the stopword generator, now refactored and living as `from cltk.utils.frequency`.

Page 11 of 13

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.