- add several Tibetan corpora (per ticket 77) - comment out old, pre-Git importer code
0.0.1.13
• Add Latin syllabifier (by Luke Hollis) • add concordance maker • add Chinese, Coptic, and Pali corpora
0.0.1.12
Overhaul of importers to use Git for corpus download and, now, update. The one known bug here is the Latin lemmatizer not working, which will get fixed soon.
0.0.1.10
Extended TLGU wrapper to allow for the break_works option. Also added two TLG indices.
0.0.1.8
This release fixes a breaking change introduced by nltk v. 3.0.2, which removed`PunktWordTokenizer()`, by replacing it with `PunktLanguageVars().word_tokenize()`. An improved TLG index has been added, too.
0.0.1.7
This release improves TLG and PHI corpora support over the previous (v0.0.1.0). It is also the first CLTK release to come with a [DOI](http://en.wikipedia.org/wiki/Digital_object_identifier), useful for the purpose of academic citation.