Pythainlp

Latest version: v5.0.5

Safety actively analyzes 691168 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

5.0.5

- Add Thai Discourse Treebank postag 910
- Add Thai Universal Dependency Treebank postag 916
- Add Thai G2P v2 Grapheme-to-Phoneme model 923
- Add support for list of strings as input to sent_tokenize() 927
- Add pythainlp.tools.safe_print to handle UnicodeEncodeError on console 969
- Fix collate() to consider tonemark in ordering 926

5.0.4

- Add clause_tokenize warnings 1026
- Fix maiyamok() that expanding the wrong word 962

5.0.3

- Fix: pythainlp.util.maiyamok does not duplicate words when more than one
Maiyamok is used 917

5.0.2

- Fix: empty string ('') added when using word_tokenize with
join_broken_num=True 912

5.0.1

- Fix: crfcut: Ensure splitting of sentences using terminal punctuation 905

5.0.0

- Fix: delay calling syllable_tokenize to avoid pycrfsuite ImportError 901

Links

Releases

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.