Pythainlp

Latest version: v5.1.0

Safety actively analyzes 714919 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 2

5.1.0

[WIP]

5.0.5

- Add Thai Discourse Treebank postag 910
- Add Thai Universal Dependency Treebank postag 916
- Add Thai G2P v2 Grapheme-to-Phoneme model 923
- Add support for list of strings as input to sent_tokenize() 927
- Add pythainlp.tools.safe_print to handle UnicodeEncodeError on console 969
- Fix collate() to consider tonemark in ordering 926
- Fix nlpo3.load_dict() that never print error msg when not success 979
- Add Thai Solar Date convert to Thai Lunar Date 998
- Add Thai pangram text 1045
- Remove clause_tokenize 1024

5.0.4

- Add clause_tokenize warnings 1026
- Fix maiyamok() that expanding the wrong word 962

5.0.3

- Fix: pythainlp.util.maiyamok does not duplicate words when more than one
Maiyamok is used 917

5.0.2

- Fix: empty string ('') added when using word_tokenize with
join_broken_num=True 912

5.0.1

- Fix: crfcut: Ensure splitting of sentences using terminal punctuation 905

Page 1 of 2

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.