Nlpaug

Latest version: v1.1.11

Safety actively analyzes 688512 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 4 of 5

0.0.12

ContextualWordEmbsAug supports bert-base-multilingual-uncased (for non English inputs)
Fix missing library dependency 74
Fix single token error when using RandomWordAug 76
Fix replacing character in RandomCharAug error 77
Enhance word's augmenter to support regular expression stopwords 81
Enhance char's augmenter to support regular expression stopwords 86
KeyboardAug supports Thai language 92
Fix word casing issue 82

0.0.11

Support color noise (pink, blue, red and violet noise) in audio's NoiseAug
Support given background noise in audio's NoiseAug
Support inject noise to portion of audio only in audio's NoiseAug
Introduce zone, coverage to all audio augmenter. Support only augmented portion of audio input
Add VTLP augmentation methods (Audio's augmenter)
Adopt latest transformer's interface 59
Support RoBERTa (including DistilRoBERTa) and DistilBERT (ContextualWordEmbsAug)
Support DistilGPT2 (ContextualWordEmbsForSentenceAug)
Fix librosa hard dependency 62
Introduce optimize attribute ContextualWordEmbsForSentenceAug 63
Optimize word selection for ContextualWordEmbsAug and ContextualWordEmbsForSentenceAug (Speed up around 30%)
Add retry mechanism into ContextualWordEmbsAug insert action 68

0.0.10

- Add aug_max to control maximum number of augmented item
- Fix ContextualWordEmbsAug (for BERT) error when input is longer than max sequence length
- Add RandomWordAug Substitute action
- Fix ContextualWordEmbsAug error when no augmented data
- Support multi thread processing (for CPU only) to speed up the augmentation
- Fix KeyboardAug error [55](https://github.com/makcedward/nlpaug/issues/55)

0.0.9

* Added Swap Mode (adjacent, middle and random) for RandomAug (character level)
* Added SynonymAug (WordNet/ PPDB) and AntonymAug (WordNet)
* WordNetAug is deprecated. Uses SynonymAug instead
* Introduce parameter n. Returning more than 1 augmented data. Changing output format from text (or numpy) to list of text (or numpy) if n > 1
* Introduce parameter temperature in ContextualWordEmbsAug and ContextualWordEmbsForSentenceAug to control the randomness
* aug_n parameter is deprecated. This parameter will be replaced by top_k parameter
* Fixed tokenization issue [48](https://github.com/makcedward/nlpaug/issues/48)
* Upgraded transformers dependency (or pytorch_transformer) to 2.0.0
* Upgraded PyTorch dependency to 1.2.0
* Added SplitAug

0.0.8

- BertAug is replaced by ContextualWordEmbsAug
- Support GPU (for ContextualWordEmbsAug only) 26
- Upgraded pytorch_transformer to 1.1.0 version 33
- ContextualWordEmbsAug suuports both BERT and XLNet model
- Removed librosa dependency
- Add ContextualWordEmbsForSentenceAug for generating next sentence
- Fix sampling issue 38

0.0.7

- Add new augmenter (CropAug, LoudnessAug, MaskAug)
- QwertyAug is deprecated. It will be replaced by KeyboardAug
- Remove StopWordsAug. It will be replaced by RandomWordAug
- Code refactoring
- Added model download function for word2vec, GloVe and fasttext

Page 4 of 5

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.