Textaugment

Latest version: v2.0.0

Safety actively analyzes 682361 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 2

1.2

- Added support for [EDA algorithm](https://www.aclweb.org/anthology/D19-1670.pdf)
- Added examples using Jupyter notebook

1.1

Updated ReadMe and icons.

- Added licence icon.
- Release icon.
- Wheel icon.
- Python version icon.

**Added pre-print paper citation.**

1.0

TextAugment is a Python 3 library for augmenting text for natural language processing applications. TextAugment stands on the giant shoulders of NLTK, Gensim, and TextBlob and plays nicely with them.

Requirements
- Python 3
The following software packages are dependencies and will be installed automatically.
shell
$ pip install numpy nltk gensim textblob googletrans

The following code downloads wordnet, tokenizer, and part-of-speech tagger model.
python
nltk.download('wordnet')
nltk.download('punkt')
nltk.download('averaged_perceptron_tagger')


Install from pip [Recommended]

$ pip install textaugment

How to use
python
>>> from textaugment import Word2vec
>>> t = Word2vec(model='path/to/gensim/model'or 'gensim model itself')
>>> t.augment('The stories are good')
The films are good

Citation

article{marivate2019improving,
title={Improving short text classification through global augmentation methods},
author={Marivate, Vukosi and Sefara, Tshephisho},
journal={arXiv preprint arXiv:1907.03752},
year={2019}
}

Built with ❤ on Python

Page 2 of 2

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.