Nlpiper

Latest version: v0.3.1

Safety actively analyzes 622331 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

0.3.1

:pencil: Changelog:

- :hammer_and_wrench: Improvements:
- Support Python 3.7

- :books: Documentation:
- Improve general documentation, add new maintainer

0.3.0

:pencil: Changelog:

- :sparkles: Features:
- Add Spacy and TorchText Embeddings transformer

- :books: Documentation:
- Improve documentation and migrate links to new repository host

0.2.1

:pencil: Changelog:

- :hammer_and_wrench: Improvements:
- Allow `Document` and `Token` to be set extra attributes

- :recycle: Refactoring:
- Move `BaseTransform` out of init for better integration with IDEs

- :books: Documentation:
- Improve documentation `cleaners` for sphinix better compatibility and explanation on transforms

0.2.0

:pencil: Changelog:

- :sparkles: Features:
- Add transformer for embeddings
- Add ner and embeddings token wise
- Add embeddings document wise
- Improve Stanza integration allowing to extract lemma and ner directly to the token object

- 🐛 Bug:
- Fix bug on importing nlpiper

- :books: Documentation:
- Update and improve documentation

0.1.0

:pencil: Changelog:

- :sparkles: Features:
- Create transformation pipeline to a document
- Rollback transformation steps applied to a document
- Add document Cleaners: `CleanURL`, `CleanEmail`, `CleanNumber`, `CleanPunctuation`, `CleanEOF`, `CleanMarkup`, `CleanAccents`
- Add document Tokenizers: `BasicTokenizer`, `MosesTokenizer`, `StanzaTokenizer`
- Add document Normalizers: `CaseTokens`, `RemovePunctuation`, `RemoveStopWords`, `VocabularyFilter`, `SpellCheck`, `Stemmer`
- Add Embeddings: `GensimEmbeddings`

Links

Releases

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.