Sense2vec

Latest version: v2.0.2

Safety actively analyzes 681866 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 3 of 3

1.0.0a5

1.0.0a4

1.0.0a3

1.0.0a2

>
> The converted Reddit vectors (trained on all comments of 2015) are attached to this release as a `.tar.gz` file. For more details and usage instructions, see the [`README`](README.md).

---

✨ New features and improvements

* Completely rewrite package from scratch.
* Replace built-in vector storage with spaCy's [`Vectors`](https://spacy.io/api/vectors), making this package a pure Python package and allowing easy out-of-the-box serialization of vectors.
* Add fully serializable spaCy pipeline component and extension attributes.
* Add new methods `get_best_sense` and `get_other_senses` and improve `most_similar`.
* Add annotation recipes for [Prodigy](https://prodi.gy) to easily create word lists and match patterns from similar phrases using sense2vec vectors (like the `terms.teach` recipe, just with multi-word expressions).
* New and more efficient [training and preprocessing scripts](scripts) using GloVe.

⚠️ Backwards incompatibilities

* The `sense2vec.load` method has been removed. Use `Sense2Vec.from_disk` instead.
* The previous `VectorMap` and `VectorStorage` have been removed.
* This package now requires Python 3.6+.
* This update requires a new vectors format (see attached `.tar.gz`).

📖 Documentation and examples

* Rewrite [`README`](README.md) from scratch and include full API docs.

👥 Contributors

Thanks to kabirkhan for contributing the Prodigy recipes!

1.0.0a1

>
> Note that the library doesn't depend on spaCy anymore, so you might have to [install spaCy](https://spacy.io/usage) and the English model separately. The Reddit vectors (trained on all comments of 2015) are attached to this release as a `.tar.gz` file. For more details and usage instructions, see the [`README`](README.md).

---

✨ New features and improvements

* **NEW:** Remove spaCy dependency and allow standalone use of the `sense2vec` library.
* **NEW:** Include spaCy v2.x [pipeline component](https://spacy.io/usage/processing-pipelines#custom-components) to add sense2vec-compatible token merging and token attributes and methods.
* Attach `reddit_vectors` model to release and make it easier to download and load in models.

📖 Documentation and examples

* Rewrite [`README`](README.md) from scratch and include full API docs.

🚧 Todo

- [ ] Replace `VectorMap` implementation with spaCy's `Vectors` class.
- [ ] Don't merge tokens at runtime and adjust extension attributes accordingly.
- [ ] Update training and pre-processing scripts for spaCy v2.x.
- [ ] Retrain vectors on more data.

1.0.0a0

>
> Note that the library doesn't depend on spaCy anymore, so you might have to [install spaCy](https://spacy.io/usage) and the English model separately. The Reddit vectors (trained on all comments of 2015) are attached to this release as a `.tar.gz` file. For more details and usage instructions, see the [`README`](https://github.com/explosion/sense2vec/tree/develop/README.rst).

---

✨ New features and improvements

* **NEW:** Remove spaCy dependency and allow standalone use of the `sense2vec` library.
* **NEW:** Include spaCy v2.x [pipeline component](https://spacy.io/usage/processing-pipelines#custom-components) to add sense2vec-compatible token merging and token attributes and methods.
* Attach `reddit_vectors` model to release and make it easier to download and load in models.

📖 Documentation and examples

* Rewrite [`README`](https://github.com/explosion/sense2vec/tree/develop/README.rst) from scratch and include full API docs.

🚧 Todo

- [ ] Update training and pre-processing scripts for spaCy v2.x.

Page 3 of 3

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.