Bm25s

Latest version: v0.2.5

Safety actively analyzes 688409 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 5

0.2.0rc8

In this release, we add the Tokenizer class. Please see readme section on tokenization and `examples/tokenizer_class.py` for more details.

0.2.0rc7

This is the final version of the numba improvements:

* Refactor retrieval to make it faster to run in numba mode by xhluca in https://github.com/xhluca/bm25s/pull/47


**Full Changelog**: https://github.com/xhluca/bm25s/compare/data...0.2.0rc7

data
This contains data useful for bm25s. Among other things, it contains the original zip file from the BEIR dataset.

PLEASE IGNORE THE SOURCE CODE OF THIS RELEASE. THE DATA MAY CHANGE AT ANY TIME, IT IS NOT VERSIONED.

By downloading data from this release, you acknowledge and accept the [license of the BEIR dataset](https://github.com/beir-cellar/beir/blob/main/LICENSE), as well as the terms of use of the respective dataset. You can find the ToUs in the original homepages of the respective datasets, which you can find [here](https://github.com/beir-cellar/beir/wiki/Datasets-available#beers-available-datasets).

0.2.0rc6

Speeding up retrieval with numba, and new stopwords

[🔈 Discussions](https://github.com/xhluca/bm25s/discussions/46)

This is a pretty exciting pre-release! It is a major new feature for the v0.2.0 that will come out. I hope you get to try this and share your thoughts in the discussions!

What's Changed
* Add numba integration to allow for faster scoring and retrieval by xhluca in https://github.com/xhluca/bm25s/pull/41
* Add stopwords for 10 new languages by bm777 in https://github.com/xhluca/bm25s/pull/33
* Add type hint for `texts` argument in `tokenize` function and replace `time.time() with `time.monotonic()` by dantetemplar in https://github.com/xhluca/bm25s/pull/44

New Contributors
* bm777 made their first contribution in https://github.com/xhluca/bm25s/pull/33
* dantetemplar made their first contribution in https://github.com/xhluca/bm25s/pull/44

**Full Changelog**: https://github.com/xhluca/bm25s/compare/0.1.10...0.2.0rc6

0.2.0rc5

**Full Changelog**: https://github.com/xhluca/bm25s/compare/0.2.0rc3...0.2.0rc5

0.1.10

Update

- Add bibtex to the auto-generated readme for huggingface

**Full Changelog**: https://github.com/xhluca/bm25s/compare/0.1.9...0.1.10

0.1.9

Fix

- Allow retrieve() to take a tuple as input
- Add advanced type check for retrieve
- Add link to technical report when pushing to huggingface

Page 2 of 5

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.