Bm25s

Latest version: v0.2.7.post1

Safety actively analyzes 699354 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 5

0.2.7post1

What's Changed
* Fix query filtering and vocabulary dict by mossbee in https://github.com/xhluca/bm25s/pull/92 (1/2)
* Fix query filtering and vocabulary dict by xhluca in https://github.com/xhluca/bm25s/pull/96 (2/2)
* Update corpus.py by Restodecoca in https://github.com/xhluca/bm25s/pull/102
* Add pypi and pepy badges by xhluca in https://github.com/xhluca/bm25s/pull/103

Notes

The behavior of tokenizers have changed wrt null token. Now, the null token will be added first to the vocab rather than at the end, as the previous approach is inconsistent with the general standard (the "" string should map to 0 in general). However, it is a backward compatible change because the tokenizers should work the same way as before, but expect the tokenizers before 0.2.7 to differ from the tokenizers in 0.2.7 and beyond in the behavior, even though both will work with the retriever object.

New Contributors
* Restodecoca made their first contribution in https://github.com/xhluca/bm25s/pull/102

**Full Changelog**: https://github.com/xhluca/bm25s/compare/0.2.6...0.2.7

0.2.7pre3

What's Changed
* Update corpus.py by Restodecoca in https://github.com/xhluca/bm25s/pull/102

New Contributors
* Restodecoca made their first contribution in https://github.com/xhluca/bm25s/pull/102

**Full Changelog**: https://github.com/xhluca/bm25s/compare/0.2.7pre2...0.2.7pre3

0.2.7pre2

**Full Changelog**: https://github.com/xhluca/bm25s/compare/0.2.7pre1...0.2.7pre2

0.2.7pre1

What's Changed
* Fix query filtering and vocabulary dict by xhluca in https://github.com/xhluca/bm25s/pull/96 and mossbee in https://github.com/xhluca/bm25s/pull/92

Notes

* The behavior of tokenizers have changed wrt null token. Now, the null token will be added first to the vocab rather than at the end, as the previous approach is inconsistent with the general standard (the "" string should map to 0 in general). However, it is a backward compatible change because the tokenizers should work the same way as before, but expect the tokenizers before 0.2.7 to differ from the tokenizers in 0.2.7 and beyond in the behavior, even though both will work with the retriever object.

**Full Changelog**: https://github.com/xhluca/bm25s/compare/0.2.6...0.2.7

0.2.6

What's Changed
* Extending to Non-ASCII characters with corpora loading and saving by IssacXid in https://github.com/xhluca/bm25s/pull/93


**Full Changelog**: https://github.com/xhluca/bm25s/compare/0.2.5...0.2.6

0.2.5

What's Changed
* Update README.md by xhluca in https://github.com/xhluca/bm25s/pull/83
* Added support for saving and loading non ASCII chars in corpus and vocab by IssacXid in https://github.com/xhluca/bm25s/pull/86
* Update README.md by mrisher in https://github.com/xhluca/bm25s/pull/87

New Contributors
* IssacXid made their first contribution in https://github.com/xhluca/bm25s/pull/86
* mrisher made their first contribution in https://github.com/xhluca/bm25s/pull/87

**Full Changelog**: https://github.com/xhluca/bm25s/compare/0.2.4...0.2.5

Page 1 of 5

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.