Rubrix

Latest version: v0.19.0

Safety actively analyzes 681857 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 5

1.0.0

0.19.0

0.18.0

[0.18.0](https://github.com/recognai/rubrix/compare/v0.17.0...v0.18.0) (2022-10-05)

⚡ Highlights

Better validation of token classification records

When working with Token Classification records, there are very often misalignment problems between the entity spans and provided tokens.
Before this release, it was difficult to understand and fix these errors because validation happened on the server side.

With this release, records are validated during instantiation, giving you a clear error message which can help you to fix/ignore problematic records.

For example, the following record:
python
import rubrix as rb

rb.TokenClassificationRecord(
tokens=["I", "love", "Paris"],
text="I love Paris!",
prediction=[("LOC",7,13)]
)

Will give you the following error message:

python
ValueError: Following entity spans are not aligned with provided tokenization
Spans:
- [Paris!] defined in ...love Paris!
Tokens:
['I', 'love', 'Paris']


Delete records by query

Now it's possible to delete specific records, either by ids or by a query using Lucene's syntax. This is useful for clean up and better dataset maintenance:

python
import rubrix as rb

Delete by id
rb.delete_records(name="example-dataset", ids=[1,3,5])

Discard records by query
rb.delete_records(name="example-dataset", query="metadata.code=33", discard_only=True)


New tutorials

We have two new tutorials!

Few-shot classification with SetFit and a custom dataset: https://rubrix.readthedocs.io/en/stable/tutorials/few-shot-classification-with-setfit.html

Analyzing predictions with model explainability methods: https://rubrix.readthedocs.io/en/stable/tutorials/nlp_model_explainability.html
https://rubrix.readthedocs.io/en/stable/tutorials/few-shot-classification-with-setfit.html


Features

* **API:** provide a dict for record annotations/predictions ([1658](https://github.com/recognai/rubrix/issues/1658)) ([12b0f83](https://github.com/recognai/rubrix/commit/12b0f83ba584231de6bd2c2f775b8dcaf7b88dcc))
* **Client:** expose client extra headers in init function ([1715](https://github.com/recognai/rubrix/issues/1715)) ([79f0529](https://github.com/recognai/rubrix/commit/79f05298c408e6f0861408e2631423bff0860f01)), closes [#1706](https://github.com/recognai/rubrix/issues/1706)
* **Client:** improve httpx errors handling ([1662](https://github.com/recognai/rubrix/issues/1662)) ([85da336](https://github.com/recognai/rubrix/commit/85da336925f39a84df9577c72ce0ae7d508ca50f))
* **Client:** validate token classification annotations in client ([1709](https://github.com/recognai/rubrix/issues/1709)) ([936d1ca](https://github.com/recognai/rubrix/commit/936d1ca3e39d7df5516fc805f9501526e8a5f999)), closes [#1579](https://github.com/recognai/rubrix/issues/1579)
* **Datasets:** delete records by query ([1721](https://github.com/recognai/rubrix/issues/1721)) ([bc9685d](https://github.com/recognai/rubrix/commit/bc9685dbba092c8f29960fc79c2099febbe6b782)), closes [#1714](https://github.com/recognai/rubrix/issues/1714) [#1737](https://github.com/recognai/rubrix/issues/1737)
* **Datasets:** restrict dataset deletion only to creators and super-users ([1713](https://github.com/recognai/rubrix/issues/1713)) ([c1bef9d](https://github.com/recognai/rubrix/commit/c1bef9d0726790e9c981aa906a7c5ba8a24d6521)), closes [#1740](https://github.com/recognai/rubrix/issues/1740)
* **Server:** Add server telemetry ([1687](https://github.com/recognai/rubrix/issues/1687)) ([d7cc006](https://github.com/recognai/rubrix/commit/d7cc0064a6896c8bb4accc3a51d51fbab0cb3c77))


Bug Fixes

* 'MajorityVoter.score' when using multi-labels ([1678](https://github.com/recognai/rubrix/issues/1678)) ([0b94c86](https://github.com/recognai/rubrix/commit/0b94c868131f0fc0c366a28150f06766f8686fcd)), closes [#1628](https://github.com/recognai/rubrix/issues/1628)
* **Metadata limits:** exclude subfields from mappings ([1700](https://github.com/recognai/rubrix/issues/1700)) ([9f9650e](https://github.com/recognai/rubrix/commit/9f9650eb80a11281c0ea73606c39d6b066697f22)), closes [#1699](https://github.com/recognai/rubrix/issues/1699)
* Normalizes the UnauthorizationError for the API response ([1748](https://github.com/recognai/rubrix/issues/1748)) ([6a68048](https://github.com/recognai/rubrix/commit/6a68048b3742c17ef2f02c9b447f9266f9ef1428))
* Search tag reset prior annotation ([1736](https://github.com/recognai/rubrix/issues/1736)) ([dc0a17f](https://github.com/recognai/rubrix/commit/dc0a17fa9da80c5342cc283e841ba997aa36c9a9)), closes [#1711](https://github.com/recognai/rubrix/issues/1711)


Visual enhancements

* Align App UI with the design system ([1672](https://github.com/recognai/rubrix/issues/1672)) ([67d6de8](https://github.com/recognai/rubrix/commit/67d6de818ba1b092ec632fef74c33730df11e597)), closes [#1670](https://github.com/recognai/rubrix/issues/1670)


Documentation

* Add interpret tutorial with Transformers ([1728](https://github.com/recognai/rubrix/issues/1728)) ([c3fa079](https://github.com/recognai/rubrix/commit/c3fa079a94fe1b1515dd609e38facbc185cff38c)), closes [#1729](https://github.com/recognai/rubrix/issues/1729)
* Adds tutorial about custom few-shot classification with SetFit ([1739](https://github.com/recognai/rubrix/issues/1739)) ([4f15ee6](https://github.com/recognai/rubrix/commit/4f15ee656e199bf8ad3093939617765df23f6fdc)), closes [#1741](https://github.com/recognai/rubrix/issues/1741)
* fixing the active learning tutorial with `small-text` ([1726](https://github.com/recognai/rubrix/issues/1726)) ([909efdf](https://github.com/recognai/rubrix/commit/909efdfd07086dc1387d9957fcbb4489dbd4ae51)), closes [#1693](https://github.com/recognai/rubrix/issues/1693)
* raise small-text version to 1.1.0 and adapt tutorial ([1744](https://github.com/recognai/rubrix/issues/1744)) ([16f19b7](https://github.com/recognai/rubrix/commit/16f19b7dbb104eba492aea3cf17b15a3abefbeaa)), closes [#1693](https://github.com/recognai/rubrix/issues/1693)
* Resolve many typos in documentation, comments and tutorials ([1701](https://github.com/recognai/rubrix/issues/1701)) ([f05e1c1](https://github.com/recognai/rubrix/commit/f05e1c1059bee8bee7fc9abf03c4b196e890d85f))
* using official token class. mapper since is compatible now ([1738](https://github.com/recognai/rubrix/issues/1738)) ([e82fd13](https://github.com/recognai/rubrix/commit/e82fd139348f8a6b94a02cf195eeb33e9987281c)), closes [#482](https://github.com/recognai/rubrix/issues/482)


As always, thanks to our amazing contributors!
- refactor: accept flat text as input for token classification mapper (1686) by Ankush-Chander
- feat(Client): improve httpx errors handling (1662) by Ankush-Chander
- fix: 'MajorityVoter.score' when using multi-labels (1678) by dcfidalgo
- docs: raise small-text version to 1.1.0 and adapt tutorial (1744) by chschroeder
- refactor: Incompatible attribute type fixed (1675) by luca-digrazia
- docs: Resolve many typos in documentation, comments and tutorials (1701) by tomaarsen
- refactor: Collection of changes, primarily regarding test suite and its coverage (1702) by tomaarsen

0.17.0

[0.17.0](https://github.com/recognai/rubrix/compare/v0.16.1...v0.17.0) (2022-08-22)

⚡ Highlights

Preparing a training set in the spaCy DocBin format
`prepare_for_training` is a method that prepares a dataset for training. Before `prepare_for_training` prepared the data for easily training Hugginface Transformers.

Now, you can prepare your training data for `spaCy` NER pipelines, thanks to our great community contributor ignacioct !

With the example below, you can export your Rubrix dataset into a Docbin, save it to disk, and then use it with the spacy train [command](https://spacy.io/api/cli#train).

python
import spacy
import rubrix as rb

from datasets import load_dataset

Load annotated dataset from Rubrix
rb_dataset = rb.load("ner_dataset")

Loading an spaCy blank language model to create the Docbin, as it works faster
nlp = spacy.blank("en")

After this line, the file will be stored in disk
rb_dataset.prepare_for_training(framework="spacy", lang=nlp).to_disk("train.spacy")


You can find a full example at: https://rubrix.readthedocs.io/en/v0.17.0/guides/cookbook.html#Train-a-spaCy-model-by-exporting-to-Docbin


Load large datasets using batches
Before this release, the `rb.load` method to read datasets from Python retrieved the full dataset. For large datasets, this could cause high memory consumption, network timeouts, and the inability to read datasets larger than the available memory.

Thanks to the awesome work by maxserras. Now it's possible to optimize memory consumption and avoid network timeouts when working with large datasets. To that end, a simple batch-iteration over the whole database can be done employing the `from_id` parameter in the `rb.load` method.

An example of reading the first 1000 records and the next batch of up to 1000 records:

python
import rubrix as rb
dataset_batch_1 = rb.load(name="example-dataset", limit=1000)
dataset_batch_2 = rb.load(name="example-dataset", limit=1000, id_from=dataset_batch_1[-1].id)


The reference to the `rb.load` method can be found at: https://rubrix.readthedocs.io/en/v0.17.0/reference/python/python_client.html#rubrix.load

Larger pagination sizes for faster bulk review and annotation
Using filters and search for data annotation and review, some users are able to filter and quickly review dozens of records in one go. To serve those users, it's now possible to see and bulk annotate 50 and 100 records in each page.

<img width="1679" alt="Screenshot 2022-08-25 at 10 33 58" src="https://user-images.githubusercontent.com/1107111/186616549-09280798-8a5e-4153-9680-febafd48d65c.png">

Copy record text to clipboard
Sometimes is useful to copy the text in records to use inspect it or process it with another application. Now, this is possible thanks to the feature request by our great community member and contributor Ankush-Chander !

<img width="1028" alt="Screenshot 2022-08-25 at 10 38 19" src="https://user-images.githubusercontent.com/1107111/186617710-f91a9be5-ad0b-4ee9-bbc9-449395a5a260.png">

Better error logging for generic errors
Thanks to work done by Ankush-Chander and frascuchon we now have more meaningful messages for generic server errors!

Features

* Add new pagination size ranges ([1667](https://github.com/recognai/rubrix/issues/1667)) ([5b4f1f2](https://github.com/recognai/rubrix/commit/5b4f1f290d8a1307186059be3df330a5136a9b69)), closes [#1578](https://github.com/recognai/rubrix/issues/1578)
* Allow `rb.load` fetch records in batches passing the `from_id` argument ([3e6344a](https://github.com/recognai/rubrix/commit/3e6344a7b2a6adce4a9289753cfb268f7bb9a0a8))
* Copy to clipboard the record text ([1625](https://github.com/recognai/rubrix/issues/1625)) ([d634a7b](https://github.com/recognai/rubrix/commit/d634a7bfc1bd972b3fb1e82de0a1a0fda4eb7e3b)), closes [#1616](https://github.com/recognai/rubrix/issues/1616)
* **Error Logging:** send error detail in response for generic server errors ([1648](https://github.com/recognai/rubrix/issues/1648)) ([ad17631](https://github.com/recognai/rubrix/commit/ad17631107a108010ad27895e0b47246205d0378))
* **Listeners:** allow using query params in the condition through search parameter ([1627](https://github.com/recognai/rubrix/issues/1627)) ([a0a245d](https://github.com/recognai/rubrix/commit/a0a245d936c139cd7a229c17246cd3f4ed2233e6)), closes [#1622](https://github.com/recognai/rubrix/issues/1622)
* `prepare_for_training` supports spacy ([1635](https://github.com/recognai/rubrix/issues/1635)) ([8587808](https://github.com/recognai/rubrix/commit/85878087144723695568ebde0d04e16fc9de3c1c))


Bug Fixes

* **Client:** reusing the inner `httpx` client ([1640](https://github.com/recognai/rubrix/issues/1640)) ([854a972](https://github.com/recognai/rubrix/commit/854a972e3ad0e3beac9bdc62d79fdf8d53e15f84)), closes [#1646](https://github.com/recognai/rubrix/issues/1646)
* **docker-compose.yaml:** default volume and disable disk threshold ([1656](https://github.com/recognai/rubrix/issues/1656)) ([05ae688](https://github.com/recognai/rubrix/commit/05ae6883e681a773cef280fc6cc1c3d115337924)), closes [#1275](https://github.com/recognai/rubrix/issues/1275)
* Encode rule name in Weak Labeling API requests ([1649](https://github.com/recognai/rubrix/issues/1649)) ([4634df8](https://github.com/recognai/rubrix/commit/4634df8a74b06020f0ab6cf386ad3bbd419b7cbd)), closes [#1645](https://github.com/recognai/rubrix/issues/1645)
* handle stream api connection errors gracefully ([1636](https://github.com/recognai/rubrix/issues/1636)) ([a106ec4](https://github.com/recognai/rubrix/commit/a106ec4541ae9c90e61c835173c11876d249e775)), closes [#1559](https://github.com/recognai/rubrix/issues/1559)
* Update progress bar when refreshing after adding new records ([1666](https://github.com/recognai/rubrix/issues/1666)) ([7e0d915](https://github.com/recognai/rubrix/commit/7e0d9156d614787996b6a5212bd4715132616a12)), closes [#1590](https://github.com/recognai/rubrix/issues/1590)


Documentation

* Add Slack support link in README's get started ([1688](https://github.com/recognai/rubrix/issues/1688)) ([bef010c](https://github.com/recognai/rubrix/commit/bef010cd6fb1c772c21a04c205cde64a7991bd9a))
* Adding Elasticsearch persistence to docker compose section ([1643](https://github.com/recognai/rubrix/issues/1643)) ([ecdc854](https://github.com/recognai/rubrix/commit/ecdc854a5ee5714a85ef3f87202eb102954667d8))
* spacy `DocBin` cookbook ([1642](https://github.com/recognai/rubrix/issues/1642)) ([bb98278](https://github.com/recognai/rubrix/commit/bb9827845053ff47e3575103a1f51097ace9f396)), closes [#420](https://github.com/recognai/rubrix/issues/420)


Visual enhancements

* Small visual adjustments for Text2Text record card ([1632](https://github.com/recognai/rubrix/issues/1632)) ([9c87cf1](https://github.com/recognai/rubrix/commit/9c87cf191093f1779f894f5fb55b2e1febe27123)), closes [#1138](https://github.com/recognai/rubrix/issues/1138)
* Improve card spacing ([1638](https://github.com/recognai/rubrix/issues/1638)) ([fd4016a](https://github.com/recognai/rubrix/commit/fd4016a6a654db54999ff91a3974d4616722c634)), closes [#1624](https://github.com/recognai/rubrix/issues/1624)

You can see all work included in the release here
- fix: Update progress bar when refreshing after adding new records (1666) by leiyre
- chore: configure miniconda for readthedocs builder by frascuchon
- style: Small visual adjustments for Text2Text record card (1632) by leiyre
- feat: Copy to clipboard the record text (1625) by leiyre
- docs: Add Slack support link in README's get started (1688) by dvsrepo
- chore: update version by frascuchon
- feat: Add new pagination size ranges (1667) by leiyre
- fix: handle stream api connection errors gracefully (1636) by Ankush-Chander
- feat: allow `rb.load` fetch records in batches passing the `from_id` argument by maxserras
- fix(Client): reusing the inner `httpx` client (1640) by frascuchon
- feat(Error Logging): send error detail in response for generic server errors (1648) by frascuchon
- docs: spacy `DocBin` cookbook (1642) by ignacioct
- feat: prepare_for_training supports spacy (1635) by frascuchon
- style: Improve card spacing (1638) by leiyre
- docs: Adding Elasticsearch persistence to docker compose section (1643) by maxserras
- chore: remove old rubrix client class (1639) by frascuchon
- feat(Listeners): allow using query params in the condition through search parameter (1627) by frascuchon
- doc: show metric graphs in documentation (1669) by leiyre
- fix(docker-compose.yaml): default volume and disable disk threshold (1656) by frascuchon
- fix: Encode rule name in Weak Labeling API requests (1649) by leiyre

0.16.1

[0.16.1](https://github.com/recognai/rubrix/compare/v0.16.0...v0.16.1) (2022-07-22)


Bug Fixes

* 'WeakMultiLabels.summary' and 'show_records' after extending the weak label matrix ([1633](https://github.com/recognai/rubrix/issues/1633)) ([3cb4c07](https://github.com/recognai/rubrix/commit/3cb4c073456ab71b495459ec4a0d64875c0cd134)), closes [#1631](https://github.com/recognai/rubrix/issues/1631)
* Display metadata in Text2Text dataset ([1626](https://github.com/recognai/rubrix/issues/1626)) ([0089e0a](https://github.com/recognai/rubrix/commit/0089e0a674d05813cc7889b0530ffedae75a1ad5)), closes [#1623](https://github.com/recognai/rubrix/issues/1623)
* Show predicted OK/KO when predictions exist ([1620](https://github.com/recognai/rubrix/issues/1620)) ([ef66e9c](https://github.com/recognai/rubrix/commit/ef66e9ca78416ee3a33deda05234b9392302f635)), closes [#1619](https://github.com/recognai/rubrix/issues/1619)


Documentation

* Fix typo in Getting Started -> Concepts ([1618](https://github.com/recognai/rubrix/issues/1618)) ([b236cb8](https://github.com/recognai/rubrix/commit/b236cb887c7c3d99fbbc24a2495b96f66747816e)), closes [#1617](https://github.com/recognai/rubrix/issues/1617)

You can see all work included in the release here
- fix: 'WeakMultiLabels.summary' and 'show_records' after extending the weak label matrix (1633) by dcfidalgo
- fix: Display metadata in Text2Text dataset (1626) by leiyre
- chore: set version by dcfidalgo
- docs: Fix typo in Getting Started -> Concepts (1618) by dcfidalgo
- fix: Show predicted OK/KO when predictions exist (1620) by leiyre

0.16.0

[0.16.0](https://github.com/recognai/rubrix/compare/v0.15.0...v0.16.0) (2022-07-08)

Highlights

👂 Listeners: enable more interactive workflows between client and server

Listeners enable you to define functions that get executed under certain conditions when something changes in a dataset. There are many use cases for this: monitoring annotation jobs, monitoring model predictions, enabling active learning workflows, and many more.

You can find the Python API reference docs here: https://rubrix.readthedocs.io/en/stable/reference/python/python_listeners.html#python-listeners

We will be documenting these use cases with practical examples, but for this release, we've included a new tutorial for using this with active learning: https://rubrix.readthedocs.io/en/stable/tutorials/active_learning_with_small_text.html. This tutorial includes the following listener function, which implements the active learning loop:

python
from rubrix.listeners import listener
from sklearn.metrics import accuracy_score

Define some helper variables
LABEL2INT = trec["train"].features["label-coarse"].str2int
ACCURACIES = []

Set up the active learning loop with the listener decorator
listener(
dataset=DATASET_NAME,
query="status:Validated AND metadata.batch_id:{batch_id}",
condition=lambda search: search.total==NUM_SAMPLES,
execution_interval_in_seconds=3,
batch_id=0
)
def active_learning_loop(records, ctx):

1. Update active learner
print(f"Updating with batch_id {ctx.query_params['batch_id']} ...")
y = np.array([LABEL2INT(rec.annotation) for rec in records])

initial update
if ctx.query_params["batch_id"] == 0:
indices = np.array([rec.id for rec in records])
active_learner.initialize_data(indices, y)
update with the prior queried indices
else:
active_learner.update(y)
print("Done!")

2. Query active learner
print("Querying new data points ...")
queried_indices = active_learner.query(num_samples=NUM_SAMPLES)
ctx.query_params["batch_id"] += 1
new_records = [
rb.TextClassificationRecord(
text=trec["train"]["text"][idx],
metadata={"batch_id": ctx.query_params["batch_id"]},
id=idx,
)
for idx in queried_indices
]

3. Log the batch to Rubrix
rb.log(new_records, DATASET_NAME)

4. Evaluate current classifier on the test set
print("Evaluating current classifier ...")
accuracy = accuracy_score(
dataset_test.y,
active_learner.classifier.predict(dataset_test),
)
ACCURACIES.append(accuracy)
print("Done!")

print("Waiting for annotations ...")



📖 New docs!

https://rubrix.readthedocs.io/

<img width="1643" alt="Screenshot 2022-07-13 at 12 49 42" src="https://user-images.githubusercontent.com/1107111/178716820-f675ec48-486f-4763-bd48-60e5e7d773da.png">

🧱 `extend_matrix`: Weak label augmentation using embeddings

This release includes an exciting feature to augment the coverage of your weak labels using embeddings. You can find a practical tutorial here: https://rubrix.readthedocs.io/en/stable/tutorials/extend_weak_labels_with_embeddings.html

Features

* **1561:** standardize icons ([1565](https://github.com/recognai/rubrix/issues/1565)) ([15254e7](https://github.com/recognai/rubrix/commit/15254e73c1cec4b9f3f2bb940a89e72d66da78e5)), closes [#1561](https://github.com/recognai/rubrix/issues/1561)
* **1602:** new rubrix dataset listeners ([1507](https://github.com/recognai/rubrix/issues/1507), [#1586](https://github.com/recognai/rubrix/issues/1586), [#1583](https://github.com/recognai/rubrix/issues/1583), [#1596](https://github.com/recognai/rubrix/issues/1596)) ([65747ab](https://github.com/recognai/rubrix/commit/65747abcde1283356465cfc9836bd600ff354535)), closes [#1602](https://github.com/recognai/rubrix/issues/1602)
* Add 'extend_matrix' to the WeakMultiLabel class ([1577](https://github.com/recognai/rubrix/issues/1577)) ([cf89311](https://github.com/recognai/rubrix/commit/cf89311473c5446b7e01baf9429e0b673e3cf5a1))
* Improve from datasets ([1567](https://github.com/recognai/rubrix/issues/1567)) ([2b0d607](https://github.com/recognai/rubrix/commit/2b0d6075ec3f4eb2cf2783583dd21d4f4a0d5c4f))
* **token-class:** adjust token spans spaces ([1599](https://github.com/recognai/rubrix/issues/1599)) ([0fb3576](https://github.com/recognai/rubrix/commit/0fb3576e6ade30cc7dbbb9d6af947fa3f85ea4c0))


Bug Fixes

* **1264:** discard first space after a token ([1591](https://github.com/recognai/rubrix/issues/1591)) ([eff0ac5](https://github.com/recognai/rubrix/commit/eff0ac5b0e2f7198e695ede905737497bba451cf)), closes [#1264](https://github.com/recognai/rubrix/issues/1264)
* **1545:** highlight words with accents ([1550](https://github.com/recognai/rubrix/issues/1550)) ([c42e77b](https://github.com/recognai/rubrix/commit/c42e77be021e57ba6b15074f457e99d4d06f0a33)), closes [#1545](https://github.com/recognai/rubrix/issues/1545)
* **1548:** access datasets for superusers when workspace is not provided ([1572](https://github.com/recognai/rubrix/issues/1572), [#1608](https://github.com/recognai/rubrix/issues/1608)) ([0b04bc8](https://github.com/recognai/rubrix/commit/0b04bc8920b78e346cb6fef8fa650fc485e54819)), closes [#1548](https://github.com/recognai/rubrix/issues/1548)
* **1551:** don't show error traces for EntityNotFoundError's ([1569](https://github.com/recognai/rubrix/issues/1569)) ([04e101c](https://github.com/recognai/rubrix/commit/04e101c36e00c87d32359ca0df7c92b2cf9ed55c)), closes [#1551](https://github.com/recognai/rubrix/issues/1551)
* **1557:** allow text editing when clicking the "edit" button ([1558](https://github.com/recognai/rubrix/issues/1558)) ([e751414](https://github.com/recognai/rubrix/commit/e7514149be3632062dd755e79c099a0f091d70df)), closes [#1557](https://github.com/recognai/rubrix/issues/1557)
* **1574:** search highlighting for a single dot ([1592](https://github.com/recognai/rubrix/issues/1592)) ([53474a1](https://github.com/recognai/rubrix/commit/53474a1db9fd9a92d263988169833af0507f6ffe)), closes [#1574](https://github.com/recognai/rubrix/issues/1574)
* **1575:** show predicted ok/ko in Text Classifier explore mode ([1576](https://github.com/recognai/rubrix/issues/1576)) ([ada87c0](https://github.com/recognai/rubrix/commit/ada87c07d0a603fff56f61ff1c321434ce028791)), closes [#1575](https://github.com/recognai/rubrix/issues/1575)
* compatibility with new dataset version ([1566](https://github.com/recognai/rubrix/issues/1566)) ([ac26e30](https://github.com/recognai/rubrix/commit/ac26e301a636d193ed5036dfa31370c29e2f1462))


Documentation

* **1512:** change theme to furo ([1564](https://github.com/recognai/rubrix/issues/1564), [#1604](https://github.com/recognai/rubrix/issues/1604)) ([98869d2](https://github.com/recognai/rubrix/commit/98869d20efcff27c0c884fe76f5f32cc2a1bfe35)), closes [#1512](https://github.com/recognai/rubrix/issues/1512)
* add 'how to prepare your data for training' to basics ([1589](https://github.com/recognai/rubrix/issues/1589)) ([a21bcf3](https://github.com/recognai/rubrix/commit/a21bcf3e1a89e74e3ce4db0f66a7854aa4a41e7c))
* add active learning with small text and listener tutorial ([1585](https://github.com/recognai/rubrix/issues/1585), [#1609](https://github.com/recognai/rubrix/issues/1609)) ([d59573f](https://github.com/recognai/rubrix/commit/d59573fefa46be55159b4f08fdfa92ee75b76973)), closes [#1601](https://github.com/recognai/rubrix/issues/1601) [#421](https://github.com/recognai/rubrix/issues/421)
* Add MajorityVoter to references + Add comments about multi-label support of the label models ([1582](https://github.com/recognai/rubrix/issues/1582)) ([ab481c7](https://github.com/recognai/rubrix/commit/ab481c77551e00d5f11bec51f48f1d1d1adda6a0))
* add pip version and dockertag as parameter in the build process ([1560](https://github.com/recognai/rubrix/issues/1560)) ([73a31e2](https://github.com/recognai/rubrix/commit/73a31e26d50883bc7ece90f287e64295ba0c17ee))

You can see all work included in the release here
- chore(docs): remove by frascuchon
- docs: add active learning with small text and listener tutorial (1585, 1609) by dcfidalgo
- docs(1512): change theme to furo (1564, 1604) by frascuchon
- chore: set version by frascuchon
- feat(token-class): adjust token spans spaces (1599) by frascuchon
- feat(1602): new rubrix dataset listeners (1507, 1586, 1583, 1596) by frascuchon
- docs: add 'how to prepare your data for training' to basics (1589) by dcfidalgo
- test: configure numpy to disable multi threading (1593) by frascuchon
- docs: Add MajorityVoter to references + Add comments about multi-label support of the label models (1582) by dcfidalgo
- feat(1561): standardize icons (1565) by leiyre
- Feat: Improve from datasets (1567) by dcfidalgo
- feat: Add 'extend_matrix' to the WeakMultiLabel class (1577) by dcfidalgo
- docs: add pip version and dockertag as parameter in the build process (1560) by frascuchon
- refactor: remove `words` references in searches (1571) by frascuchon
- ci: check conda env cache (1570) by frascuchon
- fix(1264): discard first space after a token (1591) by frascuchon
- ci(package): regenerate view snapshot (1600) by frascuchon
- fix(1574): search highlighting for a single dot (1592) by leiyre
- fix(1575): show predicted ok/ko in Text Classifier explore mode (1576) by leiyre
- fix(1548): access datasets for superusers when workspace is not provided (1572, 1608) by frascuchon
- fix(1551): don't show error traces for EntityNotFoundError's (1569) by frascuchon
- fix: compatibility with new dataset version (1566) by dcfidalgo
- fix(1557): allow text editing when clicking the "edit" button (1558) by leiyre
- fix(1545): highlight words with accents (1550) by leiyre

Page 1 of 5

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.