[0.11.0](https://github.com/recognai/rubrix/compare/v0.10.0...v0.11.0) (2022-02-19)
Highlights
Introducing `rb.Dataset*` and π€ Hub integration
The Dataset classes are lightweight containers for Rubrix records. These classes facilitate importing from and exporting to different formats (e.g., `pandas.DataFrame`, `datasets.Dataset`) as well as sharing and versioning Rubrix datasets using the Hugging Face Hub.
With this release, Rubrix users and teams can use the Hugging Face Hub to share and read both public and private Rubrix datasets for TextClassification, TokenClassification, and Text2Text datasets. This opens up a whole new world of possibilities for data reproducibility and sharing. Let's see an example:
python
import rubrix as rb
from datasets import load_datasets
π§π» π·οΈ Leire has labeled a text classification dataset using a local Rubrix instance
dataset_rb = rb.load("text_classification_ds", as_pandas=False)
π§π» exports a Rubrix Dataset to a hf Dataset
dataset_ds = dataset_rb.to_datasets()
π§π» π Leire shares the labelled dataset with the world
dataset_ds.push_to_hub("text_classification_ds")
π¨ John downloads the dataset from the Hugging Face Hub
dataset_ds = load_dataset("leire/text_classification_ds", split="train")
π¨ reads in dataset
dataset_rb = rb.read_datasets(dataset_ds, task="TextClassification")
π¨ π·οΈ logs the dataset and continues labeling with his own Rubrix instance
rb.log(dataset_rb, "john_text_classification_ds")
You can read more at https://rubrix.readthedocs.io/en/stable/guides/datasets.html
For each record type, thereβs a corresponding Dataset class called `DatasetFor<RecordType>`. You can look up their API in the [reference section](https://rubrix.readthedocs.io/en/stable/reference/python/python_client.html#module-rubrix.client.datasets).
Improving NER UI and UX
The UI for Token Classification has been completely redesigned to provide a better user experience for exploration and annotation. This is the first of a set of changes focusing on annotation productivity for token classification.
<img width="1564" alt="Screenshot 2022-02-21 at 12 39 22" src="https://user-images.githubusercontent.com/1107111/154948266-e458f604-1a8b-4e18-8f70-48f5f1c6b7b4.png">
Features
* **1051:** keep predictions labels when annotating ([1077](https://github.com/recognai/rubrix/issues/1077)) ([f1824ba](https://github.com/recognai/rubrix/commit/f1824ba2c759d6eb310de10c3791dde740e68118)), closes [#1051](https://github.com/recognai/rubrix/issues/1051)
* **1063:** Token Classifier fine tuning content selection ([1084](https://github.com/recognai/rubrix/issues/1084)) ([9e14d05](https://github.com/recognai/rubrix/commit/9e14d0566f033e1e12fba4f1f2fc1d194ee108f8)), closes [#1063](https://github.com/recognai/rubrix/issues/1063)
* **1127:** raise startup app error from es connection error ([1145](https://github.com/recognai/rubrix/issues/1145)) ([7e7e9d8](https://github.com/recognai/rubrix/commit/7e7e9d83ec73708d0f6bf13efd2933e0fb7ecdf7)), closes [#1127](https://github.com/recognai/rubrix/issues/1127)
* **422:** introducing the rb.Dataset* classes ([1109](https://github.com/recognai/rubrix/issues/1109)) ([b5bbca6](https://github.com/recognai/rubrix/commit/b5bbca62f9d8a833088c009ff1539d2843b30859)), closes [#422](https://github.com/recognai/rubrix/issues/422)
* **821:** token classifier show predictions in explore view ([1009](https://github.com/recognai/rubrix/issues/1009)) ([6ba6764](https://github.com/recognai/rubrix/commit/6ba6764c8ceef09037f6c76bc01f84f4660e9b36)), closes [#821](https://github.com/recognai/rubrix/issues/821)
* **951:** new *"not covered records by rules"* filter ([991](https://github.com/recognai/rubrix/issues/991)) ([0649f2a](https://github.com/recognai/rubrix/commit/0649f2ab30430e0f2486e2e4fd0f0c2096bd94cc)), closes [#951](https://github.com/recognai/rubrix/issues/951) [#1156](https://github.com/recognai/rubrix/issues/1156)
Bug Fixes
* **1140:** fix/make client models more consistent ([1147](https://github.com/recognai/rubrix/issues/1147)) ([926bb16](https://github.com/recognai/rubrix/commit/926bb1605c14cf4afbe6431f5914303b0cb01328)), closes [#1140](https://github.com/recognai/rubrix/issues/1140)
* **client:** parse unauthorized api error properly ([1164](https://github.com/recognai/rubrix/issues/1164)) ([1a5a08d](https://github.com/recognai/rubrix/commit/1a5a08d8cce98efa6c9d5ea2fc7ccc4b71494684))
* **search:** prevent metrics computation breaks searches ([1175](https://github.com/recognai/rubrix/issues/1175)) ([9f2adc9](https://github.com/recognai/rubrix/commit/9f2adc97d9958798be52cce5c2dd3e6af264a221))