Argilla

Latest version: v2.8.0

Safety actively analyzes 723650 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 9 of 22

1.7.0

🔆 Highlights

OpenAI fine-tuning support
Use your data in Argilla to fine-tune OpenAI models. You can do this by getting your data in the specific format through the `prepare_for_training` method or train directly using `ArgillaTrainer`.

Argilla Trainer improvements
We’ve added CLI support for Argilla Trainer and two new frameworks for training: `OpenAI` & `SpanMarker`.

Logging and loading enhancements
We’ve improved the speed and robustness of `rg.log` and `rg.load` methods.

`typer` CLI
A more user-friendly command line interface with `typer` that includes argument suggestions and colorful messages.

[Changelog 1.7.0](https://github.com/argilla-io/argilla/compare/v1.6.0...v1.7.0)

Added

- add `max_retries` and `num_threads` parameters to `rg.log` to run data logging request concurrently with backoff retry policy. See [2458](https://github.com/argilla-io/argilla/issues/2458) and [#2533](https://github.com/argilla-io/argilla/issues/2533)
- `rg.load` accepts `include_vectors` and `include_metrics` when loading data. Closes [2398](https://github.com/argilla-io/argilla/issues/2398)
- Added `settings` param to `prepare_for_training` ([2689](https://github.com/argilla-io/argilla/issues/2689))
- Added `prepare_for_training` for `openai` ([2658](https://github.com/argilla-io/argilla/issues/2658))
- Added `ArgillaOpenAITrainer` ([2659](https://github.com/argilla-io/argilla/issues/2659))
- Added `ArgillaSpanMarkerTrainer` for Named Entity Recognition ([2693](https://github.com/argilla-io/argilla/pull/2693))
- Added `ArgillaTrainer` CLI support. Closes ([2809](https://github.com/argilla-io/argilla/issues/2809))

Changed

- Argilla quickstart image dependencies are externalized into `quickstart.requirements.txt`. See [2666](https://github.com/argilla-io/argilla/pull/2666)
- bulk endpoints will upsert data when record `id` is present. Closes [2535](https://github.com/argilla-io/argilla/issues/2535)
- moved from `click` to `typer` CLI support. Closes ([2815](https://github.com/argilla-io/argilla/issues/2815))
- Argilla server docker image is built with PostgreSQL support. Closes [2686](https://github.com/argilla-io/argilla/issues/2686)
- The `rg.log` computes all batches and raise an error for all failed batches.
- The default batch size for `rg.log` is now 100.

Fixed

- `argilla.training` bugfixes and unification ([2665](https://github.com/argilla-io/argilla/issues/2665))
- Resolved several small bugs in the `ArgillaTrainer`.

Deprecated

- The `rg.log_async` function is deprecated and will be removed in next minor release.

As always, thanks to out amazing contributors!
- docs: Fix broken links in README.md (2759) by stephantul
- Update how_to.ipynb by chainyo
- Update log_load_and_prepare_data.ipynb by ignacioct

1.6.0

🔆 Highlights

User roles & settings page
We've introduced two user roles to help you manage your annotation team: `admin` and `annotator`. `admin` users can create, list and delete other users, workspaces and datasets. The `annotator` role is specifically designed for users who focus solely on annotating datasets.

<img width="1507" alt="User settings page" src="https://user-images.githubusercontent.com/126158523/230874953-402034f0-d170-4f6a-9e6a-4a0fea9754a1.png">

We've also added a page to see your user's settings in the Argilla UI. To access it click on your user avatar at the top right corner and then select `My settings`.

Argilla Trainer
The new `Argilla.training` module deals with all data transformations and basic default configurations to train a model with annotations from Argilla using popular NLP frameworks. It currently supports `spacy`, `setfit` and `transformers`.

Additionally, `admin` users can access ready-made code snippets to copy-paste directly from the Argilla UI. Just go to the dataset you want to use, click the `</> Train` button in the top banner and select your preferred framework.

<img width="1506" alt="Ready-made code snippet to train a model using the `alpaca-clean-en` dataset and `setfit` as shown in the Argilla UI" src="https://user-images.githubusercontent.com/126158523/230872180-9a71159e-7dbc-46cd-8043-73325971f4eb.png">

Learn more about `Argilla.training` [in our docs](https://docs.argilla.io/en/latest/guides/train_a_model.html).

Database support
Argilla will now create a default SQLite database to store users and workspaces. PostgreSQL is also officially supported. Simply set a custom value for the `ARGILLA_DATABASE_URL` environment variable pointing to your PostgreSQL instance.

[Changelog 1.6.0](https://github.com/argilla-io/argilla/compare/v1.5.1...v1.6.0)

Added

- `ARGILLA_HOME_PATH` new environment variable ([2564]).
- `ARGILLA_DATABASE_URL` new environment variable ([2564]).
- Basic support for user roles with `admin` and `annotator` ([2564]).
- `id`, `first_name`, `last_name`, `role`, `inserted_at` and `updated_at` new user fields ([2564]).
- `/api/users` new endpoint to list and create users ([2564]).
- `/api/users/{user_id}` new endpoint to delete users ([2564]).
- `/api/workspaces` new endpoint to list and create workspaces ([2564]).
- `/api/workspaces/{workspace_id}/users` new endpoint to list workspace users ([2564]).
- `/api/workspaces/{workspace_id}/users/{user_id}` new endpoint to create and delete workspace users ([2564]).
- `argilla.tasks.users.migrate` new task to migrate users from old YAML file to database ([2564]).
- `argilla.tasks.users.create` new task to create a user ([2564]).
- `argilla.tasks.users.create_default` new task to create a user with default credentials ([2564]).
- `argilla.tasks.database.migrate` new task to execute database migrations ([2564]).
- `release.Dockerfile` and `quickstart.Dockerfile` now creates a default `argilladata` volume to persist data ([2564]).
- Add user settings page. Closes [2496](https://github.com/argilla-io/argilla/issues/2496)
- Added `Argilla.training` module with support for `spacy`, `setfit`, and `transformers`. Closes [2504](https://github.com/argilla-io/argilla/issues/2496)

Fixes
- Now the `prepare_for_training` method is working when `multi_label=True`. Closes [2606](https://github.com/argilla-io/argilla/issues/2606)

Changed

- `ARGILLA_USERS_DB_FILE` environment variable now it's only used to migrate users from YAML file to database ([2564]).
- `full_name` user field is now deprecated and `first_name` and `last_name` should be used instead ([2564]).
- `password` user field now requires a minimum of `8` and a maximum of `100` characters in size ([2564]).
- `quickstart.Dockerfile` image default users from `team` and `argilla` to `admin` and `annotator` including new passwords and API keys ([2564]).
- Datasets to be managed only by users with `admin` role ([2564]).
- The list of rules is now accessible while metrics are computed. Closes[2117](https://github.com/argilla-io/argilla/issues/2117)
- Style updates for weak labelling and adding feedback toast when delete rules. See [2626](https://github.com/argilla-io/argilla/pull/2626) and [#2648](https://github.com/argilla-io/argilla/pull/2648)

Removed

- `email` user field ([2564]).
- `disabled` user field ([2564]).
- Support for private workspaces ([2564]).
- `ARGILLA_LOCAL_AUTH_DEFAULT_APIKEY` and `ARGILLA_LOCAL_AUTH_DEFAULT_PASSWORD` environment variables. Use `python -m argilla.tasks.users.create_default` instead ([2564]).
- The old headers for `API Key` and `workspace` from python client
- The default value for old `API Key` constant. Closes [2251](https://github.com/argilla-io/argilla/issues/2251)

[2564]: https://github.com/argilla-io/argilla/issues/2564

As always, thanks to our amazing contributors!
- feat: add ArgillaSpaCyTrainer for both TokenClassification and TextClassification (2604) by alvarobartt
- Move dataset dump to train, ignored unnecessary imports, & remove _required_fields attribute (2642) by alvarobartt
- fix: update field name in metadata for image url (2609) by burtenshaw
- fix Install doc spell error by PhilipMay
- fix: broken README.md link (2616) by alvarobartt

1.5.1

[1.5.1](https://github.com/recognai/rubrix/compare/v1.5.0...v1.5.1)

Fixes

- Copying datasets between workspaces with proper owner/workspace info. Closes [2562](https://github.com/argilla-io/argilla/issues/2562)
- Copy dataset with empty workspace to the default user workspace. See [2618](https://github.com/argilla-io/argilla/pull/2618)
- Using elasticsearch config to request backend version. Closes [2311](https://github.com/argilla-io/argilla/issues/2311)
- Remove sorting by score in labels. Closes [2622](https://github.com/argilla-io/argilla/issues/2622)

Changed

- Update field name in metadata for image url. See [2609](https://github.com/argilla-io/argilla/pull/2609)

1.5.0

🔆 Highlights

Dataset Settings page
<img width="1505" alt="Captura de pantalla 2023-03-23 a las 8 28 31" src="https://user-images.githubusercontent.com/126158523/227145823-420c0dcb-67b1-42b3-a27a-94e21d498707.png">

We have added a Settings page for your datasets. From there, you will be able to manage your dataset. Currently, it is possible to add labels to your labeling schema and delete the dataset.


Add images to your records
<img width="1371" alt="Captura de pantalla 2023-03-23 a las 9 48 52" src="https://user-images.githubusercontent.com/126158523/227165672-a97f1d72-00c1-4b54-bca5-b9ef5eb5d491.png"><figcaption> The image in this record was generated using [https://robohash.org](url) </figcaption>

You can pass a URL in the metadata field `_image_url` and the image will be rendered in the Argilla UI. You can use this in the Text Classification and the Token Classification tasks.


Non-searchable metadata fields
Apart from the `_image_url` field you can also pass other metadata fields that won't be used in queries or filters by adding an underscore at the start e.g. `_my_field`.

Load only what you need using `rg.load`
You can now specify the fields you want to load from your Argilla dataset. That way, you can avoid loading heavy vectors if you're using them for your annotations.

Two new tutorials (kudos embonhomme & burtenshaw)
Check out our new tutorials created by the community!
- Compare the performance of two text classification models [here](https://docs.argilla.io/en/latest/tutorials/notebooks/monitoring-textclassification-setfit-explainability.html?highlight=models%20comparison)
- Multimodal bulk annotation [here](https://docs.argilla.io/en/latest/tutorials/notebooks/labelling-textclassification-image-bulk-labelling.html#Bulk-Labelling-Multimodal-Data)

Changelog

All notable changes to this project will be documented in this file. See [standard-version](https://github.com/conventional-changelog/standard-version) for commit guidelines.

[1.5.0](https://github.com/recognai/rubrix/compare/v1.4.0...v1.5.0) - 2023-03-21

Added

- Add the fields to retrieve when loading the data from argilla. `rg.load` takes too long because of the vector field, even when users don't need it. Closes [2398](https://github.com/argilla-io/argilla/issues/2398)
- Add new page and components for dataset settings. Closes [2442](https://github.com/argilla-io/argilla/issues/2003)
- Add ability to show image in records (for TokenClassification and TextClassification) if an URL is passed in metadata with the key `_image_url`
- Non-searchable fields support in metadata. [2570](https://github.com/argilla-io/argilla/pull/2570)

Changed

- Labels are now centralized in a specific vuex ORM called GlobalLabel Model, see https://github.com/argilla-io/argilla/issues/2210. This model is the same for TokenClassification and TextClassification (so both task have labels with color_id and shortcuts parameters in the vuex ORM)
- The shortcuts improvement for labels [2339](https://github.com/argilla-io/argilla/pull/2339) have been moved to the vuex ORM in dataset settings feature [#2444](https://github.com/argilla-io/argilla/commit/eb37c3bcff3ad253481d6a10f8abb093384f2dcb)
- Update "Define a labeling schema" section in docs.
- The record inputs are sorted alphabetically in UI by default. [2581](https://github.com/argilla-io/argilla/pull/2581)

Fixes

- Allow URL to be clickable in Jupyter notebook again. Closes [2527](https://github.com/argilla-io/argilla/issues/2527)

Removed

- Removing some data scan deprecated endpoints used by old clients. This change will break compatibility with client `<v1.3.0`
- Stop using old scan deprecated endpoints in python client. This logic will break client compatibility with server version `<1.3.0`
- Remove the previous way to add labels through the dataset page. Now labels can be added only through dataset settings page.



As always, thanks to our amazing contributors!
- Documentation update: tutorial for text classification models comparison (2426) by embonhomme
- Docs: fix little typo (2522) by anakin87
- Docs: Tutorial on image classification (2420) by burtenshaw

1.4.1

[1.4.1](https://github.com/recognai/rubrix/compare/v1.4.0...v1.4.1)


Bug Fixes

- Copying datasets between workspaces with proper owner/workspace info. Closes [2562](https://github.com/argilla-io/argilla/issues/2562)
- Copy dataset with empty workspace to the default user workspace [905d4de](https://github.com/recognai/argilla/commit/905d4deaa769bfc9bbc022cd2dc75c7435cfe865)
- Using elasticsearch config to request backend version. Closes [2311](https://github.com/argilla-io/argilla/issues/2311)

1.4.0

🔆 Highlights

Enhanced annotation flow for all tasks

Improved bulk annotation and actions
A more stylish banner for available global actions. It includes an improved label selector to apply and remove labels in bulk.

<img width="1372" alt="features-multiclass-bulk-labels" src="https://user-images.githubusercontent.com/126158523/224002594-e1ebea1d-a2a5-4fbf-a9bf-96d2802ece5a.png">

We enhanced multi-label text classification annotations and now adding labels in bulk doesn't remove previous labels. This action will change the status of the records to _Pending_ and you will need to validate the annotation to save the changes.

Learn more about [bulk annotations](https://docs.argilla.io/en/latest/reference/webapp/features.html#bulk-annotate) and [multi-level text classification annotations](https://docs.argilla.io/en/latest/reference/webapp/features.html#multi-label) in our docs.

Clear and Reset actions
New actions to clear all annotations and reset changes. They can be used at the record level or as bulk actions.

Unvalidate and undiscard
Click the _Validate_ or _Discard_ buttons in a record to undo this action.

Optimized one-record view
Improved view for a single record to enable a more focused annotation experience.

Prepare for training for SparkNLP Text2Text
Extended support to prepare Text2Text datasets for training with SparkNLP.

Learn more in our [docs](https://docs.argilla.io/en/latest/reference/webapp/features.html#bulk-annotate).

Extended shortcuts for token classification (kudos cceyda)
In token classification tasks that have 10+ options, labels get assigned QWERTY keys as shortcuts.

Changelog

All notable changes to this project will be documented in this file. See [standard-version](https://github.com/conventional-changelog/standard-version) for commit guidelines.

[1.4.0](https://github.com/recognai/rubrix/compare/v1.3.1...v1.4.0) (2023-03-09)

Features

* `configure_dataset` accepts a workspace as argument ([2503](https://github.com/recognai/rubrix/issues/2503)) ([29c9ee3](https://github.com/recognai/rubrix/commit/29c9ee3bf95a3cb77932c4c19638da464cf98830)),
* Add `active_client` function to main argilla module ([2387](https://github.com/recognai/rubrix/issues/2387)) ([4e623d4](https://github.com/recognai/rubrix/commit/4e623d4c1adc97776812355d2587e41cb487221e)), closes [#2183](https://github.com/recognai/rubrix/issues/2183)
* Add text2text support for prepare for training spark nlp ([2466](https://github.com/recognai/rubrix/issues/2466)) ([21efb83](https://github.com/recognai/rubrix/commit/21efb839e051f1cf48c16a5d25373f25da192d53)), closes [#2465](https://github.com/recognai/rubrix/issues/2465) [#2482](https://github.com/recognai/rubrix/issues/2482)
* Allow passing workspace as client param for `rg.log` or `rg.load` ([2425](https://github.com/recognai/rubrix/issues/2425)) ([b3b897a](https://github.com/recognai/rubrix/commit/b3b897ac731d0113ea5e078b5bf5974f351935fd)), closes [#2059](https://github.com/recognai/rubrix/issues/2059)
* Bulk annotation improvement ([2437](https://github.com/recognai/rubrix/issues/2437)) ([3fce915](https://github.com/recognai/rubrix/commit/3fce9151d82f943e5956137544a5ea4f434f5410)), closes [#2264](https://github.com/recognai/rubrix/issues/2264)
* Deprecate `chunk_size` in favor of `batch_size` for `rg.log` ([2455](https://github.com/recognai/rubrix/issues/2455)) ([3ebea76](https://github.com/recognai/rubrix/commit/3ebea7644bf21cf9e74f376225252c95ed82707f)), closes [#2453](https://github.com/recognai/rubrix/issues/2453)
* Expose `batch_size` parameter for `rg.load` ([2460](https://github.com/recognai/rubrix/issues/2460)) ([e25be3e](https://github.com/recognai/rubrix/commit/e25be3e5e2d0b886e15433d109cc24bebc6cf93d)), closes [#2454](https://github.com/recognai/rubrix/issues/2454) [#2434](https://github.com/recognai/rubrix/issues/2434)
* Extend shortcuts to include alphabet for token classification ([2339](https://github.com/recognai/rubrix/issues/2339)) ([4a92b35](https://github.com/recognai/rubrix/commit/4a92b3510a9c1703cddd925782bf5d57c4d28259))


Bug Fixes

* added flexible app redirect to docs page ([2428](https://github.com/recognai/rubrix/issues/2428)) ([5600301](https://github.com/recognai/rubrix/commit/5600301a59899bc6fbc5d40384478e90cb2e21d7)), closes [#2377](https://github.com/recognai/rubrix/issues/2377)
* added regex match to set workspace method ([2427](https://github.com/recognai/rubrix/issues/2427)) ([d789fa1](https://github.com/recognai/rubrix/commit/d789fa1570e70cd0fdeb9d9ec0b4a7ac1c12f5aa)), closes [#2388]
* error when loading record with empty string query ([2429](https://github.com/recognai/rubrix/issues/2429)) ([fc71c3b](https://github.com/recognai/rubrix/commit/fc71c3b538e767133ec4fa53333a76490e0d9449)), closes [#2400](https://github.com/recognai/rubrix/issues/2400) [#2303](https://github.com/recognai/rubrix/issues/2303)
* Remove extra-action dropdown state after navigation ([2479](https://github.com/recognai/rubrix/issues/2479)) ([9328994](https://github.com/recognai/rubrix/commit/932899473c25bbb27b9d9626ba6d23a60c34a3d8)), closes [#2158](https://github.com/recognai/rubrix/issues/2158)


Documentation

* Add AutoTrain to readme ([7199780](https://github.com/recognai/rubrix/commit/71997809dbe1d5330d907d5c81195e13979d8205))
* Add migration to label schema section ([2435](https://github.com/recognai/rubrix/issues/2435)) ([d57a1e5](https://github.com/recognai/rubrix/commit/d57a1e50014858c844a56da17471351332689359)), closes [#2003](https://github.com/recognai/rubrix/issues/2003) [#2003](https://github.com/recognai/rubrix/issues/2003)
* Adds zero+few shot tutorial with SetFit ([2409](https://github.com/recognai/rubrix/issues/2409)) ([6c679ad](https://github.com/recognai/rubrix/commit/6c679ada0d79a11b332bb34c96d8d16331093b69))
* Update readme with quickstart section and new links to guides ([2333](https://github.com/recognai/rubrix/issues/2333)) ([91a77ad](https://github.com/recognai/rubrix/commit/91a77ad85244cd885fafd40c38251ecdf2e37147))


As always, thanks to our amazing contributors!
- Documentation update: adding missing n (2362) by Gnonpi
- feat: Extend shortcuts to include alphabet for token classification (2339) by cceyda

Page 9 of 22

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.