Argilla

Latest version: v2.8.0

Safety actively analyzes 723650 Python packages for vulnerabilities to keep your Python projects secure.

Page 6 of 22

1.19.0

Added

- Added `POST /api/v1/datasets/:dataset_id/records/search` endpoint to search for records without user context, including responses by all users. ([4143](https://github.com/argilla-io/argilla/pull/4143))
- Added `POST /api/v1/datasets/:dataset_id/vectors-settings` endpoint for creating vector settings for a dataset. ([3776](https://github.com/argilla-io/argilla/pull/3776))
- Added `GET /api/v1/datasets/:dataset_id/vectors-settings` endpoint for listing the vectors settings for a dataset. ([3776](https://github.com/argilla-io/argilla/pull/3776))
- Added `DELETE /api/v1/vectors-settings/:vector_settings_id` endpoint for deleting a vector settings. ([3776](https://github.com/argilla-io/argilla/pull/3776))
- Added `PATCH /api/v1/vectors-settings/:vector_settings_id` endpoint for updating a vector settings. ([4092](https://github.com/argilla-io/argilla/pull/4092))
- Added `GET /api/v1/records/:record_id` endpoint to get a specific record. ([4039](https://github.com/argilla-io/argilla/pull/4039))
- Added support to include vectors for `GET /api/v1/datasets/:dataset_id/records` endpoint response using `include` query param. ([4063](https://github.com/argilla-io/argilla/pull/4063))
- Added support to include vectors for `GET /api/v1/me/datasets/:dataset_id/records` endpoint response using `include` query param. ([4063](https://github.com/argilla-io/argilla/pull/4063))
- Added support to include vectors for `POST /api/v1/me/datasets/:dataset_id/records/search` endpoint response using `include` query param. ([4063](https://github.com/argilla-io/argilla/pull/4063))
- Added `show_progress` argument to `from_huggingface()` method to make the progress bar for parsing records process optional.([4132](https://github.com/argilla-io/argilla/pull/4132)).
- Added a progress bar for parsing records process to `from_huggingface()` method with `trange` in `tqdm`.([4132](https://github.com/argilla-io/argilla/pull/4132)).
- Added to sort by `inserted_at` or `updated_at` for datasets with no metadata. ([4147](https://github.com/argilla-io/argilla/pull/4147))
- Added `max_records` argument to `pull()` method for `RemoteFeedbackDataset`.([4074](https://github.com/argilla-io/argilla/pull/4074))
- Added functionality to push your models to the Hugging Face hub with `ArgillaTrainer.push_to_huggingface` ([3976](https://github.com/argilla-io/argilla/pull/3976)). Contributed by Racso-3141.
- Added `filter_by` argument to `ArgillaTrainer` to filter by `response_status` ([4120](https://github.com/argilla-io/argilla/pull/4120)).
- Added `sort_by` argument to `ArgillaTrainer` to sort by `metadata` ([4120](https://github.com/argilla-io/argilla/pull/4120)).
- Added `max_records` argument to `ArgillaTrainer` to limit record used for training ([4120](https://github.com/argilla-io/argilla/pull/4120)).
- Added `add_vector_settings` method to local and remote `FeedbackDataset`. ([4055](https://github.com/argilla-io/argilla/pull/4055))
- Added `update_vectors_settings` method to local and remote `FeedbackDataset`. ([4122](https://github.com/argilla-io/argilla/pull/4122))
- Added `delete_vectors_settings` method to local and remote `FeedbackDataset`. ([4130](https://github.com/argilla-io/argilla/pull/4130))
- Added `vector_settings_by_name` method to local and remote `FeedbackDataset`. ([4055](https://github.com/argilla-io/argilla/pull/4055))
- Added `find_similar_records` method to local and remote `FeedbackDataset`. ([4023](https://github.com/argilla-io/argilla/pull/4023))
- Added `ARGILLA_SEARCH_ENGINE` environment variable to configure the search engine to use. ([4019](https://github.com/argilla-io/argilla/pull/4019))

Changed

- [breaking] Remove support for Elasticsearch < 8.5 and OpenSearch < 2.4. ([4173](https://github.com/argilla-io/argilla/pull/4173))
- [breaking] Users working with OpenSearch engines must use version >=2.4 and set `ARGILLA_SEARCH_ENGINE=opensearch`. ([4019](https://github.com/argilla-io/argilla/pull/4019) and [#4111](https://github.com/argilla-io/argilla/pull/4111))
- [breaking] Changed `FeedbackDataset.*_by_name()` methods to return `None` when no match is found ([4101](https://github.com/argilla-io/argilla/pull/3976)).
- [breaking] `limit` query parameter for `GET /api/v1/datasets/:dataset_id/records` endpoint is now only accepting values greater or equal than `1` and less or equal than `1000`. ([4143](https://github.com/argilla-io/argilla/pull/4143))
- [breaking] `limit` query parameter for `GET /api/v1/me/datasets/:dataset_id/records` endpoint is now only accepting values greater or equal than `1` and less or equal than `1000`. ([4143](https://github.com/argilla-io/argilla/pull/4143))
- Update `GET /api/v1/datasets/:dataset_id/records` endpoint to fetch record using the search engine. ([4142](https://github.com/argilla-io/argilla/pull/4142))
- Update `GET /api/v1/me/datasets/:dataset_id/records` endpoint to fetch record using the search engine. ([4142](https://github.com/argilla-io/argilla/pull/4142))
- Update `POST /api/v1/datasets/:dataset_id/records` endpoint to allow to create records with `vectors` ([4022](https://github.com/argilla-io/argilla/pull/4022))
- Update `PATCH /api/v1/datasets/:dataset_id` endpoint to allow updating `allow_extra_metadata` attribute. ([4112](https://github.com/argilla-io/argilla/pull/4112))
- Update `PATCH /api/v1/datasets/:dataset_id/records` endpoint to allow to update records with `vectors`. ([4062](https://github.com/argilla-io/argilla/pull/4062))
- Update `PATCH /api/v1/records/:record_id` endpoint to allow to update record with `vectors`. ([4062](https://github.com/argilla-io/argilla/pull/4062))
- Update `POST /api/v1/me/datasets/:dataset_id/records/search` endpoint to allow to search records with vectors. ([4019](https://github.com/argilla-io/argilla/pull/4019))
- Update `BaseElasticAndOpenSearchEngine.index_records` method to also index record vectors. ([4062](https://github.com/argilla-io/argilla/pull/4062))
- Update `FeedbackDataset.__init__` to allow passing a list of vector settings. ([4055](https://github.com/argilla-io/argilla/pull/4055))
- Update `FeedbackDataset.push_to_argilla` to also push vector settings. ([4055](https://github.com/argilla-io/argilla/pull/4055))
- Update `FeedbackDatasetRecord` to support the creation of records with vectors. ([4043](https://github.com/argilla-io/argilla/pull/4043))
- Using cosine similarity to compute similarity between vectors. ([4124](https://github.com/argilla-io/argilla/pull/4124))

Fixed

- Fixed svg images out of screen with too large images ([4047](https://github.com/argilla-io/argilla/pull/4047))
- Fixed creating records with responses from multiple users. Closes [3746](https://github.com/argilla-io/argilla/issues/3746) and [#3808](https://github.com/argilla-io/argilla/issues/3808) ([#4142](https://github.com/argilla-io/argilla/pull/4142))
- Fixed deleting or updating responses as an owner for annotators. (Commit [403a66d](https://github.com/argilla-io/argilla/commit/403a66d16d816fa8a62e3f76314ccc90e0073297))
- Fixed passing user_id when getting records by id. (Commit [98c7927](https://github.com/argilla-io/argilla/commit/98c792757a21da05bac89b7f625e7e5792ad59f9))
- Fixed non-basic tags serialized when pushing a dataset to the Hugging Face Hub. Closes [4089](https://github.com/argilla-io/argilla/issues/4089) ([#4200](https://github.com/argilla-io/argilla/pull/4200))

Contributors

- Racso-3141 Added a progress bar for parsing records process to `from_huggingface()` method with `trange` in `tqdm`.([4132](https://github.com/argilla-io/argilla/pull/4132)).

1.18.0

Added

- New `GET /api/v1/datasets/:dataset_id/metadata-properties` endpoint for listing dataset metadata properties. ([3813](https://github.com/argilla-io/argilla/pull/3813))
- New `POST /api/v1/datasets/:dataset_id/metadata-properties` endpoint for creating dataset metadata properties. ([3813](https://github.com/argilla-io/argilla/pull/3813))
- New `PATCH /api/v1/metadata-properties/:metadata_property_id` endpoint allowing the update of a specific metadata property. ([3952](https://github.com/argilla-io/argilla/pull/3952))
- New `DELETE /api/v1/metadata-properties/:metadata_property_id` endpoint for deletion of a specific metadata property. ([3911](https://github.com/argilla-io/argilla/pull/3911))
- New `GET /api/v1/metadata-properties/:metadata_property_id/metrics` endpoint to compute metrics for a specific metadata property. ([3856](https://github.com/argilla-io/argilla/pull/3856))
- New `PATCH /api/v1/records/:record_id` endpoint to update a record. ([3920](https://github.com/argilla-io/argilla/pull/3920))
- New `PATCH /api/v1/dataset/:dataset_id/records` endpoint to bulk update the records of a dataset. ([3934](https://github.com/argilla-io/argilla/pull/3934))
- Missing validations to `PATCH /api/v1/questions/:question_id`. Now `title` and `description` are using the same validations used to create questions. ([3967](https://github.com/argilla-io/argilla/pull/3967))
- Added `TermsMetadataProperty`, `IntegerMetadataProperty` and `FloatMetadataProperty` classes allowing to define metadata properties for a `FeedbackDataset`. ([3818](https://github.com/argilla-io/argilla/pull/3818))
- Added `metadata_filters` to `filter_by` method in `RemoteFeedbackDataset` to filter based on metadata i.e. `TermsMetadataFilter`, `IntegerMetadataFilter`, and `FloatMetadataFilter`. ([3834](https://github.com/argilla-io/argilla/pull/3834))
- Added a validation layer for both `metadata_properties` and `metadata_filters` in their schemas and as part of the `add_records` and `filter_by` methods, respectively. ([3860](https://github.com/argilla-io/argilla/pull/3860))
- Added `sort_by` query parameter to listing records endpoints that allows to sort the records by `inserted_at`, `updated_at` or metadata property. ([3843](https://github.com/argilla-io/argilla/pull/3843))
- Added `add_metadata_property` method to both `FeedbackDataset` and `RemoteFeedbackDataset` (i.e. `FeedbackDataset` in Argilla). ([3900](https://github.com/argilla-io/argilla/pull/3900))
- Added fields `inserted_at` and `updated_at` in `RemoteResponseSchema`. ([3822](https://github.com/argilla-io/argilla/pull/3822))
- Added support for `sort_by` for `RemoteFeedbackDataset` i.e. a `FeedbackDataset` uploaded to Argilla. ([3925](https://github.com/argilla-io/argilla/pull/3925))
- Added `metadata_properties` support for both `push_to_huggingface` and `from_huggingface`. ([3947](https://github.com/argilla-io/argilla/pull/3947))
- Add support for update records (`metadata`) from Python SDK. ([3946](https://github.com/argilla-io/argilla/pull/3946))
- Added `delete_metadata_properties` method to delete metadata properties. ([3932](https://github.com/argilla-io/argilla/pull/3932))
- Added `update_metadata_properties` method to update `metadata_properties`. ([3961](https://github.com/argilla-io/argilla/pull/3961))
- Added automatic model card generation through `ArgillaTrainer.save` ([3857](https://github.com/argilla-io/argilla/pull/3857))
- Added `FeedbackDataset` `TaskTemplateMixin` for pre-defined task templates. ([3969](https://github.com/argilla-io/argilla/pull/3969))
- A maximum limit of 50 on the number of options a ranking question can accept. ([3975](https://github.com/argilla-io/argilla/pull/3975))
- New `last_activity_at` field to `FeedbackDataset` exposing when the last activity for the associated dataset occurs. ([3992](https://github.com/argilla-io/argilla/pull/3992))

Changed

- `GET /api/v1/datasets/{dataset_id}/records`, `GET /api/v1/me/datasets/{dataset_id}/records` and `POST /api/v1/me/datasets/{dataset_id}/records/search` endpoints to return the `total` number of records. ([3848](https://github.com/argilla-io/argilla/pull/3848), [#3903](https://github.com/argilla-io/argilla/pull/3903))
- Implemented `__len__` method for filtered datasets to return the number of records matching the provided filters. ([3916](https://github.com/argilla-io/argilla/pull/3916))
- Increase the default max result window for Elasticsearch created for Feedback datasets. ([3929](https://github.com/argilla-io/argilla/pull/))
- Force elastic index refresh after records creation. ([3929](https://github.com/argilla-io/argilla/pull/))
- Validate metadata fields for filtering and sorting in the Python SDK. ([3993](https://github.com/argilla-io/argilla/pull/3993))
- Using metadata property name instead of id for indexing data in search engine index. ([3994](https://github.com/argilla-io/argilla/pull/3994))

Fixed

- Fixed response schemas to allow `values` to be `None` i.e. when a record is discarded the `response.values` are set to `None`. ([3926](https://github.com/argilla-io/argilla/pull/3926))
- New Contributors
* splevine made their first contribution in https://github.com/argilla-io/argilla/pull/3832

**Full Changelog**: https://github.com/argilla-io/argilla/compare/v1.17.0...v1.18.0

1.17.0

☀️ Highlights

This release comes with a lot of new goodies and quality improvements. We added model card support for the `ArgillaTrainer`, worked on the `FeedbackDataset` task templates and added timestamps to responses. We also fixed a lot of bugs and improved the overall quality of the codebase. Enjoy!

🚨 Breaking change in updating existing Hugging Face Spaces deployments

The quickstart image startup script was changed from `from /start_quickstart.sh` to `/home/argilla/start_quickstart.sh`, which might cause existing Hugging Face Spaces deployments to malfunction. A fix was added for the Argilla template space via [this PR](https://huggingface.co/spaces/argilla/argilla-template-space/discussions/19/files). Alternatively, you can just [create a new deployment](https://huggingface.co/new-space?template=argilla%2Fargilla-template-space).

⚠️ Breaking change using SQLite as backend in a docker deployment

From version 1.17.0 a new `argilla` os user is configured for the provided docker images. If you are using the docker deployment and you want to upload to this version, you should do some actions once update your container and before working with Argilla. Execute the following command:

bash
docker exec --user root <argilla_server_container_id> /bin/bash -c 'chown -R argilla:argilla "$ARGILLA_HOME_PATH"'

This will change the permissions on the argilla home path, which allows it to work with new containers.

Note: You can find the docker container id by running:
bash
docker ps | grep -i argilla-server

bash
713973693fb7 argilla/argilla-server:v1.17.0 "/bin/bash start_arg…" 11 hours ago Up 7 minutes 0.0.0.0:6900->6900/tcp docker-argilla-1

💾 `ArgillaTrainer` Model Card Generation

The `ArgillaTrainer` now supports automatic model card generation. This means that you can now generate a model card with all the required info for Hugging Face and directly share these models to the hub, as you would expect within the Hugging Face ecosystem. See [the docs](https://docs.argilla.io/en/v1.17.0/practical_guides/fine_tune.html#model-card-generation) for more info.

python
model_card_kwargs = {
"language": ["en", "es"],
"license": "Apache-2.0",
"model_id": "all-MiniLM-L6-v2",
"dataset_name": "argilla/emotion",
"tags": ["nlp", "few-shot-learning", "argilla", "setfit"],
"model_summary": "Small summary of what the model does",
"model_description": "An extended explanation of the model",
"model_type": "A 1.3B parameter embedding model fine-tuned on an awesome dataset",
"finetuned_from": "all-MiniLM-L6-v2",
"repo": "https://github.com/..."
"developers": "",
"shared_by": "",
}

trainer = ArgillaTrainer(
dataset=dataset,
task=task,
framework="setfit",
framework_kwargs={"model_card_kwargs": model_card_kwargs}
)
trainer.train(output_dir="my_model")
or get the card as `str` by calling the `generate_model_card` method
argilla_model_card = trainer.generate_model_card("my_model")

🦮 `FeedbackDataset` Task Templates

The Argilla `FeedbackDataset` now supports a number of task templates that can be used to quickly create a dataset for specific tasks out of the box. This should help starting users get right into the action without having to worry about the dataset structure. We support basic tasks like Text Classification but also allow you to setup complex RAG-pipelines. See [the docs](https://docs.argilla.io/en/v1.17.0/practical_guides/create_dataset.html#task-templates) for more info.

python
import argilla as rg

ds = rg.FeedbackDataset.for_text_classification(
labels=["positive", "negative"],
multi_label=False,
use_markdown=True,
guidelines=None,
)
ds
FeedbackDataset(
fields=[TextField(name="text", use_markdown=True)],
questions=[LabelQuestion(name="label", labels=["positive", "negative"])]
guidelines="<Guidelines for the task>",
)

⏱️ `inserted_at` and `updated_at` are added to responses

What are responses without timestamps? The `RemoteResponseSchema` now supports `inserted_at` and `updated_at` fields. This should help you to keep track of the time when a response was created and updated. Perfectly, for keeping track of annotator performance within your company.

[1.17.0](https://github.com/argilla-io/argilla/compare/v1.16.0...v1.17.0)

Added

- Added fields `inserted_at` and `updated_at` in `RemoteResponseSchema` ([3822](https://github.com/argilla-io/argilla/pull/3822)).
- Added automatic model card generation through `ArgillaTrainer.save` ([3857](https://github.com/argilla-io/argilla/pull/3857)).
- Added task templates to the `FeedbackDataset` ([3973](https://github.com/argilla-io/argilla/pull/3973)).

Changed

- Updated `Dockerfile` to use multi stage build ([3221](https://github.com/argilla-io/argilla/pull/3221) and [#3793](https://github.com/argilla-io/argilla/pull/3793)).
- Updated active learning for text classification notebooks to use the most recent small-text version ([3831](https://github.com/argilla-io/argilla/pull/3831)).
- Changed argilla dataset name in the active learning for text classification notebooks to be consistent with the default names in the huggingface spaces ([3831](https://github.com/argilla-io/argilla/pull/3831)).
- FeedbackDataset API methods have been aligned to be accessible through the several implementations ([3937](https://github.com/argilla-io/argilla/pull/3937)).
- The `unify_responses` support for remote datasets ([3937](https://github.com/argilla-io/argilla/pull/3937)).

Fixed

- Fix field not shown in the order defined in the dataset settings. Closes [3959](https://github.com/argilla-io/argilla/issues/3959) ([#3984](https://github.com/argilla-io/argilla/pull/3984))
- Updated active learning for text classification notebooks to pass ids of type int to `TextClassificationRecord` ([3831](https://github.com/argilla-io/argilla/pull/3831)).
- Fixed record fields validation that was preventing from logging records with optional fields (i.e. `required=True`) when the field value was `None` ([3846](https://github.com/argilla-io/argilla/pull/3846)).
- Always set `pretrained_model_name_or_path` attribute as string in `ArgillaTrainer` ([3914](https://github.com/argilla-io/argilla/pull/3914)).
- The `inserted_at` and `updated_at` attributes are create using the `utcnow` factory to avoid unexpected race conditions on timestamp creation ([3945](https://github.com/argilla-io/argilla/pull/3945))
- Fixed `configure_dataset_settings` when providing the workspace via the arg `workspace` ([3887](https://github.com/argilla-io/argilla/pull/3887)).
- Fixed saving of models trained with `ArgillaTrainer` with a `peft_config` parameter ([3795](https://github.com/argilla-io/argilla/pull/3795)).
- Fixed backwards compatibility on `from_huggingface` when loading a `FeedbackDataset` from the Hugging Face Hub that was previously dumped using another version of Argilla, starting at 1.8.0, when it was first introduced ([3829](https://github.com/argilla-io/argilla/pull/3829)).
- Fixed `TrainingTaskForQuestionAnswering.__repr__` ([3969](https://github.com/argilla-io/argilla/pull/3969))
- Fixed potential dictionary key-errors in `TrainingTask.prepare_for_training_with_*`-methods ([3969](https://github.com/argilla-io/argilla/pull/3969))

Deprecated

- Function `rg.configure_dataset` is deprecated in favour of `rg.configure_dataset_settings`. The former will be removed in version 1.19.0

New Contributors
* osintalex made their first contribution in https://github.com/argilla-io/argilla/pull/3221
* kursathalat made their first contribution in https://github.com/argilla-io/argilla/pull/3756
* splevine made their first contribution in https://github.com/argilla-io/argilla/pull/3832

**Full Changelog**: https://github.com/argilla-io/argilla/compare/v1.16.0...v1.17.0

1.16.0

☀️ Highlights

This release comes with an auto save feature for the UI, an enhanced Argilla CLI app, new keyboard shortcuts for the annotation process in the Feedback Dataset and new integrations for the `ArgillaTrainer`.

💾 Auto save

![Argilla UI Feedback Record getting auto saved](https://github.com/argilla-io/argilla/assets/29572918/5b664195-5068-49c7-a5db-b1cd91ab195d)

Have you been writing a long corrected text in a `TextField` for a completion given by an LLM and you have refreshed the page before submitting it? Well, since this release you are covered! The Argilla UI will save every few seconds the responses given in the annotation form of a `FeedbackDataset`. Annotators can partially annotate one record and then come back to finish the annotation process without losing the previous work.

👨🏻‍💻 More operations directly from the Argilla CLI

![Argilla CLI displaying help information](https://github.com/argilla-io/argilla/assets/29572918/ce99a622-e8f9-4349-b128-2c7f85aeeb7c)

The Argilla CLI has been updated to include an extensive list of new commands, from users and datasets management to training models all from the terminal!

⌨️ New keyboard shorcuts for the Feedback Dataset

![Feedback dataset shortcuts](https://github.com/argilla-io/argilla/assets/29572918/99918b9b-8fa5-406e-96d3-0113605b4602)

Now, you can seamlessly navigate within the feedback form using just your keyboard. We've extended the functionality of these shortcuts to cover all types of available questions: Label, Multi-label, Ranking, Rating and Text

QnA, Chat Completion with OpenAI and Sentence Transformers model training now in the `ArgillaTrainer`

The `ArgillaTrainer` doesn't stop getting new features and improvements!

- A new `TrainingTask` has been added for [Question and Answering (QnA)](https://docs.argilla.io/en/latest/practical_guides/fine_tune.html#question-answering)
- Use a `FeedbackDataset` for fine-tuning an [OpenAI model for Chat Completion](https://docs.argilla.io/en/latest/practical_guides/fine_tune.html#chat-completion)
- New integration with Sentence Transformers for fine-tuning a model for embedding generation

[1.16.0](https://github.com/argilla-io/argilla/compare/v1.15.1...v1.16.0)

Added

- Added `ArgillaTrainer` integration with sentence-transformers, allowing fine tuning for sentence similarity ([3739](https://github.com/argilla-io/argilla/pull/3739))
- Added `ArgillaTrainer` integration with `TrainingTask.for_question_answering` ([3740](https://github.com/argilla-io/argilla/pull/3740))
- Added `Auto save record` to save automatically the current record that you are working on ([3541](https://github.com/argilla-io/argilla/pull/3541))
- Added `ArgillaTrainer` integration with OpenAI, allowing fine tuning for chat completion ([3615](https://github.com/argilla-io/argilla/pull/3615))
- Added `workspaces list` command to list Argilla workspaces ([3594](https://github.com/argilla-io/argilla/pull/3594)).
- Added `datasets list` command to list Argilla datasets ([3658](https://github.com/argilla-io/argilla/pull/3658)).
- Added `users create` command to create users ([3667](https://github.com/argilla-io/argilla/pull/3667)).
- Added `whoami` command to get current user ([3673](https://github.com/argilla-io/argilla/pull/3673)).
- Added `users delete` command to delete users ([3671](https://github.com/argilla-io/argilla/pull/3671)).
- Added `users list` command to list users ([3688](https://github.com/argilla-io/argilla/pull/3688)).
- Added `workspaces delete-user` command to remove a user from a workspace ([3699](https://github.com/argilla-io/argilla/pull/3699)).
- Added `datasets list` command to list Argilla datasets ([3658](https://github.com/argilla-io/argilla/pull/3658)).
- Added `users create` command to create users ([3667](https://github.com/argilla-io/argilla/pull/3667)).
- Added `users delete` command to delete users ([3671](https://github.com/argilla-io/argilla/pull/3671)).
- Added `workspaces create` command to create an Argilla workspace ([3676](https://github.com/argilla-io/argilla/pull/3676)).
- Added `datasets push-to-hub` command to push a `FeedbackDataset` from Argilla into the HuggingFace Hub ([3685](https://github.com/argilla-io/argilla/pull/3685)).
- Added `info` command to get info about the used Argilla client and server ([3707](https://github.com/argilla-io/argilla/pull/3707)).
- Added `datasets delete` command to delete a `FeedbackDataset` from Argilla ([3703](https://github.com/argilla-io/argilla/pull/3703)).
- Added `created_at` and `updated_at` properties to `RemoteFeedbackDataset` and `FilteredRemoteFeedbackDataset` ([3709](https://github.com/argilla-io/argilla/pull/3709)).
- Added handling `PermissionError` when executing a command with a logged in user with not enough permissions ([3717](https://github.com/argilla-io/argilla/pull/3717)).
- Added `workspaces add-user` command to add a user to workspace ([3712](https://github.com/argilla-io/argilla/pull/3712)).
- Added `workspace_id` param to `GET /api/v1/me/datasets` endpoint ([3727](https://github.com/argilla-io/argilla/pull/3727)).
- Added `workspace_id` arg to `list_datasets` in the Python SDK ([3727](https://github.com/argilla-io/argilla/pull/3727)).
- Added `argilla` script that allows to execute Argilla CLI using the `argilla` command ([3730](https://github.com/argilla-io/argilla/pull/3730)).
- Added `server_info` function to check the Argilla server information (also accessible via `rg.server_info`) ([3772](https://github.com/argilla-io/argilla/issues/3772)).

Changed

- Move `database` commands under `server` group of commands ([3710](https://github.com/argilla-io/argilla/pull/3710))
- `server` commands only included in the CLI app when `server` extra requirements are installed ([3710](https://github.com/argilla-io/argilla/pull/3710)).
- Updated `PUT /api/v1/responses/{response_id}` to replace `values` stored with received `values` in request ([3711](https://github.com/argilla-io/argilla/pull/3711)).
- Display a `UserWarning` when the `user_id` in `Workspace.add_user` and `Workspace.delete_user` is the ID of an user with the owner role as they don't require explicit permissions ([3716](https://github.com/argilla-io/argilla/issues/3716)).
- Rename `tasks` sub-package to `cli` ([3723](https://github.com/argilla-io/argilla/pull/3723)).
- Changed `argilla database` command in the CLI to now be accessed via `argilla server database`, to be deprecated in the upcoming release ([3754](https://github.com/argilla-io/argilla/pull/3754)).
- Changed `visible_options` (of label and multi label selection questions) validation in the backend to check that the provided value is greater or equal than/to 3 and less or equal than/to the number of provided options ([3773](https://github.com/argilla-io/argilla/pull/3773)).

Fixed

- Fixed `remove user modification in text component on clear answers` ([3775](https://github.com/argilla-io/argilla/pull/3775))
- Fixed `Highlight raw text field in dataset feedback task` ([3731](https://github.com/argilla-io/argilla/pull/3731))
- Fixed `Field title too long` ([3734](https://github.com/argilla-io/argilla/pull/3734))
- Fixed error messages when deleting a `DatasetForTextClassification` ([3652](https://github.com/argilla-io/argilla/pull/3652))
- Fixed `Pending queue` pagination problems when during data annotation ([3677](https://github.com/argilla-io/argilla/pull/3677))
- Fixed `visible_labels` default value to be 20 just when `visible_labels` not provided and `len(labels) > 20`, otherwise it will either be the provided `visible_labels` value or `None`, for `LabelQuestion` and `MultiLabelQuestion` ([3702](https://github.com/argilla-io/argilla/pull/3702)).
- Fixed `DatasetCard` generation when `RemoteFeedbackDataset` contains suggestions ([3718](https://github.com/argilla-io/argilla/pull/3718)).
- Add missing `draft` status in `ResponseSchema` as now there can be responses with `draft` status when annotating via the UI ([3749](https://github.com/argilla-io/argilla/pull/3749)).
- Searches when queried words are distributed along the record fields ([3759](https://github.com/argilla-io/argilla/pull/3759)).
- Fixed Python 3.11 compatibility issue with `/api/datasets` endpoints due to the `TaskType` enum replacement in the endpoint URL ([3769](https://github.com/argilla-io/argilla/pull/3769)).

As always, thanks to our amazing contributors

* sdiazlor made their first contribution in https://github.com/argilla-io/argilla/pull/3384

**Full Changelog**: https://github.com/argilla-io/argilla/compare/v1.15.1...v1.16.0

1.15.1

[Changelog 1.15.1](https://github.com/argilla-io/argilla/compare/v1.15.0...v1.15.1)

Fixed

- Fixed `Text component` text content sanitization behavior just for markdown to prevent disappear the text ([3738](https://github.com/argilla-io/argilla/pull/3738))
- Fixed `Text component` now you need to press Escape to exit the text area ([3733](https://github.com/argilla-io/argilla/pull/3733))
- Fixed `SearchEngine` was creating the same number of primary shards and replica shards for each `FeedbackDataset` ([3736](https://github.com/argilla-io/argilla/pull/3736)).

1.15.0

🔆 Highlights

Argilla 1.15.0 comes with an enhanced `FeedbackDataset` settings page enabling the update of the dataset settings, an integration of the [TRL](https://github.com/huggingface/trl) package with the `ArgillaTrainer`, and continues adding improvements to the Python client for managing `FeedbackDataset`s.

⚙️ Update `FeedbackDataset` settings from the UI

![Update Feedback Dataset settings from the UI](https://github.com/argilla-io/argilla/assets/29572918/2402b192-abca-4d49-a689-1d7fa60f621e)

`FeedbackDataset` settings page has been updated and now it allows to update the `guidelines` and some attributes of the `fields` and `questions` of the dataset. Did you misspell the title or description of a field or question? Well, you don't have to remove your dataset and create it again anymore! Just go to the settings page and fix it.

🤖 TRL integration with the `ArgillaTrainer`

![ArgillaTrainer code snippet for training reward model with TRL](https://github.com/argilla-io/argilla/assets/29572918/7e03dabc-456a-4745-bf37-4ea89d35e4da)

The famous TRL package for training Transformers with Reinforcement Learning techniques has been integrated with the ArgillaTrainer, that comes with four new `TrainingTask`: SFT, Reward Modeling, PPO and DPO. Each training task expects a formatting function that will return the data in the expected format for training the model.

Check this [🆕 tutorial for training a Reward Model using the Argilla Trainer](https://docs.argilla.io/en/latest/guides/llms/examples/train-reward-model-rlhf.html).

🐍 Filter `FeedbackDataset` and remove suggestions

![Using FeedbackDataset filter method](https://github.com/argilla-io/argilla/assets/29572918/a49136bc-49c9-47e3-b961-4b37952939e0)

In the 1.14.0 release we added many improvements for working with remote `FeedbackDataset`s. In this release, a new `filter_by` method has been added that allows to filter the records of a dataset from the Python client. For now, the records can be only filtered using the `response_status`, but we're planning adding more complex filters for the upcoming releases. In addition, new methods have been added allowing to remove the suggestions created for a record.

[1.15.0](https://github.com/argilla-io/argilla/compare/v1.14.1...v1.15.0)

Added

- Added `Enable to update guidelines and dataset settings for Feedback Datasets directly in the UI` ([3489](https://github.com/argilla-io/argilla/pull/3489))
- Added `ArgillaTrainer` integration with TRL, allowing for easy supervised finetuning, reward modeling, direct preference optimization and proximal policy optimization ([3467](https://github.com/argilla-io/argilla/pull/3467))
- Added `formatting_func` to `ArgillaTrainer` for `FeedbackDataset` datasets add a custom formatting for the data ([3599](https://github.com/argilla-io/argilla/pull/3599)).
- Added `login` function in `argilla.client.login` to login into an Argilla server and store the credentials locally ([3582](https://github.com/argilla-io/argilla/pull/3582)).
- Added `login` command to login into an Argilla server ([3600](https://github.com/argilla-io/argilla/pull/3600)).
- Added `logout` command to logout from an Argilla server ([3605](https://github.com/argilla-io/argilla/pull/3605)).
- Added `DELETE /api/v1/suggestions/{suggestion_id}` endpoint to delete a suggestion given its ID ([3617](https://github.com/argilla-io/argilla/pull/3617)).
- Added `DELETE /api/v1/records/{record_id}/suggestions` endpoint to delete several suggestions linked to the same record given their IDs ([3617](https://github.com/argilla-io/argilla/pull/3617)).
- Added `response_status` param to `GET /api/v1/datasets/{dataset_id}/records` to be able to filter by `response_status` as previously included for `GET /api/v1/me/datasets/{dataset_id}/records` ([3613](https://github.com/argilla-io/argilla/pull/3613)).
- Added `list` classmethod to `ArgillaMixin` to be used as `FeedbackDataset.list()`, also including the `workspace` to list from as arg ([3619](https://github.com/argilla-io/argilla/pull/3619)).
- Added `filter_by` method in `RemoteFeedbackDataset` to filter based on `response_status` ([3610](https://github.com/argilla-io/argilla/pull/3610)).
- Added `list_workspaces` function (to be used as `rg.list_workspaces`, but `Workspace.list` is preferred) to list all the workspaces from an user in Argilla ([3641](https://github.com/argilla-io/argilla/pull/3641)).
- Added `list_datasets` function (to be used as `rg.list_datasets`) to list the `TextClassification`, `TokenClassification`, and `Text2Text` datasets in Argilla ([3638](https://github.com/argilla-io/argilla/pull/3638)).
- Added `RemoteSuggestionSchema` to manage suggestions in Argilla, including the `delete` method to delete suggestios from Argilla via `DELETE /api/v1/suggestions/{suggestion_id}` ([3651](https://github.com/argilla-io/argilla/pull/3651)).
- Added `delete_suggestions` to `RemoteFeedbackRecord` to remove suggestions from Argilla via `DELETE /api/v1/records/{record_id}/suggestions` ([3651](https://github.com/argilla-io/argilla/pull/3651)).

Changed

- Changed `Optional label for * mark for required question` ([3608](https://github.com/argilla-io/argilla/pull/3608))
- Updated `RemoteFeedbackDataset.delete_records` to use batch delete records endpoint ([3580](https://github.com/argilla-io/argilla/pull/3580)).
- Included `allowed_for_roles` for some `RemoteFeedbackDataset`, `RemoteFeedbackRecords`, and `RemoteFeedbackRecord` methods that are only allowed for users with roles `owner` and `admin` ([3601](https://github.com/argilla-io/argilla/pull/3601)).
- Renamed `ArgillaToFromMixin` to `ArgillaMixin` ([3619](https://github.com/argilla-io/argilla/pull/3619)).
- Move `users` CLI app under `database` CLI app ([3593](https://github.com/argilla-io/argilla/pull/3593)).
- Move server `Enum` classes to `argilla.server.enums` module ([3620](https://github.com/argilla-io/argilla/pull/3620)).

Fixed

- Fixed `Filter by workspace in breadcrumbs` ([3577](https://github.com/argilla-io/argilla/pull/3577))
- Fixed `Filter by workspace in datasets table` ([3604](https://github.com/argilla-io/argilla/pull/3604))
- Fixed `Query search highlight` for Text2Text and TextClassification ([3621](https://github.com/argilla-io/argilla/pull/3621))
- Fixed `RatingQuestion.values` validation to raise a `ValidationError` when values are out of range i.e. [1, 10] ([3626](https://github.com/argilla-io/argilla/pull/3626)).

Removed

- Removed `multi_task_text_token_classification` from `TaskType` as not used ([3640](https://github.com/argilla-io/argilla/pull/3640)).
- Removed `argilla_id` in favor of `id` from `RemoteFeedbackDataset` ([3663](https://github.com/argilla-io/argilla/pull/3663)).
- Removed `fetch_records` from `RemoteFeedbackDataset` as now the records are lazily fetched from Argilla ([3663](https://github.com/argilla-io/argilla/pull/3663)).
- Removed `push_to_argilla` from `RemoteFeedbackDataset`, as it just works when calling it through a `FeedbackDataset` locally, as now the updates of the remote datasets are automatically pushed to Argilla ([3663](https://github.com/argilla-io/argilla/pull/3663)).
- Removed `set_suggestions` in favor of `update(suggestions=...)` for both `FeedbackRecord` and `RemoteFeedbackRecord`, as all the updates of any "updateable" attribute of a record will go through `update` instead ([3663](https://github.com/argilla-io/argilla/pull/3663)).
- Remove unused `owner` attribute for client Dataset data model ([3665](https://github.com/argilla-io/argilla/pull/3665))

As always, thanks to our amazing contributors

* peppinob-ol made their first contribution in https://github.com/argilla-io/argilla/pull/3472
* eshwarhs made their first contribution in https://github.com/argilla-io/argilla/pull/3605

**Full Changelog**: https://github.com/argilla-io/argilla/compare/v1.14.1...v1.15.0

Page 6 of 22

Releases

Has known vulnerabilities

Previous Next

Argilla

Page 6 of 22

1.19.0

1.18.0

1.17.0

1.16.0

1.15.1

1.15.0

Page 6 of 22

Links

Releases