Argilla

Latest version: v2.8.0

Safety actively analyzes 723650 Python packages for vulnerabilities to keep your Python projects secure.

Page 8 of 22

1.12.1

[1.12.1](https://github.com/argilla-io/argilla/compare/v1.12.0...v1.12.1)

Fixed

- Using `rg.init` with default `argilla` user skips setting the default workspace if not available. (Closes [3340](https://github.com/argilla-io/argilla/issues/3340))
- Resolved wrong import structure `ArgillaTrainer` `TrainingTaskMapping` (Closes [3345](https://github.com/argilla-io/argilla/issues/3345))
- Pin pydantic dependency to version < 2 (Closes [3348](https://github.com/argilla-io/argilla/issues/3348))

1.12.0

🔆 Highlights

New `RankingQuestion` in Feedback Task datasets
![Ranking Question snapshot](https://github.com/argilla-io/argilla/assets/126158523/2e58bf8a-0172-435d-9483-ad0dc498051f)
Now you will be able to include `RankingQuestion`s in your Feedback datasets. These are specially designed to gather feedback on labeler's preferences, by providing a set of options that labelers can order.
Here's how you can add a `RankingQuestion` to a `FeedbackDataset`:
`python
dataset = FeedbackDataset(
fields=[
rg.TextField(name="prompt"),
rg.TextField(name="reply-1", title="Reply 1"),
rg.TextField(name="reply-2", title="Reply 2"),
rg.TextField(name="reply-3", title="Reply 3"),
],
questions=[
rg.RankingQuestion(
name="ranking",
title="Order replies based on your preference",
description="1 = best, 3 = worst. Ties are allowed.",
required=True,
values={"reply-1": "Reply 1", "reply-2": "Reply 2", "reply-3": "Reply 3"} or ["reply-1", "reply-2", "reply-3"]
]
)
`
More info in [our docs](https://docs.argilla.io/en/latest/guides/llms/practical_guides/create_dataset.html#define-questions).

Extended training support
You can now format responses from `RatingQuestion`, `LabelQuestion` and `MultiLabelQuestion` for your preferred training framework using the `prepare_for_training` method.

Also, we've added support for `spacy-transformers` in our Argilla Trainer.

Here's an example code snippet:
python
import argilla.feedback as rg

dataset = rg.FeedbackDataset.from_huggingface(
repo_id="argilla/stackoverflow_feedback_demo"
)
task_mapping = rg.TrainingTaskMapping.for_text_classification(
text=dataset.field_by_name("question"),
label=dataset.question_by_name("tags")
)
trainer = rg.ArgillaTrainer(
dataset=dataset,
task_mapping=task_mapping,
framework="spacy-transformers",
fetch_records=False
)
trainer.update_config(num_train_epochs=2)
trainer.train(output_dir="my_awesone_model")
`
To learn more about how to use Argilla Trainer check [our docs](https://docs.argilla.io/en/latest/guides/llms/practical_guides/fine_tune_others.html).

[Changelog 1.12.0](https://github.com/argilla-io/argilla/compare/v1.11.0...v1.12.0)

Added

- Added `RankingQuestionSettings` class allowing to create ranking questions in the API using `POST /api/v1/datasets/{dataset_id}/questions` endpoint ([3232](https://github.com/argilla-io/argilla/pull/3232))
- Added `RankingQuestion` in the Python client to create ranking questions ([3275](https://github.com/argilla-io/argilla/issues/3275)).
- Added `Ranking` component in feedback task question form ([3177](https://github.com/argilla-io/argilla/pull/3177) & [#3246](https://github.com/argilla-io/argilla/pull/3246)).
- Added `FeedbackDataset.prepare_for_training` method for generaring a framework-specific dataset with the responses provided for `RatingQuestion`, `LabelQuestion` and `MultiLabelQuestion` ([3151](https://github.com/argilla-io/argilla/pull/3151)).
- Added `ArgillaSpaCyTransformersTrainer` class for supporting the training with `spacy-transformers` ([3256](https://github.com/argilla-io/argilla/pull/3256)).

Changed

- All docker related files have been moved into the `docker` folder ([3053](https://github.com/argilla-io/argilla/pull/3053)).
- `release.Dockerfile` have been renamed to `Dockerfile` ([3133](https://github.com/argilla-io/argilla/pull/3133)).
- Updated `rg.load` function to raise a `ValueError` with a explanatory message for the cases in which the user tries to use the function to load a `FeedbackDataset` ([3289](https://github.com/argilla-io/argilla/pull/3289)).
- Updated `ArgillaSpaCyTrainer` to allow re-using `tok2vec` ([3256](https://github.com/argilla-io/argilla/pull/3256)).

Fixed

- Check available workspaces on Argilla on `rg.set_workspace` (Closes [3262](https://github.com/argilla-io/argilla/issues/3262))

New Contributors
* garimau made their first contribution in https://github.com/argilla-io/argilla/pull/3255
* adurante92 made their first contribution in https://github.com/argilla-io/argilla/pull/3242

**Full Changelog**: https://github.com/argilla-io/argilla/compare/v1.11.0...v1.12.0

1.11.0

🔆 Highlights

New `owner` role and user update command
We've added a new user role, `owner`, that has permissions over all users, workspaces and datasets in Argilla (like the `admin` role in earlier versions). From this version, the `admin` role will only have permissions over datasets and users in workspaces assigned to them.
You can change a user from admin to owner using a simple CLI command: `python -m argilla users update argilla --role owner`.

Improved user and workspace management
You can now get lists of users and workspaces, create new ones and give users access to workspaces directly from the Python SDK. Note that only owners will have permissions for all these actions. Admins will be able to give users access to workspaces where they have access.

Metadata fields for Feedback records

You can now add metadata information to your records. This is useful to store information that's not needed for the labeling UI but important for downstream usage (e.g., prompt id, model IDs, etc.)

[Changelog 1.11.0](https://github.com/argilla-io/argilla/compare/v1.10.0...v1.11.0)

Fixed

- Replaced `np.float` alias by `float` to avoid `AttributeError` when using `find_label_errors` function with `numpy>=1.24.0` ([3214](https://github.com/argilla-io/argilla/pull/3214)).
- Fixed `format_as("datasets")` when no responses or optional respones in `FeedbackRecord`, to set their value to what 🤗 Datasets expects instead of just `None` ([3224](https://github.com/argilla-io/argilla/pull/3224)).
- Fixed `push_to_huggingface()` when `generate_card=True` (default behaviour), as we were passing a sample record to the `ArgillaDatasetCard` class, and `UUID`s introduced in 1.10.0 ([3192](https://github.com/argilla-io/argilla/pull/3192)), are not JSON-serializable ([#3231](https://github.com/argilla-io/argilla/pull/3231)).
- Fixed `from_argilla` and `push_to_argilla` to ensure consistency on both field and question re-construction, and to ensure `UUID`s are properly serialized as `str`, respectively ([3234](https://github.com/argilla-io/argilla/pull/3234)).

Added

- Added `metadata` attribute to the `Record` of the `FeedbackDataset` ([3194](https://github.com/argilla-io/argilla/pull/3194))
- New `users update` command to update the role for an existing user ([3188](https://github.com/argilla-io/argilla/pull/3188))
- New `Workspace` class to allow users manage their Argilla workspaces and the users assigned to those workspaces via the Python client ([3180](https://github.com/argilla-io/argilla/pull/3180))
- Added `User` class to let users manage their Argilla users via the Python client ([3169](https://github.com/argilla-io/argilla/pull/3169)).
- Added an option to display `tqdm` progress bar to `FeedbackDataset.push_to_argilla` when looping over the records to upload ([3233](https://github.com/argilla-io/argilla/pull/3233)).

Changed

- The role system now support three different roles `owner`, `admin` and `annotator` ([3104](https://github.com/argilla-io/argilla/pull/3104))
- `admin` role is scoped to workspace-level operations ([3115](https://github.com/argilla-io/argilla/pull/3115))
- The `owner` user is created among the default pool of users in the quickstart, and the default user in the server has now `owner` role ([3248](https://github.com/argilla-io/argilla/pull/3248)), reverting ([#3188](https://github.com/argilla-io/argilla/pull/3188)).

Deprecated

- As of Python 3.7 end-of-life (EOL) on 2023-06-27, Argilla will no longer support Python 3.7 ([3188](https://github.com/argilla-io/argilla/pull/33188)). More information at https://peps.python.org/pep-0537/

As always, thanks to our amazing contributors!

* damianpumar made their first contribution in https://github.com/argilla-io/argilla/pull/2950
* MedAmine-SUDO made their first contribution in https://github.com/argilla-io/argilla/pull/3204
* manulpatel made their first contribution in https://github.com/argilla-io/argilla/pull/3233

1.10.0

🔆 Highlights

Search records in Feedback Task
We've added a search bar in the Feedback Task UI so you can filter records based on specific words or phrases.

Extended markdown support
Annotation guidelines are now rendered as markdown text to make them easier to read and have a more flexible format.

Train button in Feedback Task
Admin users have access to a `Train </>` button in the Feedback Task UI with quick links to all the information needed to train a model with the feedback gathered in Argilla.

[Changelog 1.10.0](https://github.com/argilla-io/argilla/compare/v1.9.0...v1.10.0)

Added

- Added search component for feedback datasets ([3138](https://github.com/argilla-io/argilla/pull/3138))
- Added markdown support for feedback dataset guidelines ([3153](https://github.com/argilla-io/argilla/pull/3153))
- Added Train button for feedback datasets ([3170](https://github.com/argilla-io/argilla/pull/3170))

Changed

- Updated `SearchEngine` and `POST /api/v1/me/datasets/{dataset_id}/records/search` to return the `total` number of records matching the search query ([3166](https://github.com/argilla-io/argilla/pull/3166))

Fixed

- Replaced Enum for string value in URLs for client API calls (Closes [3149](https://github.com/argilla-io/argilla/issues/3149))
- Resolve breaking issue with `ArgillaSpanMarkerTrainer` for Named Entity Recognition with `span_marker` v1.1.x onwards.
- Move `ArgillaDatasetCard` import under `requires_version` decorator, so that the `ImportError` on `huggingface_hub` is handled properly ([3174](https://github.com/argilla-io/argilla/pull/3174))
- Allow flow `FeedbackDataset.from_argilla` -> `FeedbackDataset.push_to_argilla` under different dataset names and/or workspaces ([3192](https://github.com/argilla-io/argilla/issues/3192))

As always, thanks to our amazing contributors!
* hjain5164 made their first contribution in https://github.com/argilla-io/argilla/pull/3146
* Fancman made their first contribution in https://github.com/argilla-io/argilla/pull/3150
* preetgami made their first contribution in https://github.com/argilla-io/argilla/pull/3196

**Full Changelog**: https://github.com/argilla-io/argilla/compare/v1.9.0...v1.10.0

1.9.0

🔆 Highlights
New question types in Feedback Datasets
<img width="1508" alt="Screenshot of a Feedback Dataset with the new Label and MultiLabel questions and markdown support" src="https://github.com/argilla-io/argilla/assets/126158523/83f018e5-d503-4d72-baea-eee749563c3c">

We've included two new question types in Feedback Datasets: `LabelQuestion` and `MultiLabelQuestion`. These are specially useful for applying one or multiple labels to a record, for example, for text classification tasks. In this new view, you can add multiple classification questions and even combine them with the other question types available in Feedback Datasets: `RatingQuestion` and `TextQuestion`.

Markdown support in Feedback Fields and Text Questions
You can now add the `use_markdown=True` tag to a `TextField` or a `TextQuestion` to have the UI render the text as markdown. You can use this to read and write code, tables or even add images.

<img width="1507" alt="Screenshot of a Feedback Dataset with rendered markdown in a record field and a text question" src="https://github.com/argilla-io/argilla/assets/126158523/b92440d7-d44e-4f04-b613-3d652427c437">

Further improvements in Feedback Datasets
We continue to add improvements to our new Feedback Datasets:
- We've added checks to avoid having fields and questions with repeated names.
- Dataset cards generated using `FeedbackDataset.push_to_huggingface(generate_card=True)` now follow the official Hugging Face template.

[Changelog 1.9.0](https://github.com/argilla-io/argilla/compare/v1.8.0...v1.9.0)

Added

- Added boolean `use_markdown` property to `TextFieldSettings` model ([3000](https://github.com/argilla-io/argilla/pull/3000))
- Added boolean `use_markdown` property to `TextQuestionSettings` model ([3000](https://github.com/argilla-io/argilla/pull/3000)).
- Added new status `draft` for the `Response` model ([3033](https://github.com/argilla-io/argilla/pull/3033))
- Added `LabelSelectionQuestionSettings` class allowing to create label selection (single-choice) questions in the API ([3005](https://github.com/argilla-io/argilla/pull/3005))
- Added `MultiLabelSelectionQuestionSettings` class allowing to create multi-label selection (multi-choice) questions in the API ([3010](https://github.com/argilla-io/argilla/pull/3010)).
- Added `POST /api/v1/me/datasets/{dataset_id}/records/search` endpoint ([3068](https://github.com/argilla-io/argilla/pull/3068)).
- Added new components in feedback task Question form: MultiLabel ([3064](https://github.com/argilla-io/argilla/pull/3064)) and SingleLabel ([#3016](https://github.com/argilla-io/argilla/pull/3016)).
- Added docstrings to the `pydantic.BaseModel`s defined at `argilla/client/feedback/schemas.py` ([3137](https://github.com/argilla-io/argilla/pull/3137))

Changed

- Updated `GET /api/v1/me/datasets/:dataset_id/metrics` output payload to include the count of responses with `draft` status ([3033](https://github.com/argilla-io/argilla/pull/3033))
- Database setup for unit tests. Now the unit tests use a different database than the one used by the local Argilla server (Closes [2987](https://github.com/argilla-io/argilla/issues/2987)).
- Updated `alembic` setup to be able to autogenerate revision/migration scripts using SQLAlchemy metadata from Argilla server models ([3044](https://github.com/argilla-io/argilla/pull/3044))
- Improved `DatasetCard` generation on `FeedbackDataset.push_to_huggingface` when `generate_card=True`, following the official HuggingFace Hub template, but suited to `FeedbackDataset`s from Argilla ([3110](https://github.com/argilla-io/argilla/pull/3100))

Fixed

- Disallow `fields` and `questions` in `FeedbackDataset` with the same name ([3126](https://github.com/argilla-io/argilla/pull/3126)).

As always, thanks to our amazing contributors!
* gitrock made their first contribution in https://github.com/argilla-io/argilla/pull/3091
* ChadDa3mon made their first contribution in https://github.com/argilla-io/argilla/pull/3092

1.8.0

🔆 Highlights

New Feedback Task 🎉
<img width="1508" alt="snapshot-feedback-demo" src="https://github.com/argilla-io/argilla/assets/126158523/c07d0918-146f-4d1f-97b3-1db06a6d741c">
Big welcome to our new `FeedbackDataset`! This new type of dataset is designed to cover the specific needs of working with LLMs. Use this task to gather demonstration examples, human feedback, curate other datasets... Questions of different types can be combined so you can adapt your dataset to the specific needs of your project. Currently, it supports `RatingQuestion` and `TextQuestion`, but more question types will be added shortly in the coming releases.

In addition, these datasets support multiple annotations: all users with access to the dataset can give their responses.

The `FeedbackDataset` has an enhanced integration with the Hugging Face Hub, so that saving a dataset to the Hub or pushing a `FeedbackDataset` from the Hub directly to Argilla is seamless.

Check all the things you can do with Feedback Tasks [in our docs](https://docs.argilla.io/en/latest/guides/llms/conceptual_guides/conceptual_guides.html)

New LLM section in our docs
We've added a new section in our docs that covers:
- Useful concepts around work with [LLMs](https://docs.argilla.io/en/latest/guides/llms/conceptual_guides/conceptual_guides.html)
- How-to guides that cover all the functionalities of the new [Feedback Task](https://docs.argilla.io/en/latest/guides/llms/practical_guides/practical_guides.html)
- [End-to-end examples](https://docs.argilla.io/en/latest/guides/llms/examples/examples.html)

More training integrations
We've added new frameworks for the `ArgillaTrainer`: `ArgillaPeftTrainer` for Text and Token Classification and `ArgillaAutoTrainTrainer` for Text Classification.

[Changelog 1.8.0](https://github.com/argilla-io/argilla/compare/v1.7.0...v1.8.0)

Added

- `/api/v1/datasets` new endpoint to list and create datasets ([2615]).
- `/api/v1/datasets/{dataset_id}` new endpoint to get and delete datasets ([2615]).
- `/api/v1/datasets/{dataset_id}/publish` new endpoint to publish a dataset ([2615]).
- `/api/v1/datasets/{dataset_id}/questions` new endpoint to list and create dataset questions ([2615])
- `/api/v1/datasets/{dataset_id}/fields` new endpoint to list and create dataset fields ([2615])
- `/api/v1/datasets/{dataset_id}/questions/{question_id}` new endpoint to delete a dataset questions ([2615])
- `/api/v1/datasets/{dataset_id}/fields/{field_id}` new endpoint to delete a dataset field ([2615])
- `/api/v1/workspaces/{workspace_id}` new endpoint to get workspaces by id ([2615])
- `/api/v1/responses/{response_id}` new endpoint to update and delete a response ([2615])
- `/api/v1/datasets/{dataset_id}/records` new endpoint to create and list dataset records ([2615])
- `/api/v1/me/datasets` new endpoint to list user visible datasets ([2615])
- `/api/v1/me/dataset/{dataset_id}/records` new endpoint to list dataset records with user responses ([2615])
- `/api/v1/me/datasets/{dataset_id}/metrics` new endpoint to get the dataset user metrics ([2615])
- `/api/v1/me/records/{record_id}/responses` new endpoint to create record user responses ([2615])
- showing new feedback task datasets in datasets list ([2719])
- new page for feedback task ([2680])
- show feedback task metrics ([2822])
- user can delete dataset in dataset settings page ([2792])
- Support for `FeedbackDataset` in Python client (parent PR [2615], and nested PRs: [2949], [2827], [2943], [2945], [2962], and [3003])
- Integration with the HuggingFace Hub ([2949])
- Added `ArgillaPeftTrainer` for text and token classification [2854](https://github.com/argilla-io/argilla/issues/2854)
- Added `predict_proba()` method to `ArgillaSetFitTrainer`
- Added `ArgillaAutoTrainTrainer` for Text Classification [2664](https://github.com/argilla-io/argilla/issues/2664)
- New `database revisions` command showing database revisions info [2615]: https://github.com/argilla-io/argilla/issues/2615

Fixes

- Avoid rendering html for invalid html strings in Text2text ([2911]https://github.com/argilla-io/argilla/issues/2911)

Changed

- The `database migrate` command accepts a `--revision` param to provide specific revision id
- `tokens_length` metrics function returns empty data ([3045])
- `token_length` metrics function returns empty data ([3045])
- `mention_length` metrics function returns empty data ([3045])
- `entity_density` metrics function returns empty data ([3045])

Deprecated

- Using argilla with python 3.7 runtime is deprecated and support will be removed from version 1.9.0 ([2902](https://github.com/argilla-io/argilla/issues/2902))
- `tokens_length` metrics function has been deprecated and will be removed in 1.10.0 ([3045])
- `token_length` metrics function has been deprecated and will be removed in 1.10.0 ([3045])
- `mention_length` metrics function has been deprecated and will be removed in 1.10.0 ([3045])
- `entity_density` metrics function has been deprecated and will be removed in 1.10.0 ([3045])

Removed

- Removed mention `density`, `tokens_length` and `chars_length` metrics from token classification metrics storage ([3045])
- Removed token `char_start`, `char_end`, `tag`, and `score` metrics from token classification metrics storage ([3045])
- Removed tags-related metrics from token classification metrics storage ([3045])

[3045]: https://github.com/argilla-io/argilla/pull/3045

As always, thanks to our amazing contributors!
* Fix image alignment on token classification by cceyda in https://github.com/argilla-io/argilla/pull/2779
* Update cloud_providers.md by chainyo in https://github.com/argilla-io/argilla/pull/2866

Page 8 of 22

Releases

Has known vulnerabilities

Previous Next

Argilla

Page 8 of 22

1.12.1

1.12.0

1.11.0

1.10.0

1.9.0

1.8.0

Page 8 of 22

Links

Releases