Unitxt

Latest version: v1.20.0

Safety actively analyzes 714875 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 7 of 10

1.7.6

What's Changed
The most significat change in this release is the addition of the notion of `\N` (slash capital N) to formats. With `\N` you can define places where you want a single new line removing all newlines ahead.

A very detailed explanation if you want to go deeper:
> The Capital New Line Notation (\N) transforms a given string by applying the Capital New Line Notation.
The Capital New Line Notation (\N) is designed to manage newline behavior in a string efficiently.
This custom notation aims to consolidate multiple newline characters (\n) into a single newline under
specific conditions, with tailored handling based on whether there's preceding text. The function
distinguishes between two primary scenarios:
1. If there's text (referred to as a prefix) followed by any number of \n characters and then one or
more \N, the entire sequence is replaced with a single \n. This effectively simplifies multiple
newlines and notation characters into a single newline when there's preceding text.
2. If the string starts with \n characters followed by \N without any text before this sequence, or if
\N is at the very beginning of the string, the sequence is completely removed. This case is
applicable when the notation should not introduce any newlines due to the absence of preceding text.


This allows us two things:
First define system formats that are not having unnecassry new lines when instruciton of system prompt are missing.
Second, to ignore any new lines created by the template ensuring the number of new lines will be set by the format only.

For example if we defined the system format in the following way:
python
from unitxt.formats import SystemFormat

format = SystemFormat(model_input_format="{system_prompt}\n{instruction}\n|user|\n{source}\n|assistant|\n{target_prefix}")

We faced two issues:
1. If the system prompt is empty or the instruction is empty we have two trailing new lines for no reason.
2. If the source finished with new line (mostly due to template structre) we would have unnecassry empty line before the "|user|"

Both problems are solved with \N notation:
python
from unitxt.formats import SystemFormat

format = SystemFormat(model_input_format="{system_prompt}\\N{instruction}\\N|user|\n{source}\\N|assistant|\n{target_prefix}")


Breaking changes

* Fix typo in MultipleChoiceTemplate field choices_seperator -> choices_separator
* Deprecation of use_query option in all operators , for now it is just raising warning but will be removed in the next major release. The new default behavior is equivalent to use_query=True.

All Changes

Bug Fixes:
* Fix error in unitxt versions conflict and improve message by elronbandel in https://github.com/IBM/unitxt/pull/730
* Fix wrong handling of list in dict_get by yoavkatz in https://github.com/IBM/unitxt/pull/733
* Fix classification datasets with wrong schema by elronbandel in https://github.com/IBM/unitxt/pull/735
* Fix codespell by elronbandel in https://github.com/IBM/unitxt/pull/742
* Fix UI errors cause by grammar tasks by elronbandel in https://github.com/IBM/unitxt/pull/750
* Fix src layout and enforce its rules with pre-commit hooks by elronbandel in https://github.com/IBM/unitxt/pull/753

Assets Fixes:
* Rename to correct model name by eladven in https://github.com/IBM/unitxt/pull/729

New Features:
* Add notion of \N to formats, to fix format new line clashes by elronbandel in https://github.com/IBM/unitxt/pull/751
* Ability to dynamically change InstanceMetric inputs + grammar metrics by arielge in https://github.com/IBM/unitxt/pull/736
* Add DeprecatedFIeld for more informative procedure for deprecating fields of artifacts by dafnapension in https://github.com/IBM/unitxt/pull/741

New Assets:
* Add rerank recall metric to unitxt by jlqibm in https://github.com/IBM/unitxt/pull/662
* Add many selection and human preference tasks and datasets by elronbandel in https://github.com/IBM/unitxt/pull/746
* Adding Detector metric for running any classifier from huggingface as a metric by mnagired in https://github.com/IBM/unitxt/pull/745
* Add operators: RegexSplit, TokensSplit, Chunk by elronbandel in https://github.com/IBM/unitxt/pull/749
* Add bert score large and base versions by assaftibm in https://github.com/IBM/unitxt/pull/748

Enhancments:
* Remove use_dpath parameter from dict_get and dict_set by dafnapension in https://github.com/IBM/unitxt/pull/727
* Add mock judge test to cohere for ai by perlitz in https://github.com/IBM/unitxt/pull/720

New Contributors
* mnagired made their first contribution in https://github.com/IBM/unitxt/pull/745

**Full Changelog**: https://github.com/IBM/unitxt/compare/1.7.4...1.7.6

1.7.4

Breaking Changes
* Add generic mechanism to check prediction and reference types in metrics by yoavkatz in https://github.com/IBM/unitxt/pull/667 See explaination in the previoues sections for why this change is breaking.

New Features
* Add ability to fuse sources with disjoint splits by yoavkatz in https://github.com/IBM/unitxt/pull/707
* Allow max reduction type in metric to find the best overall score over all instances by yoavkatz in https://github.com/IBM/unitxt/pull/709
* Add string operators module with many standard string operaotrs by elronbandel in https://github.com/IBM/unitxt/pull/721
* Allow disabling per group f1 scores in customF1 by yoavkatz in https://github.com/IBM/unitxt/pull/719
* Add improved type inference capabilities, inferring type_string from a given object, and infer_type therefrom via parse_type_string by dafnapension in https://github.com/IBM/unitxt/pull/706
* Add description and tags to every catalog artifact by elronbandel in https://github.com/IBM/unitxt/pull/725
* allow contexts not to be entered to metric by perlitz in https://github.com/IBM/unitxt/pull/653
* Add control over metrics and postprocessors through the recipe by elronbandel in https://github.com/IBM/unitxt/pull/663
* Add coqa and dialog processing capabilites by elronbandel in https://github.com/IBM/unitxt/pull/640
* Add pandas_load_args for LoadCSV by elronbandel in https://github.com/IBM/unitxt/pull/696
* Add safe and complete type parsing function to type_utils, for allowing better type checking. by elronbandel in https://github.com/IBM/unitxt/pull/688
* Add deprecation decorator for warning and errors for deprecation of functions and classes by elronbandel in https://github.com/IBM/unitxt/pull/689
* Add choices shuffling to MultipleChoiceTemplate by elronbandel in https://github.com/IBM/unitxt/pull/678
* Make settings utils type sensetive by elronbandel in https://github.com/IBM/unitxt/pull/674

New Assets
* Add intl to korean and arabic + improved packaged dependency checks by pklpriv in https://github.com/IBM/unitxt/pull/698
* Added BERT Score with new embedding model "distilbert-base-uncased" by shivangibithel in https://github.com/IBM/unitxt/pull/703
* Grammatical error correction task by arielge in https://github.com/IBM/unitxt/pull/718
* Add trec dataset by elronbandel in https://github.com/IBM/unitxt/pull/723
* Add templates for flan text similarity by elronbandel in https://github.com/IBM/unitxt/pull/728
* Add metrics for binary tasks with float predictions by lilacheden in https://github.com/IBM/unitxt/pull/654
* Add mistral format by elronbandel in https://github.com/IBM/unitxt/pull/660
* Added new metric for unsorted_list_exact_math by yoavkatz in https://github.com/IBM/unitxt/pull/685
* Add flan wnli truthfulness format by elronbandel in https://github.com/IBM/unitxt/pull/665
* DuplicateInstances operator by pawelknes in https://github.com/IBM/unitxt/pull/682
* introduce arabic to normalized sacrebleu by pklpriv in https://github.com/IBM/unitxt/pull/638
* 20newsgroup from sklearn by ilyashnil in https://github.com/IBM/unitxt/pull/659
* Add match_closest_option post processor for multiple choice qa by elronbandel in https://github.com/IBM/unitxt/pull/679
* Duplicate instance operator - new functionality by pawelknes in https://github.com/IBM/unitxt/pull/687
* Add babi qa dataset by elronbandel in https://github.com/IBM/unitxt/pull/666

Asset Fixes
* Add missing instruction in labrador zero shot format by alonh in https://github.com/IBM/unitxt/pull/716
* Fix title template for classification by elronbandel in https://github.com/IBM/unitxt/pull/722
* prevent cohere4ai using judge as default by perlitz in https://github.com/IBM/unitxt/pull/664
* fix summarization template by gitMichal in https://github.com/IBM/unitxt/pull/652

Bug Fixes
* Fix handling of boolean environment variables by arielge in https://github.com/IBM/unitxt/pull/711
* Handle all env variables with expected types by arielge in https://github.com/IBM/unitxt/pull/714
* Properly define the abstract fields by elronbandel in https://github.com/IBM/unitxt/pull/724
* Fix places not using general settings or logger by elronbandel in https://github.com/IBM/unitxt/pull/656
* removal of dpath -- ready for review by dafnapension in https://github.com/IBM/unitxt/pull/680
* fix: LoadFromIBMCloud empty data_dir breaks processing by jezekra1 in https://github.com/IBM/unitxt/pull/668
* Fix bug in references with none by elronbandel in https://github.com/IBM/unitxt/pull/677
* Validating that the prepare dir is consistent with catalog by eladven in https://github.com/IBM/unitxt/pull/683

New Contributors
* shivangibithel made their first contribution in https://github.com/IBM/unitxt/pull/703
* jezekra1 made their first contribution in https://github.com/IBM/unitxt/pull/668
* pklpriv made their first contribution in https://github.com/IBM/unitxt/pull/638
* pawelknes made their first contribution in https://github.com/IBM/unitxt/pull/682

**Full Changelog**: https://github.com/IBM/unitxt/compare/1.7.1...1.7.4

1.7.3

What's Changed
* added BERT Score with new embedding model "distilbert-base-uncased" by shivangibithel in https://github.com/IBM/unitxt/pull/703
* Fix handling of boolean environment variables by arielge in https://github.com/IBM/unitxt/pull/711
* Allow max reduction type in metric to find the best overall score over all instances by yoavkatz in https://github.com/IBM/unitxt/pull/709
* Add ability to fuse sources with disjoint splits by yoavkatz in https://github.com/IBM/unitxt/pull/707
* Handle all env variables with expected types by arielge in https://github.com/IBM/unitxt/pull/714
* Add missing instruction in labrador zero shot format by alonh in https://github.com/IBM/unitxt/pull/716
* Add string operators module by elronbandel in https://github.com/IBM/unitxt/pull/721
* Fix title template for classification by elronbandel in https://github.com/IBM/unitxt/pull/722
* Grammatical error correction task by arielge in https://github.com/IBM/unitxt/pull/718
* properly define the abstract fields by elronbandel in https://github.com/IBM/unitxt/pull/724
* Add trec dataset by elronbandel in https://github.com/IBM/unitxt/pull/723
* add intl to korean and arabic + improved packaged dependency checks by pklpriv in https://github.com/IBM/unitxt/pull/698

New Contributors
* shivangibithel made their first contribution in https://github.com/IBM/unitxt/pull/703

**Full Changelog**: https://github.com/IBM/unitxt/compare/1.7.2...1.7.3

1.7.2

What's Changed
* Add metrics for binary tasks with float predictions by lilacheden in https://github.com/IBM/unitxt/pull/654
* Fix places not using general settings or logger by elronbandel in https://github.com/IBM/unitxt/pull/656
* Add mistral format by elronbandel in https://github.com/IBM/unitxt/pull/660
* allow contexts not to be entered to metric by perlitz in https://github.com/IBM/unitxt/pull/653
* Add control over metrics and postprocessors through the recipe by elronbandel in https://github.com/IBM/unitxt/pull/663
* prevent cohere4ai using judge as default by perlitz in https://github.com/IBM/unitxt/pull/664
* 20newsgroup from sklearn by ilyashnil in https://github.com/IBM/unitxt/pull/659
* Add flan wnli truthfulness format by elronbandel in https://github.com/IBM/unitxt/pull/665
* Add babi qa dataset by elronbandel in https://github.com/IBM/unitxt/pull/666
* fix: LoadFromIBMCloud empty data_dir breaks processing by jezekra1 in https://github.com/IBM/unitxt/pull/668
* Make settings utils type sensetive by elronbandel in https://github.com/IBM/unitxt/pull/674
* Fix bug in references with none by elronbandel in https://github.com/IBM/unitxt/pull/677
* Add choices shuffling to MultipleChoiceTemplate by elronbandel in https://github.com/IBM/unitxt/pull/678
* Add match_closest_option post processor for multiple choice qa by elronbandel in https://github.com/IBM/unitxt/pull/679
* introduce arabic to normalized sacrebleu by pklpriv in https://github.com/IBM/unitxt/pull/638
* DuplicateInstances operator by pawelknes in https://github.com/IBM/unitxt/pull/682
* Validating that the prepare dir is consistent with catalog by eladven in https://github.com/IBM/unitxt/pull/683
* fix summarization template by gitMichal in https://github.com/IBM/unitxt/pull/652
* Added new metric for unsorted_list_exact_math by yoavkatz in https://github.com/IBM/unitxt/pull/685
* Add deprecation decorator for warning and errors for deprecation of functions and classes by elronbandel in https://github.com/IBM/unitxt/pull/689
* Duplicate instance operator - new functionality by pawelknes in https://github.com/IBM/unitxt/pull/687
* Add safe and complete type parsing function to type_utils, for allowing better type checking. by elronbandel in https://github.com/IBM/unitxt/pull/688
* Add pandas_load_args for LoadCSV by elronbandel in https://github.com/IBM/unitxt/pull/696
* Add coqa and dialog processing capabilites by elronbandel in https://github.com/IBM/unitxt/pull/640
* Add generic mechanism to check prediction and reference types in metrics by yoavkatz in https://github.com/IBM/unitxt/pull/667
* removal of dpath -- ready for review by dafnapension in https://github.com/IBM/unitxt/pull/680
* Update version to 1.7.2 by elronbandel in https://github.com/IBM/unitxt/pull/704

New Contributors
* jezekra1 made their first contribution in https://github.com/IBM/unitxt/pull/668
* pklpriv made their first contribution in https://github.com/IBM/unitxt/pull/638
* pawelknes made their first contribution in https://github.com/IBM/unitxt/pull/682

**Full Changelog**: https://github.com/IBM/unitxt/compare/1.7.1...1.7.2

1.7.1

What's Changed
* Update version to 1.7.0 by elronbandel in https://github.com/IBM/unitxt/pull/630
* Return copies of artifacts from the artifacts cache by matanor in https://github.com/IBM/unitxt/pull/612
* Avoid RuntimeWarning in confidence interval computation by matanor in https://github.com/IBM/unitxt/pull/632
* Add essential table processing operators by csrajmohan in https://github.com/IBM/unitxt/pull/627
* Add Capitalize and Substring operators. Add tests. by jlqibm in https://github.com/IBM/unitxt/pull/609
* Add codespell spell checker to pre-commit and fix spelling by elronbandel in https://github.com/IBM/unitxt/pull/633
* Add processors and metrics by lilacheden in https://github.com/IBM/unitxt/pull/634
* Add recipe metadata to the internal stream by elronbandel in https://github.com/IBM/unitxt/pull/636
* Add instance field operator by elronbandel in https://github.com/IBM/unitxt/pull/637
* Fix split in mmlu which was removed in huggingface by elronbandel in https://github.com/IBM/unitxt/pull/645
* Seperate inputs processing and instruction processing in templates by elronbandel in https://github.com/IBM/unitxt/pull/644
* Add some operators requirements by elronbandel in https://github.com/IBM/unitxt/pull/643
* more careful before rejecting queries by dafnapension in https://github.com/IBM/unitxt/pull/647
* Add format args and labrador format by elronbandel in https://github.com/IBM/unitxt/pull/649
* Fix instruction preparation for multiple choice by elronbandel in https://github.com/IBM/unitxt/pull/651
* Add utilities for comparing datasets examples between unitxt versions by eladven in https://github.com/IBM/unitxt/pull/650
* add LlamaIndexCorrectnessMetric by perlitz in https://github.com/IBM/unitxt/pull/594

New Contributors
* jlqibm made their first contribution in https://github.com/IBM/unitxt/pull/609

**Full Changelog**: https://github.com/IBM/unitxt/compare/1.7.0...1.7.1

1.7.0

**This release introduces a few significant changes that modify existing conventions:**

1. **Instructions renamed to system_prompts**

This means that from now on, to define a new system-level instruction, you can use this code:

python
system_prompt = TextualSystemPrompt( <<<< Class name has changed
"Below is an instruction that describes a task. Write a response that appropriately completes the request.\n\n"
)

add_to_catalog(system_prompt, "system_prompts.models.alpaca", overwrite=True) <<<< Catalog name has changed


It also means that all the system-level instructions were moved to the catalog under `system_prompts` instead of `instructions`.
This change is breaking old instruction but was necassry to enable the next very useful change.

2. **Templates can now (1) generate task specific instruction once at the head of the example, and (2) can add few words the model will say before the models' final prediction**

This change was requested by many pepole.

For example here in this COLA dataset example:


User: Classify the grammatical acceptability of the following text to one of these options: unacceptable, acceptable. text: Fred watered the plants flat.
Agent: acceptable

User: Classify the grammatical acceptability of the following text to one of these options: unacceptable, acceptable. text: The pond froze solid.
Agent:


The instruction "Classify the ..." is reapted for every demonstration. Also with the current template there is no way to put few words that the agent will say before the prediciton for instance: "Agent: The class is ". With the new changes both of these important features are enabled.

If the old way for defining tempaltes for classification was:

python
add_to_catalog(
InputOutputTemplate(
input_format="Classify the {type_of_class} of the following {text_type} to one of these options: {classes}. {text_type}: {text}",
output_format="{label}",
),
"templates.classification.multi_class.default_no_instruction",
overwrite=True,
)


It is now defined this way:

python
add_to_catalog(
InputOutputTemplate(
input_format="{text_type}: {text}", <<<< Changed
output_format="{label}",
target_prefix="The {type_of_class} is ", <<<< Added
instruction="Classify the {type_of_class} of the following {text_type} to one of these options: {classes}.\n", <<<< Added
),
"templates.classification.multi_class.instruction",
overwrite=True,
)


The new template fields `instruction` and `target_prefix` will produce this example:


Classify the grammatical acceptability of the following text to one of these options: unacceptable, acceptable.

User: text: Fred watered the plants flat.
Agent: The grammatical acceptability is acceptable

User: text: The pond froze solid.
Agent: The grammatical acceptability is


Notice how the instruction appears only once, and the target prefix is appearing after the 'Agent:'.

Read more in the [tutorial on preparing templates](https://unitxt.readthedocs.io/en/latest/docs/adding_template.html).


3. **Loading from catalog with modifications**

Now you can load an item from the catalog and change its fields. For example, if you want to use a task but with a different metric, you can use this syntax:

python
card = TaskCard(
loader=LoadHF(path="glue", name="cola"),
preprocess_steps=[...],
task="tasks.classification.multi_class[metrics=[metrics.matthews_correlation]]", <<<< Modified
templates="templates.classification.multi_class.all",
)

add_to_catalog(card, "cards.cola", overwrite=True)


Read more in the [tutorial on loading from the catalog](https://unitxt.readthedocs.io/en/latest/docs/saving_and_loading_from_catalog.html).

4. **Renaming of `additional_inputs` to `task_data`**

In an effort to more accurately represent the origin of certain fields within our system, we've renamed the `additional_inputs` parameter to `task_data`. This modification underscores the fact that these fields are derived directly from the task definition itself. This change is crucial for maintaining the integrity and reliability of metrics, as it ensures these fields are validated against the task schema. Consequently, developers crafting metrics for specific tasks can effortlessly ascertain which fields are accessible to them by simply referring to the task schema. This alignment between task definitions and metrics development fosters a more intuitive and efficient workflow for unitxt contributors.

Release Changes
BugFixes:

* Fix parser to allow source name that starts with numeric by marukaz in https://github.com/IBM/unitxt/pull/530
* Avoid race condition when download files to IBM COS cache by yoavkatz in https://github.com/IBM/unitxt/pull/536
* Updating perplexity computation, to apply exp(-x) by assaftibm in https://github.com/IBM/unitxt/pull/534
* Avoid duplicate values in UI by Roni-Friedman in https://github.com/IBM/unitxt/pull/552
* Fixed the test that generated a new entry in the catalog by dafnapension in https://github.com/IBM/unitxt/pull/550
* Fix artifact initialization dict creation to be recursive by elronbandel in https://github.com/IBM/unitxt/pull/559
* Enforce tests to use only local catalogs by elronbandel in https://github.com/IBM/unitxt/pull/564
* Fix multi label classification template and improve debugging by yoavkatz in https://github.com/IBM/unitxt/pull/571
* Fix classification code so multi-label metrics are not aware of 'none' by yoavkatz in https://github.com/IBM/unitxt/pull/580
* Fix MultiReferenceTemplate import by perlitz in https://github.com/IBM/unitxt/pull/583
* Add uncomitted processor by elronbandel in https://github.com/IBM/unitxt/pull/588
* Add missing processor in catalog by yoavkatz in https://github.com/IBM/unitxt/pull/590
* Docfix: Fix incorrect artifact names in Adding Dataset doc by yifanmai in https://github.com/IBM/unitxt/pull/591
* fixes to perplexity metric, updates to catalog by assaftibm in https://github.com/IBM/unitxt/pull/592
* Fix many datasets and templates by elronbandel in https://github.com/IBM/unitxt/pull/599
* Fix Test catalog preperation without hugginface access by elronbandel in https://github.com/IBM/unitxt/pull/601
* Fix format instruction same as source in templates by dafnapension in https://github.com/IBM/unitxt/pull/607
* Fixed formats and system prompts by elronbandel in https://github.com/IBM/unitxt/pull/604
* Add scipy to base requirements by matanor in https://github.com/IBM/unitxt/pull/611
* Reverese undocumented capitalization in templates by elronbandel in https://github.com/IBM/unitxt/pull/616
* Fix broken OptionalField in dataclass by elronbandel in https://github.com/IBM/unitxt/pull/619
* Fix some features of the Tempate for ffqa by dafnapension in https://github.com/IBM/unitxt/pull/613
* Fix problem in process_instance by yoavkatz in https://github.com/IBM/unitxt/pull/628


New Assets:

* Added table serializers operators and add Wikitq table question answering dataset by csrajmohan in https://github.com/IBM/unitxt/pull/544
* Added human eval dataset by OfirArviv in https://github.com/IBM/unitxt/pull/509
* Added Clinc and news datasets by ilyashnil in https://github.com/IBM/unitxt/pull/578
* Added cards for cohere for ai aya dataset by dafnapension in https://github.com/IBM/unitxt/pull/579
* Add multi class relation classification task and change nli datasets to use it by elronbandel in https://github.com/IBM/unitxt/pull/586
* Eval metrics by lilacheden in https://github.com/IBM/unitxt/pull/587
* Add tab_fact dataset, a dataset for classification of textual entailment from tables by csrajmohan in https://github.com/IBM/unitxt/pull/582
* Add filtered ffqa dataset by marukaz in https://github.com/IBM/unitxt/pull/593
* Add universal_ner by elronbandel in https://github.com/IBM/unitxt/pull/622
* Add atis dataset by elronbandel in https://github.com/IBM/unitxt/pull/629

Enhancments

* Tests can be done now also on PRs from forks. by elronbandel in https://github.com/IBM/unitxt/pull/537 https://github.com/IBM/unitxt/pull/538
* Show artifact class details in the documentation. by dafnapension in https://github.com/IBM/unitxt/pull/528
* UI improvements by Roni-Friedman in https://github.com/IBM/unitxt/pull/541
* Update README.md by eltociear in https://github.com/IBM/unitxt/pull/540
* Add artifact_identifier to Artifact objects loaded from the catalog, linking them to their catalog name. by matanor in https://github.com/IBM/unitxt/pull/545 https://github.com/IBM/unitxt/pull/547 https://github.com/IBM/unitxt/pull/546
* allow imports list for executequery and filterbyquery and rename to ExecuteExpression and FilterByExpression by dafnapension in https://github.com/IBM/unitxt/pull/542
* Add tests for api is presented in the unitxt paper. by elronbandel in https://github.com/IBM/unitxt/pull/558
* Extend the function that evaluate with unitxt metric on external data to new types of data by assaftibm in https://github.com/IBM/unitxt/pull/557
* Add Kendall's tau metric by lilacheden in https://github.com/IBM/unitxt/pull/535
* Add new table operators for serialization & truncation by csrajmohan in https://github.com/IBM/unitxt/pull/567
* Unitxt should operate with no package requirements by default. This adds some tools to do so. by elronbandel in https://github.com/IBM/unitxt/pull/570
* Seperate library tests and catalog preperation by elronbandel in https://github.com/IBM/unitxt/pull/572
* Add class for constants handling by elronbandel in https://github.com/IBM/unitxt/pull/575
* Add code needed for evaluating metrics as models by lilacheden in https://github.com/IBM/unitxt/pull/573
* Improved error message when using TemplateDict by yoavkatz in https://github.com/IBM/unitxt/pull/499
* Add ability to load from catalog with arguments overwrite by elronbandel in https://github.com/IBM/unitxt/pull/581
* Add Grouped instance metric inherit from InstanceMetrics by sam-data-guy-iam in https://github.com/IBM/unitxt/pull/452
* Website touch up. by elronbandel in https://github.com/IBM/unitxt/pull/597
* Add structured data operators for serializing tablerows, triples and keyvalue pairs added by csrajmohan in https://github.com/IBM/unitxt/pull/589
* Allow dicts in overwrite args of fetched artifact by dafnapension in https://github.com/IBM/unitxt/pull/598
* filter on loading rather than increase loading limit by dafnapension in https://github.com/IBM/unitxt/pull/584
* Reduce log size by printing less by elronbandel in https://github.com/IBM/unitxt/pull/605
* Add Support for execution of metrics on a remote host by matanor in https://github.com/IBM/unitxt/pull/568
* Add signatures to dataclasses __init__ for clearer docs by elronbandel in https://github.com/IBM/unitxt/pull/624


🚨 Breaking Changes 🚨

* Rename answer_relevance to answer_reward by assaftibm in https://github.com/IBM/unitxt/pull/539
* Migrate (task-related) Instruction into Template, and introduce (task independent) SystemPrompt by dafnapension in https://github.com/IBM/unitxt/pull/565
* Rename additional_inputs to task_data and make it a simple json dumped by elronbandel in https://github.com/IBM/unitxt/pull/595
* Add shuffling to banking77 and few more classification datasets ilyashnil in https://github.com/IBM/unitxt/pull/603 this was necassry in order to balance the classes in those dataset.

New Contributors

* eltociear made their first contribution in https://github.com/IBM/unitxt/pull/540
* csrajmohan made their first contribution in https://github.com/IBM/unitxt/pull/544
* yifanmai made their first contribution in https://github.com/IBM/unitxt/pull/591
* sam-data-guy-iam made their first contribution in https://github.com/IBM/unitxt/pull/452

**Full Changelog**: https://github.com/IBM/unitxt/compare/1.6.1...1.7.0

Page 7 of 10

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.