Unitxt

Latest version: v1.15.6

Safety actively analyzes 681775 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 8

1.15.6

Detailed blog post to be released soon!

What's Changed
* Add seed bench dataset and support for videos by elronbandel in https://github.com/IBM/unitxt/pull/1309
* Add LMMSEvalInferenceEngine by elronbandel in https://github.com/IBM/unitxt/pull/1301
* a more elaborated message from performace-test-summary, and doc-string of card_profiler by dafnapension in https://github.com/IBM/unitxt/pull/1307
* Make package requirements compatible with requirements.txt like format by elronbandel in https://github.com/IBM/unitxt/pull/1310
* Make inference engine tests run only when inference.py has changed by elronbandel in https://github.com/IBM/unitxt/pull/1311
* Minor fixes by elronbandel in https://github.com/IBM/unitxt/pull/1312
* added QTSUMM taskcard for query-focused table summarization task by csrajmohan in https://github.com/IBM/unitxt/pull/1304
* Add OptionSelectingByLogProbsInferenceEngine by martinscooper in https://github.com/IBM/unitxt/pull/1317
* Vision robustness blog by elronbandel in https://github.com/IBM/unitxt/pull/1318
* Seperate examples tests by elronbandel in https://github.com/IBM/unitxt/pull/1322
* add score name prefix for judge_raw_output/input in llmaj metric by OfirArviv in https://github.com/IBM/unitxt/pull/1323
* Fix pyproject.toml to be standalone and comply with modern standards by elronbandel in https://github.com/IBM/unitxt/pull/1324
* Update error checking and documentation of processors by yoavkatz in https://github.com/IBM/unitxt/pull/1325
* adding generic inference binary+idk judges by Roni-Friedman in https://github.com/IBM/unitxt/pull/1316
* Add table augmentors by elronbandel in https://github.com/IBM/unitxt/pull/1328
* Add loaders cache by elronbandel in https://github.com/IBM/unitxt/pull/1333
* A swifty PR suggested for issue 1332 by dafnapension in https://github.com/IBM/unitxt/pull/1334
* Allow turning single stream to dataset by elronbandel in https://github.com/IBM/unitxt/pull/1335
* Add chat api format with standard open ai chat format by elronbandel in https://github.com/IBM/unitxt/pull/1314
* A swifty PR suggested for issue 1331 by dafnapension in https://github.com/IBM/unitxt/pull/1337
* Bluebench Update by perlitz in https://github.com/IBM/unitxt/pull/1342
* Replace 20 newsgroup with a shorter version in bluebench by perlitz in https://github.com/IBM/unitxt/pull/1347
* Fix catalog links by elronbandel in https://github.com/IBM/unitxt/pull/1348
* Make tests faster and clearer by dafnapension in https://github.com/IBM/unitxt/pull/1345
* Fix GitHub Actions concurrence execution by elronbandel in https://github.com/IBM/unitxt/pull/1349
* Batched multi class classification by yoavkatz in https://github.com/IBM/unitxt/pull/1340
* Print catalog entries as yamls by dafnapension in https://github.com/IBM/unitxt/pull/1351
* Update Blue Bench description by elronbandel in https://github.com/IBM/unitxt/pull/1354
* Add ability to load_dataset without a template for simpler usage for beginners by elronbandel in https://github.com/IBM/unitxt/pull/1350
* Unified catalog terminology by yoavkatz in https://github.com/IBM/unitxt/pull/1355
* move rag binary llmaj under rag metrics by lilacheden in https://github.com/IBM/unitxt/pull/1338
* Add option selecting huggingface inference engine by elronbandel in https://github.com/IBM/unitxt/pull/1357
* Fix docs compilation and links from docs to github by elronbandel in https://github.com/IBM/unitxt/pull/1359
* Fix website docs-code links by elronbandel in https://github.com/IBM/unitxt/pull/1360
* Align augmenters with task and types mechanisms by elronbandel in https://github.com/IBM/unitxt/pull/1356
* Add link to source in catalog assets by elronbandel in https://github.com/IBM/unitxt/pull/1362
* add serializers to catalog + new table operators by ShirApp in https://github.com/IBM/unitxt/pull/1365
* Add target_prefix erasing post processor by default by elronbandel in https://github.com/IBM/unitxt/pull/1361
* Add multi api inference engine by elronbandel in https://github.com/IBM/unitxt/pull/1343


New Contributors
* martinscooper made their first contribution in https://github.com/IBM/unitxt/pull/1317

**Full Changelog**: https://github.com/IBM/unitxt/compare/1.14.1...1.15.6

1.14.1

Important Change: Unitxt is Faster!

To improve Unitxt’s performance, we've made several optimizations:

1. **Operator Acceleration**: Many operators have been sped up by removing unnecessary deep copying in their code, enhancing runtime efficiency.

2. **Caching Hugging Face Datasets**: We added the option to cache Hugging Face datasets in loaders, which can prevent redundant loading operations. To enable this, you can either:
- Set it globally in code:
python
import unitxt

unitxt.settings.disable_hf_datasets_cache = False

- Use the settings context:
python
with settings.context(disable_hf_datasets_cache=False):
your code

- Or set the environment variable:
bash
export UNITXT_DISABLE_HF_DATASETS_CACHE=False


3. **Eager Execution Mode**: Running Unitxt without streaming, which can be faster in certain scenarios. Enable eager execution using the environment variable or directly in code:
python
unitxt.settings.use_eager_execution = True
or
with settings.context(use_eager_execution=True):
your code


4. **Partial Stream Loading**: This feature lets you load only the necessary data instances, avoiding full dataset loads when not required. Here's an example:
python
from unitxt import load_dataset

dataset = load_dataset(
card="cards.doc_vqa.lmms_eval",
template="templates.qa.with_context.title",
format="formats.models.llava_interleave",
loader_limit=300,
streaming=True,
)
print(next(iter(dataset["test"][0]))) Loads only the first instance


**Complete Example**: Combining the optimizations above can lead to near 1000x faster dataset loading:
python
from unitxt import load_dataset, settings

with settings.context(
disable_hf_datasets_cache=False,
use_eager_execution=True,
):
dataset = load_dataset(
card="cards.doc_vqa.lmms_eval",
template="templates.qa.with_context.title",
format="formats.models.llava_interleave",
loader_limit=300,
streaming=True,
)
print(next(iter(dataset["test"][0]))) Loads only the first instance


5. **Execution Speed Tracking**: A GitHub action has been added to monitor Unitxt’s execution speed in new pull requests, helping ensure that optimizations are maintained.

---

Summary

This release is focused on accelerating performance in Unitxt by introducing several key optimizations. Operator efficiency has been enhanced by removing deep copies, making operations faster. Users can now enable dataset caching for Hugging Face datasets to prevent redundant loading, configured directly in code or through environment variables. An optional eager execution mode has been added, bypassing streaming to increase speed in certain scenarios. Additionally, partial stream loading allows selective instance loading, reducing memory usage and improving response times. To maintain these improvements, a new GitHub action now monitors Unitxt’s execution speed in pull requests, ensuring consistent performance across updates.

All Changes
* Enhancements to inference engines by lilacheden in https://github.com/IBM/unitxt/pull/1243
* add post processor to convert log probs dictionary to probabilities of a specific class by lilacheden in https://github.com/IBM/unitxt/pull/1247
* CI for metrics other than main + Bugfix in RetrievalAtK by lilacheden in https://github.com/IBM/unitxt/pull/1246
* Add huggingface cache disabling option to unitxt settings by elronbandel in https://github.com/IBM/unitxt/pull/1250
* Make F1Strings faster by elronbandel in https://github.com/IBM/unitxt/pull/1248
* Fix duplicate column deletion bug in pandas serializer by elronbandel in https://github.com/IBM/unitxt/pull/1249
* revived no_deep just to compare performance by dafnapension in https://github.com/IBM/unitxt/pull/1254
* fixed scigen post-processor by csrajmohan in https://github.com/IBM/unitxt/pull/1253
* Add prediction length metric by perlitz in https://github.com/IBM/unitxt/pull/1252
* Fix faithfulness confidence intervals by matanor in https://github.com/IBM/unitxt/pull/1257
* Allow role names to be captialized in SerializeOpenAiFormatDialog by yoavkatz in https://github.com/IBM/unitxt/pull/1259
* Accelerate image example 1000X by elronbandel in https://github.com/IBM/unitxt/pull/1258
* Fix the empty few-shot target issue when using produce() by marukaz in https://github.com/IBM/unitxt/pull/1266
* fix postprocessors in turl_col_type taskcard by csrajmohan in https://github.com/IBM/unitxt/pull/1261
* Fix answer correctness confidence intervals by matanor in https://github.com/IBM/unitxt/pull/1256
* add BlueBench as a benchmark to the catalog by shachardon in https://github.com/IBM/unitxt/pull/1262
* Fix MultipleSourceLoader documentation by marukaz in https://github.com/IBM/unitxt/pull/1270
* Ignore unitxt-venv by marukaz in https://github.com/IBM/unitxt/pull/1269
* Add mmmu by elronbandel in https://github.com/IBM/unitxt/pull/1271
* A fix for a bug in metric pipeline by elronbandel in https://github.com/IBM/unitxt/pull/1268
* Added Tablebench taskcard by csrajmohan in https://github.com/IBM/unitxt/pull/1273
* Fix missing deep copy in MapInstanceValues by yoavkatz in https://github.com/IBM/unitxt/pull/1267
* Add stream name to generation of dataset by elronbandel in https://github.com/IBM/unitxt/pull/1276
* Fix demos pool inference by elronbandel in https://github.com/IBM/unitxt/pull/1278
* Fix quality github action by elronbandel in https://github.com/IBM/unitxt/pull/1281
* add operators for robustness check on tables by csrajmohan in https://github.com/IBM/unitxt/pull/1279
* Instruction in SystemFormet demo support. by piotrhelm in https://github.com/IBM/unitxt/pull/1274
* change the max_test_instances of bluebench.recipe.attaq_500 to 100 by shachardon in https://github.com/IBM/unitxt/pull/1285
* Add documentation for types and serializers by elronbandel in https://github.com/IBM/unitxt/pull/1286
* Add example for image processing with different templates by elronbandel in https://github.com/IBM/unitxt/pull/1280
* Integrate metrics team LLMaJ with current unitxt implemantation by lilacheden in https://github.com/IBM/unitxt/pull/1205
* performance profiler with visualization by dafnapension in https://github.com/IBM/unitxt/pull/1255
* Remove split arg to support old hf datasets versions by elronbandel in https://github.com/IBM/unitxt/pull/1288
* add post-processors for tablebench taskcard by csrajmohan in https://github.com/IBM/unitxt/pull/1289
* recursive copy seems safer here by dafnapension in https://github.com/IBM/unitxt/pull/1295
* Fix performance tracking action by elronbandel in https://github.com/IBM/unitxt/pull/1296
* try num of instances in nested global scores by dafnapension in https://github.com/IBM/unitxt/pull/1282
* Update version to 1.14.0 by elronbandel in https://github.com/IBM/unitxt/pull/1298
* expand performance table by dafnapension in https://github.com/IBM/unitxt/pull/1299
* Fix doc_vqa lmms_eval by elronbandel in https://github.com/IBM/unitxt/pull/1300
* prepare for int-ish group names and type names and add the exposing card by dafnapension in https://github.com/IBM/unitxt/pull/1303
* remove groups breakdowns from global score of grouped instance metrics by dafnapension in https://github.com/IBM/unitxt/pull/1306
* Update the safety metric batch size to 10 by perlitz in https://github.com/IBM/unitxt/pull/1305

New Contributors
* piotrhelm made their first contribution in https://github.com/IBM/unitxt/pull/1274

**Full Changelog**: https://github.com/IBM/unitxt/compare/1.13.1...1.14.1

1.14.0

What's Changed
* Simplify qa example by yoavkatz in https://github.com/IBM/unitxt/pull/1234
* allow multiple references for f1 strings metric by ShirApp in https://github.com/IBM/unitxt/pull/1225
* Add bluebench recipes by shachardon in https://github.com/IBM/unitxt/pull/1237
* Allow templates dicts to be python dicts and fix a bug in the TemplatesDict definition by elronbandel in https://github.com/IBM/unitxt/pull/1240
* Deep copy artifacts that fetched twice by elronbandel in https://github.com/IBM/unitxt/pull/1239
* Adding of ANLS metric to doc_vqa and info_vqa datasets by alfassy in https://github.com/IBM/unitxt/pull/1241
* Update README.md by elronbandel in https://github.com/IBM/unitxt/pull/1242
* Update version to 1.13.1 by elronbandel in https://github.com/IBM/unitxt/pull/1244
* Enhancements to inference engines by lilacheden in https://github.com/IBM/unitxt/pull/1243
* add post processor to convert log probs dictionary to probabilities of a specific class by lilacheden in https://github.com/IBM/unitxt/pull/1247
* CI for metrics other than main + Bugfix in RetrievalAtK by lilacheden in https://github.com/IBM/unitxt/pull/1246
* Add huggingface cache disabling option to unitxt settings by elronbandel in https://github.com/IBM/unitxt/pull/1250
* Make F1Strings faster by elronbandel in https://github.com/IBM/unitxt/pull/1248
* Fix duplicate column deletion bug in pandas serializer by elronbandel in https://github.com/IBM/unitxt/pull/1249
* revived no_deep just to compare performance by dafnapension in https://github.com/IBM/unitxt/pull/1254
* fixed scigen post-processor by csrajmohan in https://github.com/IBM/unitxt/pull/1253
* Add prediction length metric by perlitz in https://github.com/IBM/unitxt/pull/1252
* Fix faithfulness confidence intervals by matanor in https://github.com/IBM/unitxt/pull/1257
* Allow role names to be captialized in SerializeOpenAiFormatDialog by yoavkatz in https://github.com/IBM/unitxt/pull/1259
* Accelerate image example 1000X by elronbandel in https://github.com/IBM/unitxt/pull/1258
* Fix the empty few-shot target issue when using produce() by marukaz in https://github.com/IBM/unitxt/pull/1266
* fix postprocessors in turl_col_type taskcard by csrajmohan in https://github.com/IBM/unitxt/pull/1261
* Fix answer correctness confidence intervals by matanor in https://github.com/IBM/unitxt/pull/1256
* add BlueBench as a benchmark to the catalog by shachardon in https://github.com/IBM/unitxt/pull/1262
* Fix MultipleSourceLoader documentation by marukaz in https://github.com/IBM/unitxt/pull/1270
* Ignore unitxt-venv by marukaz in https://github.com/IBM/unitxt/pull/1269
* Add mmmu by elronbandel in https://github.com/IBM/unitxt/pull/1271
* A fix for a bug in metric pipeline by elronbandel in https://github.com/IBM/unitxt/pull/1268
* Added Tablebench taskcard by csrajmohan in https://github.com/IBM/unitxt/pull/1273
* Fix missing deep copy in MapInstanceValues by yoavkatz in https://github.com/IBM/unitxt/pull/1267
* Add stream name to generation of dataset by elronbandel in https://github.com/IBM/unitxt/pull/1276
* Fix demos pool inference by elronbandel in https://github.com/IBM/unitxt/pull/1278
* Fix quality github action by elronbandel in https://github.com/IBM/unitxt/pull/1281
* add operators for robustness check on tables by csrajmohan in https://github.com/IBM/unitxt/pull/1279
* Instruction in SystemFormet demo support. by piotrhelm in https://github.com/IBM/unitxt/pull/1274
* change the max_test_instances of bluebench.recipe.attaq_500 to 100 by shachardon in https://github.com/IBM/unitxt/pull/1285
* Add documentation for types and serializers by elronbandel in https://github.com/IBM/unitxt/pull/1286
* Add example for image processing with different templates by elronbandel in https://github.com/IBM/unitxt/pull/1280
* Integrate metrics team LLMaJ with current unitxt implemantation by lilacheden in https://github.com/IBM/unitxt/pull/1205
* performance profiler with visualization by dafnapension in https://github.com/IBM/unitxt/pull/1255
* Remove split arg to support old hf datasets versions by elronbandel in https://github.com/IBM/unitxt/pull/1288
* add post-processors for tablebench taskcard by csrajmohan in https://github.com/IBM/unitxt/pull/1289
* recursive copy seems safer here by dafnapension in https://github.com/IBM/unitxt/pull/1295
* Fix performance tracking action by elronbandel in https://github.com/IBM/unitxt/pull/1296
* try num of instances in nested global scores by dafnapension in https://github.com/IBM/unitxt/pull/1282
* Update version to 1.14.0 by elronbandel in https://github.com/IBM/unitxt/pull/1298

New Contributors
* alfassy made their first contribution in https://github.com/IBM/unitxt/pull/1241
* piotrhelm made their first contribution in https://github.com/IBM/unitxt/pull/1274

**Full Changelog**: https://github.com/IBM/unitxt/compare/1.13.0...1.14.0

1.13.1

1.13.0

New type handling capabilities
The most significant change in this release is the introduction of type serializers to unitxt.
Type serializers in charge of taking a specific type of data structure such as Table, or Dialog and serialize it to textual representation.
Now you can define tasks in unitxt that have complex types such as Table or Dialog and define serializers that handle their transformation to text.

This allows to control the representation of different types from the recipe api:

python
from unitxt import load_dataset
from unitxt.struct_data_operators import SerializeTableAsMarkdown

serializer = SerializeTableAsMarkdown(shuffle_rows=True, seed=0)
dataset = load_dataset(card="cards.wikitq", template_card_index=0, serializer=serializer)

And if you want to serialize this table differently you can change any of the many available [table serializers](https://github.com/IBM/unitxt/blob/80b284fec1954bdf48638d9442c75808cd79a4c5/src/unitxt/struct_data_operators.py#L103-L203).

Defining New Type

If you wish to define a new type with custom serializers you can do so by using python `typing` library:

python
from typing import Any, List, TypedDict

class Table(TypedDict):
header: List[str]
rows: List[List[Any]]

Once your type is ready you should register it to unitxt type handling within the code you are running:
python
from unitxt.type_utils import register_type

register_type(Table)

Now your type can be used anywhere across unitxt (e.g in task definition or serializers).

Defining a Serializer For a Type

If you want to define a serializer for your custom type or any typing type combination you can do so by:
python
class MySerizlizer(SingleTypeSerializer):
serialized_type = Table
def serialize(self, value: Table, instance: Dict[str, Any]) -> str:
your code to turn value of type Table to string

Multi-Modality
You now can process Image-Text to Text or Image-Audio to Text datasets in unitxt.
For example if you want to load the doc-vqa dataset you can do so by:
python
from unitxt import load_dataset

dataset = load_dataset(
card="cards.doc_vqa.en",
template="templates.qa.with_context.title",
format="formats.models.llava_interleave",
loader_limit=20,
)

Since we have data augmentation mechanisms it is just natural to use it for images. For example if you want your images in grey scale:
python
dataset = load_dataset(
card="cards.doc_vqa.en",
template="templates.qa.with_context.title",
format="formats.models.llava_interleave",
loader_limit=20,
augmentor="augmentors.image.grey_scale", <= Just like the text augmenters!
)


Then if you want to get the scores of a model on this dataset you can use:

python
from unitxt.inference import HFLlavaInferenceEngine
from unitxt.text_utils import print_dict
from unitxt import evaluate

inference_model = HFLlavaInferenceEngine(
model_name="llava-hf/llava-interleave-qwen-0.5b-hf", max_new_tokens=32
)

test_dataset = dataset["test"].select(range(5))

predictions = inference_model.infer(test_dataset)
evaluated_dataset = evaluate(predictions=predictions, data=test_dataset)

print_dict(
evaluated_dataset[0],
keys_to_print=["source", "media", "references", "processed_prediction", "score"],
)

Multi modality support in unitxt is building upon the type handling introduced in the previous section with two new types: Image and Audio.

What's Changed
* add revision option to hf loader by OfirArviv in https://github.com/IBM/unitxt/pull/1189
* Support dataset field in nested JSON files by antonpibm in https://github.com/IBM/unitxt/pull/1188
* Add TURL Table column type annotation task card by csrajmohan in https://github.com/IBM/unitxt/pull/1186
* Update operators.py - copy edits (grammar, consistency, clarity) by welisheva22 in https://github.com/IBM/unitxt/pull/1187
* Numeric nlg postproc by ShirApp in https://github.com/IBM/unitxt/pull/1185
* Add support for Literal, TypedDict and NewType for unitxt type checking by elronbandel in https://github.com/IBM/unitxt/pull/1191
* Scarebleu metric: remove mecab_ko and mecab_ko_dic from metric requir… by eladven in https://github.com/IBM/unitxt/pull/1197
* Add rag dataset + openai format dialog operator by OfirArviv in https://github.com/IBM/unitxt/pull/1192
* Update README.md by elronbandel in https://github.com/IBM/unitxt/pull/1198
* add decorator with init warning by MikolajCharchut in https://github.com/IBM/unitxt/pull/1200
* Add mock inference mode setting and allow testing without gen ai key by elronbandel in https://github.com/IBM/unitxt/pull/1204
* Fix using OpenAiInferenceEngine for LLMAsJudge by yifanmai in https://github.com/IBM/unitxt/pull/1194
* Add TogetherAiInferenceEngine by yifanmai in https://github.com/IBM/unitxt/pull/1203
* Fix OpenAiInferenceEngine by yifanmai in https://github.com/IBM/unitxt/pull/1193
* Add serializers to templates and reorganize and unite all templates by elronbandel in https://github.com/IBM/unitxt/pull/1195
* Add demos to task_data by elronbandel in https://github.com/IBM/unitxt/pull/1206
* Move test_context_correctness by matanor in https://github.com/IBM/unitxt/pull/1207
* Add image-text to text datasets by elronbandel in https://github.com/IBM/unitxt/pull/1211
* Refactor augmentors to be more scaleable + add image aumgentors by elronbandel in https://github.com/IBM/unitxt/pull/1212
* Fix grey scale augmentor and add to image example by elronbandel in https://github.com/IBM/unitxt/pull/1213
* Add images to UI by elronbandel in https://github.com/IBM/unitxt/pull/1216
* add unified decorator for warnings and unit tests by MikolajCharchut in https://github.com/IBM/unitxt/pull/1209
* Add templates list option to standard recipe by elronbandel in https://github.com/IBM/unitxt/pull/1219
* Use read token for huggingface datasets reading by elronbandel in https://github.com/IBM/unitxt/pull/1223
* add Llava-next system prompt by OfirArviv in https://github.com/IBM/unitxt/pull/1221
* Improve performance for huggingface tokenizer based format by elronbandel in https://github.com/IBM/unitxt/pull/1224
* Fix compute expression to use the instance variables as globals by elronbandel in https://github.com/IBM/unitxt/pull/1217
* Add generic inference engine to allow dynamic selection by the user by eladven in https://github.com/IBM/unitxt/pull/1226
* A suggested PR for issue 1106: More meaningful error message when catalog consistency fails by dafnapension in https://github.com/IBM/unitxt/pull/1201
* Add random templates for bluebench by perlitz in https://github.com/IBM/unitxt/pull/1222
* A suggested PR for issue 1214: fixed a bug in score_prefix for grouped instance scores by dafnapension in https://github.com/IBM/unitxt/pull/1228
* Add control over serizliers from recipe + improve serializers construction + allow seed for table shuffling serizliers by elronbandel in https://github.com/IBM/unitxt/pull/1229
* Fix table tasks to use default table serializers by elronbandel in https://github.com/IBM/unitxt/pull/1230
* Add concurency_limit parameter to WMLInferenceEngine by elronbandel in https://github.com/IBM/unitxt/pull/1231
* Add wml and generic based llmaj metric by perlitz in https://github.com/IBM/unitxt/pull/1227
* Update version to 1.13.0 by elronbandel in https://github.com/IBM/unitxt/pull/1232

New Contributors
* MikolajCharchut made their first contribution in https://github.com/IBM/unitxt/pull/1200

**Full Changelog**: https://github.com/IBM/unitxt/compare/1.12.4...1.13.0

1.12.4

Main changes
* Enable to define benchmark in Unitxt by adding the ability to produce scores of groups based on task attributes and recipe metadata. For more information see https://www.unitxt.ai/en/latest/docs/benchmark.html by elronbandel in https://github.com/IBM/unitxt/pull/1130
* Enable inference/production APIs to support invocation by task without specifying a card. It enables using any task in the Unitxt catalog as an inference function. Check https://www.unitxt.ai/en/latest/docs/production.html for details (#957)
* Add support for multi-modality. For details see https://www.unitxt.ai/en/latest/docs/multimodality.html by elronbandel in https://github.com/IBM/unitxt/pull/1175


Additions to catalog
* Add ProvoQ dataset artifacts by bnayahu in https://github.com/IBM/unitxt/pull/1168
* Add Wikitq metric by ShirApp in https://github.com/IBM/unitxt/pull/1167
* Add more LLMs as judges ensembles by pvn25 in https://github.com/IBM/unitxt/pull/1171
* Add Scigen table2text task with llm_as_judge metric by csrajmohan in https://github.com/IBM/unitxt/pull/1134

New Features
* Add LLM as judge ensemble metrics, and add LLMaaJ ensemble example by pvn25 in https://github.com/IBM/unitxt/pull/1081
* Refactor RenameFields operator to Rename. The old operator is still supported but raises a deprecation warning by elronbandel in https://github.com/IBM/unitxt/pull/1123

Bug Fixes
* Make cache compatible with python 3.8 by elronbandel in https://github.com/IBM/unitxt/pull/1172
* Deprecated field used to print warning message with wrong reason dafnapension in https://github.com/IBM/unitxt/pull/1174

Documentation changes
* Update llm_as_judge.py --- copy edits (grammar, consistency, clarity) by welisheva22 in https://github.com/IBM/unitxt/pull/1164
* Update formats.py --- copy edits (grammar, consistency, clarity) by welisheva22 in https://github.com/IBM/unitxt/pull/1163
* Update loaders.py --- copy edits (grammar, consistency, clarity) by welisheva22 in https://github.com/IBM/unitxt/pull/1162
* Update card.py - minor documentation changes by welisheva22 in https://github.com/IBM/unitxt/pull/1161
* Update adding_dataset.rst - a few more minor documentation changes by welisheva22 in https://github.com/IBM/unitxt/pull/1160
* Update artifact.py --- documentation edits (grammar, consistency, cla… by welisheva22 in https://github.com/IBM/unitxt/pull/1159
* Update glossary.rst --- copy edits (grammar, consistency, clarity) by welisheva22 in https://github.com/IBM/unitxt/pull/1155
* Update helm.rst --- copy edits (grammar, consistency, clarity) by welisheva22 in https://github.com/IBM/unitxt/pull/1154
* Update operators.py --- copy edits (grammar, consistency, clarity) - take 2 by welisheva22 in https://github.com/IBM/unitxt/pull/1158
* Docfix: Fix typo in Installation doc by yifanmai in https://github.com/IBM/unitxt/pull/1181

New Contributors
* pvn25 made their first contribution in https://github.com/IBM/unitxt/pull/1081

Page 1 of 8

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.