Optimum

Latest version: v1.19.2

Safety actively analyzes 631178 Python packages for vulnerabilities to keep your Python projects secure.

Page 1 of 16

4.38

The codebase is fully validated for Transformers v4.38.

- Upgrade to Transformers 4.38 788 regisss

Model optimizations

- Add optimization for blip text model generation 653 sywangyi
- Enable internal kv bucket in llama 720 xt574chen
- Enable Mixtral-8x7B 739 jychen-habana
- Update Mixtral-8x7B fp8 hqt example 756 jychen-habana
- Further fixes for performance with internal bucketing 781 puneeshkhanna
- speecht5 optimization 722 sywangyi
- move img_maskget_attn_mask() to hpu 795 hsubramony
- Mistral optimizations 804 ssarkar2

Image-to-text and VQA examples

- Add image-to-text and visual question answering example 738 sywangyi

torch.compile

- Enable torch_compile mode for distributed 659 kalyanjk
- Fix graph breaks in torch compile mode 806 hlahkar
- Fix torch.compile for text generation 811 regisss
- Add Llama7b FSDP test for torch.compile mode 818 pankd

Bug fixes

- Fix beamsearch crash and incorrect output in decode-only model and encode-decode model 627 sywangyi
- Fix translation models 710 vidyasiv
- Fix throughput calculation for diffusion models 715 skavulya
- Fix crash in llama mode in llava image-to-text generation 755 sywangyi
- Fix backward error in DDP when running reward model finetune in RLHF 507 sywangyi
- Fix get_dtype and convert_into_dtypes 769 regisss
- Override sdpa option in Gaudi 771 jiminha
- Fix Llama-70B-FSDP model loading issue 752 hlahkar
- Fix FSDP in transformer4.38 812 libinta
- Delay importing deepspeed comm due for perf 810 jiminha
- Fix llama rotary pos emb issue for transformers 4.38 813 libinta
- Fix torch.full issue below when running deepspeed z3 for llama 820 libinta
- Fix profile issue with 1st step 837 libinta
- Fix mistral after syn1.15 update 858 ssarkar2

Others

- Small test_text_generation_example.py refacto 725 regisss
- Update README, add PPO support 721 sywangyi
- Update the Mistral model naming 726 yafshar
- Changing backend name 708 vivekgoe
- Update ppo_trainer.py 718 skaulintel
- Add seed in sft example, make sft result reproducable 735 sywangyi
- Adding a flag whether to save checkpoint or not in run_lora_clm.py 736 yeonsily
- Refactor and update CI for encoder-decoders 742 regisss
- Expose Llama Fused OPs control from run_lora_clm.py 751 hlahkar
- Fixing tests by making static_shapes False 778 bhargaveede
- Fix ControlNet README 785 regisss
- Workaround for RoPE computed in bf16 for GPT-NeoX 746 regisss
- Add Whisper and SpeechT5 to model table 790 regisss
- Update summarization example README 791 srajabos
- Block torchscript pytest because of seg fault issue 793 yeonsily
- Fix test_encoder_decoder.py for opus-mt-zh-en 798 regisss
- Replacing obsolete API for mediapipe 796 MohitIntel
- Add --distribution_strategy fast_ddp in contrastive-image-text README and BridgeTower test 799 regisss
- Fix redundant bucket internal and hpu graph setting 797 puneeshkhanna
- Add Llama test for fsdp 761 hlahkar
- Enable dynamic shapes for esmfold 803 hsubramony
- Add Llama/Llama2 support in Question-Answering 745 kplau1128
- Update MLM example 830 regisss
- Revert Wav2Vec2 TDNNLayer forward function same as transformer v4.37.2 827 yeonsily
- Save CI test output image 835 MohitIntel
- Update ckpt loading 773 schoi-habana
- Skip SDXL test in CI 840 regisss
- Fix FSDP test on Gaudi1 841 regisss
- Remove installation from source for Diffusers in CI 846 regisss
- Fix fp8 ci 852 regisss
- Fix PR 848 853 regisss
- Disable safe loading tests in CI 854 regisss
- Add warmup for eval 855 libinta

Known issue

- A crash may occur with [unify_measurements.py](https://github.com/huggingface/optimum-habana/blob/main/examples/text-generation/quantization_tools/unify_measurements.py)

4.37

- Upgrade to Transformers 4.37 651

**Full Changelog**: https://github.com/huggingface/optimum-habana/compare/v1.10.0...v1.10.2

4.31

Transformers v4.31 (latest stable release) is fully supported.

- Upgrade to Transformers v4.31 312 regisss

1.19.2

- Update the Transformers dependency in the Habana extra 1851 regisss

**Full Changelog**: https://github.com/huggingface/optimum/compare/v1.19.1...v1.19.2

1.19.1

* Bump transformers version by echarlaix in https://github.com/huggingface/optimum/pull/1824
* Remove call to `apt update` before `apt purge` in the main doc build workflow by regisss in https://github.com/huggingface/optimum/pull/1830

**Full Changelog**: https://github.com/huggingface/optimum/compare/v1.19.0...v1.19.1

1.19.0

Extended ONNX export

Musicgen and MarkupLM models from Transformers can now be exported to ONNX through `optimum-cli export onnx`. Musicgen ONNX export is used to run the model locally in a browser through [transformers.js](https://github.com/xenova/transformers.js).

* Musicgen ONNX export (text-conditional only) by fxmarty in https://github.com/huggingface/optimum/pull/1779
* Add support for markuplm ONNX export by pogzyb in https://github.com/huggingface/optimum/pull/1784

Other changes and bugfixes
* Fix IR version for merged ONNX decoders by fxmarty in https://github.com/huggingface/optimum/pull/1780
* Update test model id by echarlaix in https://github.com/huggingface/optimum/pull/1785
* Add Nvidia and Neuron to README by JingyaHuang in https://github.com/huggingface/optimum/pull/1791
* adds debug options to dump onnx graphs by prathikr in https://github.com/huggingface/optimum/pull/1789
* Improve PR template by fxmarty in https://github.com/huggingface/optimum/pull/1799
* Add Google TPU to the mix by mfuntowicz in https://github.com/huggingface/optimum/pull/1797
* Add redirection for Optimum TPU by regisss in https://github.com/huggingface/optimum/pull/1801
* Add Nvidia and Neuron to the installation doc by JingyaHuang in https://github.com/huggingface/optimum/pull/1803
* Update installation instructions by echarlaix in https://github.com/huggingface/optimum/pull/1806
* Fix offline compatibility by fxmarty in https://github.com/huggingface/optimum/pull/1805
* Remove unnecessary constants for > 2GB ONNX models by fxmarty in https://github.com/huggingface/optimum/pull/1808
* Add onnx export function for pix2struct model by naormatania in https://github.com/huggingface/optimum/pull/1815

New Contributors
* pogzyb made their first contribution in https://github.com/huggingface/optimum/pull/1784
* naormatania made their first contribution in https://github.com/huggingface/optimum/pull/1815

**Full Changelog**: https://github.com/huggingface/optimum/compare/v1.18.0...v1.19.0

Page 1 of 16

Releases

Has known vulnerabilities

Optimum

Page 1 of 16

4.38

4.37

4.31

1.19.2

1.19.1

1.19.0

Page 1 of 16

Links

Releases