Deepspeed

Latest version: v0.16.1

Safety actively analyzes 688418 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 17

0.16.1

What's Changed
* Update version.txt after 0.16.0 release by loadams in https://github.com/microsoft/DeepSpeed/pull/6786
* Domino news update on readme.md by GuanhuaWang in https://github.com/microsoft/DeepSpeed/pull/6815
* Fix zero checkpoint by xu-song in https://github.com/microsoft/DeepSpeed/pull/6792
* Update python version but now we need to include setuptools on our own by loadams in https://github.com/microsoft/DeepSpeed/pull/6787
* Adding the new feature of FPDT by YJHMITWEB in https://github.com/microsoft/DeepSpeed/pull/6462
* Pin transformers to avoid errors with latest version by loadams in https://github.com/microsoft/DeepSpeed/pull/6820
* Ulyssess offload blog by samadejacobs in https://github.com/microsoft/DeepSpeed/pull/6814
* add FPDT tutorial by samadejacobs in https://github.com/microsoft/DeepSpeed/pull/6813
* Update README.md by samadejacobs in https://github.com/microsoft/DeepSpeed/pull/6824
* Update README.md by samadejacobs in https://github.com/microsoft/DeepSpeed/pull/6825
* Pin transformers version in cpu-torch-latest due to multiprocessing error. by loadams in https://github.com/microsoft/DeepSpeed/pull/6823


**Full Changelog**: https://github.com/microsoft/DeepSpeed/compare/v0.16.0...v0.16.1

0.16.0

What's Changed
* Update version.txt after 0.15.4 release by loadams in https://github.com/microsoft/DeepSpeed/pull/6731
* Update GH hosted workflows to 24.04 by loadams in https://github.com/microsoft/DeepSpeed/pull/6717
* Add COMMITTER file by tjruwase in https://github.com/microsoft/DeepSpeed/pull/6741
* Update AMD apex version by loadams in https://github.com/microsoft/DeepSpeed/pull/6739
* Fix Type Name Inconsistency & Typo in cpu_adam by xylian86 in https://github.com/microsoft/DeepSpeed/pull/6732
* Add Domino code by zhangsmallshark in https://github.com/microsoft/DeepSpeed/pull/6733
* Add data type check for bf16 by hwchen2017 in https://github.com/microsoft/DeepSpeed/pull/6742
* add zero3 module_granularity_threshold to zero optimization. by inkcherry in https://github.com/microsoft/DeepSpeed/pull/6649
* AIO File Offsets by jomayeri in https://github.com/microsoft/DeepSpeed/pull/6641
* Update path for BingBertSquad from DeepSpeedExamples by loadams in https://github.com/microsoft/DeepSpeed/pull/6746
* Sanitize inputs to eval() by loadams in https://github.com/microsoft/DeepSpeed/pull/6745
* Adding the governance doc by minjiazhang in https://github.com/microsoft/DeepSpeed/pull/6748
* Add no_sync context manager by tjruwase in https://github.com/microsoft/DeepSpeed/pull/6675
* Gaudi2 Nightly job for daily check by raza-sikander in https://github.com/microsoft/DeepSpeed/pull/6753
* Disable failing python tests by loadams in https://github.com/microsoft/DeepSpeed/pull/6758
* A faster and more memory-efficient implementation of `zero_to_fp32` by xu-song in https://github.com/microsoft/DeepSpeed/pull/6658
* Pin transformers version to work around latest torch requirements by loadams in https://github.com/microsoft/DeepSpeed/pull/6759
* make xpu ops compatible with oneapi 2025.0 by baodii in https://github.com/microsoft/DeepSpeed/pull/6760
* Add explicit parameters for torch.load by loadams in https://github.com/microsoft/DeepSpeed/pull/6751
* Fix setup.py bash cmd generation to correctly extract git info by nelyahu in https://github.com/microsoft/DeepSpeed/pull/6762
* Use `json_schema_extra` instead of extra keyword in `Field` by qgallouedec in https://github.com/microsoft/DeepSpeed/pull/6764
* Fix potential memory issues when use deepspeed Z3 by wenbinc-Bin in https://github.com/microsoft/DeepSpeed/pull/6726
* Removes unnecessary cloning by swigls in https://github.com/microsoft/DeepSpeed/pull/6761
* Enable torch compile on _allgather_params by deepcharm in https://github.com/microsoft/DeepSpeed/pull/6769
* Unpin with latest transformers fixes by loadams in https://github.com/microsoft/DeepSpeed/pull/6763
* docs: fix HF links by imba-tjd in https://github.com/microsoft/DeepSpeed/pull/6780
* Fix Doc Error: ZeRO Stage 2 gradient partitioning by yewentao256 in https://github.com/microsoft/DeepSpeed/pull/6775
* Cleanup code docs warnings by loadams in https://github.com/microsoft/DeepSpeed/pull/6783
* Domino Blog by GuanhuaWang in https://github.com/microsoft/DeepSpeed/pull/6776
* Update version.txt before release by loadams in https://github.com/microsoft/DeepSpeed/pull/6784
* Revert release workflow by loadams in https://github.com/microsoft/DeepSpeed/pull/6785

New Contributors
* zhangsmallshark made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6733
* hwchen2017 made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6742
* minjiazhang made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6748
* qgallouedec made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6764
* wenbinc-Bin made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6726
* swigls made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6761
* imba-tjd made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6780
* yewentao256 made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6775

**Full Changelog**: https://github.com/microsoft/DeepSpeed/compare/v0.15.4...v0.16.0

0.15.4

What's Changed
* Update version.txt after 0.15.3 release by loadams in https://github.com/microsoft/DeepSpeed/pull/6652
* Fix expert grad scaling problem with ZeRO optimizer by wyooyw in https://github.com/microsoft/DeepSpeed/pull/6546
* Add attribute check for language_model when replace last linear module by Yejing-Lai in https://github.com/microsoft/DeepSpeed/pull/6650
* fix init_device_mesh for torch 2.4 by Lzhang-hub in https://github.com/microsoft/DeepSpeed/pull/6614
* Fix dynamo issue by oraluben in https://github.com/microsoft/DeepSpeed/pull/6527
* sequence parallel for uneven heads by inkcherry in https://github.com/microsoft/DeepSpeed/pull/6392
* Add fallback for is_compiling by tohtana in https://github.com/microsoft/DeepSpeed/pull/6663
* Update profiler registration check by loadams in https://github.com/microsoft/DeepSpeed/pull/6668
* Add support for H100/sm_90 arch compilation by loadams in https://github.com/microsoft/DeepSpeed/pull/6669
* Update Gaudi2 docker image by loadams in https://github.com/microsoft/DeepSpeed/pull/6677
* Update gaudi2 docker version to latest release (1.18) by raza-sikander in https://github.com/microsoft/DeepSpeed/pull/6648
* Update base docker image for A6000 GPU tests by loadams in https://github.com/microsoft/DeepSpeed/pull/6681
* Remove packages that no longer need to be updated in the latest container by loadams in https://github.com/microsoft/DeepSpeed/pull/6682
* Fix training of pipeline based peft's lora model by xuanhua in https://github.com/microsoft/DeepSpeed/pull/5477
* Update checkout action to latest version by loadams in https://github.com/microsoft/DeepSpeed/pull/5021
* Add attribute check to support git-base autotp by Yejing-Lai in https://github.com/microsoft/DeepSpeed/pull/6688
* fix memcpy issue on backward for zero-infinity by xylian86 in https://github.com/microsoft/DeepSpeed/pull/6670
* Free memory in universal checkpointing tests by tohtana in https://github.com/microsoft/DeepSpeed/pull/6693
* Explictly set device when reusing dist env by tohtana in https://github.com/microsoft/DeepSpeed/pull/6696
* Update URL in README Pipeline Status for Huawei Ascend NPU by xuedinge233 in https://github.com/microsoft/DeepSpeed/pull/6706
* Pin transformers to 4.45.2 in nv-ds-chat workflow by loadams in https://github.com/microsoft/DeepSpeed/pull/6710
* [Bug Fix] Support threads_per_head < 64 for wavefront size of 64 by jagadish-amd in https://github.com/microsoft/DeepSpeed/pull/6622
* Use one param coordinator for both train/inference scenarios by tohtana in https://github.com/microsoft/DeepSpeed/pull/6662
* Update yapf version by loadams in https://github.com/microsoft/DeepSpeed/pull/6721
* Update flake8 version by loadams in https://github.com/microsoft/DeepSpeed/pull/6722
* Switch what versions of python are supported by loadams in https://github.com/microsoft/DeepSpeed/pull/5676

New Contributors
* wyooyw made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6546
* xuanhua made their first contribution in https://github.com/microsoft/DeepSpeed/pull/5477

**Full Changelog**: https://github.com/microsoft/DeepSpeed/compare/v0.15.3...v0.15.4

0.15.3

What's Changed
* Update version.txt after 0.15.2 release by loadams in https://github.com/microsoft/DeepSpeed/pull/6615
* Clean up prefetched parameters by tohtana in https://github.com/microsoft/DeepSpeed/pull/6557
* AIO CPU Locked Tensor by jomayeri in https://github.com/microsoft/DeepSpeed/pull/6592
* reduce setting global variables to reduce torch compile graph breaks by NirSonnenschein in https://github.com/microsoft/DeepSpeed/pull/6541
* Add API to get devices of offload states by tohtana in https://github.com/microsoft/DeepSpeed/pull/6586
* Ignore reuse_dist_env by tohtana in https://github.com/microsoft/DeepSpeed/pull/6623
* Add API for updating ZeRO gradients by tjruwase in https://github.com/microsoft/DeepSpeed/pull/6590
* [compile] Show breakdown of graph break by delock in https://github.com/microsoft/DeepSpeed/pull/6601
* Accept btl_tcp_if_include option through launcher_args by diskkid in https://github.com/microsoft/DeepSpeed/pull/6613
* Add first Step in LR Schedulers by jomayeri in https://github.com/microsoft/DeepSpeed/pull/6597
* Support safetensors export by xu-song in https://github.com/microsoft/DeepSpeed/pull/6579
* add option to disable logger while compiling to avoid graph breaks by ShellyNR in https://github.com/microsoft/DeepSpeed/pull/6496
* Lock cache file of HF model list by tohtana in https://github.com/microsoft/DeepSpeed/pull/6628
* Add README Pipeline Status for Huawei Ascend NPU by xuedinge233 in https://github.com/microsoft/DeepSpeed/pull/6588
* Update torch version in workflows by tohtana in https://github.com/microsoft/DeepSpeed/pull/6631
* Use file store for tests by tohtana in https://github.com/microsoft/DeepSpeed/pull/6632
* Fix Memory Leak In AIO by jomayeri in https://github.com/microsoft/DeepSpeed/pull/6630
* [XPU] upgrade xpu max1100 CI workflow to pytorch2.3 by Liangliang-Ma in https://github.com/microsoft/DeepSpeed/pull/6646
* [XPU] host timer check version from Torch 2.5 to Torch 2.6 by YizhouZ in https://github.com/microsoft/DeepSpeed/pull/6633
* [XPU] [DeepNVMe] use same cpu_op_desc_t with cuda by Liangliang-Ma in https://github.com/microsoft/DeepSpeed/pull/6645

New Contributors
* diskkid made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6613
* ShellyNR made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6496

**Full Changelog**: https://github.com/microsoft/DeepSpeed/compare/v0.15.2...v0.15.3

0.15.2

What's Changed
* Update version.txt after 0.15.1 release by loadams in https://github.com/microsoft/DeepSpeed/pull/6493
* HPU: add required ENV vars to acccelerator init by nelyahu in https://github.com/microsoft/DeepSpeed/pull/6495
* Op_builder->is_compatible quite warning by terry-for-github in https://github.com/microsoft/DeepSpeed/pull/6093
* fix pipeline eval_batch micro_batches argument for schedule by nelyahu in https://github.com/microsoft/DeepSpeed/pull/6484
* Fix the broken url link by rogerxfeng8 in https://github.com/microsoft/DeepSpeed/pull/6500
* fix environment variable export bug for MultiNodeRunner by TideDra in https://github.com/microsoft/DeepSpeed/pull/5878
* Revert "BF16 optimizer: Clear lp grads after updating hp grads in hook" by nelyahu in https://github.com/microsoft/DeepSpeed/pull/6508
* wrap include cuda_bf16.h with ifdef BF16_AVAILABLE by oelayan7 in https://github.com/microsoft/DeepSpeed/pull/6520
* Avoid security issues of subprocess shell by tjruwase in https://github.com/microsoft/DeepSpeed/pull/6498
* Add conditional on torch version for scaled_dot_product_attention by loadams in https://github.com/microsoft/DeepSpeed/pull/6517
* Added Intel Gaudi to Accelerator Setup Guide by ShifaAbu in https://github.com/microsoft/DeepSpeed/pull/6543
* Skip failing newly added tests in accelerate by loadams in https://github.com/microsoft/DeepSpeed/pull/6574
* Use msgpack for p2p comm by tohtana in https://github.com/microsoft/DeepSpeed/pull/6547
* DeepNVMe perf tuning by tjruwase in https://github.com/microsoft/DeepSpeed/pull/6560
* [Accelerator] Cambricon MLU support by Andy666G in https://github.com/microsoft/DeepSpeed/pull/6472
* Fix gradient accumulation for Z2+offload by tohtana in https://github.com/microsoft/DeepSpeed/pull/6550
* fix errors when setting zero3 leaf modules with torch.compile by NirSonnenschein in https://github.com/microsoft/DeepSpeed/pull/6564
* [XPU] Support DeepNVMe new code structure by Liangliang-Ma in https://github.com/microsoft/DeepSpeed/pull/6532
* Add APIs to offload states of model, optimizer, and engine by tohtana in https://github.com/microsoft/DeepSpeed/pull/6011
* add bfloat16 to inference support dtypes by nelyahu in https://github.com/microsoft/DeepSpeed/pull/6528
* [COMPILE] workflow for deepspeed + torch.compile by YizhouZ in https://github.com/microsoft/DeepSpeed/pull/6570
* Fixes on the accelerate side mean we do not need to skip this test by loadams in https://github.com/microsoft/DeepSpeed/pull/6583
* Fix torch include in `op_builder/mlu/fused_adam.py` and update no-torch workflow triggers by loadams in https://github.com/microsoft/DeepSpeed/pull/6584
* [ROCm] Fix subprocess error by jagadish-amd in https://github.com/microsoft/DeepSpeed/pull/6587
* Cleanup CODEOWNERS file to be valid by loadams in https://github.com/microsoft/DeepSpeed/pull/6603
* Add SSF Best practices badge by loadams in https://github.com/microsoft/DeepSpeed/pull/6604
* Move V100 workflows from cuda 11.1/11.7 to 12.1 by loadams in https://github.com/microsoft/DeepSpeed/pull/6607
* Fix SD workflow by loadams in https://github.com/microsoft/DeepSpeed/pull/6609
* Pin accelerate to fix CI failures/issues by loadams in https://github.com/microsoft/DeepSpeed/pull/6610
* Add llama3.2 vision autotp by Yejing-Lai in https://github.com/microsoft/DeepSpeed/pull/6577
* Improve DS logging control by tjruwase in https://github.com/microsoft/DeepSpeed/pull/6602
* Fix device selection using CUDA_VISIBLE_DEVICES by tohtana in https://github.com/microsoft/DeepSpeed/pull/6530
* Handle when `backend` is also in compile_kwargs by oraluben in https://github.com/microsoft/DeepSpeed/pull/6502
* Rearrange inference OPS and stop using builder.load by oelayan7 in https://github.com/microsoft/DeepSpeed/pull/5490
* Unpin accelerate tests, update lightning with node16 removal. by loadams in https://github.com/microsoft/DeepSpeed/pull/6611
* Enabled Qwen2-MoE Tensor Parallelism (TP) inference by gyou2021 in https://github.com/microsoft/DeepSpeed/pull/6551

New Contributors
* TideDra made their first contribution in https://github.com/microsoft/DeepSpeed/pull/5878
* ShifaAbu made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6543
* jagadish-amd made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6587
* gyou2021 made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6551

**Full Changelog**: https://github.com/microsoft/DeepSpeed/compare/v0.15.1...v0.15.2

0.15.1

Not secure
What's Changed
* Update version.txt after 0.15.0 release by loadams in https://github.com/microsoft/DeepSpeed/pull/6403
* Fix Type Mismatch by jomayeri in https://github.com/microsoft/DeepSpeed/pull/6410
* Fix redundant seq data parallel grp argument in Z3/MiCS by samadejacobs in https://github.com/microsoft/DeepSpeed/pull/5352
* add Huawei Ascend NPU setup guide by xuedinge233 in https://github.com/microsoft/DeepSpeed/pull/6445
* Add documentation for launcher without SSH by dogacancolak-kensho in https://github.com/microsoft/DeepSpeed/pull/6455
* Dtype support check for accelerator in UTs by raza-sikander in https://github.com/microsoft/DeepSpeed/pull/6360
* Store/Load CIFAR from local/offline by raza-sikander in https://github.com/microsoft/DeepSpeed/pull/6390
* Add the accelerator setup guide link in Getting Started page by rogerxfeng8 in https://github.com/microsoft/DeepSpeed/pull/6452
* Allow triton==3.0.x for fp_quantizer by siddartha-RE in https://github.com/microsoft/DeepSpeed/pull/6447
* Change GDS to 1 AIO thread by jomayeri in https://github.com/microsoft/DeepSpeed/pull/6459
* [CCL] fix condition issue in ccl.py by YizhouZ in https://github.com/microsoft/DeepSpeed/pull/6443
* Avoid gds build errors on ROCm by rraminen in https://github.com/microsoft/DeepSpeed/pull/6456
* TestLowCpuMemUsage UT get device by device_name by raza-sikander in https://github.com/microsoft/DeepSpeed/pull/6397
* Add workflow to build DS without torch to better test before releases by loadams in https://github.com/microsoft/DeepSpeed/pull/6450
* Fix patch for parameter partitioning in zero.Init() by tohtana in https://github.com/microsoft/DeepSpeed/pull/6388
* Add default value to "checkpoint_folder" in "load_state_dict" of bf16_optimizer by ljcc0930 in https://github.com/microsoft/DeepSpeed/pull/6446
* DeepNVMe tutorial by tjruwase in https://github.com/microsoft/DeepSpeed/pull/6449
* bf16_optimizer: fixes to different grad acc dtype by nelyahu in https://github.com/microsoft/DeepSpeed/pull/6485
* print warning if actual triton cache dir is on NFS, not just for default by jrandall in https://github.com/microsoft/DeepSpeed/pull/6487
* DS_BUILD_OPS should build only compatible ops by tjruwase in https://github.com/microsoft/DeepSpeed/pull/6489
* Safe usage of popen by tjruwase in https://github.com/microsoft/DeepSpeed/pull/6490
* Handle an edge case where `CUDA_HOME` is not defined on ROCm systems by amorehead in https://github.com/microsoft/DeepSpeed/pull/6488

New Contributors
* xuedinge233 made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6445
* siddartha-RE made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6447
* ljcc0930 made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6446
* jrandall made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6487
* amorehead made their first contribution in https://github.com/microsoft/DeepSpeed/pull/6488

**Full Changelog**: https://github.com/microsoft/DeepSpeed/compare/v0.15.0...v0.15.1

Page 1 of 17

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.