Mosaicml

Latest version: v0.29.0

Safety actively analyzes 723625 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 9 of 15

0.13.1

What's Changed
* Add configurability to HF checkpointer timeout by dakinggg in https://github.com/mosaicml/llm-foundry/pull/1599

**Full Changelog**: https://github.com/mosaicml/llm-foundry/compare/v0.13.0...v0.13.1

0.13.0

🛠️ Bug Fixes & Cleanup
Pytorch 2.4 Checkpointing (1569, 1581, 1583)
Resolved issues related to checkpointing for Curriculum Learning (CL) callbacks.

🔧 Dependency Updates
Bumped tiktoken from 0.4.0 to 0.8.0 (1572)
Updated onnxruntime from 1.19.0 to 1.19.2 (1590)

What's Changed
* Update mcli yamls by dakinggg in https://github.com/mosaicml/llm-foundry/pull/1552
* Use `allenai/c4` instead of `c4` dataset by eitanturok in https://github.com/mosaicml/llm-foundry/pull/1554
* Tensor Parallelism by eitanturok in https://github.com/mosaicml/llm-foundry/pull/1521
* Insufficient Permissions Error when trying to access table by KuuCi in https://github.com/mosaicml/llm-foundry/pull/1555
* Add NoOp optimizer by snarayan21 in https://github.com/mosaicml/llm-foundry/pull/1560
* Deterministic GCRP Errors by KuuCi in https://github.com/mosaicml/llm-foundry/pull/1559
* Simplify CL API by b-chu in https://github.com/mosaicml/llm-foundry/pull/1510
* Reapply 1389 by dakinggg in https://github.com/mosaicml/llm-foundry/pull/1561
* Add dataset swap callback by b-chu in https://github.com/mosaicml/llm-foundry/pull/1536
* Add error to catch more unknown example types by milocress in https://github.com/mosaicml/llm-foundry/pull/1562
* Add FileExtensionNotFoundError by b-chu in https://github.com/mosaicml/llm-foundry/pull/1564
* Add InvalidConversationError by b-chu in https://github.com/mosaicml/llm-foundry/pull/1565
* Release docker img by KuuCi in https://github.com/mosaicml/llm-foundry/pull/1547
* Revert FT dataloader changes from 1561, keep 1564 by snarayan21 in https://github.com/mosaicml/llm-foundry/pull/1566
* Cleanup TP by eitanturok in https://github.com/mosaicml/llm-foundry/pull/1556
* Changes for dataset swap callback by gupta-abhay in https://github.com/mosaicml/llm-foundry/pull/1569
* Do not consider run_name when auto-detecting autoresume by irenedea in https://github.com/mosaicml/llm-foundry/pull/1571
* Allow parameters with requires_grad=False in meta init by sashaDoubov in https://github.com/mosaicml/llm-foundry/pull/1567
* Bump tiktoken from 0.4.0 to 0.8.0 by dependabot in https://github.com/mosaicml/llm-foundry/pull/1572
* Add extensions to FinetuningFileNotFoundError by b-chu in https://github.com/mosaicml/llm-foundry/pull/1578
* Handle long file names in convert text to mds by irenedea in https://github.com/mosaicml/llm-foundry/pull/1579
* Set streaming log level by mvpatel2000 in https://github.com/mosaicml/llm-foundry/pull/1582
* Fix pytorch checkpointing for CL callback by b-chu in https://github.com/mosaicml/llm-foundry/pull/1581
* Fix pytorch checkpointing for CL callback by b-chu in https://github.com/mosaicml/llm-foundry/pull/1583
* Error if filtered dataset contains 0 examples by irenedea in https://github.com/mosaicml/llm-foundry/pull/1585
* Change cluster errors from NetworkError to UserError by irenedea in https://github.com/mosaicml/llm-foundry/pull/1586
* Do not autoresume if a default name is set, only on user defined ones by irenedea in https://github.com/mosaicml/llm-foundry/pull/1588
* Bump onnxruntime from 1.19.0 to 1.19.2 by dependabot in https://github.com/mosaicml/llm-foundry/pull/1590
* Make FinetuningStreamingDataset parameters more flexible by XiaohanZhangCMU in https://github.com/mosaicml/llm-foundry/pull/1580
* Add build callback tests by irenedea in https://github.com/mosaicml/llm-foundry/pull/1577
* Bump version to 0.14.0.dev0 by irenedea in https://github.com/mosaicml/llm-foundry/pull/1587
* Fix typo in eval code by using 'fsdp' instead of 'fsdp_config' by irenedea in https://github.com/mosaicml/llm-foundry/pull/1593


**Full Changelog**: https://github.com/mosaicml/llm-foundry/compare/v0.12.0...v0.13.0

0.12.1

Not secure
New Features

1. **📚 In-Context Learning (1876)**

With Composer and MosaicML Cloud you can now evaluate LLMs on in-context learning tasks (LAMBADA, HellaSwag, PIQA, and more) hundreds of times faster than other evaluation harnesses. Please see our "[Blazingly Fast LLM Evaluation for In-Context Learning](https://www.mosaicml.com/blog/llm-evaluation-for-icl)" blog post for more details!

1. **💾 Added support for Coreweave [Object Storage](https://docs.coreweave.com/storage/object-storage) (#1915)**

Coreweave object store is compatible with `boto3`. Uploading objects to Coreweave object store is almost exactly like writing to using S3, except an `endpoint_url` must be set via the `S3_ENDPOINT_URL`environment variable. For example:

python
import os
os.environ['S3_ENDPOINT_URL'] = 'https://object.las1.coreweave.com'

from composer.trainer import Trainer

Save checkpoints every epoch to s3://my_bucket/checkpoints
trainer = Trainer(
model=model,
train_dataloader=train_dataloader,
max_duration='10ep',
save_folder='s3://my_bucket/checkpoints',
save_interval='1ep',
save_overwrite=True,
save_filename='ep{epoch}.pt',
save_num_checkpoints_to_keep=0, delete all checkpoints locally
)

trainer.fit()


Please see our [checkpointing documentation](https://docs.mosaicml.com/en/stable/trainer/checkpointing.html#uploading-checkpoints-to-object-store) for more details.

1. **🪵 Automatic logging of Trainer hparams (1855)**

Hyperparameter arguments passed to the `Trainer` are now automatically logged. Simply set the `Trainer` argument `auto_log_hparams=True`.

Bug Fixes

* Update Docker images to use ‘posix_prefix’ paths (1854)
* Disable new notebook in CI (1875)
* [Fix] Enable logging of metrics from Callbacks to ConsoleLogging (1884)
* Ensure loggers run init event before callbacks in Engine (1890)
* Raise an error in FSDP meta tensor initialization if there's no initialization functions, fix associated flaky FSDP test (1905)
* Add primitive list support (1906)
* Add logic for shifting labels before computing metrics (1913)
* Fixes mis specified dependency (1919)
* pin setuptools in build requirements (1926)
* Pin pip<23 in Docker images (1936)
* Fix bug in trainer.eval and add test cases for test_console_logger (1937)

What's Changed
* Rename GradMonitor -> OptimizerMonitor; add functionality to log optimizer-specific metrics to assist loss spike investigation by bmosaicml in https://github.com/mosaicml/composer/pull/1743
* Add GCS uri support for loading and saving checkpoints by eracah in https://github.com/mosaicml/composer/pull/1833
* HF factory function tests by dakinggg in https://github.com/mosaicml/composer/pull/1832
* Fix doc issue, Trainer hparam log_to_console defaults to False by eracah in https://github.com/mosaicml/composer/pull/1840
* Removed YAHP references from Docs by bandish-shah in https://github.com/mosaicml/composer/pull/1841
* Typo by nguyenhoan1988 in https://github.com/mosaicml/composer/pull/1843
* Fix source code links in docs by bandish-shah in https://github.com/mosaicml/composer/pull/1844
* add importorskip by dakinggg in https://github.com/mosaicml/composer/pull/1847
* Update Docker images to use ‘posix_prefix’ paths by mvpatel2000 in https://github.com/mosaicml/composer/pull/1854
* Fix typo by standardAI in https://github.com/mosaicml/composer/pull/1849
* ConsoleLogger: log first batch and first epoch when using console_log_interval by eracah in https://github.com/mosaicml/composer/pull/1860
* Simpler auto log hparams by eracah in https://github.com/mosaicml/composer/pull/1855
* Fix typos by cclauss in https://github.com/mosaicml/composer/pull/1850
* Bump sphinxext-opengraph from 0.7.3 to 0.7.4 by dependabot in https://github.com/mosaicml/composer/pull/1851
* Bump coverage[toml] from 6.5.0 to 7.0.1 by dependabot in https://github.com/mosaicml/composer/pull/1853
* Bump traitlets from 5.7.0 to 5.8.0 by dependabot in https://github.com/mosaicml/composer/pull/1852
* Bump ipython from 7.32.0 to 8.8.0 by dependabot in https://github.com/mosaicml/composer/pull/1865
* Update monai requirement from <0.10,>=0.9.1 to >=0.9.1,<1.2 by dependabot in https://github.com/mosaicml/composer/pull/1869
* Bump sphinxcontrib-katex from 0.9.3 to 0.9.4 by dependabot in https://github.com/mosaicml/composer/pull/1868
* Bump coverage[toml] from 7.0.1 to 7.0.4 by dependabot in https://github.com/mosaicml/composer/pull/1867
* Upgrade docker images to `torch==1.13.1` by abhi-mosaic in https://github.com/mosaicml/composer/pull/1863
* add more useful info to state by dakinggg in https://github.com/mosaicml/composer/pull/1848
* Feature/lambada evaluator by bmosaicml in https://github.com/mosaicml/composer/pull/1845
* multi-node distributed training, submitit & composer integration demo by YilunKuang in https://github.com/mosaicml/composer/pull/1753
* Daily tests by mvpatel2000 in https://github.com/mosaicml/composer/pull/1870
* Disable new notebook in CI by mvpatel2000 in https://github.com/mosaicml/composer/pull/1875
* Update deepspeed by mvpatel2000 in https://github.com/mosaicml/composer/pull/1864
* fix fail fast in daily by mvpatel2000 in https://github.com/mosaicml/composer/pull/1880
* Fix getting started docs by mvpatel2000 in https://github.com/mosaicml/composer/pull/1878
* Speed up test_lm_task_evaluation by mvpatel2000 in https://github.com/mosaicml/composer/pull/1879
* Fix unprotected import by mvpatel2000 in https://github.com/mosaicml/composer/pull/1874
* add ignore_modules to fsdp by vchiley in https://github.com/mosaicml/composer/pull/1877
* Change vision image by mvpatel2000 in https://github.com/mosaicml/composer/pull/1881
* Fix eval_forward in the ComposerModel ABC by eracah in https://github.com/mosaicml/composer/pull/1871
* Fix fsdp weight tying by bcui19 in https://github.com/mosaicml/composer/pull/1856
* Bump pytest from 7.2.0 to 7.2.1 by dependabot in https://github.com/mosaicml/composer/pull/1886
* Bump ipykernel from 6.19.2 to 6.20.1 by dependabot in https://github.com/mosaicml/composer/pull/1887
* Bump gitpython from 3.1.28 to 3.1.30 by dependabot in https://github.com/mosaicml/composer/pull/1888
* Update Vision Image in Pytest by mvpatel2000 in https://github.com/mosaicml/composer/pull/1882
* Streaming data tests by dakinggg in https://github.com/mosaicml/composer/pull/1842
* Add NLP Algorithms Tests by nik-mosaic in https://github.com/mosaicml/composer/pull/1839
* rename HF notebook by dakinggg in https://github.com/mosaicml/composer/pull/1873
* Ensure loggers run init event before callbacks in Engine by eracah in https://github.com/mosaicml/composer/pull/1890
* [Fix] Enable logging of metrics from Callbacks to ConsoleLogging by eracah in https://github.com/mosaicml/composer/pull/1884
* Updating how we load metrics in a state_dict so we don't add extra memory overhead by bcui19 in https://github.com/mosaicml/composer/pull/1892
* Getting daily tests passing by dakinggg in https://github.com/mosaicml/composer/pull/1893
* Bump nbsphinx from 0.8.10 to 0.8.12 by dependabot in https://github.com/mosaicml/composer/pull/1897
* Fix docker image by mvpatel2000 in https://github.com/mosaicml/composer/pull/1894
* Add primitive list support by mvpatel2000 in https://github.com/mosaicml/composer/pull/1906
* Raise an error in FSDP `meta` tensor initialization if there's no initialization functions, fix associated flaky FSDP test by bcui19 in https://github.com/mosaicml/composer/pull/1905
* Gpu Test by mvpatel2000 in https://github.com/mosaicml/composer/pull/1907
* Update docker with FFCV fix by mvpatel2000 in https://github.com/mosaicml/composer/pull/1908
* Restore GPU tests by mvpatel2000 in https://github.com/mosaicml/composer/pull/1909
* Update workflow names by mvpatel2000 in https://github.com/mosaicml/composer/pull/1910
* Enable daily gpu tests by mvpatel2000 in https://github.com/mosaicml/composer/pull/1911
* Tweak daily GPU tests by mvpatel2000 in https://github.com/mosaicml/composer/pull/1912
* Daily GPU Tests -- Change to Git Commit by mvpatel2000 in https://github.com/mosaicml/composer/pull/1914
* Add logic for shifting labels before computing metrics by alextrott16 in https://github.com/mosaicml/composer/pull/1913
* Add coreweave object store support. by eracah in https://github.com/mosaicml/composer/pull/1915
* Fixes mis specified dependency by dakinggg in https://github.com/mosaicml/composer/pull/1919
* Bump coverage[toml] from 7.0.4 to 7.1.0 by dependabot in https://github.com/mosaicml/composer/pull/1923
* Update importlib-metadata requirement from <6,>=5.0.0 to >=5.0.0,<7 by dependabot in https://github.com/mosaicml/composer/pull/1921
* pin setuptools in build requirements by dakinggg in https://github.com/mosaicml/composer/pull/1926
* Remove synthetic testing infrastructure for HF/NLP by dakinggg in https://github.com/mosaicml/composer/pull/1895
* Add upgrade flags to pip installs by dakinggg in https://github.com/mosaicml/composer/pull/1916
* Temporarily pin pip to <23 by dakinggg in https://github.com/mosaicml/composer/pull/1930
* add link protection by mvpatel2000 in https://github.com/mosaicml/composer/pull/1927
* Cleaning up error checking for FSDP sharding strategies with fp32 precision by bcui19 in https://github.com/mosaicml/composer/pull/1925
* Fix mcp script to avoid follow by mvpatel2000 in https://github.com/mosaicml/composer/pull/1932
* Emit Eval progress in console logging by eracah in https://github.com/mosaicml/composer/pull/1917
* Remove Fused LayerNorm deprecation by nik-mosaic in https://github.com/mosaicml/composer/pull/1931
* Add EFA Support for Multinode in AWS by mvpatel2000 in https://github.com/mosaicml/composer/pull/1891
* remove jenkins gpu tests by mvpatel2000 in https://github.com/mosaicml/composer/pull/1933
* Typo due to stale MCLI docs by mvpatel2000 in https://github.com/mosaicml/composer/pull/1934
* Pin pip<23 in Docker images by bandish-shah in https://github.com/mosaicml/composer/pull/1936
* Fix bug in trainer.eval and add test cases for test_console_logger by eracah in https://github.com/mosaicml/composer/pull/1937
* Add few shot and multiple choice to ICL evaluation by bmosaicml in https://github.com/mosaicml/composer/pull/1876
* Disable test_streaming_datasets in pytest-daily by bandish-shah in https://github.com/mosaicml/composer/pull/1939

New Contributors
* bmosaicml made their first contribution in https://github.com/mosaicml/composer/pull/1743
* nguyenhoan1988 made their first contribution in https://github.com/mosaicml/composer/pull/1843
* standardAI made their first contribution in https://github.com/mosaicml/composer/pull/1849
* cclauss made their first contribution in https://github.com/mosaicml/composer/pull/1850
* YilunKuang made their first contribution in https://github.com/mosaicml/composer/pull/1753
* vchiley made their first contribution in https://github.com/mosaicml/composer/pull/1877

**Full Changelog**: https://github.com/mosaicml/composer/compare/v0.12.0...v0.12.1

0.12.0

Not secure
New Features

PyTorch 2.4 (1505)
This release updates LLM Foundry to the PyTorch 2.4 release, bringing with it support for the new features and optimizations in PyTorch 2.4

Extensibility improvements (1450, 1449, 1468, 1467, 1478, 1493, 1495, 1511, 1512, 1527)
Numerous improvements to the extensibility of the modeling and data loading code, enabling easier reuse for subclassing and extending. Please see the linked PRs for more details on each change.

Improved error messages (1457, 1459, 1519, 1518, 1522, 1534, 1548, 1551)
Various improved error messages, making debugging user errors more clear.

Sliding window in torch attention (1455)
We've added support for sliding window attention to the reference attention implementation, allowing easier testing and comparison against more optimized attention variants.

Bug fixes

Extra BOS token for llama 3.1 with completion data (1476)
A bug resulted in an extra BOS token being added between prompt and response during finetuning. This is fixed so that the prompt and response supplied by the user are concatenated without any extra tokens put between them.

What's Changed
* Add test for logged_config transforms by b-chu in https://github.com/mosaicml/llm-foundry/pull/1441
* Bump version to 0.12.0.dev0. by irenedea in https://github.com/mosaicml/llm-foundry/pull/1447
* Update pytest-codeblocks requirement from <0.17,>=0.16.1 to >=0.16.1,<0.18 by dependabot in https://github.com/mosaicml/llm-foundry/pull/1445
* Bump coverage[toml] from 7.4.4 to 7.6.1 by dependabot in https://github.com/mosaicml/llm-foundry/pull/1442
* Enabled generalizing build_inner_model in ComposerHFCausalLM by gupta-abhay in https://github.com/mosaicml/llm-foundry/pull/1450
* Update llm foundry version in mcli yamls by irenedea in https://github.com/mosaicml/llm-foundry/pull/1451
* merge to main by XiaohanZhangCMU in https://github.com/mosaicml/llm-foundry/pull/865
* allow embedding resizing passed through by jdchang1 in https://github.com/mosaicml/llm-foundry/pull/1449
* Update packaging requirement from <23,>=21 to >=21,<25 by dependabot in https://github.com/mosaicml/llm-foundry/pull/1444
* Update pytest requirement from <8,>=7.2.1 to >=7.2.1,<9 by dependabot in https://github.com/mosaicml/llm-foundry/pull/1443
* Implement ruff rules enforcing PEP 585 by snarayan21 in https://github.com/mosaicml/llm-foundry/pull/1453
* Adding sliding window attn to scaled_multihead_dot_product_attention by ShashankMosaicML in https://github.com/mosaicml/llm-foundry/pull/1455
* Add user error for UnicodeDeocdeError in convert text to mds by irenedea in https://github.com/mosaicml/llm-foundry/pull/1457
* Fix log_config by josejg in https://github.com/mosaicml/llm-foundry/pull/1432
* Add EnvironmentLogger Callback by josejg in https://github.com/mosaicml/llm-foundry/pull/1350
* Update mosaicml/ci-testing to 0.1.2 by irenedea in https://github.com/mosaicml/llm-foundry/pull/1458
* Correct error message for inference wrapper by josejg in https://github.com/mosaicml/llm-foundry/pull/1459
* Update CI tests to v0.1.2 by KuuCi in https://github.com/mosaicml/llm-foundry/pull/1466
* Bump onnxruntime from 1.18.1 to 1.19.0 by dependabot in https://github.com/mosaicml/llm-foundry/pull/1461
* Update tenacity requirement from <9,>=8.2.3 to >=8.2.3,<10 by dependabot in https://github.com/mosaicml/llm-foundry/pull/1460
* Simple change to enable mapping functions for ft constructor by gupta-abhay in https://github.com/mosaicml/llm-foundry/pull/1468
* use default eval interval from composer by milocress in https://github.com/mosaicml/llm-foundry/pull/1369
* Consistent Naming EnviromentLoggingCallback by josejg in https://github.com/mosaicml/llm-foundry/pull/1470
* Register NaN Monitor Callback by josejg in https://github.com/mosaicml/llm-foundry/pull/1471
* Add train subset num batches by mvpatel2000 in https://github.com/mosaicml/llm-foundry/pull/1472
* Parent class hf models by jdchang1 in https://github.com/mosaicml/llm-foundry/pull/1467
* Remove extra bos for prompt/response data with llama3.1 by dakinggg in https://github.com/mosaicml/llm-foundry/pull/1476
* Add prepare fsdp back by dakinggg in https://github.com/mosaicml/llm-foundry/pull/1477
* Add date_string when applying tokenizer chat template by snarayan21 in https://github.com/mosaicml/llm-foundry/pull/1474
* Make sample tokenization extensible by gupta-abhay in https://github.com/mosaicml/llm-foundry/pull/1478
* Use Streaming version 0.8.1 by snarayan21 in https://github.com/mosaicml/llm-foundry/pull/1479
* Bump hf-transfer from 0.1.3 to 0.1.8 by dependabot in https://github.com/mosaicml/llm-foundry/pull/1480
* fix hf checkpointer by milocress in https://github.com/mosaicml/llm-foundry/pull/1489
* Fix device mismatch when running hf.generate by ShashankMosaicML in https://github.com/mosaicml/llm-foundry/pull/1486
* Bump composer to 0.24.1 + FSDP config device_mesh deprecation by snarayan21 in https://github.com/mosaicml/llm-foundry/pull/1487
* master_weights_dtype not supported by ComposerHFCausalLM.__init__() by eldarkurtic in https://github.com/mosaicml/llm-foundry/pull/1485
* Detect loss spikes and high losses during training by joyce-chen-uni in https://github.com/mosaicml/llm-foundry/pull/1473
* Enable passing in external position ids by gupta-abhay in https://github.com/mosaicml/llm-foundry/pull/1493
* Align logged attributes for errors and run metadata in kill_loss_spike_callback.py by joyce-chen-uni in https://github.com/mosaicml/llm-foundry/pull/1494
* tokenizer is never built when converting finetuning dataset by eldarkurtic in https://github.com/mosaicml/llm-foundry/pull/1496
* Removing error message for reusing kv cache with torch attn by ShashankMosaicML in https://github.com/mosaicml/llm-foundry/pull/1497
* Fix formatting of loss spike & high loss error messages by joyce-chen-uni in https://github.com/mosaicml/llm-foundry/pull/1498
* Enable cross attention layers by gupta-abhay in https://github.com/mosaicml/llm-foundry/pull/1495
* Update to ci-testing 0.2.0 by dakinggg in https://github.com/mosaicml/llm-foundry/pull/1500
* [WIP] Torch 2.4 in docker images by snarayan21 in https://github.com/mosaicml/llm-foundry/pull/1491
* [WIP] Only torch 2.4.0 compatible by snarayan21 in https://github.com/mosaicml/llm-foundry/pull/1505
* Update mlflow requirement from <2.16,>=2.14.1 to >=2.14.1,<2.17 by dependabot in https://github.com/mosaicml/llm-foundry/pull/1506
* Update ci-testing to 0.2.2 by dakinggg in https://github.com/mosaicml/llm-foundry/pull/1503
* Allow passing key_value_statest for x-attn through MPT Block by gupta-abhay in https://github.com/mosaicml/llm-foundry/pull/1511
* Fix cross attention for blocks by gupta-abhay in https://github.com/mosaicml/llm-foundry/pull/1512
* Put 2.3 image back in release examples by dakinggg in https://github.com/mosaicml/llm-foundry/pull/1513
* Sort callbacks so that CheckpointSaver goes before HuggingFaceCheckpointer by irenedea in https://github.com/mosaicml/llm-foundry/pull/1515
* Raise MisconfiguredDatasetError from original error by irenedea in https://github.com/mosaicml/llm-foundry/pull/1519
* Peft fsdp by dakinggg in https://github.com/mosaicml/llm-foundry/pull/1520
* Raise DatasetTooSmall exception if canonical nodes is less than num samples by irenedea in https://github.com/mosaicml/llm-foundry/pull/1518
* Add permissions check for delta table reading by irenedea in https://github.com/mosaicml/llm-foundry/pull/1522
* Add HuggingFaceCheckpointer option for only registering final checkpoint by irenedea in https://github.com/mosaicml/llm-foundry/pull/1516
* Replace FSDP args by KuuCi in https://github.com/mosaicml/llm-foundry/pull/1517
* enable correct padding_idx for embedding layers by gupta-abhay in https://github.com/mosaicml/llm-foundry/pull/1527
* Revert "Replace FSDP args" by KuuCi in https://github.com/mosaicml/llm-foundry/pull/1533
* Delete unneeded inner base model in PEFT HF Checkpointer by snarayan21 in https://github.com/mosaicml/llm-foundry/pull/1532
* Add deprecation warning to fsdp_config by KuuCi in https://github.com/mosaicml/llm-foundry/pull/1530
* Fix reuse kv cache for torch attention by ShashankMosaicML in https://github.com/mosaicml/llm-foundry/pull/1539
* Error on text dataset file not found by milocress in https://github.com/mosaicml/llm-foundry/pull/1534
* Make ICL tasks not required for eval by snarayan21 in https://github.com/mosaicml/llm-foundry/pull/1540
* Bumping flash attention version to 2.6.3 and adding option for softcap in attention and lm_head logits. by ShashankMosaicML in https://github.com/mosaicml/llm-foundry/pull/1374
* Register mosaic logger by dakinggg in https://github.com/mosaicml/llm-foundry/pull/1542
* Hfcheckpointer optional generation config by KuuCi in https://github.com/mosaicml/llm-foundry/pull/1543
* Bump composer version to 0.25.0 by dakinggg in https://github.com/mosaicml/llm-foundry/pull/1546
* Bump streaming version to 0.9.0 by dakinggg in https://github.com/mosaicml/llm-foundry/pull/1550
* Bump version to 0.13.0.dev0 by dakinggg in https://github.com/mosaicml/llm-foundry/pull/1549
* Add proper user error for accessing schema by KuuCi in https://github.com/mosaicml/llm-foundry/pull/1548
* Validate Cluster Access Mode by KuuCi in https://github.com/mosaicml/llm-foundry/pull/1551

New Contributors
* jdchang1 made their first contribution in https://github.com/mosaicml/llm-foundry/pull/1449
* joyce-chen-uni made their first contribution in https://github.com/mosaicml/llm-foundry/pull/1473

**Full Changelog**: https://github.com/mosaicml/llm-foundry/compare/v0.11.0...v0.12.0

0.11.1

Not secure
Bug Fixes

* Fixes for Notebooks (1659)
* Documentation updates and fixes (1685, 1696, 1702, 1709)
* Addressed warnings and speed improvements for Torchmetrics (1674)
* Fixes to Gated Linear Units method (1575, 1689)
* Set `NCCL_ASYNC_ERROR_HANDLING` ENV variable in Composer launcher to enable distributed timeout (1695)
* Fix epoch count when `eval` is called before `fit` (1697)
* Constrain PyTorch package versions to avoid unintended upgrades (1688)
* Fix Optimizer state sharding issue with FSDP (1732)
* Rase `ValueError` with if evaluation dataloader of infinite length is specified

**Full Changelog**: https://github.com/mosaicml/composer/compare/v0.11.0...v0.11.1

0.11.0

Not secure
Streaming `v0.11.0` is released! Install via `pip`:


pip install --upgrade mosaicml-streaming==0.11.0


What's new
1. Introducing registry for customizable components (https://github.com/mosaicml/streaming/pull/858)
`StreamingDataset` can now be used with custom `Stream` implementations via a registry. See [the documentation page](https://docs.mosaicml.com/projects/streaming/en/stable/dataset_configuration/mixing_data_sources.html) for example usage.

🐛 Bug fixes
* Fix `simulation` module import paths (srstevenson)
* Fix `S3Downloader` serialization issues (wouterzwerink)

What's Changed
* Bound numpy version below 2.2.0 by snarayan21 in https://github.com/mosaicml/streaming/pull/849
* Fix import paths in `simulation` module by srstevenson in https://github.com/mosaicml/streaming/pull/838
* Prevent _s3_client from being serialized by wouterzwerink in https://github.com/mosaicml/streaming/pull/847
* Fix a few typos by srstevenson in https://github.com/mosaicml/streaming/pull/843
* Change broken user guide link to quick start by srstevenson in https://github.com/mosaicml/streaming/pull/841
* Remove unused import from quick start example by srstevenson in https://github.com/mosaicml/streaming/pull/842
* Change simulator UI help text to refer to directory by srstevenson in https://github.com/mosaicml/streaming/pull/839
* Bump fastapi from 0.115.5 to 0.115.6 by dependabot in https://github.com/mosaicml/streaming/pull/845
* Bump pydantic from 2.10.2 to 2.10.3 by dependabot in https://github.com/mosaicml/streaming/pull/846
* Update mosaicml-cli requirement from <0.7,>=0.5.25 to >=0.5.25,<0.8 by dependabot in https://github.com/mosaicml/streaming/pull/850
* Bump uvicorn from 0.32.1 to 0.34.0 by dependabot in https://github.com/mosaicml/streaming/pull/855
* Bump pydantic from 2.10.3 to 2.10.4 by dependabot in https://github.com/mosaicml/streaming/pull/856
* Update huggingface-hub requirement from <0.27,>=0.23.4 to >=0.23.4,<0.28 by dependabot in https://github.com/mosaicml/streaming/pull/859
* Set `epoch_seed_change` attribute on `SimulationDataset` by srstevenson in https://github.com/mosaicml/streaming/pull/840
* Use registry when creating Stream in StreamingDataset by es94129 in https://github.com/mosaicml/streaming/pull/858
* Bump pydantic from 2.10.4 to 2.10.5 by dependabot in https://github.com/mosaicml/streaming/pull/861

New Contributors
* srstevenson made their first contribution in https://github.com/mosaicml/streaming/pull/838
* wouterzwerink made their first contribution in https://github.com/mosaicml/streaming/pull/847
* es94129 made their first contribution in https://github.com/mosaicml/streaming/pull/858

**Full Changelog**: https://github.com/mosaicml/streaming/compare/v0.10.0...v0.11.0

Page 9 of 15

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.