Datachain

Latest version: v0.8.3

Safety actively analyzes 693883 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 7 of 11

0.3.15

What's Changed
* Add resolve files by EdwardLi-coder in https://github.com/iterative/datachain/pull/313
* unskip test_udf_parallel by mattseddon in https://github.com/iterative/datachain/pull/432
* fix last modified comparison in resolve file test by mattseddon in https://github.com/iterative/datachain/pull/436
* Refactor `Client.parse_url()` by ilongin in https://github.com/iterative/datachain/pull/435
* Set stream for nested file signals by dberenbaum in https://github.com/iterative/datachain/pull/443
* Read arrow files from cache by dberenbaum in https://github.com/iterative/datachain/pull/442
* Auto-detect huggingface datasets when reading tabular data by dberenbaum in https://github.com/iterative/datachain/pull/398
* Add `datachain.lib.tar.process_tar()` generator by rlamy in https://github.com/iterative/datachain/pull/440
* Fix storage dependencies by ilongin in https://github.com/iterative/datachain/pull/421


**Full Changelog**: https://github.com/iterative/datachain/compare/0.3.14...0.3.15

0.3.14

What's Changed
* fix dependency install instructions for examples by mattseddon in https://github.com/iterative/datachain/pull/426
* Show progress bar for pytorch conversion by dberenbaum in https://github.com/iterative/datachain/pull/429
* Fix calculating datasets stats size by dreadatour in https://github.com/iterative/datachain/pull/418
* use the correct fixtures in tests by mattseddon in https://github.com/iterative/datachain/pull/428
* Adding Complex Type Support to Signal Schema by dtulga in https://github.com/iterative/datachain/pull/422
* tests: fix mock for subprocess stdout/stderr to return BytesIO by skshetry in https://github.com/iterative/datachain/pull/431
* prevent tests from hanging on CI (windows) by mattseddon in https://github.com/iterative/datachain/pull/427
* Remove Entry class and use File instead by rlamy in https://github.com/iterative/datachain/pull/419


**Full Changelog**: https://github.com/iterative/datachain/compare/0.3.13...0.3.14

0.3.13

What's Changed
* Remove legacy columns by rlamy in https://github.com/iterative/datachain/pull/263


**Full Changelog**: https://github.com/iterative/datachain/compare/0.3.12...0.3.13

0.3.12

What's Changed
* Fixes settings by dberenbaum in https://github.com/iterative/datachain/pull/397
* fix open file method for tar files by dberenbaum in https://github.com/iterative/datachain/pull/412
* disable execution of last query expression by default by skshetry in https://github.com/iterative/datachain/pull/407

New Contributors
* yathomasi made their first contribution in https://github.com/iterative/datachain/pull/408

**Full Changelog**: https://github.com/iterative/datachain/compare/0.3.11...0.3.12

0.3.11

What's Changed
* query: remove use of pipe for communication by skshetry in https://github.com/iterative/datachain/pull/393
* do not require last statement to be an expression or an instance of DatasetQuery by skshetry in https://github.com/iterative/datachain/pull/395
* pin pydantic < 2.9 by mattseddon in https://github.com/iterative/datachain/pull/399
* unpin pydantic, use python API for datamodel_codegen by skshetry in https://github.com/iterative/datachain/pull/400
* Update the DataChain logo in the README and docs by djsauble in https://github.com/iterative/datachain/pull/402
* avoid splitting script into feature files/scripts by skshetry in https://github.com/iterative/datachain/pull/385
* allow merge on expressions by mattseddon in https://github.com/iterative/datachain/pull/388

New Contributors
* djsauble made their first contribution in https://github.com/iterative/datachain/pull/402

**Full Changelog**: https://github.com/iterative/datachain/compare/0.3.10...0.3.11

0.3.10

What's Changed
* Support for reading from huggingface hub with `hf://` filesystem by dberenbaum in https://github.com/iterative/datachain/pull/375
* Simplify datachain.lib.listing by reusing Cilent.scandir() by rlamy in https://github.com/iterative/datachain/pull/376
* Use stderr for sql debug prints by shcheklein in https://github.com/iterative/datachain/pull/378
* Refactor `DataChain.from_storage()` to use new listing generator by ilongin in https://github.com/iterative/datachain/pull/294
* remove unused finally block by mattseddon in https://github.com/iterative/datachain/pull/379
* [pre-commit.ci] pre-commit autoupdate by pre-commit-ci in https://github.com/iterative/datachain/pull/382
* increase timeout of e2e test by mattseddon in https://github.com/iterative/datachain/pull/383
* metrics: save metrics in realtime by skshetry in https://github.com/iterative/datachain/pull/387
* query: remove support for saving dataset query with a given name by skshetry in https://github.com/iterative/datachain/pull/389
* Using job class instead of hardcodced `Job` by ilongin in https://github.com/iterative/datachain/pull/391
* cli: remove preview from `datachain query` command by skshetry in https://github.com/iterative/datachain/pull/392
* fix issues with new version of huggingface datasets package by mattseddon in https://github.com/iterative/datachain/pull/394
* Add `DataChain.listings()` method and use it in getting storages by ilongin in https://github.com/iterative/datachain/pull/331


**Full Changelog**: https://github.com/iterative/datachain/compare/0.3.9...0.3.10

Page 7 of 11

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.