Dolma

Latest version: v1.0.3

Safety actively analyzes 638773 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 3

0.9.1

What's Changed
* Fix Jekyll Docs Build by soldni in https://github.com/allenai/dolma/pull/55
* Adding Citation text back to README by soldni in https://github.com/allenai/dolma/pull/56
* Bump rustix from 0.37.20 to 0.37.25 by dependabot in https://github.com/allenai/dolma/pull/59
* Documentation on BaseParallelProcessor by soldni in https://github.com/allenai/dolma/pull/62
* Add download instruction by Muennighoff in https://github.com/allenai/dolma/pull/63
* Fix spawn method for multiprocessing by soldni in https://github.com/allenai/dolma/pull/64
* Fix hardcoded URL by soldni in https://github.com/allenai/dolma/pull/65
* Fix Accidental Override of Boolean Value by soldni in https://github.com/allenai/dolma/pull/66

New Contributors
* Muennighoff made their first contribution in https://github.com/allenai/dolma/pull/63

**Full Changelog**: https://github.com/allenai/dolma/compare/v0.9.0...v0.9.1

0.9.0

What's Changed
* Skipping AWS checks when aws access key is not available by soldni in https://github.com/allenai/dolma/pull/28
* env variable is not passed to tests by soldni in https://github.com/allenai/dolma/pull/29
* Fix make by chris-ha458 in https://github.com/allenai/dolma/pull/24
* Fix `make` more by chris-ha458 in https://github.com/allenai/dolma/pull/31
* ff by soldni in https://github.com/allenai/dolma/pull/36
* Adding C4 example, dryrun mode, profiling taggers by soldni in https://github.com/allenai/dolma/pull/37
* Only run Python style checks on source and tests by soldni in https://github.com/allenai/dolma/pull/38
* fix rust parts by chris-ha458 in https://github.com/allenai/dolma/pull/23
* Add rust unit tests by chris-ha458 in https://github.com/allenai/dolma/pull/35
* Bump webpki from 0.22.0 to 0.22.2 by dependabot in https://github.com/allenai/dolma/pull/52
* Adding Tokenizer, Writing Documentation, Misc Bugs & CLI improvements by soldni in https://github.com/allenai/dolma/pull/54

New Contributors
* chris-ha458 made their first contribution in https://github.com/allenai/dolma/pull/24
* dependabot made their first contribution in https://github.com/allenai/dolma/pull/52

**Full Changelog**: https://github.com/allenai/dolma/compare/v0.8.0...v0.9.0

0.8.0

What's Changed
* Analyzer to save and plot taggers distribution by soldni in https://github.com/allenai/dolma/pull/21
* Scripts to compute statistics by soldni in https://github.com/allenai/dolma/pull/22


**Full Changelog**: https://github.com/allenai/dolma/compare/v0.7.0...v0.8.0

0.7.0

What's Changed
* CLI improvements, remove need of experiment name by soldni in https://github.com/allenai/dolma/pull/20


**Full Changelog**: https://github.com/allenai/dolma/compare/v0.6.5...v0.7.0

0.6.5

What's Changed
* added validation of configs, tagger bugfixes by soldni in https://github.com/allenai/dolma/pull/18
* upping version by soldni in https://github.com/allenai/dolma/pull/19


**Full Changelog**: https://github.com/allenai/dolma/compare/v0.6.4...v0.6.5

0.6.4

What's Changed
* adding tests in CI by soldni in https://github.com/allenai/dolma/pull/17
* Added tests for local/remote bindings for deduper/mixer by soldni in https://github.com/allenai/dolma/pull/15


**Full Changelog**: https://github.com/allenai/dolma/compare/v0.6.3...v0.6.4

Page 2 of 3

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.