Llm-dataset-converter

Latest version: v0.2.6

Safety actively analyzes 722491 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 3

0.2.6

------------------

- switched to underscores in project name
- requiring seppl>=0.2.13 now
- added support for aliases
- added `discard-by-name` filter, which uses the `file` filed in the meta-data for its matching
- added placeholder support
- method `text_utils.empty_str_if_none` now handles bool/int/float as well
- CSV/TSV writers now have an `--encoding` option to use a specific encoding other than the default, e.g., UTF-8

0.2.5

------------------

- added `setuptools` as dependency

0.2.4

------------------

- requiring seppl>=0.2.6 now
- readers use default globs now, allowing the user to simply supply directories as input
- renamed `split` filter to `split-records` to avoid name clash with meta-data key `split` as parameter

0.2.3

------------------

- requiring seppl>=0.2.4 now

0.2.2

------------------

- requiring seppl>=0.2.3 now

0.2.1

------------------

- filters `split` and `tee` now support `ClassificationData` as well
- added `metadata-from-name` filter to extract meta-data from the current input file name
- added `inspect` filter that allows inspecting data interactively as it passes through the pipeline
- added `empty_str_if_none` helper method to `ldc.text_utils` to ensure no None/null values are output with writers
- upgraded seppl to 0.2.2 and switched to using `seppl.ClassListerRegistry`

Page 1 of 3

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.