Unstructured

Latest version: v0.16.11

Safety actively analyzes 687918 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 20 of 34

0.7.1

Enhancements

Features

* Add `stage_for_weaviate` to stage `unstructured` outputs for upload to Weaviate, along with
a helper function for defining a class to use in Weaviate schemas.
* Builds from Unstructured base image, built off of Rocky Linux 8.7, this resolves almost all CVE's in the image.

Fixes

0.7.0

Enhancements

* Installing `detectron2` from source is no longer required when using the `local-inference` extra.
* Updates `.pptx` parsing to include text in tables.

Features

Fixes

* Fixes an issue in `_add_element_metadata` that caused all elements to have `page_number=1`
in the element metadata.
* Adds `.log` as a file extension for TXT files.
* Adds functionality to try other common encodings for email (`.eml`) files if an error related to the encoding is raised and the user has not specified an encoding.
* Allow passed encoding to be used in the `replace_mime_encodings`
* Fixes page metadata for `partition_html` when `include_metadata=False`
* A `ValueError` now raises if `file_filename` is not specified when you use `partition_via_api`
with a file-like object.

0.6.11

Enhancements

* Supports epub tests since pandoc is updated in base image

Features

Fixes

0.6.10

Enhancements

* XLS support from auto partition

Features

Fixes

0.6.9

Enhancements

* fast strategy for pdf now keeps element bounding box data
* setup.py refactor

Features

Fixes

* Adds functionality to try other common encodings if an error related to the encoding is raised and the user has not specified an encoding.
* Adds additional MIME types for CSV

0.6.8

Enhancements

Features

* Add `partition_csv` for CSV files.

Fixes

Page 20 of 34

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.