Unstructured

Latest version: v0.17.2

Safety actively analyzes 723607 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 24 of 39

0.6.5

Not secure
Enhancements

* Added support for SpooledTemporaryFile file argument.

Features

Fixes

0.6.4

Not secure
Enhancements

* Added an "ocr_only" strategy for `partition_pdf`. Refactored the strategy decision
logic into its own module.

Features

Fixes

0.6.3

Not secure
Enhancements

* Add an "ocr_only" strategy for `partition_image`.

Features

* Added `partition_multiple_via_api` for partitioning multiple documents in a single REST
API call.
* Added `stage_for_baseplate` function to prepare outputs for ingestion into Baseplate.
* Added `partition_odt` for processing Open Office documents.

Fixes

* Updates the grouping logic in the `partition_pdf` fast strategy to group together text
in the same bounding box.

0.6.2

Not secure
Enhancements

* Added logic to `partition_pdf` for detecting copy protected PDFs and falling back
to the hi res strategy when necessary.

Features

* Add `partition_via_api` for partitioning documents through the hosted API.

Fixes

* Fix how `exceeds_cap_ratio` handles empty (returns `True` instead of `False`)
* Updates `detect_filetype` to properly detect JSONs when the MIME type is `text/plain`.

0.6.1

Not secure
Enhancements

* Updated the table extraction parameter name to be more descriptive

Features

Fixes

0.6.0

Not secure
Enhancements

* Adds an `ssl_verify` kwarg to `partition` and `partition_html` to enable turning off
SSL verification for HTTP requests. SSL verification is on by default.
* Allows users to pass in ocr language to `partition_pdf` and `partition_image` through
the `ocr_language` kwarg. `ocr_language` corresponds to the code for the language pack
in Tesseract. You will need to install the relevant Tesseract language pack to use a
given language.

Features

* Table extraction is now possible for pdfs from `partition` and `partition_pdf`.
* Adds support for extracting attachments from `.msg` files

Fixes

* Adds an `ssl_verify` kwarg to `partition` and `partition_html` to enable turning off
SSL verification for HTTP requests. SSL verification is on by default.

Page 24 of 39

Links

Releases

Has known vulnerabilities

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.