Unstructured-ingest

Latest version: v1.0.7

Safety actively analyzes 723650 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 11 of 16

0.3.3

Enhancements

* **Add `precheck` to Milvus connector**

Fixes

* **Make AstraDB uploader truncate `text` and `text_as_html` content to max 8000 bytes**
* **Add missing LanceDb extra**
* **Weaviate cloud auth detection fixed**

0.3.2

Enhancements

* **Persist record id in mongodb data, use it to delete previous content to prevent duplicates.**


Fixes

* **Remove forward slash from Google Drive relative path field**
* **Create LanceDB test databases in unique remote locations to avoid conflicts**
* **Add weaviate to destination registry**

0.3.1

Enhancements

* **LanceDB V2 Destination Connector**
* **Persist record id in milvus, use it to delete previous content to prevent duplicates.**
* **Persist record id in weaviate metadata, use it to delete previous content to prevent duplicates.**
* **Persist record id in sql metadata, use it to delete previous content to prevent duplicates.**
* **Persist record id in elasticsearch/opensearch metadata, use it to delete previous content to prevent duplicates.**

Fixes

* **Make AstraDB precheck fail on non-existant collections**
* **Respect Pinecone's metadata size limits** crop metadata sent to Pinecone's to fit inside its limits, to avoid error responses
* **Propagate exceptions raised by delta table connector during write**

0.3.0

Enhancements

* **Added V2 kafka destination connector**
* **Persist record id in pinecone metadata, use it to delete previous content to prevent duplicates.**
* **Persist record id in azure ai search, use it to delete previous content to prevent duplicates.**
* **Persist record id in astradb, use it to delete previous content to prevent duplicates.**
* **Update Azure Cognitive Search to Azure AI Search**

Fixes

* **Fix Delta Table destination precheck** Validate AWS Region in precheck.
* **Add missing batch label to FileData where applicable**
* **Handle fsspec download file into directory** When filenames have odd characters, files are downloaded into a directory. Code added to shift it around to match expected behavior.
* **Postgres Connector Query** causing syntax error when ID column contains strings

0.2.2

Enhancements
* **Remove `overwrite` field** from fsspec and databricks connectors
* **Added migration for GitLab Source V2**
* **Added V2 confluence source connector**
* **Added OneDrive destination connector**
* **Qdrant destination to v2**
* **Migrate Kafka Source Connector to V2**

0.2.1

Enhancements

* **File system based indexers return a record display name**
* **Add singlestore source connector**
* **Astra DB V2 Source Connector** Create a v2 version of the Astra DB Source Connector.
* **Support native async requests from unstructured-client**
* **Support filtering element types in partitioner step**


Fixes

* **Fix Databricks Volumes file naming** Add .json to end of upload file.
* **Fix SQL Type destination precheck** Change to context manager "with".

Page 11 of 16

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.