Unstructured-inference

Latest version: v0.8.10

Safety actively analyzes 723929 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 11 of 19

0.5.31

* Add functionality to extract and save images from the page
* Add functionality to get only "true" embedded images when extracting elements from PDF pages
* Update the layout visualization script to be able to show only image elements if need
* add an evaluation metric for table comparison based on token similarity
* fix paddle unit tests where `make test` fails since paddle doesn't work on M1/M2 chip locally

0.5.28

* add env variable `ENTIRE_PAGE_OCR` to specify using paddle or tesseract on entire page OCR

0.5.27

* table structure detection now pads the input image by 25 pixels in all 4 directions to improve its recall

0.5.26

* support paddle with both cpu and gpu and assumed it is pre-installed

0.5.25

* fix a bug where `cells_to_html` doesn't handle cells spanning multiple rows properly

0.5.24

* remove `cv2` preprocessing step before OCR step in table transformer

Page 11 of 19

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.