Pdftext

Latest version: v0.3.10

Safety actively analyzes 642295 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 3

0.3.4

- Use line breaks from pdfium

0.3.3

- Option to keep characters with JSON/dictionary output
- Fix some bugs when interfacing with the pdfium c api (thanks mara004)

0.3.2

- Add probability threshold for block predictions

0.3.1

- Select a range of pages versus converting the whole doc
- Minor internal refactor to use docs versus paths

0.3.0

- Fix bug where hyphens didn't show up at the end of lines
- Improve wrapping for hyphens - join words across hyphens before newline (disable by passing `keep_hyphens`)
- Restructure output to avoid redundant info in json blob - keep track of text spans with similar font info instead of individual characters
- Update model to predict blocks more accurately

0.2.1

- Switch the character box to a `loose` box, to get the full character range

Page 2 of 3

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.