- There was a bug where pdf paths were assumed to be strings - this is not always the case
0.3.11
- Faster inference with ONNX - Remove warning when loading scikit-learn model - Flatten form fields into pdf
0.3.10
Fixed bug that didn't unnormalize bboxes properly.
0.3.9
- Optimize dictionary access and loops to get an ~10% speedup
0.3.8
Enable optional parallel workers when extracting text. This can cause a performance hit on small pdfs, but can speed things up 2x or more on larger ones. This can be done with the `--workers` flag via CLI, or via the workers kwarg.
0.3.7
Not all spans would have the right font information before. This fixes the issue.