Ocrodjvu

Latest version: v0.13.2

Safety actively analyzes 682361 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 6 of 9

0.7.2

* Don't hang if one of the threads raises an exception.
* Use the logging module for printing progress messages, errors etc.
* Produce more useful import error messages on Debian-based systems.

-- Jakub Wilk <jwilkjwilk.net> Mon, 04 Apr 2011 01:14:22 +0200

0.7.1

* Windows: guess location of the DjVuLibre DLL (requires python-djvulibre
≥ 0.3.3).
* ocrodjvu:
+ Work around a bug in Cuneiform, which mistakenly use “slo” (rather than
“slv”) as language code for Slovenian.
https://bugs.launchpad.net/cuneiform-linux/+bug/707951
+ Accept “ces”, “nld”, “slv”, “ron” as language codes for Czech, Dutch,
Slovenian and Romanian languages, even when Cuneiform internally use
different ones.
* djvu2hocr:
+ Don't flip hOCR upside-down.
https://bugs.debian.org/611460

-- Jakub Wilk <jwilkjwilk.net> Sat, 29 Jan 2011 18:14:40 +0100

0.7.0

* Correctly handle empty pages recognized by Cuneiform and Ocrad.
Thanks to Alexey Shipunov for the bug report.
* Fix crash on Cuneiform-generated hOCR with bounding boxes for whitespace
characters.
Thanks to Alexey Shipunov for the bug report.
* Fix compatibility with Tesseract 3.00.
* Fix colors in 24-bit BMP images.
* ocrodjvu:
+ Make “-e” an alias for “--engine”.
+ Make “-l” an alias for “--language”.
+ Add the -X option (for advanced users).
+ Work-around for Cuneiform returning files with control characters is now
disabled by default. Use “-X fix-html=1” to re-enable it.
+ Add the --on-error option (for advanced users).
* djvu2hocr:
+ Fix a typo, which prevented hocr2djvused from correctly parsing files
produced by it.
https://bugs.debian.org/600539
* Extend the test suite.

-- Jakub Wilk <jwilkjwilk.net> Sun, 07 Nov 2010 21:37:00 +0100

0.6.1

* Improve detection of Tesseract.
* Correctly handle unrecognized and non-ASCII characters in Ocrad ORF output.
Thanks to Heinrich Schwietering for the bug report.
* Correctly handle text that is closer than 100 pixels from the left edge in
Ocrad ORF output.
Thanks to Heinrich Schwietering for the test case.
* Fix crash on hOCR with image elements.
https://bugs.debian.org/598139
Thanks to Alexey Shipunov for the bug report.
* Fix insecure use of temporary files when using Cuneiform.
https://bugs.debian.org/598134
CVE-2010-4338

-- Jakub Wilk <jwilkjwilk.net> Sun, 26 Sep 2010 15:01:51 +0200

0.6.0

* Add support for the Tesseract OCR engine.
* Fix Cuneiform support (a regression introduced in 0.5).
Thanks to Kyrill Detinov for the bug report.

-- Jakub Wilk <jwilkjwilk.net> Thu, 16 Sep 2010 19:24:20 +0200

0.5.1

* Fix crash when listing engines/languages if OCRopus is not found.
Thanks to Kyrill Detinov for the bug report.
* lxml is no longer required for OCR engines that are not using hOCR as
output format.

-- Jakub Wilk <jwilkjwilk.net> Wed, 15 Sep 2010 18:38:00 +0200

Page 6 of 9

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.