* Strip trailing whitespace from text zones bigger than words (lines,
paragraphs, …).
* Fix compatibility with Tesseract 3.02.
Thanks to Janusz S. Bień for the bug report.
* ocrodjvu:
+ Make it possible to pass multiple languages to Tesseract ≥ 3.02.
https://github.com/jwilk/ocrodjvu/issues/3
Thanks to Janusz S. Bień for the bug report.
+ Cuneiform: rename mixed Russian-English language code:
“rus-eng” → “rus+eng”. This is for consistency with Tesseract.
+ Tesseract: fix support for Chinese language pack.
+ Tesseract: make it possible to pass the -psm option in order to
customize layout analysis. For example, to enable OSD, use:
-X extra_args='-psm 1'
+ Make --list-languages output sorted.
+ Tesseract: remove “osd” from language list.
+ Accept both ISO 639-2/T and ISO 639-2/B language codes.
+ Add the --save-raw-ocr option.
+ Add the --raw-ocr-filename-template option.
+ Improve documentation of the --ocr-only option.
* Require Python ≥ 2.6.
* Fix compatibility with nose 1.2.
Thanks to Kyrill Detinov for the bug report.
-- Jakub Wilk <jwilkjwilk.net> Wed, 17 Apr 2013 00:59:23 +0200