* Add table of stream lengths for PDF objects containing streams to `--doc-info` output * Quote extraction API methods should use yara, not bespoke extraction * Fix bug with rich tree view of non binary streams
1.7.0
* Use `yaralyzer` as the match engine * Scan all binary streams, not just the fonts
1.6.0
* Integrate YARA scanning - all the rules I could dig up relating to PDFs * Add MD5, SHA1, SHA256 to document info section * `pdfalyzer_show_color_theme` script shows the theme * Make `README` more PyPi friendly
1.5.0
Bunch of small changes to support releasing on [pypi](https://pypi.org/project/pdfalyzer/) * Invoke with shell command `pdfalyze` instead of local python file `./pdfalyzer.py` (options are the same) * Core class renames: `PdfWalker` -> `Pdfalyzer`, `DataStreamHandler` -> `BinaryScanner` * Permanent env var configuration moved from a file called `.env` to a file called `.pdfalyzer` * Logging to a file is off unless configured by env var * To use Didier Stevens's `pdf-parser.py` you must provide the `PDFALYZER_PDF_PARSER_PY_PATH` env var
1.4.0
* Hexadecimal representation of matched bytes in decode attempts table * `--quote-type` option to limit binary scans * `--min-decode-length` option to skip decode attempts on short matches * `--file-suffix` option * Output filenames will contain some of the options used to generate them * Add runtime params to export filenames where it is material to the output * Ensure `/OpenAction` etc are not subsumed by parent/child relationships in the condensed tree view * Tweak available configuration options for logging to file.
1.3.1
* Fix bug with validating directly embedded objects