-----
* added layout preservation option for pdftotext pdf extractor (`93`_ by
`ankushshah89`_)
* added simple support for extensionless filenames, treating them as plain
``.txt`` files (`85`_)
* several bug fixes, including:
* now extracting the text in tables from docx files at the end of the text
extraction (`92`_ by `jsmith-mploir`_)
* faster testing framework by only rebuilding test data when needed (`90`_)
* fixed ``.html`` and ``.epub`` parsers to deal with beautifulsoup4
upgrades
* using official ``msg-extractor`` now that it has a native ``setup.py``
* updated tests for ``.html``, ``.ogg``, ``.wav``, and ``.mp3`` file types to
be consistent with more recent versions of the underlying packages.