Uses jnelson16's version of textstat for Flesch scores, which uses textblob to measure sentence length.
Filters Flesch scores and average sentence lengths to be greater than -100 and less than 100 respectively (by default, filtering value can be changed with CLI argument --threshold).
0.6.2
This hotfix allows the user to pass arguments to the `ml.estimate` and `corpus.get_streamer` functions.
0.6.1
This hotfix catches a "division by zero" error on the sentence length analysis when no sentences are found.
0.6.0
This version introduces readability metrics, including Flesch Reading Ease.
0.5.0
This version significantly reorganizes and simplifies the library and framework:
* NLP analysis is moved from corpus metadata to the new `nlp` commands * `estimator` functions have been moved to `ml` * Trained estimators are now packaged as a single, `.qge` file * Snakemake is now no longer a dependency; workflow management is left to the discretion of the user.
See the updated [documentation](http://docs.quantgov.org) for details.
0.4.2
This version fixes a bug for S3 Corpus Drivers on Windows.