Sumgram

Latest version: v1.0.4

Safety actively analyzes 722491 Python packages for vulnerabilities to keep your Python projects secure.

1.0.1

Major updates:

* Support for reading text from STDIN: `$ cat path/to/collection/of/text/files/*.txt | sumgram -`
* Sumgram uses an [English stopwords](https://github.com/oduwsdl/sumgram/blob/c584e7eba1fab7deac1489aefa3f48f1b0d3ab64/sumgram/util.py#L33) list by default (switch off with `--no-default-stopwords`). To include additional stopwords `--add-stopwords`
* may be used to include additional stopwords:
$ sumgram --add-stopwords stopword1 stopword2 -t 10 path/to/collection/of/text/files/
* may be used to include additional stopwords in a text file (1 stopword per line):
$ sumgram --add-stopwords my_stopwords_file.txt -t 10 path/to/collection/of/text/files/
* Extracting/processing text from URLs:
$ sumgram "http://example.com/news/article-1.html" "http://example.com/news/article-1.html".
To change the default new article boilerplate removal method (`boilerpy3.ArticleExtractor`), set `--boilerplate-rm-method` with one of the following `'boilerpy3.DefaultExtractor', 'boilerpy3.ArticleSentencesExtractor', 'boilerpy3.LargestContentExtractor', 'boilerpy3.CanolaExtractor', 'boilerpy3.KeepEverythingExtractor', 'boilerpy3.NumWordsRulesExtractor', and 'nltk'` (regular expression for stripping all HTML tags)

0.0.19

Minor update to [address](https://github.com/oduwsdl/sumgram/commit/c584e7eba1fab7deac1489aefa3f48f1b0d3ab64) [Unable to install numpy 1.17.0 with setup script](https://github.com/oduwsdl/sumgram/issues/27) issue

0.0.18

Minor changes
* Added `-v --version` command-line option
* Made `regex` default `--sentence-tokenizer`

Releases

Has known vulnerabilities