Product Research Enterprise Plans Docs

Scrape

Latest version: v0.11.3

Safety actively analyzes 723158 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 16 of 20

0.2.5

Not secure

------

- added requests to install_requires in setup.py

0.2.4

Not secure

------

- added attributes flag which specifies which tag attributes to extract from a given page, such as text, href, etc.

0.2.3

Not secure

------

- updated flags and flag help messages
- verbose now by default and reduced number of messages, use --quiet to silence messages
- changed name of --files flag to --html for saving output as html
- added --text flag, default is still text

0.2.2

Not secure

------

- fixed character encoding issue, all unicode now

0.2.1

Not secure

------

- improvements to exception handling for proper PART file removal

0.2.0

Not secure

------

- pages are now saved as they are crawled to PART.html files and processed/removed as necessary, this greatly saves on program memory
- added a page cache with a limit of 10 for greater duplicate protection
- added --files option for keeping webpages as PART.html instead of saving as text or pdf, this also organizes them into a subdirectory named after the seed url's domain
- changed --restrict flag to --strict for restricting the domain to the seed domain while crawling
- more --verbose messages being printed

Page 16 of 20

Releases

Has known vulnerabilities

Previous Next

Scrape

Page 16 of 20

0.2.5

0.2.4

0.2.3

0.2.2

0.2.1

0.2.0

Page 16 of 20

Links

Releases