Pywebcopy

Latest version: v7.0.2

Safety actively analyzes 682404 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

7.0.0

- Object-oriented rewrite of the `pywebcopy`
- command line interface is improved.
- better reliability when using threading.
- separate modes for threaded and non-threaded tasks.

6.0.0

- `WebPage` class now doesn't take any argument **(breaking change)**
- `WebPage` class has new methods `WebPage.get` and `WebPage.set_source`
- Queuing of downloads is replaced with a barrier to manage active threads


[version 5.x]

- Optimization of existing code, upto 5x speed ups in certain cases
- Removed cluttering, improved readability

[version 4.x]

- **A complete rewrite and restructing of apis.**
- Availble apis through `from pywebcopy import *`

* `save_webpage`
* `save_website`
* `config`
* `WebPage`
* `Crawler`
* `MultiParser`
* `SESSION`
* `URLTransformer`
* `filename_present`
* `TagBase`
* `LinkTag`
* `ScriptTag`
* `ImgTag`
* `AnchorTag`
* `get`
* `new_file`

2.0.0

[changed]

- `core.setup_config` function is changed to `config.setup_config`.

[added]

- added `utils.trace` decorator, which will **print** *function_name*, *args*, *kwargs* and *return value* when debug config key is True.
- new html-parsers ('html5lib', 'lxml') are supported for better webpages.
- html-parser is now defaulted to 'lxml'. You can use any through new `config.config` key called `parser`

[fixed]

- fixed issue while changing `user-agent` key cracked webpages. You can now use any browser's user-agent id and it will get exact same page downloaded.
- fixed issue in `generators.extract_css_urls` which was caused by `str` and `bytes` difference in python3.
- fixed issues in modules importing. (Thanks "**Илья Игоревич**").
- added `errorhandling` to required functions

2.0beta

- `init` function is replaced with `save_webpage`
- three new `config` automation functions are added -
- `core.setup_config` (creates every ideal config just from url and download location)
- `config.reset_config` (resets the configuration to default state)
- `config.update_config` (manual-mode version of `core.setup_config`)
- object `structures.WebPage` added
- merged `generators.generate_style_map` and `generators.generate_relative_paths` to a single function `generators.generate_style_map`
- rewrite of majority of functions
- new module `exceptions` added

1.10

- `url` is checked and resolved of any redirection before starting any work functions.
- `init` vars : `mirrors_dir` and `clean_up` were fixed which cleaned the dir before the log was completely written.
- `init` call now takes `url` arg by default and could raise a error when not supplied
- professional looking log entries
- rewritten archiving system now uses `zipfile` and `exceptions` handling to prevent errors and eventual archive corruption

1.9

- more redundant code
- modules are now separated based on type e.g. Core, Generators, Utils etc.
- new helper functions and class `structures.WebPage`
- Compatible with Python 2.6, 2.7, 3.6, 3.7

Links

Releases

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.