Scrapinghub-autoextract

Latest version: v0.6.0

Safety actively analyzes 682416 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 2

0.2.0

------------------

* asyncio API is rewritten, to simplify use in cases where passing meta
is required. ``autoextract.aio.request_parallel_as_completed`` is added,
``autoextract.aio.request_parallel`` and ``autoextract.aio.request_batch``
are removed.
* CLI: it now shows various stats: mean response and connect time,
% of throttling errors, % of network and other errors
* CLI: new ``--intype jl`` option allows to process a .jl file
with arbitrary AutoExtract API queries
* CLI: new ``--shuffle`` option allows to shuffle input data, to spread it
more evenly across websites.
* CLI: it no longer exits on unrecoverable errors, to aid long-running
processing tasks.
* retry logic is adjusted to handle network errors better.
* ``autoextract.aio.request_raw`` and
``autoextract.aio.request_parallel_as_completed`` functions provide an
interface to return statistics about requests made, including retries.
* autoextract.Request, autoextract.ArticleRequest, autoextract.ProductRequest,
autoextract.JobPostingRequest helper classes
* Documentation improvements.

0.1.1

------------------

* allow up to 100 elements in a batch, not up to 99
* custom User-Agent header is added
* Python 3.8 support is declared & tested

0.1

----------------

Initial release.

Page 2 of 2

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.