Etl-manager

Latest version: v8.0.0

Safety actively analyzes 706267 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 4 of 6

6.0.0

Change

- Glue jobs now run using Python 3 and Spark 2.4 as default

5.0.0

Added

- ETL manager now allows use of STRUCT and ARRAY col types in your hive metadata tables.

4.0.0

Updated

- Method function in TableMeta `refresh_paritions` renamed to `refresh_partitions`.
- `refresh_partitions` function now wait for athena to complete the query. This should avoid errors where you hit limits of concurrent Athena queries (max 4) when using `refresh_all_table_partitions` (from DatabaseMeta class).

3.1.0

Added

- Two new input arguments to GlueJob method function `wait_for_completion`.
- Input `back_off_retries` now is the number of retries to boto API to avoid Throttling Error. Retries are done with exponential back off.
- `cleanup_if_successful` will delete the glue job if the `wait_for_completion` doesn't raise an error. i.e. Glue job completes successfully.

3.0.0

Added

- Fixed issue [91](https://github.com/moj-analytical-services/etl_manager/issues/91) and [92](https://github.com/moj-analytical-services/etl_manager/issues/92)
- Improved python format
- Refactored to Python 3.6
- Fixed unknown issue where arguments passed into function were not copied (same memory location)

2.2.1

Change

- Updated output from `GlueJob` class function `wait_for_job_completion()` (when verbose is set to True), now states how long Glue has been running the job.

Page 4 of 6

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.