Mrjob

Latest version: v0.7.4

Safety actively analyzes 681935 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 4 of 10

0.5.12

* dropped support for Python 2.6 and 3.3
* termination protection doesn't make terminate-idle-clusters crash (1802)
* mrjob.parse.parse_s3_uri() handles s3a:// URIs (1709)
* mins_to_end_of_hour option defaults to 60.0, disabling it (1808)
* always use str in environment dictionaries (affects Python 2 on Windows)

0.5.11

* report-long-jobs tool can exclude jobs based on tag (1636)
* mrjob won't crash when inspecting instance fleet clusters (1639)

0.5.10

* JSON protcols use rapidjson if ujson unavailable (1579)
* can also explicitly use RapidJSONProtocol, RapidJSONValueProtocol
* EMR runner:
* aws_security_token option renamed to aws_session_token (1536)
* EMR and Dataproc runners:
* bootstrapping mrjob no longer stalls if mrjob already installed (1567)
* master bootstrap script has correct extension: .sh, not .py (1504)

0.5.9

* fixes which affect Docker:
* task_python_bin option, used by tasks but not setup script (1394)
* local mode references mrjob/cat.py by relative path, not absolute (1540)
* EMR runner
* re-launch SSH tunnel when cluster pooling auto-recovers (1549)
* get job progress using `ssh curl` when tunnel is unavailable (1547)
* work around `sh -e` setup script bug on AMI 5.2.0+ (1548)
* renamed emr_applications option to "applications" (1420)
* small fix to terminate-idle-cluster command's S3 "locking" code (1545)

0.5.8

* automatically tarball and upload directories with --dir, setup hooks (23)
* specify path for inter-step output with --step-output-dir 263
* jobs:
* better --help printout
* deprecated option groups in MRJobs
* deprecated MRJob.get_all_option_groups()
* overriding *_pre_filter() methods in MRJob works again (1521)
* all step types accept jobconf (1447)
* quieted warning about SORT_VALUES on Hadoop 2 (1286)
* all runners:
* wrap tasks that require pipes with sh_bin, not bash (1330)
* local runner:
* allows non-zero exit status from pre-filters (1524)
* pre-filters can now handle compressed input (1061)
* EMR runner:
* fetch logs from task nodes as well as core nodes (1400)
* use ListInstances rather than dfsadmin to get node list (1345)
* moved mrjob.util.bunzip2_stream() to mrjob.cat
* moved mrjob.util.gunzip_stream() to mrjob.cat
* mrjob.util.parse_and_save_options() now returns dict, not defaultdict
* deprecated:
* mrjob.util.args_for_opt_dest_subset()
* mrjob.util.bash_wrap()
* mrjob.util.populate_option_groups_with_options()
* mrjob.util.scrape_options_and_index_by_dest()
* mrjob.util.tar_and_gz()
* SSHFilesystem.ssh_slave_hosts()

0.5.7

* EMR and Hadoop runners:
* full support for Spark (1320)
* includes spark() method in MRJob and SparkStep/SparkScriptStep
* can use environment variables and ~ in hadoop_streaming_jar option
* EMR runner:
* default AMI version is now 4.8.2 (1486)
* default instance type is m1.large when running Spark jobs (1465)
* added debug logging for matching available pooled clusters (1449)
* defaults to cheapest instance type that will work (1369)
* master bootstrap script always created when pooling
* no longer crashes when trying to use missing ssh binary (1474)
* pooled clusters may have 1000 steps (1463)
* failed jobs no longer reported as 100% complete (793)
* All runners:
* py_files option for Spark and streaming steps (1375)
* bootstrap mrjob with a .zip rather than a tarball
* options refactor, added missing command-line switches (1439)
* mrjob terminate-idle-clusters works with all step types (1363)
* log interpretation
* dropped unnecessary container-to-attempt-ID mapping (1487)
* more efficient search for task log errors (1450)
* cleaner error messages when bootstrapped mrjob won't compile
* JarSteps
* now support libjars, jobconf (1481)
* JarStep.{INPUT,OUTPUT} are deprecated (use mrjob.step.{INPUT,OUTPUT})
* is_uri() now only matches URIs containing "://" (1455)
* works in Anaconda3 Jupyter Notebook (1441)
* deprecated mrjob.parse.is_windows_path()
* deprecated mrjob.parse.parse_key_value_list()
* deprecated mrjob.parse.parse_port_range_list()
* deprecated mrjob.util.scrape_options_into_new_groups()
* deprecated non-strict protocols (1452)
* deprecated python_archives (1056)

Page 4 of 10

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.