Mrjob

Latest version: v0.7.4

Safety actively analyzes 681935 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 9 of 10

0.2.5

* Added hadoop_input/output_format options
* You can now specify a custom Hadoop streaming jar (hadoop_streaming_jar)
* extra args to hadoop now come before -mapper/-reducer on EMR, so
that e.g. -libjar will work (worked in hadoop mode since v0.2.2)
* hadoop mode now supports s3n:// URIs (Issue 53)

0.2.4

* Fix bootstrapping of mrjob in hadoop and local mode (Issue 89)
* SSH tunnels try to use the same port for the same job flow (Issue 67)
* Added mr_postfix_bounce and mr_pegasos_svm to examples.
* Retry on spurious 505s from EMR API

0.2.3

* Fix incompatibility with boto 2.0b4 (Issue 91)

0.2.2

* Use POST requests for most EMR queries (EMR was choking on large GETs)
* find_probable_cause_of_failure() ignores transient errors (Issue 31)
* --hadoop-arg now actually works (Issue 79)
* on Hadoop, extra args are added first, so you can set e.g. -libjar
* S3 buckets may now have . in their names
* MRJob scripts now respect --quiet (Issue 84)
* added --no-output option for MRJob scripts (Issue 81)
* added --python-bin option (Issue 54)

0.2.1

* Don't assume EMR sets laststatechangereason

0.2.0

* New Features/Changes:
* EMRJobRunner now prints % of mappers and reducers completed when you
enable the SSH tunnel.
* Added mr_page_rank example
* Added mrjob.tools.emr.audit_usage script (Issue 21)
* You can specify alternate job owners with the "owner" option. Useful for
auditing usage. (Issue 59)
* The job_name_prefix option has been renamed to label (the old name still
works but is deprecated)
* bootstrap_cmds and bootstrap_scripts no longer automatically invoke sudo
* Bugs Fixed/Cleanup:
* bootstrap files no longer get uploaded to S3 twice (Issue 8)
* When using add_file_option(), show_steps() can now see the local version
of the file (Issue 45)
* Now works on Windows (Issue 46)
* No longer requires external jar, tar, or zip binaries (Issue 47)
* mrjob-* scratch bucket is only created as needed (Issue 50)
* Can now specify us-east-1 region explicitly (Issue 58)
* mrjob.tools.emr.terminate_idle_job_flows leaves Hive jobs alone (Issue 60)

Page 9 of 10

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.