Mlagents

Latest version: v1.1.0

Safety actively analyzes 707009 Python packages for vulnerabilities to keep your Python projects secure.

Page 13 of 14

0.3.0a

Fixes
Added the missing Ray Perception components to the agents in the BananaImitation scene.

0.2.1c

Fixes
* Require TensorFlow 1.4 to prevent incompatibilities between models built using TensorFlow 1.5 and current TensorFlowSharp bindings.

0.2.1b

Fixes & Performance Improvements
* [Python] Fixes a bug that prevented the creation of network graphs which did not contain visual observations.

0.2.1a

Features
* [Python] Adds support for training brains with multiple visual observations using PPO. Thanks to asolano for contributing this!

0.2.0

Environments

* Four new example environments added ([learn more](../master/docs/Learning-Environment-Examples.md)):
* Crawler
* Reacher
* Wall Area
* Push Area

* Environments no longer use normalized state values due to optional auto-normalizing done in PPO.

Features

Communication API Updated. Be sure both Unity project files and Python api are most current version.

Python

* PPO now optionally auto-normalizes states using running-average and running-variance (with `--normalize` flag).
* unityagents package now includes Curriculum Learning support ([learn more](../master/docs/Training-Curriculum-Learning.md)).
* Absolute path to training environments can now be used when running `UnityEnvironment()`.
* The Environment now logs errors and exceptions on the Unity side into the `unity-environment.log` file.

Unity

* New more flexible Monitor which allows for displaying arbitrary information ([learn more](../master/docs/Feature-Monitor.md)).
* Broadcast support for internal, heuristic, and player brains which allows all relevant agent information to be sent to python-side for supervised/imitation learning ([learn more](../master/docs/Learning-Environment-Design-Brains.md)).

Bug Fixes & Performance Improvements

Python

* Communication code now supports arbitrarily large observation cameras and states.

Unity

* Cumulative reward now accurately tracks reward.
* `AcademyReset()` now called before agent reset.
* `isInference` is now correctly set when running in Editor.
* Frame-rate is unlocked by default when in `isInference` is false.

Mlagents

Page 13 of 14

0.3.0a

0.2.1c

0.2.1b

0.2.1a

0.2.0

0.2.0preview

Page 13 of 14

Links

Releases