Mo-gymnasium

Latest version: v1.1.0

Safety actively analyzes 624665 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 3

1.1.0

Environments
* Add new MuJoCo environments by LucasAlegre in https://github.com/Farama-Foundation/MO-Gymnasium/pull/87
* Add mirror DST env by ffelten in https://github.com/Farama-Foundation/MO-Gymnasium/pull/79

Other improvements and utils
* Use .unwrapped to access reward_space by LucasAlegre in https://github.com/Farama-Foundation/MO-Gymnasium/pull/77
* Add rendering for fruit_tree env by tomekster in https://github.com/Farama-Foundation/MO-Gymnasium/pull/81

Documentation
* Group environments by type in docs by LucasAlegre in https://github.com/Farama-Foundation/MO-Gymnasium/pull/83
* Add mirrored DST to docs by ffelten in https://github.com/Farama-Foundation/MO-Gymnasium/pull/80
* Update citations by LucasAlegre in https://github.com/Farama-Foundation/MO-Gymnasium/pull/86

Bug fixes
* unpin mujoco by Kallinteris-Andreas in https://github.com/Farama-Foundation/MO-Gymnasium/pull/84

**Full Changelog**: https://github.com/Farama-Foundation/MO-Gymnasium/compare/v1.0.1...v1.1.0

1.0.1

Environments
* Add pygame render to breakable-bottles by LucasAlegre in https://github.com/Farama-Foundation/MO-Gymnasium/pull/75

Wrapper
* Add MOMaxAndSkipObservation Wrapper by LucasAlegre in https://github.com/Farama-Foundation/MO-Gymnasium/pull/76

Other improvements and utils
* Modify LinearReward to return reward weights as part of info_dict by ianleongudri in https://github.com/Farama-Foundation/MO-Gymnasium/pull/69
* Add warning for order of wrapping in the MORecordEpisodeStatistics Wrapper by ffelten in https://github.com/Farama-Foundation/MO-Gymnasium/pull/70
* Support Gymnasium 0.29 by LucasAlegre in https://github.com/Farama-Foundation/MO-Gymnasium/pull/73


Documentation
* Add tuto for custom env creation by ffelten in https://github.com/Farama-Foundation/MO-Gymnasium/pull/72

Bug fixes
* Fix test worker by ffelten in https://github.com/Farama-Foundation/MO-Gymnasium/pull/67
* Fix PF and CCS computation of minecart-deterministic-v0 by LucasAlegre in https://github.com/Farama-Foundation/MO-Gymnasium/pull/74

**Full Changelog**: https://github.com/Farama-Foundation/MO-Gymnasium/compare/v1.0.0...v1.0.1

1.0.0

We are thrilled to introduce the mature release of [MO-Gymnasium](https://mo-gymnasium.farama.org/), a standardized API and collection of environments designed for Multi-Objective Reinforcement Learning (MORL).

MORL expands the capabilities of RL to scenarios where agents need to optimize multiple objectives, which may potentially conflict with each other. Each objective is represented by a distinct reward function. In this context, the agent learns to make trade-offs between these objectives based on a reward vector received after each step. For instance, in the well-known Mujoco halfcheetah environment, reward components are combined linearly using predefined weights as shown in the following code snippet from [Gymnasium](https://github.com/Farama-Foundation/Gymnasium/blob/main/gymnasium/envs/mujoco/half_cheetah_v4.py#LL201C9-L206C44):

python
ctrl_cost = self.control_cost(action)
forward_reward = self._forward_reward_weight * x_velocity
reward = forward_reward - ctrl_cost


With MORL, users have the flexibility to determine the compromises they desire based on their preferences for each objective. Consequently, the environments in MO-Gymnasium do not have predefined weights. Thus, MO-Gymnasium extends the capabilities of [Gymnasium](https://gymnasium.farama.org/) to the multi-objective setting, where the agents receives a vectorial reward.

For example, here is an illustration of the multiple policies learned by an MORL agent for the `mo-halfcheetah` domain, balancing between saving battery and speed:

<img src="https://github.com/Farama-Foundation/MO-Gymnasium/assets/11799929/10796cae-6f84-4690-8e17-d23f792c32c2" width=400 />

This release marks the first mature version of MO-Gymnasium within Farama, indicating that the API is stable, and we have achieved a high level of quality in this library.

API
python
import gymnasium as gym
import mo_gymnasium as mo_gym
import numpy as np

It follows the original Gymnasium API ...
env = mo_gym.make('minecart-v0')

obs, info = env.reset()
but vector_reward is a numpy array!
next_obs, vector_reward, terminated, truncated, info = env.step(your_agent.act(obs))

Optionally, you can scalarize the reward function with the LinearReward wrapper.
This allows to fall back to single objective RL

0.3.4

Changelogs
Environments
* Add new pixel art rendering for `deep-sea-treasure-v0`, `resource-gathering-v0` and `water-reservoir-v0` by LucasAlegre in https://github.com/Farama-Foundation/MO-Gymnasium/pull/41
* Add `pareto_front` function to get known optimal front in DST, Minecart and Resource Gathering by LucasAlegre and ffelten in https://github.com/Farama-Foundation/MO-Gymnasium/pull/45, https://github.com/Farama-Foundation/MO-Gymnasium/pull/43;
* Add `deep-sea-treasure-concave-v0` by ffelten in https://github.com/Farama-Foundation/MO-Gymnasium/pull/43

Utils
* Moved evaluation utils to [MORL-Baselines](https://github.com/LucasAlegre/morl-baselines) by ffelten in https://github.com/Farama-Foundation/MO-Gymnasium/pull/47

Documentation
* Improve documentation and README by LucasAlegre in https://github.com/Farama-Foundation/MO-Gymnasium/pull/40
* Create docs/README.md to link to a new CONTRIBUTING.md for docs by mgoulao in https://github.com/Farama-Foundation/MO-Gymnasium/pull/42
* Enable documentation versioning and release notes in website by mgoulao in https://github.com/Farama-Foundation/MO-Gymnasium/pull/46

New Contributors
* mgoulao made their first contribution in https://github.com/Farama-Foundation/MO-Gymnasium/pull/42

**Full Changelog**: https://github.com/Farama-Foundation/MO-Gymnasium/compare/v0.3.3...0.3.4

0.3.3

New improvements/features

* Add EzPickle to all envs by ffelten in https://github.com/Farama-Foundation/MO-Gymnasium/pull/34
* Automatic generation of tests by LucasAlegre in https://github.com/Farama-Foundation/MO-Gymnasium/pull/37

Bugs fixed

* Fix highway env observation conversion by LucasAlegre in https://github.com/Farama-Foundation/MO-Gymnasium/pull/33
* Fix bug in eval_mo which was passing None to all weight vectors
* Fix minecart and water-reservoir ObservationSpace dtype and bounds

Documentation
* Improve documentation and readme by LucasAlegre in https://github.com/Farama-Foundation/MO-Gymnasium/pull/35

**Full Changelog**: https://github.com/Farama-Foundation/MO-Gymnasium/compare/0.3.2...v0.3.3

0.3.2

Bug fixes

* Bump highway-env version, to fix rendering
* Add assets to the pypi release package

Documentation
* Add gifs to the webpage

**Full Changelog**: https://github.com/Farama-Foundation/MO-Gymnasium/compare/0.3.1...0.3.2

Page 1 of 3

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.