Tmrl

Latest version: v0.6.5

Safety actively analyzes 626208 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 5

0.6.5

Release 0.6.5 removes the broken dependency on the `keyboard` library.

If you wish to manually decide where to start and stop reward recording in TrackMania, install the `keyboard` library manually (`pip install keyboard`) and use the `--use-keyboard` modifier on top of `--record-reward`. Then you can press `e` to start recording and `q` to stop recording. If you don't use this modifier, reward recording starts when you launch the script (wait for the confirmation to display before you start driving) and ends when you cross the finish line.

0.6.4

Fixes https://github.com/trackmania-rl/tmrl/issues/94

0.6.3

This release fixes TrackMania tools for Linux.

- `--check-environment` now works for LIDAR on Linux
- `--record-reward` does not use `keyboard` anymore on Linux

0.6.2

Release `0.6.2` introduces a mechanism for automatically healing the `tmrl` installation when the `TmrlData` folder when it is missing, as pip sometimes silently fails to download it during installation.

0.6.1

- Same as `0.6.1` but fixed the PyPI installer.

0.6.0

This release introduces support for non-real-time environments in the TMRL library, and support for Linux in the TrackMania pipeline.

Version `0.6.0` is backward-incompatible and requires a clean installation.

Major changes

- TrackMania example pipeline (see `config.json`):
- Support for Linux
- Support for saving replays automatically
- Support for reward shaping
- TMRL library:
- Support for non-real-time environments and Trainer/Worker synchronization
- Generic training pipeline (in particular, introduced a generic `Memory` class for lazy developers, compatible with random sampling in 1-step TD learning)

Minor changes
- TrackMania example pipeline (see `config.json`):
- More fine-tuning options for SAC
- Set the default Adam Betas to the RL-compatible setting described by [Mahmood et al. 2023](https://openreview.net/forum?id=m9Jfdz4ymO) in an attempt to avoid policy collapse
- The default hyperparameters changed for full vision-based training instead of LIDAR training

Page 1 of 5

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.