This release introduces support for non-real-time environments in the TMRL library, and support for Linux in the TrackMania pipeline.
Version `0.6.0` is backward-incompatible and requires a clean installation.
Major changes
- TrackMania example pipeline (see `config.json`):
- Support for Linux
- Support for saving replays automatically
- Support for reward shaping
- TMRL library:
- Support for non-real-time environments and Trainer/Worker synchronization
- Generic training pipeline (in particular, introduced a generic `Memory` class for lazy developers, compatible with random sampling in 1-step TD learning)
Minor changes
- TrackMania example pipeline (see `config.json`):
- More fine-tuning options for SAC
- Set the default Adam Betas to the RL-compatible setting described by [Mahmood et al. 2023](https://openreview.net/forum?id=m9Jfdz4ymO) in an attempt to avoid policy collapse
- The default hyperparameters changed for full vision-based training instead of LIDAR training