- Fixed an issue with cartpole_swingup's `best_episode` not being computed correctly due to negative rewards (beb163021f56eb065508441d9e4d859580c409dc). This change will affect scores for this experiment, so we bump the minor version in keeping with semantic versioning.
- Fix versioning issues in colab and setup.py.
- Improve test.sh coverage.
- Make TAGS accessible as sweep (e.g. sweep.TAGS['exploration'] to retrieve all exploration experiments.
- Add a bootstrapped DQN baseline in JAX.
- Extract environments to their own package for easier direct access for debugging/development.