Rl-toolkit

Latest version: v4.1.1

Safety actively analyzes 641872 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 2

4.1.1

- update default `config.yaml`

4.1.0

Features 🔊
- .fit()
- AgentCallback

4.0.0

Features 🔊
- Render environments to WanDB
- Grouping of runs in WanDB
- SampleToInsertRatio rate limiter
- Global Gradient Clipping to avoid exploding gradients
- Softplus for numerical stability
- YAML configuration file
- LogCosh instead of Huber loss
- Critic network with Add layer applied on state & action branches
- Custom uniform initializer
- XLA (Accelerated Linear Algebra) compiler
- Optimized Replay Buffer (https://github.com/deepmind/reverb/issues/90)
- split into **Agent**, **Learner**, **Tester** and **Server**
Bug fixes 🛠️
- Fixed creating of saving path for models
- Fixed model's `summary()`

3.2.4

Features 🔊
- Reverb
- `setup.py` (package is available on PyPI)
- split into **Agent**, **Learner** and **Tester**
- Use custom model and layer for defining Actor-Critic
- MultiCritic - concatenating multiple critic networks into one network
- Truncated Quantile Critics

2.0.2

Features 🔊
- update Dockerfile
- update `README.md`
- formatted code by Black & Flake8

2.0.1

Bug fixes 🛠️
- fixed Critic model

Page 1 of 2

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.