Stable-baselines3

Latest version: v2.6.0

Safety actively analyzes 723625 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 5

2.6.0

Breaking Changes:
- Upgraded to Stable-Baselines3 >= 2.6.0
- Renamed ``_dump_logs()`` to ``dump_logs()``

New Features:
- Added support for Gymnasium v1.1.0

Bug Fixes:
- Fixed issues with ``SubprocVecEnv`` and ``MaskablePPO`` by using ``vec_env.has_attr()`` (pickling issues, mask function not present)


**Full Changelog**: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/compare/v2.5.0...v2.6.0

2.5.0

Breaking changes:
- Upgraded to PyTorch 2.3.0
- Dropped Python 3.8 support
- Upgraded to Stable-Baselines3 >= 2.5.0


New Contributors
* kplers made their first contribution in https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/pull/266

**Full Changelog**: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/compare/v2.4.0...v2.5.0

2.4.1

Bug Fixes

- Fixed a bug introduced in v2.4.0 where the ``VecVideoRecorder`` would override videos


**Full Changelog**: https://github.com/DLR-RM/stable-baselines3/compare/v2.4.0...v2.4.1

2.4.0

Breaking Changes:
- Upgraded to Stable-Baselines3 >= 2.4.0

New Features:
- Added ``CrossQ`` algorithm, from "Batch Normalization in Deep Reinforcement Learning" paper (danielpalen)
- Added ``BatchRenorm`` PyTorch layer used in ``CrossQ`` (danielpalen)
- Added support for Gymnasium v1.0

Bug Fixes:

- Updated QR-DQN optimizer input to only include quantile_net parameters (corentinlger)
- Updated QR-DQN paper link in docs (corentinlger)
- Fixed a warning with PyTorch 2.4 when loading a `RecurrentPPO` model (You are using torch.load with weights_only=False)
- Fixed loading QRDQN changes `target_update_interval` (jak3122)


Others:
- Updated PyTorch version on CI to 2.3.1
- Remove unnecessary SDE noise resampling in PPO/TRPO update
- Switched to uv to download packages on GitHub CI

New Contributors
* corentinlger made their first contribution in https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/pull/252
* jak3122 made their first contribution in https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/pull/259
* danielpalen made their first contribution in https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/pull/243

**Full Changelog**: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/compare/v2.3.0...v2.4.0

2.3.2

Bug fixes
* Reverted ``torch.load()`` to be called ``weights_only=False`` as it caused loading issue with old version of PyTorch. https://github.com/DLR-RM/stable-baselines3/pull/1913
* Cast learning_rate to float lambda for pickle safety when doing model.load by markscsmith in https://github.com/DLR-RM/stable-baselines3/pull/1901

Documentation
* Fix typo in changelog by araffin in https://github.com/DLR-RM/stable-baselines3/pull/1882
* Fixed broken link in ppo.rst by chaitanyabisht in https://github.com/DLR-RM/stable-baselines3/pull/1884
* Adding ER-MRL to community project by corentinlger in https://github.com/DLR-RM/stable-baselines3/pull/1904
* Fix tensorboad video slow numpy->torch conversion by NickLucche in https://github.com/DLR-RM/stable-baselines3/pull/1910

New Contributors
* chaitanyabisht made their first contribution in https://github.com/DLR-RM/stable-baselines3/pull/1884
* markscsmith made their first contribution in https://github.com/DLR-RM/stable-baselines3/pull/1901
* NickLucche made their first contribution in https://github.com/DLR-RM/stable-baselines3/pull/1910

**Full Changelog**: https://github.com/DLR-RM/stable-baselines3/compare/v2.3.0...v2.3.2

2.3.0

model = QRDQN("MlpPolicy", env, learning_starts=100)


New Features:

- Added ``rollout_buffer_class`` and ``rollout_buffer_kwargs`` arguments to MaskablePPO
- Log success rate ``rollout/success_rate`` when available for on policy algorithms

Others:

- Fixed ``train_freq`` type annotation for tqc and qrdqn (Armandpl)
- Fixed ``sb3_contrib/common/maskable/*.py`` type annotations
- Fixed ``sb3_contrib/ppo_mask/ppo_mask.py`` type annotations
- Fixed ``sb3_contrib/common/vec_env/async_eval.py`` type annotations

Documentation:

- Add some additional notes about ``MaskablePPO`` (evaluation and multi-process) (icheered)

**Full Changelog**: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/compare/v2.2.1...v2.3.0

Page 1 of 5

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.