Releases: araffin/sbx
Releases · araffin/sbx
v0.25.0: Increase Jax version range (using tf-nightly)
What's Changed
Full Changelog: v0.24.0...v0.25.0
v0.24.0: Add Python 3.13 support, drop Python 3.9
What's Changed
Full Changelog: v0.23.0...v0.24.0
SBX v0.23.0: CNN support for PPO
What's Changed
- Add CnnPolicy to PPO by @paulo101977 in #81
New Contributors
- @paulo101977 made their first contribution in #81
Full Changelog: v0.22.0...v0.23.0
v0.22.0: n-step returns support for off-policy algorithms via the `n_steps` argument
What's Changed
Full Changelog: v0.21.0...v0.22.0
v0.21.0: KL Adaptive LR for PPO and learning rate schedule for SAC/TQC
What's Changed
Full Changelog: v0.20.0...v0.21.0
v0.20.0: Hotfix for PPO with un-normalized env, `net_arch` support for PPO, additional fixes
What's Changed
- Update PPO to support
net_arch, and additional fixes by @araffin in #65 - fixed entropy coeff wrongly logged for SAC and derivatives.
- fixed PPO
predict()for env that were not normalized (action spaces with limits != [-1, 1]) - PPO now logs the standard deviation
Full Changelog: v0.19.0...v0.20.0
v0.19.0: SimBa Policy: Simplicity Bias for Scaling Up Parameters in DRL
What's Changed
- Add SimBa Policy: Simplicity Bias for Scaling Up Parameters in DRL by @araffin in #59
- Cleanups and update min version to python 3.9
Full Changelog: v0.18.0...v0.19.0
SBX v0.18.0: Bug fix for SAC, optimize log of ent coeff to be consistent with SB3
What's Changed
- Optimize the log of the entropy coeff instead of the entropy coeff by @jamesheald in #56
New Contributors
- @jamesheald made their first contribution in #56
Full Changelog: v0.17.0...v0.18.0
SBX v0.17.0: CNN support for DQN
What's Changed
- Fix warning and remove DroQ class in favor of SAC config by @araffin in #47
- Add CNN support for DQN by @araffin in #49
Full Changelog: v0.15.0...v0.17.0
SBX v0.15.0: Hotfix for offpolicy algorithms, the pseudo random key was not updated
Note
No performance difference should be expected (See report in #46), this bug was introduced in v0.11.0.
What's Changed
- Support for setting the target entropy by @jan1854 in #43
- Hotfix - Return the new updated key in function _train by @theovincent in #46
New Contributors
- @theovincent made their first contribution in #46
Full Changelog: v0.13.0...v0.15.0