Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Releases: araffin/sbx

v0.25.0: Increase Jax version range (using tf-nightly)

21 Dec 18:13

Choose a tag to compare

What's Changed

  • Increase Jax version range and use tf-nightly by @araffin in #84

Full Changelog: v0.24.0...v0.25.0

v0.24.0: Add Python 3.13 support, drop Python 3.9

05 Dec 18:10
1d2da71

Choose a tag to compare

What's Changed

  • Add Python 3.13 support, drop Python 3.9 by @araffin in #83

Full Changelog: v0.23.0...v0.24.0

SBX v0.23.0: CNN support for PPO

29 Sep 17:21

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.22.0...v0.23.0

v0.22.0: n-step returns support for off-policy algorithms via the `n_steps` argument

25 Jul 09:35
1e5e433

Choose a tag to compare

What's Changed

  • Add n-step returns support via the n_steps argument by @araffin in #74

Full Changelog: v0.21.0...v0.22.0

v0.21.0: KL Adaptive LR for PPO and learning rate schedule for SAC/TQC

19 May 10:47
849e908

Choose a tag to compare

What's Changed

  • KL Adaptive LR for PPO and LR schedule for SAC/TQC by @araffin in #72

Full Changelog: v0.20.0...v0.21.0

v0.20.0: Hotfix for PPO with un-normalized env, `net_arch` support for PPO, additional fixes

14 Feb 11:09
8238fcc

Choose a tag to compare

What's Changed

  • Update PPO to support net_arch, and additional fixes by @araffin in #65
  • fixed entropy coeff wrongly logged for SAC and derivatives.
  • fixed PPO predict() for env that were not normalized (action spaces with limits != [-1, 1])
  • PPO now logs the standard deviation

Full Changelog: v0.19.0...v0.20.0

v0.19.0: SimBa Policy: Simplicity Bias for Scaling Up Parameters in DRL

15 Jan 14:49
9cad1d0

Choose a tag to compare

What's Changed

  • Add SimBa Policy: Simplicity Bias for Scaling Up Parameters in DRL by @araffin in #59
  • Cleanups and update min version to python 3.9

Full Changelog: v0.18.0...v0.19.0

SBX v0.18.0: Bug fix for SAC, optimize log of ent coeff to be consistent with SB3

01 Nov 07:48
1c79684

Choose a tag to compare

What's Changed

  • Optimize the log of the entropy coeff instead of the entropy coeff by @jamesheald in #56

New Contributors

Full Changelog: v0.17.0...v0.18.0

SBX v0.17.0: CNN support for DQN

11 Jul 12:07
19c85a1

Choose a tag to compare

What's Changed

  • Fix warning and remove DroQ class in favor of SAC config by @araffin in #47
  • Add CNN support for DQN by @araffin in #49

Full Changelog: v0.15.0...v0.17.0

SBX v0.15.0: Hotfix for offpolicy algorithms, the pseudo random key was not updated

12 Apr 12:02
42caa65

Choose a tag to compare

Note

No performance difference should be expected (See report in #46), this bug was introduced in v0.11.0.

What's Changed

  • Support for setting the target entropy by @jan1854 in #43
  • Hotfix - Return the new updated key in function _train by @theovincent in #46

New Contributors

Full Changelog: v0.13.0...v0.15.0