Codestin Search App

@araffin

What's Changed

Increase Jax version range and use tf-nightly by @araffin in #84

Full Changelog: v0.24.0...v0.25.0

@araffin

What's Changed

Add Python 3.13 support, drop Python 3.9 by @araffin in #83

Full Changelog: v0.23.0...v0.24.0

@paulo101977

What's Changed

Add CnnPolicy to PPO by @paulo101977 in #81

New Contributors

@paulo101977 made their first contribution in #81

Full Changelog: v0.22.0...v0.23.0

@araffin

What's Changed

Add n-step returns support via the n_steps argument by @araffin in #74

Full Changelog: v0.21.0...v0.22.0

@araffin

What's Changed

KL Adaptive LR for PPO and LR schedule for SAC/TQC by @araffin in #72

Full Changelog: v0.20.0...v0.21.0

@araffin

What's Changed

Update PPO to support net_arch, and additional fixes by @araffin in #65
fixed entropy coeff wrongly logged for SAC and derivatives.
fixed PPO predict() for env that were not normalized (action spaces with limits != [-1, 1])
PPO now logs the standard deviation

Full Changelog: v0.19.0...v0.20.0

@araffin

What's Changed

Add SimBa Policy: Simplicity Bias for Scaling Up Parameters in DRL by @araffin in #59
Cleanups and update min version to python 3.9

Full Changelog: v0.18.0...v0.19.0

@jamesheald

What's Changed

Optimize the log of the entropy coeff instead of the entropy coeff by @jamesheald in #56

New Contributors

@jamesheald made their first contribution in #56

Full Changelog: v0.17.0...v0.18.0

@araffin

What's Changed

Fix warning and remove DroQ class in favor of SAC config by @araffin in #47
Add CNN support for DQN by @araffin in #49

Full Changelog: v0.15.0...v0.17.0

@jan1854

Note

No performance difference should be expected (See report in #46), this bug was introduced in v0.11.0.

What's Changed

Support for setting the target entropy by @jan1854 in #43
Hotfix - Return the new updated key in function _train by @theovincent in #46

New Contributors

@theovincent made their first contribution in #46

Full Changelog: v0.13.0...v0.15.0

Releases: araffin/sbx

v0.25.0: Increase Jax version range (using tf-nightly)

What's Changed

Contributors

Uh oh!

v0.24.0: Add Python 3.13 support, drop Python 3.9

What's Changed

Contributors

Uh oh!

SBX v0.23.0: CNN support for PPO

What's Changed

New Contributors

Contributors

Uh oh!

v0.22.0: n-step returns support for off-policy algorithms via the `n_steps` argument

What's Changed

Contributors

Uh oh!

v0.21.0: KL Adaptive LR for PPO and learning rate schedule for SAC/TQC

What's Changed

Contributors

Uh oh!

v0.20.0: Hotfix for PPO with un-normalized env, `net_arch` support for PPO, additional fixes

What's Changed

Contributors

Uh oh!

v0.19.0: SimBa Policy: Simplicity Bias for Scaling Up Parameters in DRL

What's Changed

Contributors

Uh oh!

SBX v0.18.0: Bug fix for SAC, optimize log of ent coeff to be consistent with SB3

What's Changed

New Contributors

Contributors

Uh oh!

SBX v0.17.0: CNN support for DQN

What's Changed

Contributors

Uh oh!

SBX v0.15.0: Hotfix for offpolicy algorithms, the pseudo random key was not updated

What's Changed

New Contributors

Contributors

Uh oh!