Changes

v5.0.0 (January 11, 2025)

Features 🔊

Gymnasium
DeepMind Control Suite wrapper
ELU activation
Optional state-action merging layer index (Critic model)

Bug fixes 🛠️

Optimized critic
Optimized server
backend.epsilon() from Keras backend

v4.1.1 (September 2, 2022)

Bug fixes 🛠️

update default config.yaml

v4.1.0 (February 9, 2022)

Features 🔊

.fit()
AgentCallback

v4.0.0 (February 5, 2022)

Features 🔊

Render environments to WanDB
Grouping of runs in WanDB
SampleToInsertRatio rate limiter
Global Gradient Clipping to avoid exploding gradients
Softplus for numerical stability
YAML configuration file
LogCosh instead of Huber loss
Critic network with Add layer applied on state & action branches
Custom uniform initializer
XLA (Accelerated Linear Algebra) compiler
Optimized Replay Buffer (google-deepmind/reverb#90)
split into Agent, Learner, Tester and Server

Bug fixes 🛠️

Fixed creating of saving path for models
Fixed model's summary()

v3.2.4 (July 7, 2021)

Features 🔊

Reverb
setup.py (package is available on PyPI)
split into Agent, Learner and Tester
Use custom model and layer for defining Actor-Critic
MultiCritic - concatenating multiple critic networks into one network
Truncated Quantile Critics

v2.0.2 (May 23, 2021)

Features 🔊

update Dockerfile
update README.md
formatted code by Black & Flake8

v2.0.1 (April 27, 2021)

Bug fixes 🛠️

fixed Critic model

v2.0.0 (April 22, 2021)

Features 🔊

Add Huber loss
In test mode, rendering to the video file
Normalized observation by Min-max method
Remove TD3 algorithm