Implemenation of DDPG with numpy only (without Tensorflow)
neural-network policy-gradient reinforcement-learning-algorithms stochastic-gradient-descent adam-optimizer deep-deterministic-policy-gradient actor-critic-algorithm replay-buffer target-network
-
Updated
Mar 4, 2018 - Python