This repository implements the meta-reinforcement learing architecture as proposed in "Learning to reinforcement learn", Wang et al., 2016. We use CartPole-V0 environment for our experiments.
This repo is inspired by Florentin's thesis and Arthur's implementation