Skip to content

A simplified implementation of World Models paper, where the controller is trained with A2C and gradients of the controller are stopped from flowing in to the world model

License

Notifications You must be signed in to change notification settings

manavchoudhary/world_models_implementation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

World Models implementation

This repository contains the code for a simplified implementation of "World Models" paper, (NeurIPS version) , where the controller is trained with A2C. Note that since we train both M and C together, we stopped gradients of the controller from flowing in to the world model, to allow for learning task independent features. Also the encoder is a simple fully connected network and will be converted into VAE and the model is currently a RNN with fully connected network, instead of mixture density network.

Results

Results of this implementation on CartPole-v0 Open AI Gym environment. More results to follow. World Models experiment on Cartpole_V0

About

A simplified implementation of World Models paper, where the controller is trained with A2C and gradients of the controller are stopped from flowing in to the world model

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages