Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
Sohojoe authored Jul 6, 2018
1 parent 55f3211 commit 58bd3cc
Showing 1 changed file with 12 additions and 0 deletions.
12 changes: 12 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,18 @@
Research into controllers for 2d and 3d Active Ragdolls (using MujocoUnity+ml_agents)



### Controller003
* **Type:** Continuous 2D
* **Actions:** Forward / Backwards
* **Mujoco Model:** DeepMindHopper
* **Hypostheis**: Use an adversarial hierarchical trained agent as the controller which gets the inverse reward of the locomation agent on a slower time step. The idea is that it will push the locomoation agent to focus on its weakest areas.
* **Outcome:**
* **FAIL** - training is too heavily influenced by the number of steps the controller agent takes between decisions;
* ... it maybe better to train a seperate agent on hyper-parms (i.e. meta learning)
* ... having read more about these approaches (MAML, RL2, etc) it would be better to move to a Discreate conrtroller as ml-agents LSTM does not work well with Continuous actions.


### Controller002
* **Type:** Continuous 2D
* **Actions:** Forward / Backwards
Expand Down

0 comments on commit 58bd3cc

Please sign in to comment.