Skip to content
This repository has been archived by the owner on Apr 26, 2023. It is now read-only.

q_values/value_vector function #73

Open
MaximeBouton opened this issue Jul 2, 2018 · 6 comments
Open

q_values/value_vector function #73

MaximeBouton opened this issue Jul 2, 2018 · 6 comments

Comments

@MaximeBouton
Copy link
Contributor

I think it would be useful to have a function returning the q values at a given state.
The signature could be q_values(policy, s) or value_vector(policy, s).

Any thoughts?

@mykelk
Copy link
Member

mykelk commented Jul 2, 2018

Here is one perspective.

@MaximeBouton
Copy link
Contributor Author

It seems like action_values(policy, s) would be a more appropriate name.

@zsunberg
Copy link
Member

zsunberg commented Jul 2, 2018 via email

@zsunberg
Copy link
Member

what order would these be in? consistent with iterator(actions(mdp, s)) or ordered_actions(mdp)?

@MaximeBouton
Copy link
Contributor Author

I vote ordered_actions(mdp), it makes it clearer I think.

@zsunberg
Copy link
Member

zsunberg commented Jul 27, 2018 via email

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants