This repository has been archived by the owner on Apr 26, 2023. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 9
q_values/value_vector function #73
Comments
Here is one perspective. |
It seems like |
Seems reasonable to me.
…On Mon, Jul 2, 2018 at 7:21 AM Maxime ***@***.***> wrote:
It seems like action_values(policy, s) would be a more appropriate name.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#73 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AEC0a4J5F99Gf3Rg6ufnGdx_6vr3BOS5ks5uCixygaJpZM4U_STr>
.
|
what order would these be in? consistent with iterator(actions(mdp, s)) or ordered_actions(mdp)? |
I vote |
What if not all the actions are in `actions(mdp, s)`? are the entries for
the missing actions undefined?
…On Thu, Jul 26, 2018 at 9:48 PM Maxime ***@***.***> wrote:
I vote ordered_actions(mdp), it makes it clearer I think.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#73 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AEC0azzZNJtsOsKj3WjuH99a7qYjWS_Tks5uKpu1gaJpZM4U_STr>
.
|
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
I think it would be useful to have a function returning the q values at a given state.
The signature could be
q_values(policy, s)
orvalue_vector(policy, s)
.Any thoughts?
The text was updated successfully, but these errors were encountered: