getMaxQValue
Obtain maximum estimated value over all possible actions from a Q-value function critic with discrete action space, given environment observations
Syntax
Description
[
evaluates the discrete-action-space Q-value function criticmaxQ
,maxActionIndex
] = getMaxQValue(qValueFcnObj
,obs
)qValueFcnObj
and returns the maximum estimated value over all possible actionsmaxQ
, with the corresponding action indexmaxActionIndex
, given environment observationsobs
.
[
also returns the updated state ofmaxQ
,maxActionIndex
,state
] = getMaxQValue(___)qValueFcnObj
when it contains a recurrent neural network.