The policyiteration method is used in solving process.
文中应用策略迭代法求解。
2
The optimal allocation policy was obtained using policyiteration or value iteration.
采用策略迭代或值迭代的办法,可以求解系统的最优库存分配策略。
3
An appropriate selection of basis function directly in?uences the learning performance of a policyiteration method during the value function approximation.