单词 | reinforcement learning algorithm |
释义 | 例句释义: 强化学习算法 1. Now reinforcement learning is widely used in agent system, among which Q-learning algorithm is widely used reinforcement learning algorithm. 学习算法是最易理解和目前广为使用的一种无模型强化学习方法,但标准的Q-学习算法应用于智能体系统时本身存在一些问题。 www.dictall.com 2. In this paper, we develop a kernel based reinforcement learning algorithm, which solve the problems with continuous state spaces directly. 为克服以上不足,本文提出了一种基于核方法的强化学习算法,能直接处理具有连续状态空间的问题。 www.dictall.com 3. n section 3 the particular reinforcement learning algorithm used in this architecture is described. 第三部分介绍一种特殊的用于此网络架构的强化学习算法。 blog.sina.com.cn 4. Reinforcement learning algorithm for partially observable Markov decision processes 求解部分可观测马氏决策过程的强化学习算法 www.ilib.cn 5. A Reinforcement Learning Algorithm for Partially Observable Markov Decision Processes 一种部分可感知系统的增强学习方法 www.ilib.cn 6. Study on an Average Reward Reinforcement Learning Algorithm 平均奖赏强化学习算法研究 www.ilib.cn 7. A Truncated Multi-step Prioritized Sweeping Reinforcement Learning Algorithm 多步截断优先扫描强化学习算法 www.ilib.cn 8. Interference Solving Strategy in Multiple Robot System Based on Reinforcement Learning Algorithm 基于强化学习算法的多机器人系统的冲突消解策略 www.ilib.cn 9. Undiscounted Reinforcement Learning Algorithm Based on Performance Potentials 一种基于性能势的无折扣强化学习算法 www.ilib.cn 10. Hybrid Intelligent Control for Ship Steering Based on Reinforcement Learning Algorithm 基于增强型学习算法的船舶运动混合智能控制 service.ilib.cn 1. Congestion Control in Networks Based on Reinforcement Learning Algorithm 基于强化学习算法的网络拥塞控制 www.ilib.cn 2. A Stock Forecasting System Based On A Reinforcement Learning Algorithm 基于强化学习的股票预测系统的研究与设计 www.ilib.cn 3. A Reinforcement Learning Algorithm Based on Recursive Least-squares Methods 一种基于递归最小二乘法的强化学习算法及其应用研究 www.ilib.cn 4. A Multi-agent Cooperative Reinforcement Learning Algorithm Based on Team Markov Game 一种基于团队马尔可夫博弈的多agent协同强化学习算法 www.ilib.cn 5. Ship Steering Control Based on SA-Reinforcement Learning Algorithm 基于模拟退火-强化学习算法的船舶运动控制 www.ilib.com.cn 6. A Kind of Forgetting Reinforcement Learning Algorithm 一种激励学习遗忘算法 www.ilib.cn 7. A Routing Model Based on Reinforcement Learning Algorithm 一个基于增强学习算法的路由模型 www.ilib.cn |
随便看 |
|
英汉双解词典包含2704715条英汉词条,基本涵盖了全部常用单词的翻译及用法,是英语学习的有利工具。