搜索结果: 1-1 共查到“管理科学与工程 REINFORCEMENT”相关记录1条 . 查询时间(0.047 秒)
Kernel-Based Reinforcement Learning in Average-Cost Problems
Average–cost problem dynamic programming kernel smoothing local averaging Markov decision process (MDP)
2015/7/8
Reinforcement learning (RL) is concerned with the identification of optimal controls in Markov decision processes (MDPs) where no explicit model of the transition probabilities is available. Many exis...