方略学科导航

搜索结果: 1-1 共查到“管理科学与工程 REINFORCEMENT”相关记录1条 . 查询时间(0.047 秒)

Kernel-Based Reinforcement Learning in Average-Cost Problems Average–cost problem dynamic programming kernel smoothing local averaging Markov decision process (MDP) 2015/7/8

Reinforcement learning (RL) is concerned with the identification of optimal controls in Markov decision processes (MDPs) where no explicit model of the transition probabilities is available. Many exis...

存档附件原文地址

中国研究生教育排行榜-条

正在加载...

中国学术期刊排行榜-条

正在加载...

世界大学科研机构排行榜-条

正在加载...

中国大学排行榜-条

正在加载...

人　物-篇

正在加载...

课　件-篇

正在加载...

视听资料-篇

正在加载...

研招资料 -篇

正在加载...

知识要闻-篇

正在加载...

国际动态-篇

正在加载...

会议中心-篇

正在加载...

学术指南-篇

正在加载...

学术站点-篇

正在加载...