Bias-Corrected Q-Learning With Multistate Extension-MedSci.cn

Bias-Corrected Q-Learning With Multistate Extension

Lee, D; Powell, WB

Lee, D (reprint author), Princeton Univ, Dept Comp Sci, Comp Sci, Princeton, NJ 08540 USA.

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019; 64 (10): 4011

Abstract

Q-learning is a sample-based model-free algorithm that solves Markov decision problems asymptotically, but in finite time, it can perform poorly when ......

Full Text Link

Links

期刊讨论 | 中国SCI论文 | 期刊主页 | 投稿经验 | 杂志官网 | 投稿链接 | 作者需知 | PMC链接 | Pubmed全文检索

科室
- - 订阅+
  - 更多科室
工具
服务