Two-level Q-learning: learning from conflict demonstrations

Li, M; Wei, Y; Kudenko, D

Li, M (reprint author), Univ York, Comp Sci Dept, York, N Yorkshire, England.

KNOWLEDGE ENGINEERING REVIEW, 2019; 34 ():

Abstract

One way to address this low sample efficiency of reinforcement learning (RL) is to employ human expert demonstrations to speed up the RL process (RL f......

Full Text Link