A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction

This paper offers a new hybrid probably approximately correct (PAC) reinforcement learning (RL) algorithm for Markov decision processes (MDPs) that intelligently maintains favorable features of both model-based and model-free methodologies. The designed algori... ...

请注册登录后继续浏览