Behavior policy learning: Learning multi-stage tasks via solution sketches and model-based controllers
{{output}}
Multi-stage tasks are a challenge for reinforcement learning methods, and require either specific task knowledge (e.g., task segmentation) or big amount of interaction times to be learned. In this paper, we propose Behavior Policy Learning (BPL) that effective... ...