Google Scholar

[PDF][PDF] Hierarchical reinforcement learning based on subgoal discovery and subpolicy specialization

B Bakker, J Schmidhuber - Proc. of the 8-th Conf. on Intelligent …, 2004 - Citeseer

Proc. of the 8-th Conf. on Intelligent Autonomous Systems, 2004•Citeseer

We introduce a new method for hierarchical reinforcement learning. Highlevel policies
automatically discover subgoals; low-level policies learn to specialize on different subgoals.
Subgoals are represented as desired abstract observations which cluster raw input data.
High-level value functions cover the state space at a coarse level; low-level value functions
cover only parts of the state space at a fine-grained level. Experiments show that this method
outperforms several flat reinforcement learning methods in a deterministic task and in a …

Abstract

We introduce a new method for hierarchical reinforcement learning. Highlevel policies automatically discover subgoals; low-level policies learn to specialize on different subgoals. Subgoals are represented as desired abstract observations which cluster raw input data. High-level value functions cover the state space at a coarse level; low-level value functions cover only parts of the state space at a fine-grained level. Experiments show that this method outperforms several flat reinforcement learning methods in a deterministic task and in a stochastic task.

Citeseer

Show moreShow less

Save Cite Cited by 175 Related articles All 4 versions View as HTML

Showing the best result for this search. See all results

Cite

Advanced search

Saved to My library

[PDF][PDF] Hierarchical reinforcement learning based on subgoal discovery and subpolicy specialization