Economic Hierarchical Q-learning

Schultink, Erik; Cavallo, Ruggiero; Parkes, David C.

dc.contributor.author	Schultink, Erik
dc.contributor.author	Cavallo, Ruggiero
dc.contributor.author	Parkes, David C.
dc.date.accessioned	2010-04-28T15:50:20Z
dc.date.issued	2008
dc.identifier.citation	Schultink, Erik, Ruggiero Cavallo, and David C. Parkes. 2008. Economic hierarchical Q-learning. In Proceedings of the Twenty-third AAAI Conference on Artificial Intelligence and the Twentieth Innovative Applications of Artificial Intelligence Conference: July 13-17, 2008, Chicago, Illinois, ed. American Association for Artificial Intelligence, 689-695. Menlo Park, Calif.: AAAI Press.	en_US
dc.identifier.isbn	978-1-57735-368-3	en_US
dc.identifier.uri	http://nrs.harvard.edu/urn-3:HUL.InstRepos:4000334
dc.description.abstract	Hierarchical state decompositions address the curse-of-dimensionality in Q-learning methods for reinforcement learning (RL) but can suffer from suboptimality. In addressing this, we introduce the Economic Hierarchical Q-Learning (EHQ) algorithm for hierarchical RL. The EHQ algorithm uses subsidies to align interests such that agents that would otherwise converge to a recursively optimal policy will instead be motivated to act hierarchically optimally. The essential idea is that a parent will pay a child for the relative value to the rest of the system for "returning the world" in one state over another state. The resulting learning framework is simple compared to other algorithms that obtain hierarchical optimality. Additionally, EHQ encapsulates relevant information about value tradeoffs faced across the hierarchy at each node and requires minimal data exchange between nodes. We provide no theoretical proof of hierarchical optimality but are able demonstrate success with EHQ in empirical results.	en_US
dc.description.sponsorship	Engineering and Applied Sciences	en_US
dc.language.iso	en_US	en_US
dc.publisher	Association for the Advancement of Artificial Intelligence	en_US
dc.relation.isversionof	http://portal.acm.org/citation.cfm?id=1620163.1620179	en_US
dc.relation.hasversion	http://www.eecs.harvard.edu/econcs/pubs/schultink08.pdf	en_US
dash.license	LAA
dc.title	Economic Hierarchical Q-learning	en_US
dc.type	Monograph or Book	en_US
dc.description.version	Proof	en_US
dash.depositing.author	Parkes, David C.
dc.date.available	2010-04-28T15:50:20Z
dash.contributor.affiliated	Parkes, David

Files in this item

Name:: Schultink_Economic.pdf
Size:: 437.1Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

FAS Scholarly Articles [18292]

Show simple item record