Show simple item record

dc.contributor.authorSchultink, Erik
dc.contributor.authorCavallo, Ruggiero
dc.contributor.authorParkes, David C.
dc.date.accessioned2010-04-28T15:50:20Z
dc.date.issued2008
dc.identifier.citationSchultink, Erik, Ruggiero Cavallo, and David C. Parkes. 2008. Economic hierarchical Q-learning. In Proceedings of the Twenty-third AAAI Conference on Artificial Intelligence and the Twentieth Innovative Applications of Artificial Intelligence Conference: July 13-17, 2008, Chicago, Illinois, ed. American Association for Artificial Intelligence, 689-695. Menlo Park, Calif.: AAAI Press.en_US
dc.identifier.isbn978-1-57735-368-3en_US
dc.identifier.urihttp://nrs.harvard.edu/urn-3:HUL.InstRepos:4000334
dc.description.abstractHierarchical state decompositions address the curse-of-dimensionality in Q-learning methods for reinforcement learning (RL) but can suffer from suboptimality. In addressing this, we introduce the Economic Hierarchical Q-Learning (EHQ) algorithm for hierarchical RL. The EHQ algorithm uses subsidies to align interests such that agents that would otherwise converge to a recursively optimal policy will instead be motivated to act hierarchically optimally. The essential idea is that a parent will pay a child for the relative value to the rest of the system for "returning the world" in one state over another state. The resulting learning framework is simple compared to other algorithms that obtain hierarchical optimality. Additionally, EHQ encapsulates relevant information about value tradeoffs faced across the hierarchy at each node and requires minimal data exchange between nodes. We provide no theoretical proof of hierarchical optimality but are able demonstrate success with EHQ in empirical results.en_US
dc.description.sponsorshipEngineering and Applied Sciencesen_US
dc.language.isoen_USen_US
dc.publisherAssociation for the Advancement of Artificial Intelligenceen_US
dc.relation.isversionofhttp://portal.acm.org/citation.cfm?id=1620163.1620179en_US
dc.relation.hasversionhttp://www.eecs.harvard.edu/econcs/pubs/schultink08.pdfen_US
dash.licenseLAA
dc.titleEconomic Hierarchical Q-learningen_US
dc.typeMonograph or Booken_US
dc.description.versionProofen_US
dash.depositing.authorParkes, David C.
dc.date.available2010-04-28T15:50:20Z
dash.contributor.affiliatedParkes, David


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record