Publication: Enabling Environment Design via Active Indirect Elicitation
Open/View Files
Date
2008
Authors
Published Version
Published Version
Journal Title
Journal ISSN
Volume Title
Publisher
The Harvard community has made this article openly available. Please share how this access benefits you.
Citation
Zhang, Haoqi, and David C. Parkes. 2008. Enabling environment design via active indirect elicitation. Paper presented at the 4th Multidisciplinary Workshop on Advances in Preference Handling (MRPERF'08) Chicago IL, July 2008
Research Data
Abstract
Many situations arise in which an interested party wishes to
affect the decisions of an agent; e.g., a teacher that seeks to
promote particular study habits, a Web 2.0 site that seeks to
encourage users to contribute content, or an online retailer
that seeks to encourage consumers to write reviews. In the
problem of environment design, one assumes an interested
party who is able to alter limited aspects of the environment
for the purpose of promoting desirable behaviors. A critical
aspect of environment design is understanding preferences,
but by assumption direct queries are unavailable. We work in
the inverse reinforcement learning framework, adopting here
the idea of active indirect preference elicitation to learn the reward function of the agent by observing behavior in response
to incentives. We show that the process is convergent and
obtain desirable bounds on the number of elicitation rounds.
We briefly discuss generalizations of the elicitation method to
other forms of environment design, e.g., modifying the state
space, transition model, and available actions.
Description
Other Available Sources
Keywords
Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service