Title: Enabling Environment Design via Active Indirect Elicitation
Author: Zhang, Haoqi; Parkes, David

Note: Order does not necessarily reflect citation order of authors.

Citation: Zhang, Haoqi, and David C. Parkes. 2008. Enabling environment design via active indirect elicitation. Paper presented at the 4th Multidisciplinary Workshop on Advances in Preference Handling (MRPERF'08) Chicago IL, July 2008
Abstract: Many situations arise in which an interested party wishes to affect the decisions of an agent; e.g., a teacher that seeks to promote particular study habits, a Web 2.0 site that seeks to encourage users to contribute content, or an online retailer that seeks to encourage consumers to write reviews. In the problem of environment design, one assumes an interested party who is able to alter limited aspects of the environment for the purpose of promoting desirable behaviors. A critical aspect of environment design is understanding preferences, but by assumption direct queries are unavailable. We work in the inverse reinforcement learning framework, adopting here the idea of active indirect preference elicitation to learn the reward function of the agent by observing behavior in response to incentives. We show that the process is convergent and obtain desirable bounds on the number of elicitation rounds. We briefly discuss generalizations of the elicitation method to other forms of environment design, e.g., modifying the state space, transition model, and available actions.
Published Version: http://www.eecs.harvard.edu/econcs/pubs/zp-mpref08.pdf
Terms of Use: This article is made available under the terms and conditions applicable to Other Posted Material, as set forth at http://nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of-use#LAA
