Now showing items 1-1 of 1

    • Policy Teaching Through Reward Function Learning 

      Zhang, Haoqi; Parkes, David C.; Chen, Yiling (Association for Computing Machinery, 2009)
      Policy teaching considers a Markov Decision Process setting in which an interested party aims to influence an agent's decisions by providing limited incentives. In this paper, we consider the specific objective of inducing ...