Show simple item record

dc.contributor.authorLennon, James
dc.date.accessioned2020-08-28T09:43:02Z
dc.date.created2019-05
dc.date.issued2019-08-23
dc.date.submitted2019
dc.identifier.citationLennon, James. 2019. Modeling Human Behavior in Space Invaders. Bachelor's thesis, Harvard College.
dc.identifier.urihttps://nrs.harvard.edu/URN-3:HUL.INSTREPOS:37364651*
dc.description.abstractEffective AI systems in the real world must be able to interact and cooperate effectively with the people who use and benefit from them. In order to make this possible, these systems must have a realistic model of how humans will behave in various situations; either overestimating or underestimating human performance can lead to strongly suboptimal outcomes. To this end, this thesis proposes a new algorithm for imitation learning, working in the Atari 2600 Space Invaders environment. We first modify GAIL, a state-of-the-art deep imitation learning algorithm, to work in Atari environments and verify that it scales up to more complex environments more effectively than the original version of the algorithm. We then build a framework for evaluating and comparing human imitators, developing a set of relevant statistics that consider both in-environment performance and descriptive similarity. The new method that is introduced breaks down the problem of human imitation into two subproblems: creating an agent that plays the game well and learning a "corrective" function that modifies this agent to play in a human manner. This hybrid approach is fast to train and can be easily tuned along a spectrum to make the tradeoff between more closely matching the human behavior or performing at a higher level. This approach shows promising results across the evaluation statistics; it achieves a high likelihood of the data under the learned policy, produces a score distribution matching that of the human data, and also matches the human distribution of actions as it acts in the environment.
dc.description.sponsorshipComputer Science
dc.format.mimetypeapplication/pdf
dc.language.isoen
dash.licenseLAA
dc.titleModeling Human Behavior in Space Invaders
dc.typeThesis or Dissertation
dash.depositing.authorLennon, James
dc.date.available2020-08-28T09:43:02Z
thesis.degree.date2019
thesis.degree.grantorHarvard College
thesis.degree.levelUndergraduate
thesis.degree.nameAB
dc.type.materialtext
thesis.degree.departmentComputer Science
thesis.degree.discipline-jointMind Brain Behavior
dash.identifier.vireo
dash.author.emailjameslennon321@gmail.com


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record