Show simple item record

dc.contributor.advisorKreiman, Gabriel
dc.contributor.authorKarev, Dimitar Nikolaev
dc.date.accessioned2021-07-19T03:58:12Z
dash.embargo.terms2021-12-23
dc.date.created2021
dc.date.issued2021-06-03
dc.date.submitted2021
dc.identifier.citationKarev, Dimitar Nikolaev. 2021. Context-Robust Object Recognition via Object Manipulations in a Synthetic 3D Environment. Bachelor's thesis, Harvard College.
dc.identifier.other28409894
dc.identifier.urihttps://nrs.harvard.edu/URN-3:HUL.INSTREPOS:37368550*
dc.description.abstractThe remote control is a small object that does not fly in the air and is generally found on a table, not in the sink. Such contextual regularities are ingrained in our perception of the world and previous research suggests that they can even influence human and computational models' object recognition ability. However, the exact effects of contextual information on object recognition are still unknown for both humans and machine learning models. Here, we introduce a novel way of studying the effects of different contextual cues in a qualitative and systematic way. We present a diverse synthetic dataset created via a 3D simulation engine that allows for complex object modifications. Our dataset consists of more than 15000 images across 36 object categories and it is designed specifically for studying the effects of gravity, object co-occurrence statistics, and relative size regularities. We conduct a series of psychophysics experiments to assess human performance and establish a benchmark for computational models on the dataset. Additionally, we test state-of-the-art deep learning models on the same dataset and study how contextual information influences their object recognition accuracy. Finally, we propose a context-aware recognition transformer network that integrates contextual and object information via multi-head attention mechanism. Our model captures useful contextual information that allows it to achieve human-level performance and significantly better robustness in out-of-context conditions compared to baseline models across our dataset and another existing out-of-context natural image dataset. Moreover, our model performs in a way that is consistent with human object recognition and shows similar recognition artifacts.
dc.format.mimetypeapplication/pdf
dc.language.isoen
dash.licenseLAA
dc.subjectcontext
dc.subjectmachine learning
dc.subjectobject recognition
dc.subjectrobust
dc.subjectsynthetic
dc.subjectvirtualhome
dc.subjectArtificial intelligence
dc.subjectNeurosciences
dc.subjectComputer science
dc.titleContext-Robust Object Recognition via Object Manipulations in a Synthetic 3D Environment
dc.typeThesis or Dissertation
dash.depositing.authorKarev, Dimitar Nikolaev
dash.embargo.until2021-12-23
dc.date.available2021-07-19T03:58:12Z
thesis.degree.date2021
thesis.degree.grantorHarvard College
thesis.degree.levelBachelor's
thesis.degree.levelUndergraduate
thesis.degree.nameAB
dc.type.materialtext
thesis.degree.departmentComputer Science
dash.author.emaildkarev98@gmail.com


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record