Publication:

Connecting Language Representations in Humans and Machines

Loading...
Thumbnail Image

Date

2020-06-18

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Jou, Christine. 2020. Connecting Language Representations in Humans and Machines. Bachelor's thesis, Harvard College.

Abstract

In the past decade, biologically-inspired neural network models have surprisingly uncovered ways of building upon our understanding of the human brain, especially in the field of computer vision. Of particular interest, more recently, however, are the mechanisms by which language is processed in the brain. State-of-art natural language processing models have gained greater traction because of their exceptional performance on linguistic tasks; however, there is still a significant gap in understanding how these models learn language and what parallels, if any, can be drawn between the human brain and these models. Parts I-III of this thesis introduces the language regions of interest, describes deep neural network model architectures, and outlines the theory and metrics of evaluations behind correlation methods. Parts IV and V demonstrate our exploratory analysis and experimental findings specific to data collected from natural reading, describing patterns found spatially in the brain and layer-wise in deep neural network models. We find more discriminatory metrics when decoding from model-to-brain (the traditional method of decoding in neuroscience literature) compared to brain-to-model. We also observe that pretrained word embeddings perform similarly to contextualized word embeddings when decoding from model-to-brain, and we further identify the left middle posterior temporal cortex and the left posterior temporal cortex as language regions of most interest. No conclusive evidence was found to support a relationship between language models layer-wise to brain regions spatially. Given the exploratory nature of this work, our framework holds promise in understanding how artificial neural networks represent language and lays the groundwork for further investigation in connecting brain data with model embeddings.

Description

Other Available Sources

Research Data

Keywords

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories