Publication: Linguistic Features for Readability Assessment
No Thumbnail Available
Open/View Files
Date
2020-06-17
Authors
Published Version
Published Version
Journal Title
Journal ISSN
Volume Title
Publisher
The Harvard community has made this article openly available. Please share how this access benefits you.
Citation
Deutsch, Tovly. 2020. Linguistic Features for Readability Assessment. Bachelor's thesis, Harvard College.
Research Data
Abstract
Readability assessment aims to automatically classify text by the level appropriate for learning readers. Traditional approaches to this task utilize a large variety of linguistically motivated features paired with simple machine learning models. More recent methods have improved performance by discarding these features and utilizing deep learning models. This thesis attempts to combine these two approaches with the goal of improving overall model performance. My primary method involves incorporating the output of a deep learning model as a feature itself, used in conjunction with linguistic features. Evaluating on two large readability corpora, I find that this fused approach is ineffective, failing to improve upon state-of-the-art performance. These results suggest that other avenues of research would be more fruitful in improving readability assessment.
Description
Other Available Sources
Keywords
Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service