Natural Language Processing for Health System Messages: Deep Transfer Learning Approach to Aspect-Based Sentiment Analysis of COVID-19 Content
Citation
Sun, Mary Daotung. 2021. Natural Language Processing for Health System Messages: Deep Transfer Learning Approach to Aspect-Based Sentiment Analysis of COVID-19 Content. Master's thesis, Harvard University Division of Continuing Education.Abstract
Recent efforts in clinical natural language processing research have focused on mining social media data and other user-generated data relevant to COVID-19. An increasing number of machine learning tools have been developed for annotation, topic modeling, and various classification tasks, including the now widely used Bidirectional Encoder Representations from Transformer (BERT) language model. In this study, we improve the domain specificity of COVID-Twitter-BERT for secure health system messages through supervised fine-tuning on a large corpus of patient-originated messages. Following sentiment annotation with a validated rule-based heuristic tool, an auxiliary sentence approach was implemented to conduct aspect-based sentiment analysis. We find that existing machine learning tools optimized on social media tools demonstrate moderate-to-good generalizability to secure patient messages. Furthermore, significant performance improvements to a weighted average F1 score of 0.881 were realized upon implementation of fine-tuning and auxiliary sentence approaches. We also observe temporal trends in positive, negative, and compound (overall) sentiment measured on over 3.2 million patient messages related to COVID-19. We believe that secure health system messages are an increasingly valuable source of clinical text data and can be effectively mined using emerging deep transfer learning techniques.Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material, as set forth at http://nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of-use#LAACitable link to this page
https://nrs.harvard.edu/URN-3:HUL.INSTREPOS:37370747
Collections
- DCE Theses and Dissertations [1393]
Contact administrator regarding this item (to report mistakes or request changes)