Temporal Topic Modeling to Assess Associations between News Trends and Infectious Disease Outbreaks

DSpace/Manakin Repository

Temporal Topic Modeling to Assess Associations between News Trends and Infectious Disease Outbreaks

Citable link to this page

 

 
Title: Temporal Topic Modeling to Assess Associations between News Trends and Infectious Disease Outbreaks
Author: Ghosh, Saurav; Chakraborty, Prithwish; Nsoesie, Elaine O.; Cohn, Emily; Mekaru, Sumiko R.; Brownstein, John S.; Ramakrishnan, Naren

Note: Order does not necessarily reflect citation order of authors.

Citation: Ghosh, Saurav, Prithwish Chakraborty, Elaine O. Nsoesie, Emily Cohn, Sumiko R. Mekaru, John S. Brownstein, and Naren Ramakrishnan. 2017. “Temporal Topic Modeling to Assess Associations between News Trends and Infectious Disease Outbreaks.” Scientific Reports 7 (1): 40841. doi:10.1038/srep40841. http://dx.doi.org/10.1038/srep40841.
Full Text & Related Files:
Abstract: In retrospective assessments, internet news reports have been shown to capture early reports of unknown infectious disease transmission prior to official laboratory confirmation. In general, media interest and reporting peaks and wanes during the course of an outbreak. In this study, we quantify the extent to which media interest during infectious disease outbreaks is indicative of trends of reported incidence. We introduce an approach that uses supervised temporal topic models to transform large corpora of news articles into temporal topic trends. The key advantages of this approach include: applicability to a wide range of diseases and ability to capture disease dynamics, including seasonality, abrupt peaks and troughs. We evaluated the method using data from multiple infectious disease outbreaks reported in the United States of America (U.S.), China, and India. We demonstrate that temporal topic trends extracted from disease-related news reports successfully capture the dynamics of multiple outbreaks such as whooping cough in U.S. (2012), dengue outbreaks in India (2013) and China (2014). Our observations also suggest that, when news coverage is uniform, efficient modeling of temporal topic trends using time-series regression techniques can estimate disease case counts with increased precision before official reports by health organizations.
Published Version: doi:10.1038/srep40841
Other Sources: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5244405/pdf/
Terms of Use: This article is made available under the terms and conditions applicable to Other Posted Material, as set forth at http://nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of-use#LAA
Citable link to this page: http://nrs.harvard.edu/urn-3:HUL.InstRepos:30370958
Downloads of this work:

Show full Dublin Core record

This item appears in the following Collection(s)

 
 

Search DASH


Advanced Search
 
 

Submitters