Mining the Web for Medical Hypothesis: A Proof-of-Concept System
MetadataShow full item record
CitationMacLean, Diana and Margo Seltzer. Forthcoming. Mining the Web for medical hypotheses: A proof-of-concept system. In Proceedings of the 2011 International Conference on Health Informatics (HEALTHINF'11), January 26 to 29, 2011, Rome, Italy.
AbstractAs the prevalence of blogs, discussion forums, and online news services continues to grow, so too does the portion of this Web content that relates to health and medicine. We propose that everyday, medically-oriented Web content is a valuable and viable data source for medical hypothesis generation and testing, despite its being noisy. In this paper, we present a proof-of-concept system supporting this notion. We construct a corpus comprising news articles relating to the drugs Vioxx, Naproxen and Ibuprofen, that were published between 1998-2002. Using this corpus, we show that there was a signiﬁcant link between Vioxx and the concept “Myocardial Infarction” well before the drug was withdrawn from the market in 2004. Indeed, within the Vioxx-related content, the concept ranks amongst the top 3.3% in terms of importance. When compared with the Naproxen and Ibuprofen control literatures, the term occurs signiﬁcantly more frequently in the Vioxx-related content.
Citable link to this pagehttp://nrs.harvard.edu/urn-3:HUL.InstRepos:8739094
- FAS Scholarly Articles