Mining the Web for Medical Hypothesis: A Proof-of-Concept System

DSpace/Manakin Repository

Mining the Web for Medical Hypothesis: A Proof-of-Concept System

Citable link to this page


Title: Mining the Web for Medical Hypothesis: A Proof-of-Concept System
Author: Maclean, Diana; Seltzer, Margo I.

Note: Order does not necessarily reflect citation order of authors.

Citation: MacLean, Diana and Margo Seltzer. Forthcoming. Mining the Web for medical hypotheses: A proof-of-concept system. In Proceedings of the 2011 International Conference on Health Informatics (HEALTHINF'11), January 26 to 29, 2011, Rome, Italy.
Full Text & Related Files:
Abstract: As the prevalence of blogs, discussion forums, and online news services continues to grow, so too does the portion of this Web content that relates to health and medicine. We propose that everyday, medically-oriented Web content is a valuable and viable data source for medical hypothesis generation and testing, despite its being noisy. In this paper, we present a proof-of-concept system supporting this notion. We construct a corpus comprising news articles relating to the drugs Vioxx, Naproxen and Ibuprofen, that were published between 1998-2002. Using this corpus, we show that there was a significant link between Vioxx and the concept “Myocardial Infarction” well before the drug was withdrawn from the market in 2004. Indeed, within the Vioxx-related content, the concept ranks amongst the top 3.3% in terms of importance. When compared with the Naproxen and Ibuprofen control literatures, the term occurs significantly more frequently in the Vioxx-related content.
Other Sources:
Terms of Use: This article is made available under the terms and conditions applicable to Open Access Policy Articles, as set forth at
Citable link to this page:
Downloads of this work:

Show full Dublin Core record

This item appears in the following Collection(s)


Search DASH

Advanced Search