The Parable of Google Flu: Traps in Big Data Analysis

DSpace/Manakin Repository

The Parable of Google Flu: Traps in Big Data Analysis

Citable link to this page


Title: The Parable of Google Flu: Traps in Big Data Analysis
Author: Lazer, David M.; Kennedy, R.; King, Gary ORCID  0000-0002-5327-7631 ; Vespignani, A.

Note: Order does not necessarily reflect citation order of authors.

Citation: Lazer, D., R. Kennedy, G. King, and A. Vespignani. 2014. “The Parable of Google Flu: Traps in Big Data Analysis.” Science 343 (6176) (March 14): 1203–1205.
Full Text & Related Files:
Abstract: Large errors in flu prediction were largely avoidable, which offers lessons for the use of big data. In February 2013, Google Flu Trends (GFT) made headlines but not for a reason that Google executives or the creators of the flu tracking system would have hoped. Nature reported that GFT was predicting more than double the proportion of doctor visits for influenza-like illness (ILI) than the Centers for Disease Control and Prevention (CDC), which bases its estimates on surveillance reports from laboratories across the United States ( 1, 2). This happened despite the fact that GFT was built to predict CDC reports. Given that GFT is often held up as an exemplary use of big data ( 3, 4), what lessons can we draw from this error?
Published Version: doi:10.1126/science.1248506
Other Sources:
Terms of Use: This article is made available under the terms and conditions applicable to Open Access Policy Articles, as set forth at
Citable link to this page:
Downloads of this work:

Show full Dublin Core record

This item appears in the following Collection(s)


Search DASH

Advanced Search