• Login
View Item 
  • DASH Home
  • Harvard Law School
  • HLS Scholarly Articles
  • View Item
  • DASH Home
  • Harvard Law School
  • HLS Scholarly Articles
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Browse

All of DASH
  • Communities & Collections
  • By Issue Date
  • Author
  • Title
  • Keyword
  • FAS Department
This Collection
  • By Issue Date
  • Author
  • Title
  • Keyword

Submitters

  • Login
  • Quick submit
  • Waiver Generator

About

  • About DASH
  • DASH Stories
  • DASH FAQs
  • Accessibility
  • COVID-related Research
  • Terms of Use
  • Privacy Policy

Statistics

  • By Schools
  • By Collections
  • By Departments
  • By Items
  • By Country
  • By Authors

The Paper of Record Meets an Ephemeral Web: An Examination of Linkrot and Content Drift within The New York Times

 
Thumbnail
View/Open
NYT Link Rot.pdf (473.4Kb)
Author
Zittrain, JonathanHARVARD
Bowers, JohnHARVARD
Stanton, ClareHARVARD
Metadata
Show full item record
Citation
Zittrain, Jonathan, John Bowers, and Clare Stanton. 2021. "The Paper of Record Meets an Ephemeral Web: An Examination of Linkrot and Content Drift within The New York Times." Library Innovation Lab, Harvard Law School.
Abstract
Hyperlinks are a powerful tool for journalists and their readers. Diving deep into the context of an article is just a click away. But hyperlinks are a double-edged sword; for all of the internet’s boundlessness, what’s found on the web can also be modified, moved, or entirely disappeared. This often-irreversible decay of web content is commonly known as linkrot. It comes with a similar problem of content drift, or the often-unannounced changes––retractions, additions, replacement––to the content at a particular URL. 
 
Our team of researchers at Harvard Law School has undertaken a project to gain insight into the extent and characteristics of journalistic linkrot and content drift. We examined hyperlinks in New York Times articles starting with the launch of the Times website in 1996 up through mid-2019, developed on the basis of a dataset provided to us by the Times. We focus on the Times not because it is an influential publication whose archives are often used to help form a historical record. Rather, the substantial linkrot and content drift we find here across the New York Times corpus accurately reflects the inherent difficulties of long-term linking to pieces of a volatile web.
 
Results show a near linear increase of linkrot over time, with interesting patterns emerging within certain sections of the paper or across top level domains. Over half of articles containing at least one URL also contained a dead link. Additionally, of the ostensibly “healthy” links existing in articles, a hand review revealed additional erosion to citations via content drift.
 
Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material, as set forth at http://nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of-use#LAA
Citable link to this page
https://nrs.harvard.edu/URN-3:HUL.INSTREPOS:37367405

Collections
  • Berkman Klein Center for Internet & Society Scholarly Articles [105]
  • HLS Scholarly Articles [1900]

Contact administrator regarding this item (to report mistakes or request changes)

e: osc@harvard.edu

t: +1 (617) 495 4089

Creative Commons license‌Creative Commons Attribution 4.0 International License

Except where otherwise noted, this work is subject to a Creative Commons Attribution 4.0 International License, which allows anyone to share and adapt our material as long as proper attribution is given. For details and exceptions, see the Harvard Library Copyright Policy ©2022 Presidents and Fellows of Harvard College.

  • Follow us on Twitter
  • Contact
  • Harvard Library
  • Harvard University