The Devil’s Shoehorn: A case study of EAD to ArchivesSpace migration at a large university

DSpace/Manakin Repository

The Devil’s Shoehorn: A case study of EAD to ArchivesSpace migration at a large university

Show simple item record

dc.contributor.author Mayo, William David
dc.contributor.author Bowers, Kathryn A.
dc.date.accessioned 2017-02-16T15:17:43Z
dc.date.issued 2017
dc.identifier Quick submit: 2017-02-14T15:41:00-0500
dc.identifier.citation Mayo, Dave and Kate Bowers. 2017. The Devil’s Shoehorn: A case study of EAD to ArchivesSpace migration at a large university. Code4Lib Journal 35: 2017-01-30. en_US
dc.identifier.issn 1940-5758 en_US
dc.identifier.uri http://nrs.harvard.edu/urn-3:HUL.InstRepos:30356833
dc.description.abstract A band of archivists and IT professionals at Harvard took on a project to convert nearly two million descriptions of archival collection components from marked-up text into the ArchivesSpace archival metadata management system. Starting in the mid-1990s, Harvard was an alpha implementer of EAD, an SGML (later XML) text markup language for electronic inventories, indexes, and finding aids that archivists use to wend their way through the sometimes quirky filing systems that bureaucracies establish for their records or the utter chaos in which some individuals keep their personal archives. These pathfinder documents, designed to cope with messy reality, can themselves be difficult to classify. Portions of them are rigorously structured, while other parts are narrative. Early documents predate the establishment of the standard; many feature idiosyncratic encoding that had been through several machine conversions, while others were freshly encoded and fairly consistent. In this paper, we will cover the practical and technical challenges involved in preparing a large (900MiB) corpus of XML for ingest into an open-source archival information system (ArchivesSpace). This case study will give an overview of the project, discuss problem discovery and problem solving, and address the technical challenges, analysis, solutions, and decisions and provide information on the tools produced and lessons learned. The authors of this piece are Kate Bowers, Collections Services Archivist for Metadata, Systems, and Standards at the Harvard University Archive, and Dave Mayo, a Digital Library Software Engineer for Harvard’s Library and Technology Services. Kate was heavily involved in both metadata analysis and later problem solving, while Dave was the sole full-time developer assigned to the migration project. en_US
dc.description.sponsorship Libraries/Museums en_US
dc.language.iso en_US en_US
dc.publisher Code4Lib en_US
dc.relation.isversionof http://journal.code4lib.org/articles/12239 en_US
dash.license LAA
dc.title The Devil’s Shoehorn: A case study of EAD to ArchivesSpace migration at a large university en_US
dc.type Journal Article en_US
dc.date.updated 2017-02-14T20:40:51Z
dc.description.version Accepted Manuscript en_US
dc.relation.journal Code4Lib Journal en_US
dash.depositing.author Mayo, William David
dc.date.available 2017
dc.date.available 2017-02-16T15:17:43Z
dash.affiliation.other HUIT en_US

Files in this item

Files Size Format View
davearticle.pdf 301.1Kb PDF View/Open

This item appears in the following Collection(s)

Show simple item record

 
 

Search DASH


Advanced Search
 
 

Submitters