Person: Boyd, Ceilyn
Email Address
AA Acceptance Date
Birth Date
Research Projects
Organizational Units
Job Title
Last Name
First Name
Name
Search Results
Publication Data as assemblage
(Emerald, 2022-03-03) Boyd, CeilynPurpose A definition of data called data as assemblage is presented. The definition accommodates different forms and meanings of data; emphasizes data subjects and data workers; and reflects the sociotechnical aspects of data throughout its lifecycle of creation and use. A scalable assemblage model describing the anatomy and behavior of data, datasets and data infrastructures is also introduced.
Design/methodology/approach Data as assemblage is compared to common meanings of data. The assemblage model's elements and relationships also are defined, mapped to the anatomy of a US Census dataset and used to describe the structure of research data repositories.
Findings Replacing common data definitions with data as assemblage enriches information science and research data management (RDM) frameworks. Also, the assemblage model is shown to describe datasets and data infrastructures despite their differences in scale, composition and outward appearance.
Originality/value Data as assemblage contributes a definition of data as mutable, portable, sociotechnical arrangements of material and symbolic components that serve as evidence. The definition is useful in information science and research data management contexts. The assemblage model contributes a scale-independent way to describe the structure and behavior of data, datasets and data infrastructures and supports analyses and comparisons involving them.
Publication Repository Approaches to Improving the Quality of Shared Data and Code
(MDPI AG, 2021-02-03) Trisovic, Ana; Mika, Katherine; Boyd, Ceilyn; Feger, Sebastian; Crosas, MerceSharing data and code for reuse has become increasingly important in scientific work over the past decade. However, in practice, shared data and code may be unusable, or published results obtained from them may be irreproducible. Data repository features and services contribute significantly to the quality, longevity, and reusability of datasets. This paper presents a combination of original and secondary data analysis studies focusing on computational reproducibility, data curation, and gamified design elements that can be employed to indicate and improve the quality of shared data and code. The findings of these studies are sorted into three approaches that can be valuable to data repositories, archives, and other research dissemination platforms.