Show simple item record

dc.contributor.authorLacson, Ronilda C.
dc.contributor.authorPitzer, Erik
dc.contributor.authorHinske, Christian
dc.contributor.authorGalante, Pedro
dc.contributor.authorOhno-Machado, Lucila
dc.date.accessioned2011-07-01T13:37:02Z
dc.date.issued2009
dc.identifier.citationLacson, Ronilda, Erik Pitzer, Christian Hinske, Pedro Galante, and Lucila Ohno-Machado. 2009. Evaluation of a large-scale biomedical data annotation initiative. BMC Bioinformatics 10(Suppl 9): S10.en_US
dc.identifier.issn1471-2105en_US
dc.identifier.urihttp://nrs.harvard.edu/urn-3:HUL.InstRepos:4931095
dc.description.abstractBackground: This study describes a large-scale manual re-annotation of data samples in the Gene Expression Omnibus (GEO), using variables and values derived from the National Cancer Institute thesaurus. A framework is described for creating an annotation scheme for various diseases that is flexible, comprehensive, and scalable. The annotation structure is evaluated by measuring coverage and agreement between annotators. Results: There were 12,500 samples annotated with approximately 30 variables, in each of six disease categories – breast cancer, colon cancer, inflammatory bowel disease (IBD), rheumatoid arthritis (RA), systemic lupus erythematosus (SLE), and Type 1 diabetes mellitus (DM). The annotators provided excellent variable coverage, with known values for over 98% of three critical variables: disease state, tissue, and sample type. There was 89% strict inter-annotator agreement and 92% agreement when using semantic and partial similarity measures. Conclusion: We show that it is possible to perform manual re-annotation of a large repository in a reliable manner.en_US
dc.language.isoen_USen_US
dc.publisherBioMed Centralen_US
dc.relation.isversionofdoi:10.1186/1471-2105-10-S9-S10en_US
dc.relation.hasversionhttp://www.ncbi.nlm.nih.gov/pmc/articles/PMC2745681/pdf/en_US
dash.licenseLAA
dc.titleEvaluation of a Large-Scale Biomedical Data Annotation Initiativeen_US
dc.typeJournal Articleen_US
dc.description.versionVersion of Recorden_US
dc.relation.journalBMC Bioinformaticsen_US
dash.depositing.authorLacson, Ronilda C.
dc.date.available2011-07-01T13:37:02Z
dash.affiliation.otherHMS^Radiology-Brigham and Women's Hospitalen_US
dc.identifier.doi10.1186/1471-2105-10-S9-S10*
dash.contributor.affiliatedLacson, Ronilda


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record