Show simple item record

dc.contributor.authorGrimmer, Justin
dc.contributor.authorKing, Gary
dc.date.accessioned2014-08-19T20:38:16Z
dc.date.issued2011
dc.identifier.citationGrimmer, Justin, and Gary King. 2011. General Purpose Computer-Assisted Clustering and Conceptualization. Proceedings of the National Academy of Sciences 108, no. 7: 2643–2650.en_US
dc.identifier.issn0027-8424en_US
dc.identifier.issn1091-6490en_US
dc.identifier.urihttp://nrs.harvard.edu/urn-3:HUL.InstRepos:12724038
dc.description.abstractWe develop a computer-assisted method for the discovery of insightful conceptualizations, in the form of clusterings (i.e., partitions) of input objects. Each of the numerous fully automated methods of cluster analysis proposed in statistics, computer science, and biology optimize a different objective function. Almost all are well defined, but how to determine before the fact which one, if any, will partition a given set of objects in an “insightful” or “useful” way for a given user is unknown and difficult, if not logically impossible. We develop a metric space of partitions from all existing cluster analysis methods applied to a given dataset (along with millions of other solutions we add based on combinations of existing clusterings) and enable a user to explore and interact with it and quickly reveal or prompt useful or insightful conceptualizations. In addition, although it is uncommon to do so in unsupervised learning problems, we offer and implement evaluation designs that make our computer-assisted approach vulnerable to being proven suboptimal in specific data types. We demonstrate that our approach facilitates more efficient and insightful discovery of useful information than expert human coders or many existing fully automated methods.en_US
dc.description.sponsorshipGovernmenten_US
dc.language.isoen_USen_US
dc.publisherProceedings of the National Academy of Sciencesen_US
dc.relation.isversionofdoi:10.1073/pnas.1018067108en_US
dc.relation.hasversionhttp://gking.harvard.edu/files/201018067_online_1.pdfen_US
dash.licenseLAA
dc.titleGeneral Purpose Computer-Assisted Clustering and Conceptualizationen_US
dc.typeJournal Articleen_US
dc.description.versionAuthor's Originalen_US
dc.relation.journalProceedings of the National Academy of Sciencesen_US
dash.depositing.authorKing, Gary
dash.waiver2011-01-03
dc.date.available2014-08-19T20:38:16Z
dc.identifier.doi10.1073/pnas.1018067108*
workflow.legacycommentsFrom Waiver Tableen_US
dash.identifier.orcid0000-0002-5327-7631*
dash.contributor.affiliatedKing, Gary
dc.identifier.orcid0000-0002-5327-7631


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record