Publication: Dimensionality Reduction of Dynamic Topic Models Using a Finite State Machine Representation
No Thumbnail Available
Date
2017-07-14
Authors
Published Version
Published Version
Journal Title
Journal ISSN
Volume Title
Publisher
The Harvard community has made this article openly available. Please share how this access benefits you.
Citation
Research Data
Abstract
Dynamic topic models produce semantically interpretable topics that describe data evolving through time. The number of these topics increases with the range of time fitted, diminishing the interpretability of the model output. In this work, we introduce a finite state machine representation characterizing the trajectory of the data with a smaller collection of topics. We develop a hierarchical clustering scheme to generate the states of the finite state machine as well as a transition model to describe data evolution.
The primary advantages of our model are lower-dimensional summaries of data and a more intuitive representation of text evolution. These advantages are important for human interpretation.
We will apply our model to a dataset of electronic health records (EHRs) of patients diagnosed with Autism Spectrum Disorder (ASD). We offer interpretations of model output describing disease trajectories of patients with ASD that corroborate previous findings on comorbidities common to ASD. Additionally, we show retention of predictive performance despite reduced dimension of model parameters.
Description
Other Available Sources
Keywords
Computer Science, Health Sciences, General
Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service