Publication:

Interpretable selection and visualization of features and interactions using Bayesian forests

Loading...
Thumbnail Image

Date

2018

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

International Press
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Victoriya Krakovna, Chenguang Dai, and Jun S Liu. 2018. "Interpretable selection and visualization of features and interactions using Bayesian forests". Statistics and Its Interface 11 (2018): 503-513.

Abstract

In analyses of scientific data, it is often of interest to learn which features and feature interactions are relevant to the prediction task. We present here Selective Bayesian Forest Classifier, which strikes a balance between predictive power and interpretability by simultaneously performing classification, feature selection, feature interaction detection and visualization. It builds parsimonious yet flexible models using tree-structured Bayesian networks, and samples an ensemble of such models using Markov chain Monte Carlo. We build in its feature selection capability by dividing the trees into two groups according to their relevance to the outcome of interest. Our method performed competitively compared to top classification algorithms on both simulated data sets and real data sets in terms of classification accuracy, and often outperformed these methods in terms of feature selections and interaction visualizations.

Description

Other Available Sources

Research Data

Keywords

Terms of Use

This article is made available under the terms and conditions applicable to Open Access Policy Articles (OAP), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories