Skip to yearly menu bar Skip to main content


Poster

L-Shapley and C-Shapley: Efficient Model Interpretation for Structured Data

Jianbo Chen · Le Song · Martin Wainwright · Michael Jordan

Great Hall BC #79

Keywords: [ feature selection ] [ model interpretation ]


Abstract:

Instancewise feature scoring is a method for model interpretation, which yields, for each test instance, a vector of importance scores associated with features. Methods based on the Shapley score have been proposed as a fair way of computing feature attributions, but incur an exponential complexity in the number of features. This combinatorial explosion arises from the definition of Shapley value and prevents these methods from being scalable to large data sets and complex models. We focus on settings in which the data have a graph structure, and the contribution of features to the target variable is well-approximated by a graph-structured factorization. In such settings, we develop two algorithms with linear complexity for instancewise feature importance scoring on black-box models. We establish the relationship of our methods to the Shapley value and a closely related concept known as the Myerson value from cooperative game theory. We demonstrate on both language and image data that our algorithms compare favorably with other methods using both quantitative metrics and human evaluation.

Live content is unavailable. Log in and register to view live content