A Projected Stochastic Gradient Algorithm for Estimating Shapley Value Applied in Attribute Importance

Grah Simon; Thouvenot Vincent

doi:10.1007/978-3-030-57321-8_6

Conference Papers Year : 2020

A Projected Stochastic Gradient Algorithm for Estimating Shapley Value Applied in Attribute Importance

(1) , (1, 2)

1
2

Grah Simon

Function : Author
PersonId : 1115806

Thales SIX GTS France

Thouvenot Vincent

Function : Author
PersonId : 1115807

Thales SIX GTS France

SINCLAIR AI Lab - Saclay Industrial Lab for Artificial Intelligence Research

Abstract

Machine Learning is enjoying an increasing success in many applications: medical, marketing, defence, cyber security, transportation. It is becoming a key tool in critical systems. However, models are often very complex and highly non-linear. This is problematic, especially for critical systems, because end-users need to fully understand the decisions of an algorithm (e.g. why an alert has been triggered or why a person has a high probability of cancer recurrence). One solution is to offer an interpretation for each individual prediction based on attribute relevance. Shapley Values allow to distribute fairly contributions for each attribute in order to understand the difference between a predicted value for an observation and a base value (e.g. the average prediction of a reference population). They come from cooperative game theory. While these values have many advantages, including their theoretical guarantees, they are however really hard to calculate. Indeed, the complexity increases exponentially with the dimension (the number of variables). In this article, we propose two novel methods to approximate these Shapley Values. The first one is an optimization of an already existing Monte Carlo scheme. It reduces the number of prediction function calls. The second method is based on a projected gradient stochastic algorithm. We prove for the second approach some probability bounds and convergence rates for the approximation errors according to the learning rate type used. Finally, we carry out experiments on simulated datasets for a classification and a regression task. We empirically show that these approaches outperform the classical Monte Carlo estimator in terms of convergence rate and number of prediction function calls, which is the major bottleneck in Shapley Value estimation for our application.

Keywords

Domains

Fichier principal

497121_1_En_6_Chapter.pdf (806.57 Ko)

Origin	Files produced by the author(s)
licence	CC BY 4.0 - Attribution

Connect in order to contact the contributor

https://inria.hal.science/hal-03414720

Submitted on : Thursday, November 4, 2021-3:56:44 PM

Last modification on : Thursday, December 18, 2025-4:20:03 PM

Long-term archiving on : Saturday, February 5, 2022-7:06:34 PM

Dates and versions

hal-03414720 , version 1 (04-11-2021)

Licence

CC BY 4.0 - Attribution

Identifiers

HAL Id : hal-03414720 , version 1
DOI : 10.1007/978-3-030-57321-8_6

Cite

Grah Simon, Thouvenot Vincent. A Projected Stochastic Gradient Algorithm for Estimating Shapley Value Applied in Attribute Importance. 4th International Cross-Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE), Aug 2020, Dublin, Ireland. pp.97-115, ⟨10.1007/978-3-030-57321-8_6⟩. ⟨hal-03414720⟩

A Projected Stochastic Gradient Algorithm for Estimating Shapley Value Applied in Attribute Importance

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share