Oral
in
Workshop: Workshop on Spurious Correlation and Shortcut Learning: Foundations and Solutions
Mitigating Spurious Correlations in Image Recognition Models using Performance-Based Feature Sampling
Aarav Monga · Sonia Zhang · Rajakrishnan Somou · Antonio Ortega
Keywords: [ Spurious Correlations ] [ Clustering ] [ Representation Learning ] [ Image Recognition ] [ Resampling ]
Existing methods for detecting and correcting spurious correlations in image recognition models often fail to identify biasing features due to incoherent groupings of biased images. There is also little exploration of targeted removal of spurious correlations in a low-dimensional feature space. To address these gaps, we propose Performance-Based Feature Sampling (PBFS), a systematic method for producing image recognition models that are de-biased w.r.t a given feature space. We introduce a method for producing coherent bias group proposals (i.e., semantically related images potentially sharing biasing feature(s)) and decorrelating biasing features from the target label using adaptive resampling. We demonstrate that our framework is able to correct for known spurious correlations, and through both established and our proposed metrics, we show that our method is able to de-bias image recognition models both w.r.t a high-dimensional feature space capturing complex representations and w.r.t low-dimensional feature spaces representing simple physical properties.