Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Workshop on Spurious Correlation and Shortcut Learning: Foundations and Solutions

Cross-Instance Contrastive Masking in Vision Transformers for Self-Supervised Hyperspectral Image Classification

Abhiroop Chatterjee · Susmita Ghosh · Ashish Ghosh

Keywords: [ Self-supervision ] [ Hyperspectral Image ] [ Vision Transformer ] [ Cross-Instance Contrastive Masking ]


Abstract:

This article presents a novel Cross-Instance Contrastive Masking-Enhanced Vision Transformer (CICM-ViT) for hyperspectral image (HSI) classification, which attempts to reduce shortcut learning through Cross-Instance Contrastive Masking (CICM) to enhance spectral-spatial feature extraction through self-supervision. Using the dependencies between instances, CICM-ViT dynamically masks spectral patches across instances, promoting the learning of discriminative features while reducing redundancy, especially in low-data settings. This approach reduces shortcut learning by focusing on global patterns rather than relying on local spurious correlations. CICM-ViT achieves state-of-the-art performance on HSI datasets, with 99.91% OA on Salinas, 96.88% OA on Indian Pines, and 98.88% OA on Botswana, outperforming fourteen SOTA CNN- and transformer-based approaches in both accuracy and efficiency, with only 89,680 parameters.

Chat is not available.