Skip to yearly menu bar Skip to main content


Oral
in
Workshop: 3rd ICLR Workshop on Machine Learning for Remote Sensing

Do Satellite Tasks Need Special Pretraining?

Ani Vanyan · Alvard Barseghyan · Hakob Tamazyan · Tigran Galstyan · Vahan Huroyan · Hrant Khachatrian


Abstract:

Foundation models have advanced machine learning across various modalities, including images. Recently multiple teams trained foundation models specialized for remote sensing applications. This line of research is motivated by the distinct characteristics of remote sensing imagery, specific applications and types of robustness useful for satellite image analysis.In this work we systematically challenge the idea that specific foundation models are more useful than general-purpose vision foundation models, at least in the small scale. First, we design a simple benchmark that measures generalization of remote sensing models towards images with lower resolution for two downstream tasks. Second, we train iBOT, a self-supervised vision encoder, on MillionAID, an ImageNet-scale satellite imagery dataset, with several modifications specific to remote sensing. We show that none of those pretrained models bring consistent improvements upon general-purpose baselines at the ViT-B scale.

Chat is not available.