Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery

Xavier Bou*, Gabriele Facciolo, Rafael Grompone von Gioi, Jean-Michel Morel, Thibaud Ehret

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

The goal of this paper is to perform object detection in satellite imagery with only a few examples, thus enabling users to specify any object class with minimal annotation. To this end, we explore recent methods and ideas from open-vocabulary detection for the remote sensing domain. We develop a few-shot object detector based on a traditional two-stage architecture, where the classification block is replaced by a prototype-based classifier. A large-scale pre-trained model is used to build class-reference embeddings or prototypes, which are compared to region proposal contents for label prediction. In addition, we propose to fine-tune prototypes on available training images to boost performance and learn differences between similar classes, such as aircraft types. We perform extensive evaluations on two remote sensing datasets containing challenging and rare objects. Moreover, we study the performance of both visual and image-text features, namely DINOv2 and CLIP, including two CLIP models specifically tailored for remote sensing applications. Results indicate that visual features are largely superior to vision-language models, as the latter lack the necessary domain-specific vocabulary. Lastly, the developed detector outperforms fully supervised and few-shot methods evaluated on the SIMD and DIOR datasets, despite minimal training parameters. © 2024 IEEE.
Original languageEnglish
Title of host publicationProceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024
Place of PublicationLos Alamitos, Calif.
PublisherIEEE Computer Society
Pages430-439
ISBN (Electronic)9798350365474
ISBN (Print)9798350365481
DOIs
Publication statusPublished - 2024
Event2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW 2024) - Seattle, United States
Duration: 16 Jun 202422 Jun 2024

Publication series

NameIEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
ISSN (Print)2160-7508
ISSN (Electronic)2160-7516

Conference

Conference2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW 2024)
PlaceUnited States
CitySeattle
Period16/06/2422/06/24

Funding

This work was funded by AID-DGA (l’Agence de l’Innovation de Defense a la Direction ´ Gen´ erale de l’Armement—Minit ´ ere des Armees), and ` was performed using HPC resources from GENCI-IDRIS (grants 2023-AD011011801R3, 2023-AD011012453R2, 2023-AD011012458R2) and from the “Mesocentre” ´ computing center of CentraleSupelec and ENS Paris- ´ Saclay supported by CNRS and Region ´ ˆIle-de-France (http://mesocentre.universite-paris-saclay.fr). Centre Borelli is also with Universite Paris Cit ´ e, SSA and ´ INSERM.

Fingerprint

Dive into the research topics of 'Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery'. Together they form a unique fingerprint.

Cite this