Rethinking the One-shot Object Detection : Cross-Domain Object Search

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

View graph of relations

Author(s)

  • Yupeng Zhang
  • Shuqi Zheng
  • Ruize Han
  • Yuzhong Feng
  • Wei Feng
  • Liang Wan

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publicationMM '24
Subtitle of host publicationProceedings of the 32nd ACM International Conference on Multimedia
Place of PublicationNew York, NY, United States
PublisherAssociation for Computing Machinery
Pages9573-9581
Number of pages9
ISBN (print)979-8-4007-0686-8
Publication statusPublished - 28 Oct 2024

Conference

Title32nd ACM International Conference on Multimedia (MM 2024)
PlaceAustralia
CityMelbourne
Period28 October - 1 November 2024

Abstract

One-shot object detection (OSOD) uses a query patch to identify the same category of object in a target image. As the OSOD setting, the target images are required to contain the object category of the query patch, and the image styles (domains) of the query patch and target images are always similar. However, in practical application, the above requirements are not commonly satisfied. Therefore, we propose a new problem namely Cross-Domain Object Search (CDOS), where the object categories of the query patch and target image are decoupled, and the image styles between them may also be significantly different. For this problem, we develop a new method, which incorporates both foreground-background contrastive learning heads and a domain-generalized feature augmentation technique. This makes our method effectively handle the object category gap and domain distribution gap, between the query patch and target image in the training and testing datasets. We further build a new benchmark for the proposed CDOS problem, on which our method shows significant performance improvements over the comparison methods.

Bibliographic Note

Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s)

Citation Format(s)

Rethinking the One-shot Object Detection: Cross-Domain Object Search. / Zhang, Yupeng; Zheng, Shuqi; Han, Ruize et al.
MM '24: Proceedings of the 32nd ACM International Conference on Multimedia. New York, NY, United States: Association for Computing Machinery, 2024. p. 9573-9581.

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review