Rethinking the One-shot Object Detection : Cross-Domain Object Search
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | MM '24 |
Subtitle of host publication | Proceedings of the 32nd ACM International Conference on Multimedia |
Place of Publication | New York, NY, United States |
Publisher | Association for Computing Machinery |
Pages | 9573-9581 |
Number of pages | 9 |
ISBN (print) | 979-8-4007-0686-8 |
Publication status | Published - 28 Oct 2024 |
Conference
Title | 32nd ACM International Conference on Multimedia (MM 2024) |
---|---|
Place | Australia |
City | Melbourne |
Period | 28 October - 1 November 2024 |
Link(s)
DOI | DOI |
---|---|
Document Link | |
Link to Scopus | https://www.scopus.com/record/display.uri?eid=2-s2.0-85209809741&origin=recordpage |
Permanent Link | https://scholars.cityu.edu.hk/en/publications/publication(f3f95662-1415-4ec5-b31e-a4203e730d4e).html |
Abstract
One-shot object detection (OSOD) uses a query patch to identify the same category of object in a target image. As the OSOD setting, the target images are required to contain the object category of the query patch, and the image styles (domains) of the query patch and target images are always similar. However, in practical application, the above requirements are not commonly satisfied. Therefore, we propose a new problem namely Cross-Domain Object Search (CDOS), where the object categories of the query patch and target image are decoupled, and the image styles between them may also be significantly different. For this problem, we develop a new method, which incorporates both foreground-background contrastive learning heads and a domain-generalized feature augmentation technique. This makes our method effectively handle the object category gap and domain distribution gap, between the query patch and target image in the training and testing datasets. We further build a new benchmark for the proposed CDOS problem, on which our method shows significant performance improvements over the comparison methods.
Bibliographic Note
Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s)
Citation Format(s)
Rethinking the One-shot Object Detection: Cross-Domain Object Search. / Zhang, Yupeng; Zheng, Shuqi; Han, Ruize et al.
MM '24: Proceedings of the 32nd ACM International Conference on Multimedia. New York, NY, United States: Association for Computing Machinery, 2024. p. 9573-9581.
MM '24: Proceedings of the 32nd ACM International Conference on Multimedia. New York, NY, United States: Association for Computing Machinery, 2024. p. 9573-9581.
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review