Learning a Deep Color Difference Metric for Photographic Images
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition CVPR 2023 |
Publisher | Institute of Electrical and Electronics Engineers, Inc. |
Pages | 22242-22251 |
ISBN (electronic) | 979-8-3503-0129-8 |
ISBN (print) | 979-8-3503-0130-4 |
Publication status | Published - 2023 |
Publication series
Name | Proceedings - IEEE Computer Society Conference on Computer Vision and Pattern Recognition |
---|---|
ISSN (Print) | 1063-6919 |
ISSN (electronic) | 2575-7075 |
Conference
Title | 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023) |
---|---|
Location | Vancouver Convention Center |
Place | Canada |
City | Vancouver |
Period | 18 - 22 June 2023 |
Link(s)
DOI | DOI |
---|---|
Document Link | |
Permanent Link | https://scholars.cityu.edu.hk/en/publications/publication(14e2bf0c-a223-431b-ae9a-2ab0899701e9).html |
Abstract
Most well-established and widely used color difference (CD) metrics are handcrafted and subject-calibrated against uniformly colored patches, which do not generalize well to photographic images characterized by natural scene complexities. Constructing CD formulae for photographic images is still an active research topic in imaging/illumination, vision science, and color science communities. In this paper, we aim to learn a deep CD metric for photographic images with four desirable properties. First, it well aligns with the observations in vision science that color and form are linked inextricably in visual cortical processing. Second, it is a proper metric in the mathematical sense. Third, it computes accurate CDs between photographic images, differing mainly in color appearances. Fourth, it is robust to mild geometric distortions (e.g., translation or due to parallax), which are often present in photographic images of the same scene captured by different digital cameras. We show that all these properties can be satisfied at once by learning a multi-scale autoregressive normalizing flow for feature transform, followed by the Euclidean distance which is linearly proportional to the human perceptual CD. Quantitative and qualitative experiments on the large-scale SPCD dataset demonstrate the promise of the learned CD metric. Source code is available at https://github.com/haoychen3/CD-Flow. © 2023 IEEE.
Bibliographic Note
Research Unit(s) information for this publication is provided by the author(s) concerned.
Citation Format(s)
Learning a Deep Color Difference Metric for Photographic Images. / Chen, Haoyu; Wang, Zhihua; Yang, Yang et al.
Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition CVPR 2023. Institute of Electrical and Electronics Engineers, Inc., 2023. p. 22242-22251 (Proceedings - IEEE Computer Society Conference on Computer Vision and Pattern Recognition).
Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition CVPR 2023. Institute of Electrical and Electronics Engineers, Inc., 2023. p. 22242-22251 (Proceedings - IEEE Computer Society Conference on Computer Vision and Pattern Recognition).
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review