Abstract
Non-local patch-based methods were until recently the state of the art for image denoising but are now outperformed by CNNs. In video denoising, however, they are still competitive with CNNs, as they can effectively exploit the video temporal redundancy, which is a key factor to attain high denoising performance. The problem is that CNN architectures are not compatible with the search for self-similarities. In this work, we propose a simple, yet efficient way to feed video self-similarities to a CNN. The non-locality is incorporated into the network via a first non-trainable layer which finds for each patch in the input image its most similar patches in a search region. The central values of these patches are then gathered in a feature vector which is assigned to each image pixel. This information is presented to a CNN which is trained to predict the clean image. We apply the proposed method to image and video denoising. In the case of video, the patches are searched for in a 3D spatiotemporal volume. The proposed method achieves state-of-the-art results. © 2020, Springer Science+Business Media, LLC, part of Springer Nature.
| Original language | English |
|---|---|
| Pages (from-to) | 73-88 |
| Journal | Journal of Mathematical Imaging and Vision |
| Volume | 63 |
| Issue number | 1 |
| Online published | 31 Oct 2020 |
| DOIs | |
| Publication status | Published - Jan 2021 |
| Externally published | Yes |
Research Keywords
- CNN
- Denoising
- Non-local
- Patch-based methods
- Video denoising
Fingerprint
Dive into the research topics of 'Video Denoising by Combining Patch Search and CNNs'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver