Fast Non-Local Adaptive In-Loop Filter Optimization on GPU

Research output: Journal Publications and Reviews (RGC: 21, 22, 62)21_Publication in refereed journalpeer-review

View graph of relations

Author(s)

  • Chuanmin Jia
  • Falei Luo
  • Xinfeng Zhang
  • Shanshe Wang
  • Siwei Ma

Related Research Unit(s)

Detail(s)

Original languageEnglish
Article number9040278
Pages (from-to)39-51
Journal / PublicationIEEE Transactions on Multimedia
Volume23
Online published18 Mar 2020
Publication statusPublished - 2021

Abstract

The non-local adaptive in-loop filter (NALF) for video coding has achieved significant coding gain by exploiting image non-local self-similarity (NSS) to efficiently reduce the compression artifacts. However, the intensive computation of NALF hinders its practical deployment in video standardizations. In this paper, we propose a fast NALF optimization algorithm in parallel-computing framework by leveraging the massive parallel execution resources of GPU. First, the computational complexity of original NALF is analyzed in depth, then the pipelines of computational-intensive modules are re-designed to adapt to the general-purpose GPU with more parallel-friendly consideration. Specifically, we speed up the NALF by optimizing thread allocation to maximize the parallelism degree and elaborately designing the GPU block dimension to avoid access conflict. The group-level and pixel-level parallelization for collaboratively filtering and patch matching modules are designed respectively. To reduce the cost in data transmission, the whole filtering process is implemented on GPU by taking the advantage of low data dependency in NALF. Extensive experimental results show that the proposed fast NALF optimization using GPU architecture achieves high-speeed processing while maintaining the significant coding performance of original NALF, which shows the potential of NALF in the future video coding standard.

Research Area(s)

  • adaptive in-loop filter, GPU, NALF, NSS, Video coding

Citation Format(s)

Fast Non-Local Adaptive In-Loop Filter Optimization on GPU. / Jia, Chuanmin; Luo, Falei; Zhang, Xinfeng; Wang, Shiqi; Wang, Shanshe; Ma, Siwei.

In: IEEE Transactions on Multimedia, Vol. 23, 9040278, 2021, p. 39-51.

Research output: Journal Publications and Reviews (RGC: 21, 22, 62)21_Publication in refereed journalpeer-review