TY - JOUR
T1 - Characterizing Generalized Rate-Distortion Performance of Video Coding
T2 - An Eigen Analysis Approach
AU - Duanmu, Zhengfang
AU - Liu, Wentao
AU - Li, Zhuoran
AU - Ma, Kede
AU - Wang, Zhou
PY - 2020
Y1 - 2020
N2 - Rate-distortion (RD) theory is at the heart of lossy data compression. Here we aim to model the generalized RD (GRD) trade-off between the visual quality of a compressed video and its encoding profiles (e.g., bitrate and spatial resolution). We first define the theoretical functional space \mathcal {W} of the GRD function by analyzing its mathematical properties. We show that \mathcal {W} is a convex set in a Hilbert space, inspiring a computational model of the GRD function, and a method of estimating model parameters from sparse measurements. To demonstrate the feasibility of our idea, we collect a large-scale database of real-world GRD functions, which turn out to live in a low-dimensional subspace of \mathcal {W}. Combining the GRD reconstruction framework and the learned low-dimensional space, we create a low-parameter eigen GRD method to accurately estimate the GRD function of a source video content from only a few queries. Experimental results on the database show that the learned GRD method significantly outperforms state-of-the-art empirical RD estimation methods both in accuracy and efficiency. Last, we demonstrate the promise of the proposed model in video codec comparison.
AB - Rate-distortion (RD) theory is at the heart of lossy data compression. Here we aim to model the generalized RD (GRD) trade-off between the visual quality of a compressed video and its encoding profiles (e.g., bitrate and spatial resolution). We first define the theoretical functional space \mathcal {W} of the GRD function by analyzing its mathematical properties. We show that \mathcal {W} is a convex set in a Hilbert space, inspiring a computational model of the GRD function, and a method of estimating model parameters from sparse measurements. To demonstrate the feasibility of our idea, we collect a large-scale database of real-world GRD functions, which turn out to live in a low-dimensional subspace of \mathcal {W}. Combining the GRD reconstruction framework and the learned low-dimensional space, we create a low-parameter eigen GRD method to accurately estimate the GRD function of a source video content from only a few queries. Experimental results on the database show that the learned GRD method significantly outperforms state-of-the-art empirical RD estimation methods both in accuracy and efficiency. Last, we demonstrate the promise of the proposed model in video codec comparison.
KW - Quadratic programming
KW - Rate-distortion function
KW - Video quality assessment
UR - http://www.scopus.com/inward/record.url?scp=85084818073&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85084818073&origin=recordpage
U2 - 10.1109/TIP.2020.2988437
DO - 10.1109/TIP.2020.2988437
M3 - RGC 21 - Publication in refereed journal
SN - 1057-7149
VL - 29
SP - 6180
EP - 6193
JO - IEEE Transactions on Image Processing
JF - IEEE Transactions on Image Processing
M1 - 9079615
ER -