TY - JOUR
T1 - The training of Karhunen-Loève transform matrix and its application for H.264 intra coding
AU - Gao, Yi
AU - Chen, Jiazhong
AU - Yu, Shengsheng
AU - Zhou, Jingli
AU - Po, Lai-Man
PY - 2009/1
Y1 - 2009/1
N2 - In H.264/AVC, 4 × 4 discrete cosine transform (DCT) is performed on the residual signals after intra prediction for decorrelation. Actually, residual blocks with different prediction modes exhibit different frequency characteristics. Therefore, the fixed transform matrix cannot match the energetic distribution of residual signals very well, which degrades the decorrelation performance. Fortunately, the energetic distributions of residual blocks with the same mode are relatively coincident, which makes it possible to train a universally good Karhunen-Loève transform (KLT) matrix for each mode. In this paper, an optimal frequency matching (OFM) algorithm is proposed to train KLT matrices for residual blocks and nine KLT matrices corresponding to nine prediction modes of 4 × 4 intra blocks are trained. Experimental results show that KLT with trained matrices yields a persistent gain over H.264 using 4 × 4 DCT with an average peak signal-to-noise ratio (PSNR) enhancement of 0.22dB and a maximum enhancement of 0.33dB. © 2008 Springer Science+Business Media, LLC.
AB - In H.264/AVC, 4 × 4 discrete cosine transform (DCT) is performed on the residual signals after intra prediction for decorrelation. Actually, residual blocks with different prediction modes exhibit different frequency characteristics. Therefore, the fixed transform matrix cannot match the energetic distribution of residual signals very well, which degrades the decorrelation performance. Fortunately, the energetic distributions of residual blocks with the same mode are relatively coincident, which makes it possible to train a universally good Karhunen-Loève transform (KLT) matrix for each mode. In this paper, an optimal frequency matching (OFM) algorithm is proposed to train KLT matrices for residual blocks and nine KLT matrices corresponding to nine prediction modes of 4 × 4 intra blocks are trained. Experimental results show that KLT with trained matrices yields a persistent gain over H.264 using 4 × 4 DCT with an average peak signal-to-noise ratio (PSNR) enhancement of 0.22dB and a maximum enhancement of 0.33dB. © 2008 Springer Science+Business Media, LLC.
KW - Discrete cosine transform
KW - H.264/AVC
KW - Intra coding
KW - Karhunen-Loève transform
UR - http://www.scopus.com/inward/record.url?scp=57849104345&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-57849104345&origin=recordpage
U2 - 10.1007/s11042-008-0221-4
DO - 10.1007/s11042-008-0221-4
M3 - RGC 21 - Publication in refereed journal
SN - 1380-7501
VL - 41
SP - 111
EP - 123
JO - Multimedia Tools and Applications
JF - Multimedia Tools and Applications
IS - 1
ER -