TY - GEN
T1 - Decoder-side Secondary Transform Derivation for Video Coding beyond AVS3
AU - Zhang, Yuhuai
AU - Ren, Huiwen
AU - Liu, Xin
AU - Zhao, Lu
AU - Wang, Shiqi
AU - Ma, Siwei
PY - 2024
Y1 - 2024
N2 - Secondary transform was adopted into the third generation Audio Video coding Standard (AVS3) to improve the intra-coded residual coding by applying a 4×4 secondary transform kernel. However, the adaptability of the single 4×4 transform kernel is limited for various residual data. In order to achieve higher residual coding gains, we propose a Decoder-side Secondary Transform Derivation (DSTD) method. Specifically, DSTD expands the maximum range of secondary transform from 4×4 to 8×8, where an 8×8 size transform kernel is introduced to further enhance the capability of compacting residuals. In particularly, three flipped secondary transform types are employed to extend transform candidates, including horizontal, vertical and diagonal flipping types. The boundary continuity is utilized to derive the transform type. Experimental results show that the proposed method can achieve 0.51% and 0.18% BD-rate savings on average under All Intra (AI) and Random Access (RA) configurations, respectively. DSTD has been adopted into the Exploration Video Model (EVM) for AVS4. © 2024 IEEE.
AB - Secondary transform was adopted into the third generation Audio Video coding Standard (AVS3) to improve the intra-coded residual coding by applying a 4×4 secondary transform kernel. However, the adaptability of the single 4×4 transform kernel is limited for various residual data. In order to achieve higher residual coding gains, we propose a Decoder-side Secondary Transform Derivation (DSTD) method. Specifically, DSTD expands the maximum range of secondary transform from 4×4 to 8×8, where an 8×8 size transform kernel is introduced to further enhance the capability of compacting residuals. In particularly, three flipped secondary transform types are employed to extend transform candidates, including horizontal, vertical and diagonal flipping types. The boundary continuity is utilized to derive the transform type. Experimental results show that the proposed method can achieve 0.51% and 0.18% BD-rate savings on average under All Intra (AI) and Random Access (RA) configurations, respectively. DSTD has been adopted into the Exploration Video Model (EVM) for AVS4. © 2024 IEEE.
KW - AVS
KW - decoder-side derivation
KW - secondary transform
KW - transform coding
UR - http://www.scopus.com/inward/record.url?scp=85194832066&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85194832066&origin=recordpage
U2 - 10.1109/DCC58796.2024.00049
DO - 10.1109/DCC58796.2024.00049
M3 - RGC 32 - Refereed conference paper (with host publication)
T3 - Data Compression Conference Proceedings
SP - 412
EP - 421
BT - Proceedings - DCC 2024
PB - IEEE
T2 - 2024 Data Compression Conference (DCC 2024)
Y2 - 19 March 2024 through 22 March 2024
ER -