Abstract
We propose a structural similarity (SSIM)-motivated two-pass variable bit rate control algorithm for High Efficiency Video Coding. Given a bit rate budget, the available bits are optimally allocated at group of pictures (GoP), frame, and coding unit (CU) levels by hierarchically constructing a perceptually uniform space with an SSIM-inspired divisive normalization mechanism. The Lagrange multiplier λ, which controls the tradeoff between perceptual distortion and bit rate, is adopted as the GoP level complexity measure. To derive λ, Laplacian distribution-based rate and perceptual distortion models are established after the first pass encoding, and the target bits are dynamically allocated by maintaining a uniform Lagrange multiplier level for each GoP through λ equalization. Within each GoP, rate control is further performed at frame and CU levels based on SSIM-inspired divisive normalization, aiming to transform the prediction residuals into a perceptually uniform space. Experiments show that the proposed scheme achieves high accuracy rate control and superior rate-SSIM performance, which is further verified by subjective visual testing.
| Original language | English |
|---|---|
| Article number | 7490407 |
| Pages (from-to) | 2189-2203 |
| Journal | IEEE Transactions on Circuits and Systems for Video Technology |
| Volume | 27 |
| Issue number | 10 |
| Online published | 13 Jun 2016 |
| DOIs | |
| Publication status | Published - Oct 2017 |
Research Keywords
- Divisive normalization
- High Efficiency Video Coding (HEVC)
- structural similarity (SSIM) index
- two-pass rate control
- variable bit rate (VBR) coding