Title
Video Coding Using Spatially Varying Transform
Abstract
In this paper, a novel algorithm called spatially varying transform (SVT) is proposed to improve the coding efficiency of video coders. SVT enables video coders to vary the position of the transform block, unlike state-of-art video codecs where the position of the transform block is fixed. In addition to changing the position of the transform block, the size of the transform can also be varied within the SVT framework, to better localize the prediction error so that the underlying correlations are better exploited. It is shown in this paper that by varying the position of the transform block and its size, characteristics of prediction error are better localized, and the coding efficiency is thus improved. The proposed algorithm is implemented and studied in the H.264/AVC framework. We show that the proposed algorithm achieves 5.85% bitrate reduction compared to H.264/AVC on average over a wide range of test set. Gains become more significant at medium to high bitrates for most tested sequences and the bitrate reduction may reach 13.50%, which makes the proposed algorithm very suitable for future video coding solutions focusing on high fidelity video applications. The gain in coding efficiency is achieved with a similar decoding complexity which makes the proposed algorithm easy to be incorporated in video codecs. However, the encoding complexity of SVT can be relatively high because of the need to perform a number of rate distortion optimization (RDO) steps to select the best location parameter (LP), which indicates the position of the transform. In this paper, a novel low complexity algorithm is also proposed, operating on a macroblock and a block level, to reduce the encoding complexity of SVT. Experimental results show that the proposed low complexity algorithm can reduce the number of LPs to be tested in RDO by about 80% with only a marginal penalty in the coding efficiency.
Year
DOI
Venue
2009
10.1109/TCSVT.2011.2105595
PSIVT
Keywords
Field
DocType
h.264/avc,bitrate reduction,novel algorithm,spatially varying transform (svt),prediction error characteristic,location parameter,varying transform,rate distortion optimization,high bit-rates,bit-rate reduction,data compression,encoding,high fidelity application,state-of-art video codecs,transform block,best portion,discrete cosine transforms,variable block-size transforms (vbt),spatially varying transform,prediction error,h.264-avc framework,video coding,spatial varying transform,proposed algorithm,low complexity algorithm,encoding complexity,video coder,decoding complexity,coding efficiency improvement,svt framework,transform,avc framework,video codecs,transform coding
Context-adaptive variable-length coding,Computer science,Coding (social sciences),Artificial intelligence,Context-adaptive binary arithmetic coding,Algorithmic efficiency,Pattern recognition,Coding tree unit,Algorithm,Transform coding,Speech recognition,Sub-band coding,Decoding methods
Conference
Volume
Issue
ISSN
21
2
1051-8215
Citations 
PageRank 
References 
13
0.91
14
Authors
4
Name
Order
Citations
PageRank
Cixun Zhang1469.22
Kemal Ugur246252.75
Jani Lainema341539.62
Moncef Gabbouj43282386.30