Fast Depth Video Coding Method Using Adaptive Edge Classification

PDF / 713,524 Bytes
11 Pages / 439.37 x 666.142 pts Page_size
51 Downloads / 257 Views

Fast Depth Video Coding Method Using Adaptive Edge Classification Da-Hyun Yoon · Yo-Sung Ho

Received: 15 February 2011 / Revised: 18 September 2012 / Published online: 9 November 2012 © Springer Science+Business Media New York 2012

Abstract In this paper, we propose a fast-mode decision algorithm for both intra and inter prediction in depth videos. The proposed algorithm can reduce the computational complexity of the depth video coding. According to the depth variation, depth video can be classified into depth-continuity and depth-discontinuity regions. From experiments, we determine a threshold value for classifying these regions. Since the depth-continuity region has an imbalance in the mode distribution, we limit the mode candidates to reduce the complexity of the mode decision process. Experimental results show that our proposed algorithm reduces the encoding time by up to 78 % and 97 % for the intra and inter frames, respectively, compared to JMVC 8.3, with negligible PSNR loss and bitrate increase. Keywords Depth video coding · Macroblock mode decision · Depth compression

1 Introduction Due to advances in three-dimensional (3D) display technologies, the development of 3DTV has realized the human dream of viewing scenes as if in the real world. Via 3DTV, the interactive selection of viewpoint and direction becomes possible within a certain operational range. This is referred to as free viewpoint TV (FTV). FTV has been widely utilized because it transmits and records all spatiotemporal information from the real world [1]. Multiview plus depth (MVD) is another framework used D.-H. Yoon () · Y.-S. Ho Gwangju Institute of Science and Technology (GIST), 261 Cheomdan-gwagiro, Buk-gu, Gwangju 500-712, Korea e-mail: [email protected] Y.-S. Ho e-mail: [email protected] url: http://vclab.gist.ac.kr

804

Circuits Syst Signal Process (2013) 32:803–813

to represent 3D scenes. MVD has been used to synthesize intermediate views from captured images and depth maps. In a recent Moving Picture Experts Group (MPEG) meeting, MVD has received increased attention and has been discussed as a nextgeneration FTV format [10]. Since the amount of data and complexity of MVD is proportional to the number of cameras, an efficient encoding method for 3D video scenes must be developed. An efficient multiview video coding (MVC) scheme is needed to transmit and store multiview contents. The MPEG 3D Audio/Video (3DAV) group is working on the standardization of MVC. The joint MVC (JMVC) was released by the MPEG 3DAV group as the reference software. JMVC uses a rate-distortion optimization (RDO) technique [8] to select the optimal coding mode, where the rate-distortion (RD) cost is calculated by J = D + λR

(1)

where J is the RD cost of the current mode, D denotes the distortion between the original and reconstructed macroblock (MB), R stands for the total bits of the MB header, motion vectors, and DCT coefficients, and λ is the Lagrange multiplier. JMVC 8.3 [3] is then used to select the best mode among 14 different macroblock modes: SKIP, Dir

Data Loading...

Fast Depth Video Coding Method Using Adaptive Edge Classification

Recommend Documents

Fast 3D-HEVC PU size decision algorithm for depth map intra-video coding

Adaptive Resolution Upconversion for Compressed Video Using Pixel Classification

A Fast Search Algorithm of Multi-View Video Coding

Least-Square Prediction for Backward Adaptive Video Coding

Interactive Video Coding and Transmission over Heterogeneous Wired-to-Wireless IP Networks Using an Edge Proxy

Dynamically Adaptive Fast Motion Estimation Algorithm for HD Video

A Framework for Adaptive Scalable Video Coding Using Wyner-Ziv Techniques

Fast Video Classification with CNNs in Compressed Domain

Fast Neural Accumulator (NAC) Based Badminton Video Action Classification

Scalable Video Coding

Multi-View Video Coding

Multi-View Video Coding