Fast intra-mode decision for depth map coding in 3D-HEVC

  • PDF / 1,795,422 Bytes
  • 10 Pages / 595.276 x 790.866 pts Page_size
  • 73 Downloads / 214 Views

DOWNLOAD

REPORT


ORIGINAL RESEARCH PAPER

Fast intra‑mode decision for depth map coding in 3D‑HEVC Ruyi Zhang1,2 · Kebin Jia1,2 · Pengyu Liu1,2 · Zhonghua Sun1,2 Received: 12 October 2018 / Accepted: 25 September 2019 © Springer-Verlag GmbH Germany, part of Springer Nature 2019

Abstract 3D-high efficiency video coding (3D-HEVC) contains more encoding viewpoints than traditional HEVC, resulting in a significant increase of coding complexity. In this paper, we propose a low complexity intra mode decision algorithm to reduce the number of intra modes by detecting the flat area and texture direction of the depth map. The corresponding intra prediction modes are skipped when the flat region condition is satisfied. Otherwise, the direction of the edge is detected to decrease the number of angle modes in rough mode decision, which can reduce the intra-coding complexity and coding time cost. Experimental results demonstrate that the proposed algorithm achieves on average 36.48% time saving with negligible degradation of coding performance. Keywords  3D-HEVC · Depth map coding · Edge detection · Rough mode decision (RMD) · Intra mode decision

1 Introduction With the rise of computer multimedia technology, video technologies such as movies and television have been continuously developed and improved. Digital video technology has become a research hotspot in modern academic and industrial circles. Three-dimensional video not only gives people a higher sense of experience on the scene but also has a wider range of applications, such as medicine, education and other fields. To meet the demand of various applications in video compression and transmission, many domestic and foreign scholars and research institutions have conducted a lot of scientific research in the high-efficiency video coding of 3D video. Up to now, the most up-to-date 3D video coding standard formally promulgated is the 3D-high efficiency video coding (3D-HEVC) standard [1] based on the highefficiency video coding (HEVC) [2]. 3D video is represented using the Multi-view Video plus Depth (MVD) [3] format, which associates depth map to texture video. After decoding the video and depth data, * Kebin Jia [email protected] 1



Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China



Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing 100124, China

2

additional intermediate views, which are suitable for displaying the 3D content on an auto-stereoscopic display, can be synthesized using depth-image-based rendering (DIBR) [4] techniques. In this way, fewer coded viewpoints can be used to obtain a larger number of intermediate views, thereby effectively improving video compression and transmission efficiency. Each viewpoint encoded is obtained by photographing the same scene from different angles at the same time, including video pictures and depth maps. Figure 1a shows a texture video extracted from Kendo video sequence. Figure 1b is its depth map in which the pixel value re