SE-MD: a single-encoder multiple-decoder deep network for point cloud reconstruction from 2D images

Abdul Mueed Hafiz, Rouf Ul Alam Bhat, Shabir Ahmad Parah, M. Hassaballah

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

3D model reconstruction from single 2D RGB images is a challenging and actively researched computer vision task. Several techniques based on conventional network architectures have been proposed for the same. However, the body of research work is limited and there are some issues like using inefficient 3D representation formats, weak 3D model reconstruction backbones, inability to reconstruct dense point clouds, dependence of post-processing for reconstruction of dense point clouds and dependence on silhouettes in RGB images. In this paper, a new 2D RGB image to point cloud conversion technique is proposed, which improves the state-of-the-art in the field due to its efficient, robust and simple model by using the concept of parallelization in network architecture. It not only uses efficient and rich 3D representation of point clouds, but also uses a new robust point cloud reconstruction backbone to address the prevalent issues. This involves using a single-encoder multiple-decoder deep network architecture wherein each decoder reconstructs certain fixed viewpoints. This is followed by fusing all the viewpoints to reconstruct a dense point cloud. Various experiments are conducted to evaluate the proposed technique and to compare its performance with those of the state-of-the-arts and impressive gains in performance are demonstrated.

Original languageEnglish
Pages (from-to)1291-1302
Number of pages12
JournalPattern Analysis and Applications
Volume26
Issue number3
DOIs
StatePublished - Aug 2023

Keywords

  • 2D images
  • 3D convolutional networks
  • 3D model reconstruction
  • 3D shape reconstruction
  • Point clouds
  • ShapeNet

Fingerprint

Dive into the research topics of 'SE-MD: a single-encoder multiple-decoder deep network for point cloud reconstruction from 2D images'. Together they form a unique fingerprint.

Cite this