SE-MD: a single-encoder multiple-decoder deep network for point cloud reconstruction from 2D images

Abdul Mueed Hafiz; Rouf Ul Alam Bhat; Shabir Ahmad Parah; M. Hassaballah

doi:10.1007/s10044-023-01155-x

SE-MD: a single-encoder multiple-decoder deep network for point cloud reconstruction from 2D images

Abdul Mueed Hafiz
, Rouf Ul Alam Bhat
, Shabir Ahmad Parah
, M. Hassaballah

Computer Sciences

University of Kashmir

Research output: Contribution to journal › Article › peer-review

6 Scopus citations

Abstract

3D model reconstruction from single 2D RGB images is a challenging and actively researched computer vision task. Several techniques based on conventional network architectures have been proposed for the same. However, the body of research work is limited and there are some issues like using inefficient 3D representation formats, weak 3D model reconstruction backbones, inability to reconstruct dense point clouds, dependence of post-processing for reconstruction of dense point clouds and dependence on silhouettes in RGB images. In this paper, a new 2D RGB image to point cloud conversion technique is proposed, which improves the state-of-the-art in the field due to its efficient, robust and simple model by using the concept of parallelization in network architecture. It not only uses efficient and rich 3D representation of point clouds, but also uses a new robust point cloud reconstruction backbone to address the prevalent issues. This involves using a single-encoder multiple-decoder deep network architecture wherein each decoder reconstructs certain fixed viewpoints. This is followed by fusing all the viewpoints to reconstruct a dense point cloud. Various experiments are conducted to evaluate the proposed technique and to compare its performance with those of the state-of-the-arts and impressive gains in performance are demonstrated.

Original language	English
Pages (from-to)	1291-1302
Number of pages	12
Journal	Pattern Analysis and Applications
Volume	26
Issue number	3
DOIs	https://doi.org/10.1007/s10044-023-01155-x
State	Published - Aug 2023

Keywords

2D images
3D convolutional networks
3D model reconstruction
3D shape reconstruction
Point clouds
ShapeNet

Access to Document

10.1007/s10044-023-01155-x

Cite this

@article{757e5953228849569be47d18e88cc8d2,

title = "SE-MD: a single-encoder multiple-decoder deep network for point cloud reconstruction from 2D images",

abstract = "3D model reconstruction from single 2D RGB images is a challenging and actively researched computer vision task. Several techniques based on conventional network architectures have been proposed for the same. However, the body of research work is limited and there are some issues like using inefficient 3D representation formats, weak 3D model reconstruction backbones, inability to reconstruct dense point clouds, dependence of post-processing for reconstruction of dense point clouds and dependence on silhouettes in RGB images. In this paper, a new 2D RGB image to point cloud conversion technique is proposed, which improves the state-of-the-art in the field due to its efficient, robust and simple model by using the concept of parallelization in network architecture. It not only uses efficient and rich 3D representation of point clouds, but also uses a new robust point cloud reconstruction backbone to address the prevalent issues. This involves using a single-encoder multiple-decoder deep network architecture wherein each decoder reconstructs certain fixed viewpoints. This is followed by fusing all the viewpoints to reconstruct a dense point cloud. Various experiments are conducted to evaluate the proposed technique and to compare its performance with those of the state-of-the-arts and impressive gains in performance are demonstrated.",

keywords = "2D images, 3D convolutional networks, 3D model reconstruction, 3D shape reconstruction, Point clouds, ShapeNet",

author = "Hafiz, \{Abdul Mueed\} and Bhat, \{Rouf Ul Alam\} and Parah, \{Shabir Ahmad\} and M. Hassaballah",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.",

year = "2023",

month = aug,

doi = "10.1007/s10044-023-01155-x",

language = "English",

volume = "26",

pages = "1291--1302",

journal = "Pattern Analysis and Applications",

issn = "1433-7541",

publisher = "Springer London",

number = "3",

}

TY - JOUR

T1 - SE-MD

T2 - a single-encoder multiple-decoder deep network for point cloud reconstruction from 2D images

AU - Hafiz, Abdul Mueed

AU - Bhat, Rouf Ul Alam

AU - Parah, Shabir Ahmad

AU - Hassaballah, M.

PY - 2023/8

Y1 - 2023/8

N2 - 3D model reconstruction from single 2D RGB images is a challenging and actively researched computer vision task. Several techniques based on conventional network architectures have been proposed for the same. However, the body of research work is limited and there are some issues like using inefficient 3D representation formats, weak 3D model reconstruction backbones, inability to reconstruct dense point clouds, dependence of post-processing for reconstruction of dense point clouds and dependence on silhouettes in RGB images. In this paper, a new 2D RGB image to point cloud conversion technique is proposed, which improves the state-of-the-art in the field due to its efficient, robust and simple model by using the concept of parallelization in network architecture. It not only uses efficient and rich 3D representation of point clouds, but also uses a new robust point cloud reconstruction backbone to address the prevalent issues. This involves using a single-encoder multiple-decoder deep network architecture wherein each decoder reconstructs certain fixed viewpoints. This is followed by fusing all the viewpoints to reconstruct a dense point cloud. Various experiments are conducted to evaluate the proposed technique and to compare its performance with those of the state-of-the-arts and impressive gains in performance are demonstrated.

AB - 3D model reconstruction from single 2D RGB images is a challenging and actively researched computer vision task. Several techniques based on conventional network architectures have been proposed for the same. However, the body of research work is limited and there are some issues like using inefficient 3D representation formats, weak 3D model reconstruction backbones, inability to reconstruct dense point clouds, dependence of post-processing for reconstruction of dense point clouds and dependence on silhouettes in RGB images. In this paper, a new 2D RGB image to point cloud conversion technique is proposed, which improves the state-of-the-art in the field due to its efficient, robust and simple model by using the concept of parallelization in network architecture. It not only uses efficient and rich 3D representation of point clouds, but also uses a new robust point cloud reconstruction backbone to address the prevalent issues. This involves using a single-encoder multiple-decoder deep network architecture wherein each decoder reconstructs certain fixed viewpoints. This is followed by fusing all the viewpoints to reconstruct a dense point cloud. Various experiments are conducted to evaluate the proposed technique and to compare its performance with those of the state-of-the-arts and impressive gains in performance are demonstrated.

KW - 2D images

KW - 3D convolutional networks

KW - 3D model reconstruction

KW - 3D shape reconstruction

KW - Point clouds

KW - ShapeNet

UR - https://www.scopus.com/pages/publications/85151502163

U2 - 10.1007/s10044-023-01155-x

DO - 10.1007/s10044-023-01155-x

M3 - Article

AN - SCOPUS:85151502163

SN - 1433-7541

VL - 26

SP - 1291

EP - 1302

JO - Pattern Analysis and Applications

JF - Pattern Analysis and Applications

IS - 3

ER -

SE-MD: a single-encoder multiple-decoder deep network for point cloud reconstruction from 2D images

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this