3DRecNet: A 3D Reconstruction Network with Dual Attention and Human-Inspired Memory

Shoukat, M. A.; Sargano, A. B.; You, Lihua; Habib, Z.

3DRecNet: A 3D Reconstruction Network with Dual Attention and Human-Inspired Memory.

Tools

Shoukat, M. A., Sargano, A. B., You, L. and Habib, Z., 2024. 3DRecNet: A 3D Reconstruction Network with Dual Attention and Human-Inspired Memory. Electronics, 13 (17), 3391.

Full text available as:

Preview

PDF (OPEN ACCESS ARTICLE)
electronics-13-03391-v2.pdf - Published Version
Available under License Creative Commons Attribution.
78MB

Copyright to original material in this document is with the original owner(s). Access to this content through BURO is granted on condition that you use it only for research, scholarly or other non-commercial purposes. If you wish to use it for any other purposes, you must contact BU via BURO@bournemouth.ac.uk.

Any third party copyright material in this document remains the property of its respective owner(s). BU grants no licence for further use of that third party material.

DOI: 10.3390/electronics13173391

Abstract

Humans inherently perceive 3D scenes using prior knowledge and visual perception, but 3D reconstruction in computer graphics is challenging due to complex object geometries, noisy backgrounds, and occlusions, leading to high time and space complexity. To addresses these challenges, this study introduces 3DRecNet, a compact 3D reconstruction architecture optimized for both efficiency and accuracy through five key modules. The first module, the Human-Inspired Memory Network (HIMNet), is designed for initial point cloud estimation, assisting in identifying and localizing objects in occluded and complex regions while preserving critical spatial information. Next, separate image and 3D encoders perform feature extraction from input images and initial point clouds. These features are combined using a dual attention-based feature fusion module, which emphasizes features from the image branch over those from the 3D encoding branch. This approach ensures independence from proposals at inference time and filters out irrelevant information, leading to more accurate and detailed reconstructions. Finally, a Decoder Branch transforms the fused features into a 3D representation. The integration of attention-based fusion with the memory network in 3DRecNet significantly enhances the overall reconstruction process. Experimental results on the benchmark datasets, such as ShapeNet, ObjectNet3D, and Pix3D, demonstrate that 3DRecNet outperforms existing methods.

Item Type:	Article
ISSN:	2079-9292
Uncontrolled Keywords:	3DRecNet; human-inspired memory network; 3D estimation; dual attention mechanism; fusion-based 3D reconstruction; point clouds
Group:	Faculty of Media & Communication
ID Code:	40362
Deposited By:	Symplectic RT2
Deposited On:	24 Sep 2024 05:26
Last Modified:	24 Sep 2024 05:26

Downloads

Downloads per month over past year

More statistics for this item...

Repository Staff Only -