Skip to main content

PointGame: Geometrically and Adaptively Masked Auto-Encoder on Point Clouds.

Liu, Y., Yan, X., Li, Z., Chen, Z., Wei, Z. and Wei, M., 2023. PointGame: Geometrically and Adaptively Masked Auto-Encoder on Point Clouds. IEEE Transactions on Geoscience and Remote Sensing, 61, 5705312.

Full text available as:

[img] PDF
PointGame_arxiv.pdf - Accepted Version
Restricted to Repository staff only until 9 November 2025.
Available under License Creative Commons Attribution Non-commercial.

10MB

DOI: 10.1109/TGRS.2023.3331748

Abstract

Self-supervised learning is attracting large attention in point cloud understanding. However, exploring discriminative and transferable features still remains challenging due to their nature of irregularity. We propose a geometrically and adaptively masked auto-encoder on point clouds for self-supervised learning, termed <italic>PointGame</italic>. PointGame contains two core components: GATE and EAT. GATE stands for the geometrical and adaptive token embedding module; it not only absorbs the conventional wisdom of geometric descriptors that captures the surface shape effectively, but also exploits adaptive saliency to focus on the salient part of a point cloud. EAT stands for the external attention-based Transformer encoder with linear computational complexity, which increases the efficiency of the whole pipeline. Unlike cutting-edge unsupervised learning models, PointGame leverages geometric descriptors to perceive surface shapes and adaptively mines discriminative features from training data. PointGame showcases clear advantages over its competitors on various downstream tasks under both global and local fine-tuning strategies. The code and pre-trained models will be publicly available.

Item Type:Article
ISSN:0196-2892
Uncontrolled Keywords:Point cloud compression; Feature extraction; Three-dimensional displays; Task analysis; Transformers; Representation learning; Shape; Geometric descriptors; geometrical and adaptive token embedding (GATE); masked autoencoder; representation learning; self-supervised learning
Group:Faculty of Media & Communication
ID Code:39219
Deposited By: Symplectic RT2
Deposited On:04 Dec 2023 10:32
Last Modified:04 Dec 2023 10:32

Downloads

Downloads per month over past year

More statistics for this item...
Repository Staff Only -