HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Masked Autoencoders for Point Cloud Self-supervised Learning

Pang Yatian ; Wang Wenxiao ; Tay Francis E. H. ; Liu Wei ; Tian Yonghong ; Yuan Li

Masked Autoencoders for Point Cloud Self-supervised Learning

Abstract

As a promising scheme of self-supervised learning, masked autoencoding hassignificantly advanced natural language processing and computer vision.Inspired by this, we propose a neat scheme of masked autoencoders for pointcloud self-supervised learning, addressing the challenges posed by pointcloud's properties, including leakage of location information and uneveninformation density. Concretely, we divide the input point cloud into irregularpoint patches and randomly mask them at a high ratio. Then, a standardTransformer based autoencoder, with an asymmetric design and a shifting masktokens operation, learns high-level latent features from unmasked pointpatches, aiming to reconstruct the masked point patches. Extensive experimentsshow that our approach is efficient during pre-training and generalizes well onvarious downstream tasks. Specifically, our pre-trained models achieve 85.18%accuracy on ScanObjectNN and 94.04% accuracy on ModelNet40, outperforming allthe other self-supervised learning methods. We show with our scheme, a simplearchitecture entirely based on standard Transformers can surpass dedicatedTransformer models from supervised learning. Our approach also advancesstate-of-the-art accuracies by 1.5%-2.3% in the few-shot object classification.Furthermore, our work inspires the feasibility of applying unifiedarchitectures from languages and images to the point cloud.

Code Repositories

liujia99/tpm
pytorch
Mentioned in GitHub
lmd0311/pointmamba
pytorch
Mentioned in GitHub
Pang-Yatian/Point-MAE
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-point-cloud-classification-on-modelnet40Point-MAE
Overall Accuracy: 94.0
3d-point-cloud-classification-on-scanobjectnnPoint-MAE
OBJ-BG (OA): 90.02
OBJ-ONLY (OA): 88.29
Overall Accuracy: 85.2
few-shot-3d-point-cloud-classification-on-1Point-MAE
Overall Accuracy: 96.3
Standard Deviation: 2.5
few-shot-3d-point-cloud-classification-on-2Point-MAE
Overall Accuracy: 97.8
Standard Deviation: 1.8
few-shot-3d-point-cloud-classification-on-3Point-MAE
Overall Accuracy: 92.6
Standard Deviation: 4.1
few-shot-3d-point-cloud-classification-on-4Point-MAE
Overall Accuracy: 95.0
Standard Deviation: 3.0
point-cloud-segmentation-on-pointcloud-cPointMAE
mean Corruption Error (mCE): 0.927

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp