HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

R2-MLP: Round-Roll MLP for Multi-View 3D Object Recognition

Chen Shuo ; Yu Tan ; Li Ping

R2-MLP: Round-Roll MLP for Multi-View 3D Object Recognition

Abstract

Recently, vision architectures based exclusively on multi-layer perceptrons(MLPs) have gained much attention in the computer vision community. MLP-likemodels achieve competitive performance on a single 2D image classification withless inductive bias without hand-crafted convolution layers. In this work, weexplore the effectiveness of MLP-based architecture for the view-based 3Dobject recognition task. We present an MLP-based architecture termed asRound-Roll MLP (R$^2$-MLP). It extends the spatial-shift MLP backbone byconsidering the communications between patches from different views. R$^2$-MLProlls part of the channels along the view dimension and promotes informationexchange between neighboring views. We benchmark MLP results on ModelNet10 andModelNet40 datasets with ablations in various aspects. The experimental resultsshow that, with a conceptually simple structure, our R$^2$-MLP achievescompetitive performance compared with existing state-of-the-art methods.

Code Repositories

shanshuo/R2-MLP
Official
pytorch
Mentioned in GitHub
shanshuo/MVT
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-object-recognition-on-modelnet40R2-MLP-36
Accuracy: 97.7%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp