HyperAI
Home
News
Latest Papers
Tutorials
Datasets
Wiki
SOTA
LLM Models
GPU Leaderboard
Events
Search
About
English
HyperAI
Toggle sidebar
Search the site…
⌘
K
Home
SOTA
Semantic Segmentation
Semantic Segmentation On Sun Rgbd
Semantic Segmentation On Sun Rgbd
Metrics
Mean IoU (test)
Results
Performance results of various models on this benchmark
Columns
Model Name
Mean IoU (test)
Paper Title
Repository
DFormer-L
48.17
Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation
CMX (B5)
-
Efficient RGB-D Semantic Segmentation for Indoor Scene Analysis
-
EMSANet (2x ResNet-34 NBt1D, PanopticNDT version, finetuned)
-
Context Contrasted Feature and Gated Multi-Scale Aggregation for Scene Segmentation
CMX (B4)
-
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
FSFNet
-
Deep feature selection-and-fusion for RGB-D semantic segmentation
-
TokenFusion (S)
-
Spatial Information Guided Convolution for Real-Time RGBD Semantic Segmentation
TokenFusion (S)
-
Depth-aware CNN for RGB-D Segmentation
PSD-ResNet50
-
ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation
DFormer-B
-
RDFNet: RGB-D Multi-Level Residual Feature Fusion for Indoor Semantic Segmentation
-
EMSANet (2x ResNet-34 NBt1D, PanopticNDT version, finetuned)
-
PanopticNDT: Efficient and Robust Panoptic Mapping
GeminiFusion (MiT-B5)
-
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
DFormer-L
-
Attention-guided Chained Context Aggregation for Semantic Segmentation
TokenFusion (S)
-
Multimodal Token Fusion for Vision Transformers
TokenFusion (Ti)
-
Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation
DFormer-L
-
DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation
DPLNet
-
Self-Supervised Model Adaptation for Multimodal Semantic Segmentation
DPLNet
-
Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning
TokenFusion (Ti)
-
Multimodal Token Fusion for Vision Transformers
CMX (B4)
-
Pixel Difference Convolutional Network for RGB-D Semantic Segmentation
-
DPLNet
-
Recurrent Scene Parsing with Perspective Understanding in the Loop
0 of 39 row(s) selected.
Previous
Next