HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Bimodal SegNet: Instance Segmentation Fusing Events and RGB Frames for Robotic Grasping

Sanket Kachole Xiaoqian Huang Fariborz Baghaei Naeini Rajkumar Muthusamy Dimitrios Makris Yahya Zweiri

Bimodal SegNet: Instance Segmentation Fusing Events and RGB Frames for Robotic Grasping

Abstract

Object segmentation for robotic grasping under dynamic conditions often faces challenges such as occlusion, low light conditions, motion blur and object size variance. To address these challenges, we propose a Deep Learning network that fuses two types of visual signals, event-based data and RGB frame data. The proposed Bimodal SegNet network has two distinct encoders, one for each signal input and a spatial pyramidal pooling with atrous convolutions. Encoders capture rich contextual information by pooling the concatenated features at different resolutions while the decoder obtains sharp object boundaries. The evaluation of the proposed method undertakes five unique image degradation challenges including occlusion, blur, brightness, trajectory and scale variance on the Event-based Segmentation (ESD) Dataset. The evaluation results show a 6-10\% segmentation accuracy improvement over state-of-the-art methods in terms of mean intersection over the union and pixel accuracy. The model code is available at https://github.com/sanket0707/Bimodal-SegNet.git

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
semantic-segmentation-on-event-basedBimodal SegNet
mIoU: 87.05

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp