8 months ago

Object Detection

Depth Estimation

3D Machine Vision

Computer Vision

Yuxuan Liu Lujia Wang Ming Liu

Abstract

Object detection in 3D with stereo cameras is an important problem incomputer vision, and is particularly crucial in low-cost autonomous mobilerobots without LiDARs. Nowadays, most of the best-performing frameworks for stereo 3D objectdetection are based on dense depth reconstruction from disparity estimation,making them extremely computationally expensive. To enable real-world deployments of vision detection with binocular images,we take a step back to gain insights from 2D image-based detection frameworksand enhance them with stereo features. We incorporate knowledge and the inference structure from real-time one-stage2D/3D object detector and introduce a light-weight stereo matching module. Our proposed framework, YOLOStereo3D, is trained on one single GPU and runsat more than ten fps. It demonstrates performance comparable tostate-of-the-art stereo 3D detection frameworks without usage of LiDAR data.The code will be published in https://github.com/Owen-Liuyuxuan/visualDet3D.

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp

8 months ago

Object Detection

Depth Estimation

3D Machine Vision

Computer Vision

Yuxuan Liu Lujia Wang Ming Liu

Abstract

Object detection in 3D with stereo cameras is an important problem incomputer vision, and is particularly crucial in low-cost autonomous mobilerobots without LiDARs. Nowadays, most of the best-performing frameworks for stereo 3D objectdetection are based on dense depth reconstruction from disparity estimation,making them extremely computationally expensive. To enable real-world deployments of vision detection with binocular images,we take a step back to gain insights from 2D image-based detection frameworksand enhance them with stereo features. We incorporate knowledge and the inference structure from real-time one-stage2D/3D object detector and introduce a light-weight stereo matching module. Our proposed framework, YOLOStereo3D, is trained on one single GPU and runsat more than ten fps. It demonstrates performance comparable tostate-of-the-art stereo 3D detection frameworks without usage of LiDAR data.The code will be published in https://github.com/Owen-Liuyuxuan/visualDet3D.

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp