A Novel Regional Fusion Network for 3D Object Detection based on RGB Images and Point Clouds

doi:10.5121/csit.2021.111812

Volume 11, Number 18, November 2021

A Novel Regional Fusion Network for 3D Object Detection based on RGB Images and Point Clouds

Authors

Hung-Hao Chen¹, Chia-Hung Wang¹, Hsueh-Wei Chen¹, Pei-Yung Hsiao², Li-Chen Fu¹ and Yi-Feng Su³, ¹National Taiwan University, Taiwan, ²National University of Kaohsiung, Kaohsiung, Taiwan, ³Automotive Research and Testing Center (ARTC), Taiwan

Abstract

The current fusion-based methods transform LiDAR data into bird’s eye view (BEV) representations or 3D voxel, leading to information loss and heavy computation cost of 3D convolution. In contrast, we directly consume raw point clouds and perform fusion between two modalities. We employ the concept of region proposal network to generate proposals from two streams, respectively. In order to make two sensors compensate the weakness of each other, we utilize the calibration parameters to project proposals from one stream onto the other. With the proposed multi-scale feature aggregation module, we are able to combine the extracted regionof-interest-level (RoI-level) features of RGB stream from different receptive fields, resulting in fertilizing feature richness. Experiments on KITTI dataset show that our proposed network outperforms other fusion-based methods with meaningful improvements as compared to 3D object detection methods under challenging setting.

Keywords

Machine Learning, 3D Object Detection, Data Fusion, Autonomous Driving.

Subscription Membership AIRCC CSCP Contact Us
All Rights Reserved ® AIRCC