9 August 2022 Faster object detector for drone-captured images
Lei Zhao, Quan Zhang, Bo Peng, Yangyi Liu
Author Affiliations +
Abstract

Object detection based on computer vision is becoming popular in drone-captured images. However, real-time object detection in unmanned aerial vehicle (UAV) scenarios is a huge challenge for low-end devices. To deal with the problem, we have improved YOLOv3-tiny in the following aspects. First, the label rewriting problem, which is caused by network structure and dataset of YOLOv3-tiny in drone-captured images detection, is very serious. The method of increasing the size of the predicted feature map is used to reduce the ratio of label rewriting. Second, the features of small targets will be reduced in a small feature map, but the context information with large receptive fields in it can improve the performance of small target detection. So we use dilated convolution to expand the receptive field without reducing the size of the feature map. Third, multiscale feature fusion is very helpful for small target detection. The multidilated module is adopted to merge features in earlier layer and deeper layers. Finally, a pretraining strategy combining copy-paste data augmentation method is proposed to learn more features from categories with a small number of samples. We evaluated our model on the VisDrone2019-Det test set. It achieves compelling results compared to the counterparts of YOLOv3-tiny, including ∼86.1 % decline in model size, increasing ∼19.2 % AP50. Although our model is slower than YOLOv3-tiny, it is 2.96 times faster than YOLOv3. The results of experiments verify that our network is more effective than YOLOv3-tiny. It is more suitable for UAV object detection applications on low-end devices.

© 2022 SPIE and IS&T
Lei Zhao, Quan Zhang, Bo Peng, and Yangyi Liu "Faster object detector for drone-captured images," Journal of Electronic Imaging 31(4), 043033 (9 August 2022). https://doi.org/10.1117/1.JEI.31.4.043033
Received: 7 April 2022; Accepted: 22 July 2022; Published: 9 August 2022
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Convolution

Target detection

Sensors

Image sensors

Unmanned aerial vehicles

Detection and tracking algorithms

Data modeling

Back to Top