Paper
22 October 2024 Two accurate backbones for object detection models in video-frames-based transfer learning: ConvNeXtBase, ConvNeXtXLarge
Author Affiliations +
Proceedings Volume 13274, Sixteenth International Conference on Digital Image Processing (ICDIP 2024); 132741U (2024) https://doi.org/10.1117/12.3040824
Event: Sixteenth International Conference on Digital Image Processing (ICDIP 2024), 2024, Haikou, HI, China
Abstract
Various applications such as urban monitoring, security, and autonomous systems rely heavily on object classification in video imagery. In this paper we present a backbone for an object detection model that uses ConvNeXt architectures with transfer learning to focus specifically on vehicle classification. By adapting the ConvNeXtBase and ConvNeXtXLarge models, and utilizing “Car Object Detection” dataset which consists of numerous videos captured in different environmental settings including varying traffic densities, weather changes and light intensities. To improve the classification capabilities to match vehicles, specialists are incorporated into these adaptations who have developed special convolutional and fully connected layers. This is accomplished through our transfer learning approach that helps the model produce distinctive features needed for accurate detection. Our models are systematically evaluated using standard performance metrics. For instance, ConvNeXtBase achieves 97.91% accuracy with validation accuracy being 97.82%, while ConvNeXtXLarge has an accuracy of 98.34% with validation accuracy at 98.11%. These results not only outperform numerous baseline models but also demonstrate that our models are effective in real world scenarios. The results obtained from this study constitute a significant contribution towards the development of intelligent transport systems as well as provide a solid foundation for future improvements in object classification via transfer learning methods. That’s why you should highly value methodologies provided in this article because they will be useful for any further findings in enhancing intelligent transportation systems by means of deep learning techniques applied to video surveillance tasks one of many applications where transfer learning can be employed successfully for more efficient outcomes.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Sara Bouraya and Abdessamad Belangour "Two accurate backbones for object detection models in video-frames-based transfer learning: ConvNeXtBase, ConvNeXtXLarge", Proc. SPIE 13274, Sixteenth International Conference on Digital Image Processing (ICDIP 2024), 132741U (22 October 2024); https://doi.org/10.1117/12.3040824
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Object detection

Education and training

Data modeling

Video

Performance modeling

Feature extraction

Visual process modeling

Back to Top