TY - GEN
T1 - A Curriculum Learning Framework to Boost Object Detection of Unmanned Aerial Vehicles
AU - Aslan, Emre Can
AU - Naddeo, Kyle
AU - Bouhsine, Taha
AU - Polikar, Robi
AU - Ditzler, Gregory
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Accurate and efficient detection of objects in complex environments, especially the detection of aerial objects, remains a significant challenge in surveillance. Conventional object detection models often struggle with the intricacies of aerial imagery, such as varying sizes of the objects, distances, and environmental factors, leading to subpar performance. Additionally, simply using a state-of-the-art object detector tuned on aerial objects tends to perform poorly on smaller objects. This paper addresses these challenges by presenting a novel approach to enhance object detection accuracy and efficiency. Using a modified YOLOv7 architecture, we introduce a mixed curriculum learning (CL)-based framework tailored for object detection in aerial imagery. The core idea of CL, drawn from the phased learning process in human education, involves the strategic ordering of training data from simpler to more complex forms. In our methodology, the difficulty criterion is based on the size of the bounding boxes, with the hypothesis that smaller bounding boxes correlate with higher detection challenges. YOLOv7 is first pre-trained on a generalized object detection dataset, then our approach systematically uses CL to improve the performance on small aerial objects. Further, we show that our CL protocol can effectively learn from a combination of synthetic and real-world data, with the former addressing the concern with the scarcity of data when such data are challenging to collect. Our findings reveal that CL can provide a generic object detector with significant performance improvement. This improvement was particularly notable in detecting smaller and more distant objects.
AB - Accurate and efficient detection of objects in complex environments, especially the detection of aerial objects, remains a significant challenge in surveillance. Conventional object detection models often struggle with the intricacies of aerial imagery, such as varying sizes of the objects, distances, and environmental factors, leading to subpar performance. Additionally, simply using a state-of-the-art object detector tuned on aerial objects tends to perform poorly on smaller objects. This paper addresses these challenges by presenting a novel approach to enhance object detection accuracy and efficiency. Using a modified YOLOv7 architecture, we introduce a mixed curriculum learning (CL)-based framework tailored for object detection in aerial imagery. The core idea of CL, drawn from the phased learning process in human education, involves the strategic ordering of training data from simpler to more complex forms. In our methodology, the difficulty criterion is based on the size of the bounding boxes, with the hypothesis that smaller bounding boxes correlate with higher detection challenges. YOLOv7 is first pre-trained on a generalized object detection dataset, then our approach systematically uses CL to improve the performance on small aerial objects. Further, we show that our CL protocol can effectively learn from a combination of synthetic and real-world data, with the former addressing the concern with the scarcity of data when such data are challenging to collect. Our findings reveal that CL can provide a generic object detector with significant performance improvement. This improvement was particularly notable in detecting smaller and more distant objects.
UR - https://www.scopus.com/pages/publications/105009986150
UR - https://www.scopus.com/pages/publications/105009986150#tab=citedBy
U2 - 10.1109/CISDB64969.2025.11010781
DO - 10.1109/CISDB64969.2025.11010781
M3 - Conference contribution
AN - SCOPUS:105009986150
T3 - 2025 IEEE Symposium on Computational Intelligence in Security, Defence and Biometrics, CISDB 2025
BT - 2025 IEEE Symposium on Computational Intelligence in Security, Defence and Biometrics, CISDB 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2025 IEEE Symposium on Computational Intelligence in Security, Defence and Biometrics, CISDB 2025
Y2 - 17 March 2025 through 20 March 2025
ER -