Authors :
Bhargavi Vijendra Sangam; Swetha V.; Ananya T.
Volume/Issue :
Volume 11 - 2026, Issue 6 - June
Google Scholar :
https://tinyurl.com/evm6ju8t
Scribd :
https://tinyurl.com/4v5spjtv
DOI :
https://doi.org/10.38124/ijisrt/26jun198
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
Object detection has become one of the most critical fields of study in computer vision due to its broad application
range including surveillance, health care, autonomous driving, agriculture, robotics, and industrial automation. YOLO is considered one of the most efficient object detection frameworks due to its unique feature of being able to localize and classify
objects in one feedforward network pass. Starting with its inception in 2016, YOLO architecture underwent multiple changesto
improve accuracy, inference speed, and efficiency. This review paper provides an overview of how YOLO architectures have
evolved from YOLO v1 to YOLO v8. The advancements, strengths, weaknesses, and enhancements of each iteration are
discussed in detail. In addition, some of the applications of YOLO for object detection are analyzed. The paper concludes with
potential future challenges and trends for real-time object detection.
Keywords :
YOLO, Object Detection, Deep Learning, Computer Vision, Real-Time Detection, CNN.
References :
- J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You only look once: Unified, real-time object detection,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 779–788.
- J. Redmon and A. Farhadi, “Yolo9000: Better, faster, stronger,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 7263–7271.
- ——, “Yolov3: An incremental improvement,” arXiv preprint arXiv:1804.02767, 2018.
- A. Bochkovskiy, C.-Y. Wang, and H.-Y. M. Liao, “Yolov4: Op-timal speed and accuracy of object detection,” arXiv preprint arXiv:2004.10934, 2020.
- C.-Y. Wang, A. Bochkovskiy, and H.-Y. M. Liao, “Yolov7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 7464–7475.
- D. Reis, J. Hong, J. Kupec, and A. Daoudi, “Real-time flying object detection with yolov8,” arXiv preprint arXiv:2305.09972, 2024.
- S. fSathish and A. Sangeetha, “Real time object detection using yolov8,” International Journal of Scientific Research & Engineering Trends, vol. 11, no. 6, Nov-Dec 2025.
- D. Reis, J. Kupec, J. Hong, and A. Daoudi, “Real-time flying object detection with yolov8,” arXiv preprint arXiv:2305.09972, 2024.
- M. Pavan, N. SaiKiran, and B. Sangam, “Bio-inspired tri modal pest defense system with predictive analytics,” KS School of Engineering and Management, 2025.
- K. Neha, A. Arya, and R. Singh, “Wheat disease detection using yolo and drone-captured images,” Journal of Graphic Era University, vol. 13, no. 02, pp. 411–438, 2025. [Online]. Available: https://doi.org/10.13052/jgeu0975-1416.1327
Object detection has become one of the most critical fields of study in computer vision due to its broad application
range including surveillance, health care, autonomous driving, agriculture, robotics, and industrial automation. YOLO is considered one of the most efficient object detection frameworks due to its unique feature of being able to localize and classify
objects in one feedforward network pass. Starting with its inception in 2016, YOLO architecture underwent multiple changesto
improve accuracy, inference speed, and efficiency. This review paper provides an overview of how YOLO architectures have
evolved from YOLO v1 to YOLO v8. The advancements, strengths, weaknesses, and enhancements of each iteration are
discussed in detail. In addition, some of the applications of YOLO for object detection are analyzed. The paper concludes with
potential future challenges and trends for real-time object detection.
Keywords :
YOLO, Object Detection, Deep Learning, Computer Vision, Real-Time Detection, CNN.