ARCHIVES
Development and Validation of a Real-Time YOLOv4-Based Multi-Fruit Detection Model for Autonomous Robotic Harvesting
Published Online: September-December 2025
Pages: 89-93
Cite this article
↗ https://www.doi.org/10.59256/indjcst.20250403017Abstract
Background: The global agricultural sector is facing unprecedented challenges with the world's food requirements projected to increase by 70% by 2050, along with persistent labor shortages in the harvesting processes worldwide. Computer vision technology equipped autonomous harvesting machines present a shining vision, but their profitability is entirely dependent upon robust real-time fruit detection capability. Problem Statement: Current fruit detection systems are susceptible to environmental fluctuations, including varying lighting levels, occlusions, and the computational burden of real-time execution within field environments. Current models either are not computationally fast enough to be used in real time or sacrifice accuracy for speed, limiting their realistic use within autonomous harvesting systems. Objectives: In this study, a YOLOv4 deep learning model was trained and evaluated using the Google Open Images Dataset for real-time multi-fruit detection, its accuracy for eight classes of fruit at various environmental conditions was evaluated, and its effectiveness was compared with other detection structures. Methodology: A quantitative experimental approach was employed with 7,700 annotated images from the Google Open Images Dataset split into training (70%), validation (15%), and test (15%) sets. Data augmentation techniques and custom anchor boxes were employed to fine-tune the YOLOv4 architecture, and the performance was evaluated using precision, recall, mean Average Precision ([email protected]), and inference speed metrics. Results: The network reached an average mAP of 0.889 across eight fruit classes with precision and recall of 0.85-0.94 and 0.78-0.90, respectively. Real-time speeds of 45 FPS were shown on GPU hardware, significantly higher than Faster R-CNN (5 FPS) while maintaining comparable accuracy. Environmental tests confirmed robust performance in normal lighting with modest degradation under high occlusion and low light. Recommendations: The future steps will include efforts at domain adaptation techniques to bridge the training-deployment gap, utilize model optimization for edge deployment, and conduct field trials on actual real-world robotic harvesting systems for test cases in real-world scenarios
Related Articles
2025
Transforming Cyber-Physical Systems: Machine Learning for Secure and Efficient Solutions
2025
Exploring AI Techniques for Quantum Threat Detection and Prevention
2025
Maturity Models for Business Intelligence: An Overview
2025
INSPIRO: An AI Driven Institution Auditor
2025
Adaptive AI Framework for Anomaly Detection and DDoS Mitigation in Distributed Systems
2025