NWU Institutional Repository

Machine learning and deep learning for crop disease diagnosis: performance analysis and review

Loading...
Thumbnail Image

Date

Researcher ID

Supervisors

Journal Title

Journal ISSN

Volume Title

Publisher

MDPI

Record Identifier

Abstract

Crop diseases pose a significant threat to global food security, with both economic and environmental consequences. Early and accurate detection is essential for timely intervention and sustainable farming. This paper presents a review of machine learning (ML) and deep learning (DL) techniques for crop disease diagnosis, focusing on Support Vector Machines (SVMs), Random Forest (RF), k-Nearest Neighbors (KNNs), and deep models like VGG16, ResNet50, and DenseNet121. The review method includes an in-depth analysis of algorithm performance using key metrics such as accuracy, precision, recall, and F1 score across various datasets. We also highlight the data imbalances in commonly used datasets, particularly PlantVillage, and discuss the challenges posed by these imbalances. The research highlights critical insights regarding ML and DL models in crop disease detection. A primary challenge identified is the imbalance in the PlantVillage dataset, with a high numberofhealthy images and a strong bias toward certain disease categories like fungi, leaving other categories like mites and molds underrepresented. This imbalance complicates model generalization, indicating a need for preprocessing steps to enhance performance. This study also shows that combining Vision Transformers (ViTs) with Green Chromatic Coordinates and hybridizing these with SVMachieves high classification accuracy, emphasizing the value of advanced feature extraction techniques in improving model efficacy. In terms of comparative performance, DL architectures like ResNet50, VGG16, and convolutional neural network demonstrated robust accuracy (95-99%) across diverse datasets, underscoring their effectiveness in managing complex image data. Additionally, traditional ML models exhibited varied strengths; for instance, SVM performed better on balanced datasets, while RF excelled with imbalanced data. Preprocessing methods like K-means clustering, Fuzzy C-Means, and PCA, along with ensemble approaches, further improved model accuracy. Lastly, the study underscores that high-quality, well-labeled datasets, stakeholder involvement, and comprehensive evaluation metrics such as F1 score and precision are crucial for optimizing ML and DLmodels, making them more effective for real-world applications in sustainable agriculture.

Sustainable Development Goals

Responsible Consumption and Production

Description

Journal Article, Faculty of Agriculture -- North-West University, Potchefstroom Campus

Citation

Ngugi, H.N.; Akinyelu, A.A.; Ezugwu, A.E. Machine Learning and Deep Learning for Crop Disease Diagnosis: Performance Analysis and Review. Agronomy 2024, 14, 3001. [https:// doi.org/10.3390/agronomy14123001]

Collections

Endorsement

Review

Supplemented By

Referenced By