Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: Ritika Chouksey, Preety D. Swami
DOI Link: https://doi.org/10.22214/ijraset.2022.45579
Certificate: View Certificate
Agriculture productivity is highly important in the world to survive. There are lots of involvements of artificial intelligence in agriculture to help the productivity. Automatic leaf disease detection is one of them. It is hard to diagnose the leaf disease by normal vision because it looks quite natural. If care is not taken properly then it directly affects the quality of the production. So, it is important to detect the disease at early stage through which production can be improved and proper care can be taken place. There are so many researches that have been done in this field but there are certain flaws present in the resulting system. Proposed system is based on Polynomial SVM (Support Vector Machine) and an Euclidean Distance Metric. Polynomial SVM is a classifier that can handle the non-linear data in a very effective manner. Euclidean Distance Metric calculates distance between two different clusters or points; through which decision can be made easily. Dataset has been taken from kaggle for four different categories; such as Alternaria Alternata, Bacterial Blight, Cercospora Leaf Spot and Healthy Leaves. The proposed method provides 97.30% of accuracy which is bit higher than the KNN classifier.
I. INTRODUCTION
Agricultural is something that requires to be taken care of today on an emergent basis. Indian economy is exceptionally reliant of agricultural efficiency. Hence in field of agriculture, discovery of disease in plants assumes a significant part. To distinguish a plant disease in exceptionally introductory stage, utilization of programmed disease recognition method is advantageous. The current strategy for plant disease detection is essentially unaided eye perception by specialists through which identification of plant diseases is done. For doing this, many specialists as well as nonstop checking of plants is required which increases with increase in the farm size.In many places, farmers don't have contacts with the specialists. Also the counseling specialists even charge high and the process is tedious as well. In such circumstances, the proposed strategy ends up being useful in checking enormous fields of harvests. Finding the nature and location of the diseases simply by seeing the pictures of the plant leaves makes it simpler as well as less expensive [1].Distinguishing between plant diseases by visual ways is a relentless assignment and simultaneously less exact and is possible just in restricted regions. While assuming that programmed identification procedure is utilized, it will take less time and will be more precise. In plants, a few general diseases seen are brown and yellow spots, early and late singe, and others are parasitic, viral and bacterial diseases. Picture handling is utilized for estimating impacted area of disease and to decide the distinction between the various diseases [2].
Fig. 1shows the leaf disease image of potato and tomato leaf.Any imaging test does not confirm that any leafpertains any disease but it gives lot of information through which any disease can be predicted with certain level of accuracy. Every disease has very less symptoms at the early stage and it is required to diagnose that disease in early stage to treat well. Artificial Neural Networks (ANN) are still being used for classification provided the network is trained through features that have been properly selected [3].Automatic diagnosis is very helpful source for diagnosing leaf disease. So, many researches rely on machine learning technique like CNN, RNN, ResNet and many more. But all are very deep neural networks that require lots of samples to train and they take more computational time for training and testing the datasets. A network model should be light in weight and use small intelligent filters in hidden layers through which effective result can be found with high level of accuracy. There is an alternative approach i.e. classifier that can classify the normal and abnormal pictures and can take a decision accordingly. There are so many classifiers available through which classification can be performed for better results like Naive Bayes, K-means clustering, SVM and many more. But SVM is considered as the best classifier among them because it has better prediction level to classifydifferent patterns, textures and variations [4].
Fig 2 shows the basic process model for leaf disease detection where theleaf images dataset is to be downloaded first and imagesare pre-processed. Later, the system extracts the features and classification is done.
II. RELATED WORKS
Many researchers tried to extract the lesion from leaf images and obtained good accuracy with certain false alarm rate. Jaskaran Singh et al. [5] proposed a research which is based on Region-Based Segmentation and KNN Classifier.In their work, the GLCM calculation is applied for the textural highlight examination, k-means clustering is applied for the region based division, and KNN classifier is applied for the disease classification. Eftekhar Hossain et al. [6] proposed a research which is based on KNN classifier that detects the disease on the basis of color and textures. Diseases, for example, alternaria alternata, anthracnose, bacterial curse, leaf spot, and blister of plant leaves are considered for the trial. The division of the disease segment is done by utilizing the k-nearest neighbor classifier and the GLCM surface elements are utilized for the grouping. The KNN classifier based division result gives certain ideal precision in plant disease identification and the quantitative performance of the proposed calculation is acquired by estimating the DSC, MSE and SSIM boundaries.Aamir Yousuf et al. [7] proposed a research which is based on ensemble classifier. Their method uses ensemble classifier that combines two classifiers and obtains the result accordingly usingKNN and Random Forest. But SVM is much modern and better classifier as compared to other classifiers. SVM is able to deal with linear as well as non linear data with high prediction rate. SVM is considered as the best classifier to diagnose diseases or faults related to image processing [8]. GLCM is also used to extract the textual feature of the image and later classifiers detect the disease accordingly.
Ch. Usha Kumari et al. [9] proposed a research which is based on K-means clustering and ANN. In this paper the leaf disease location is foundby utilizing a neural network classifier. The distinction is done utilizing k-means clustering.
Different elements like Mean,Standard Deviation,Variance, Energy,Correlation,Contrast, and Homogeneityare computed for cotton and tomato diseases. The diseased leaves considered for reproduction are bacterial leaf spot, target spot septoria leaf spot and leaf form disease. Highlights are processed from disease impacted bunches 1 and 3. The highlights are taken care of for the classifier to perceive and characterize the diseases. Out of twenty cotton tests nine examples are arranged accurately as bacterial leaf spot and one example is misclassified as target spot. Eight examples are delegated as target spot and two examples are misclassified as bacterial leaf spot.Image processing based approaches are quite popular in this area [10-13]. Abirami Devaraj et al. [14] also proposed an image processing based approach. It includes stacking an image, image preprocessing, image division, highlight extraction and order. Improvement of programmed detection framework utilizing cutting edge innovation like image process work with to help the ranchers inside the recognizable proof of diseases at an early or beginning stage and supply supportive information for its administration. Image denoising can also be treated as an important preprocessing step in image processing. Training a neural network or a classifier with features selected from denoised images produces far better results than training with the features of a noisy image. Many image denoising methods are available and from themthe method that is less computationally expensive can be chosen [15].
Table I shows the drawbacks of certain models through which leaf disease can however be diagnosed with satisfying precision. A comparison between VGG-16, Densenet, AlexNet, CNN and Ensemble Classifier is presented that have been used in various researches for implemention of automatic leaf disease detection. This table tells where each model suffers and what kind of perspective should be retained in mind to develop an ideal system that can diagnose leaf diseases. A better approach is required that can train the system with less filters or light weight filters with proper training through which results with high precision can be obtained with less false alarm rate.
III. IMPLEMENTATION DETAILS
Proposed system is based on Polynomial Support Vector Machine and Euclidean Distance Metric. This method is able to diagnose leaf disease automatically with classifying its category. In leaf image, there is presence ofnoise and to classify the lesion area, it is required to mask or erode the background information. Image pre-processing helps to enhance the image and classify the distinct regions. SVM is a method through which similar kind of cells can form a group or cluster and classify them as per the patterns. The proposed method uses polynomial SVM which is a non linear SVM that can classify non linear data. Leaf image is bit complicated in structure, so it is better to use non linear classifier to obtain high precision in classification.
Fig.4 shows the original image of leaf and Fig.5 shows the histogram equalization of the affected image where it can be seen that the visibility of the image has been enhanced that will help the model to obtain good accuracy.
B. Support Vector Machine
Support Vector Machine is a tool for classifying data on the basis of their patterns or appearance. SVM is considered as the most robust prediction technique that can classify data with more precision. The proposed method uses nonlinear SVM to deal with the nonlinear data. Most of the images with leaf diseases belong to the non-linear classes because of their complex structure. Fig. 6 shows the separation of data with a hyperplane.
C. Euclidean Distance Metric
The Euclidean distance between the input image and the segmented image is to be calculated. Afterwards, the Euclidean distance between the input leaf and the next segmented image and so on is calculated till all the distances have been found. For the computation of the Euclidean distance, following formula can be used.
Fig.7 shows the flowchart of proposed system where system firstly loads the dataset image as an input data. Then pre-processing module has been initiated for enhancing the visibility of the images. Histogram is one of them, it is responsible to balance the brightness and contrast of the system, once the visibility increase then features gets extracted and after feature selection SVM classification can be initiated to classify the data points. User is required to select the cluster which has been created by the system. Then system calculates the entropy of the extracted lesion. It decides the density of the lesion that is later compared with the threshold value. There are four switch cases in the system and comparison can be made accordingly. If entropy satisfies the case 1, 2, 3 then it will be considered as respective disease i.e. Alternaria Alternata, Bacterial Blight or Cercospora Leaf Spot. If entropy satisfies case 4, then it would be considered as healthy leaf image. So, system also classifies the affected region as per the density of the lesion.Table II tabulates the various steps involved in the proposed algorithm.
IV. EXPERIMENTAL RESULT
Experimental resultsare based on four metrices;that are True Positive (TP), False Positive (FP), True Negative (TN) and False Negative(FN).
Truepositive means if an image belongs to either class 1, 2 or 3and system diagnosed it positively, True Negative means if an image does not belong to either Class 1, 2 or 3 and system diagnosed it as healthy.
False Positive means if an image class 4 and system diagnosed it as Class 1, 2 or 3, False Negative means if an image belongs to the Class 1, 2 or 3but system diagnosed it as normal.
There are total 74 testing imageswhere 34 images belong to class 1 (Alternaria Alternata), 18 images from class 2 (Bacterial Blight), 7 images belong from class 3 (Cercospora Leaf Spot) and 15 images from normal class in Kagglebenchmark.
Proposed leaf disease detection method is based on Polynomial Support Vector Machine that classifies the abnormal and normal leaves and on the basis of that decision can be make effectively with high accuracy. System has been tested with kaggle benchmark dataset and has proved to achieve better results as compared to the KNN classifier. The comparison of the results of the proposed method with the KNN classifier has been done on the basis of four parameters: precision, recall, F-1 score and accuracy. It has been observed that percentage increase in these four parameters as compared to the KNN classifier are 2.61, 8.44, 5.89 and 0.54 respectively. Hence, it could be concluded that the proposed classification technique is bit powerful in classifing the leaves in normal and abnormal categories and in takingood decisions about the image or disease belonging to the respective category of the disease. In future, the system can be tested with different benchmarks that contain several images or data. The accuracy can be enhanced furhther by using certain modern image pre-processing approaches.
Proposed leaf disease detection method is based on Polynomial Support Vector Machine that classifies the abnormal and normal leaves and on the basis of that decision can be make effectively with high accuracy. System has been tested with kaggle benchmark dataset and has proved to achieve better results as compared to the KNN classifier. The comparison of the results of the proposed method with the KNN classifier has been done on the basis of four parameters: precision, recall, F-1 score and accuracy. It has been observed that percentage increase in these four parameters as compared to the KNN classifier are 2.61, 8.44, 5.89 and 0.54 respectively. Hence, it could be concluded that the proposed classification technique is bit powerful in classifing the leaves in normal and abnormal categories and in takingood decisions about the image or disease belonging to the respective category of the disease. In future, the system can be tested with different benchmarks that contain several images or data. The accuracy can be enhanced furhther by using certain modern image pre-processing approaches.
Copyright © 2022 Ritika Chouksey, Preety D. Swami. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET45579
Publish Date : 2022-07-13
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here