Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: Prof. Dipali M. Mane, Gunja Gupta, Akash Shilimkar, Jaai Patwardhan, Trupti Shitole
DOI Link: https://doi.org/10.22214/ijraset.2023.53304
Certificate: View Certificate
Research related to agriculture is growing rapidly, the challenges that lie ahead is solved with the help of advancement in technology. It is found that it is very beneficial for the economic growth and development of any nation. Especially in India, it requires the need for good research in order to improve agricultural productivity. In order to enhance the growth and solve the problems in the agricultural sector, the scientists use a variety of data mining methods. Different data mining techniques, like classification or prediction can be used to predict diseases in crops, and losses incurred as a result of these diseases. The diseases can be bacterial, fungal or viral. Some of the common plant diseases are bacterial wilt, black knot, curly top, etc. These diseases are caused by a variety of insects and micro- organisms. Our main focus in this research is on early identification of the diseases and helps the farmers in taking the decision to use the fertilizers that helps to protect the crops so that the diseases are eliminated in the early stage of production and so the farmers can get maximum yield. Ensemble method combines several classifiers to produce one finest predictive model and it is a very important technique in Machine Learning. In this paper, ensemble methods are used to predict the crop disease and an analyse has been done with the help of different classifiers such as Decision Trees, Naive Bayes Classifier, Random Forests, Support Vector Machine and K- Nearest Neighbour. Ensemble models, improves the performance of the classifiers that are weak. Te proposed machine learning approach that aims at predicting the best yielded crop for a particular region by analysing various atmospheric factors like rainfall, temperature, etc., and land factors like soil type including past records of crops grown. Finally, our system is expected to predict the best yielded crop based on dataset we have collected.
I. INTRODUCTION
In India, Agriculture plays an important role for the economic development of the country. Crop prediction is a complicated process in agriculture and multiple models have been proposed and tested to this end. In recent times, modern people don’t have the complete awareness about the cultivation of the crops in a right season and at a right location. Due to which modern farmers lack the knowledge of proper selection of crops. Selecting a wrong crop for cultivation may lead to loss in achieving high yield rate and also simultaneously leads to shortage of food. These difficulties imply the need of smart farming which can be achieved with various machine learning algorithm. Machine Learning approaches are used in many fields, ranging from supermarkets to evaluate the behaviour of customers to the prediction of customers’ phone use. Machine learning is also being used in agriculture for several years. One of the most difficult challenges in agriculture is predicting crop production, and numerous models have been presented and confirmed to far. Since agricultural output varies on a variety of variables, including climate, weather, soil, and fertiliser application, this problem necessitates the use of several datasets. This indicates that crop yield prediction is not a trivial task; instead, it consists of several complicated steps. Nowadays, crop yield prediction models can estimate the actual yield reasonably, but a better performance in yield prediction is still desirable. been used for training is used for the performance evaluation purpose.
Machine learning, which is a branch of Artificial Intelligence (AI) focusing on learning, is a practical approach that can provide better yield prediction based on several features. Machine learning (ML) can determine patterns and correlations and discover knowledge from datasets. The models need to be trained using datasets, where the outcomes are represented based on past experience. The predictive model is built using several features, and as such, parameters of the models are determined using historical data during the training phase. For the testing phase, part of the historical data that has not been used for training is used for the performance evaluation purpose.
II. LITERATURE SURVEY
III. SYSTEM METHODOLOGY
We have proposed a system which determines the best-suited crop depending on the percentage of nitrogen, phosphorous, and potassium in the soil; it
Figure 2 shows the main functionalities of the Farm Track system.
IV. IMPLEMENTATION
Here, label is the target variable, i.e., predicted crop. Figure 3 shows the first 10 rows of the dataset.
2. Data Pre-processing: The subsequent step is to pre-process the data. Pre-processing converts the raw data into clean data. The missing values are removed by imputing the null values, if any from the dataset. Also, label encoding method converts word labels into numbers to let the algorithms work on them. For the structured dataset this is an important pre-processing step in supervised learning. This ensures that the data in the dataset are in the specified format for usage in the algorithm.
3. Splitting the Dataset: We need to split the dataset into training and testing before training the dataset for crop prediction.
4. Predicting the Crop: The prediction of suitable crop is dependent on various factors such as N, P, K, temperature, humidity, pH, and rainfall values in order to predict the crop accurately. These factors are given as input to the model. Based on the classification made by the algorithm, it provides a suitable crop to be cultivated by using random forest classifier giving 99.09% accuracy amongst 3 other algorithms which includes Logistic Regression, Decision Tree, Random Forest, and Naïve Bayes. These four classifiers were trained on the dataset.
V. ALGORITHMS
A wide range of regression and classification-based prediction algorithms have been utilized to forecast crop yield. In crop yield prediction K-Nearest Neighbours (K-NN), Naïve Bayes, Decision tree (DT), Random Forest (RF), have been employed.
A. Machine Learning Algorithms
The K-NN is used for classification and regression that provides more weightage to close neighbours in making the prediction so that they relate more to the average than distant ones. DT is a model of supervised machine learning model which can be applied to both regression and classification. It consists of three
nodes, namely root node (no incoming edge), decision node (both incoming and outgoing edges) and leaf node (no outgoing edges). In a decision tree, each attribute is divided by each outgoing node into two or more branches according to the splitting criteria.
Breiman et al developed Classification and Regression Trees (CART) which is method of DT induction. which is supposed to nonparametric and generates binary trees from such data by employing the discrete and continuous features.
RF is a powerful tool for the prediction of yield, which has been applied to agricultural research. It generates a wide range of regression trees that are produced by a large set of decision trees for computing regression. The RF is superior to any other decision tree since it performs more precisely, and the bias is compensated for by the single decision tree due to the randomness. Extremely randomized trees (extra tree) (ERT) is an ensemble model as same as RF, but it utilizes unpruned decision trees. It splits the nodes by randomly chosen cut-points and incorporates the complete learning sample.
The number of trees and the number of variables utilized to divide the nodes are set to be the same as those of RF. An ANN is the most commonly utilized machine learning technique to predict crop yield by which the complex nonlinear relationship between input and output can be modelled. It comprises of three layers, including the input layer, hidden layer and output layer. There are numerous factors that have an impact on the performance of ANN, including the number of nodes in the hidden layer, the learning rate, and the training tolerance. The learning rate determines the amount by which the weights change during a series of iterations to bring the predicted value within an acceptable range of the observed value.
Agriculture is the most vital field in the emerging countries like India. Accurate crop yield prediction is still considered as a challenging task due to uncertainty with many factors related to crop production. Machine learning approaches in agriculture will amendment the situation of farmers and supports to improve the yield. For many problems associated with agriculture field, machine learning plays a significant role to overcome these problems. Disease detection is one of the major aspects which is to be addressed at early stage of the crop. So, various techniques of early disease detection techniques are surveyed for different crops. Using Agrico we can perform crop prediction Weather conditions, Soil condition, Diseases information. Also, we provide Market Price analysis and tractor rental services through GPS service. Agrico centralizes, manages, and optimizes the production activities and operations of farms. Crop selection is still remaining as a challenging issue for farmers. This paper focuses on the prediction of crop with the help of machine learning techniques. Several machine learning methodologies were used for the calculation of accuracy of different models. The Random Forest classifier gave the best accuracy. Thus, we have proposed a model that helps the user to predict a suitable crop to cultivate in a wider way and cost efficiently use the system for managing the farm activities as well as getting the live weather conditions of a particular place.
[1] Pooja Patil, Shivani Turamari. “Weather forecast Prediction for agriculture” in International Journal of Recent Technology and Engineering (IJRTE), 2022. [2] Rishi Gupta, Akhilesh Sharma. Weather Based Crop Prediction in India Using Big Data Analytics” in IEEE,2021. [3] Vaishnavi and Kishore. “Crop Prediction using Machine Learning” in Institute of Electrical and Electronics Engineers (IEEE), 2020. [4] Agila N, Senthil Kumar P. “An Efficient Crop Identification Using Deep Learning” in International Journal of Recent Technology and Engineering (IJSRTE),2020. [5] Mohammed Ahmed. “Car rental and tracking web-based system using GPS” in IJISCS,2020. [6] Anamika Chauhan “Smart Real Time Weather Forecasting System” in Institute of Electrical and Electronics Engineers (IEEE),2021 [7] Shubham Madan, Pravin Kumar “Prediction of crop and yield in agriculture using machine learning technique” in Institute of Electrical and Electronics Engineers (IEEE),202 [8] Kunal Pahwa, Neha Agarwal “Stock Market Analysis using Supervised Machine Learning” in Institute of Electrical and Electronics Engineers (IEEE) Published Year-2019 [9] Shubham Madan; Praveen Kumar “Analysis of Weather Prediction using Machine Learning & Big Data” in Institute of Electrical and Electronics Engineers (IEEE),2018 [10] Hind Abdalsalam Abdallah Dafallah “Design and implementation of an accurate real time GPS tracking system “in Institute of Electrical and Electronics Engineers (IEEE),2018
Copyright © 2023 Prof. Dipali M. Mane, Gunja Gupta, Akash Shilimkar, Jaai Patwardhan, Trupti Shitole. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET53304
Publish Date : 2023-05-29
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here