Artificial intelligence and machine learning are quite significant in the modern world. They are present in every sector, including the medical sector and autonomous vehicles. The medical industry produces vast amounts of patient data, which is then processed in a variety of ways. Consequently, we were able to create a prediction system that can recognize numerous diseases at once using machine learning. Many of the methods in use now can predict only one disease at a time, and that too with reduced accuracy. Reduced precision can seriously jeopardize a patient\'s health. The three illnesses we are now considering are diabetes, liver, and heart, but there may be many more added in the future. Several parameters must be entered by the user. When the user enters many disease-related parameters, the system will output whether the user has the disease or not. This initiative can help a lot of people because it makes it possible to monitor someone\'s health and take the appropriate steps to extend their lives.
Introduction
I. INTRODUCTION
In the digital age, data is a valuable resource, and enormous amounts of data were generated across all businesses. The data for the healthcare industry includes all patient-related information. A general framework for disease prediction in the healthcare industry has been presented here. Most of the current models analyse only one disease at a time. For instance, a single analysis might be done for diabetes, cancer, and skin conditions. There isn't a system in place that can examine numerous diseases simultaneously. We are concentrating on giving customers quick and accurate disease predictions based on the symptoms they enter as a result. Therefore, we provide a technique that can be used to predict various diseases using Django. We'll look at heart, diabetes, and malaria problems in this system. Later, a great deal more diseases might be included. We will build numerous illness prediction systems using Django and machine learning algorithms. Python pickling is used to protect the behavior of the model. This system analysis is significant because it considers every element that contributes to the onset of the diseases under study, enabling more accurate and effective disease detection. The finished model's behavior will be saved in a Python pickle file.
EXISTING SYSTEM
Currently, a lot of machine learning models used for healthcare analysis focus on just one disease at a time. For instance, the liver is examined first, followed by tests for lung and cancer problems, respectively. A person must visit various websites if they want to predict multiple illnesses. A single analysis cannot forecast several diseases using the same system. The lesser accuracy of some of the models may have major adverse effects on the patient's health.
When a business wants to examine patient health data, it must use multiple models, which in turn drives up costs and lengthens turnaround times. Some systems in use today take into account very little, which can lead to erroneous findings.
II. PROPOSED SYSTEM
Using multiple disease prediction, it is possible to predict several diseases at once. In order to anticipate diseases, the consumer does not need to visit numerous websites. Diabetes, liver, and heart conditions are the main topics of our attention. Given the connection between the three illnesses to implement different sickness analyses, we will use machine learning techniques and Django.
When making a request for access to this API, the user must provide both the name of the illness and its specifications. Django will make a call to the pertinent model and provide the patient's status.
III. LITERATURE REVIEW
The focus of the study is diabetes because it is one of the deadliest diseases in the world and because it can cause a wide range of illnesses, including blindness. Since it is easy to predict whether a patient will be ill or not, machine learning techniques were used in this study to identify the diabetic condition. To help patients manage their illness, the researchers hoped to create a system that could reliably identify a patient's diabetes. They compared the precision of four main algorithms used in this study: Decision Tree, Nave Bayes, and SVM.
These algorithms' respective levels of accuracy were 85%, 77%, and 77.3%. They also used the ANN algorithm after the training phase to check whether the disease had been appropriately recognized or not. They compared each model's precision, F1 score support, and accuracy in this instance. The accuracy of machine learning for predicting heart disease using k-nearest neighbour, decision tree, linear regression, and SVM is therefore calculated in this work utilizing the UCI repository dataset for training and testing. Additionally, they compared the SVM (83%), Decision Tree (79%), Linear Regression (78%), and K-Nearest Neighbour (87%) algorithms' accuracy. The system states that liver disease is a leading cause of death in India and is also regarded as a serious health issue on a global scale. because it might be challenging to detect liver disease in its early stages. Therefore, using automated software and machine learning algorithms, we can precisely detect liver disease. They employed and compared the SVM, Decision Tree, and Random Forest algorithms to assess precision, accuracy, and recall measures quantitatively and analysis helps us to predict and to take preventive steps.
IV. SYSTEM ANALYSIS
A. Functional Requirements
The patient is able to forecast the disease thanks to the system.
The user enters the information for the specific ailment, and the output is displayed based on the trained model of the user input.
B. Non Functional Requirements
The website will provide a range of values when the disease is predicted.
The website must be reliable and respectable.
C. Architecture Design
We conducted experiments on three diseases—heart, diabetes, and liver in the below diagram because they are related to one another. The first step is to import the UCI dataset, PIMA dataset, and Indian liver dataset, respectively, for the datasets for heart disease, diabetes disease, and liver disease. After the dataset has been imported, each inputted piece of data is visualized. After pre-processing the data for visualization, which involves looking for outliers, missing values, and scaling the dataset, the data is divided into training and testing. Next, we used the knn, xgboost, and random forest algorithms on the training dataset before applying what we learned about the classified algorithm to the testing dataset. In order to preserve the model's behaviour, Python pickling is employed. The significance of this system analysis is that it considers all the factors that contribute to the development of the diseases under study, making it possible to detect them more effectively and precisely. A Python pickle file will be used to store the behavior of the final model.
IX. FUTURE SCOPE
In order to lower the mortality rate, we can add more diseases to the current API and work to increase forecast accuracy.
Conclusion
In order to utilize the online application, the user must enter information that will be stored in a model of a flask and used as input by taught machine learning algorithms. The program will detect the multiple diseases of the patient by giving the input values correct. Here, we employed the more accurate Random Forest and svm classification methods. Finally, the output was displayed in a new html page, saying things like \"You are healthy\" if the patient is fine otherwise the page will be displayed with a serious problem.The program has been thoroughly tested and debugged.
References
[1] \"DIABETES PREDICTION USING DIFFERENT MACHINE LEARNING APPROACHES\", 2019 IEEE, 3rd International Conference on Computing Methodologies and Communication (ICCMC), by Priyanka Sonar and Prof. K. JayaMalini.
[2] D. W. Bates, S. Saria, L. Ohno-Machado, A. Shah, and G. Escobar, \"Big data in health care: using analytics to identify and manage high-risk and high-cost patients,\" Health Affairs, vol. 33, no. 7, 2018; pp. 1123–1131.
[3] \"Predicting Disease By Using Data Mining Based on Healthcare Information System\" by Feixiang Huang, Shengyong Wang, and Chien-Chung Chan was published in IEEE 2012.
[4] \"Diagnosis of Liver Disease Using Machine Learning Models\" was presented at the 2020 Fourth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud; I-SMAC). Authors include A.Sivasangari, Baddigam Jaya Krishna Reddy, Annamareddy Kiran, and P. Ajitha.