Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: M. Nirmala, V. Saravanan, A. R. Jayasudha, Princess Maria John, P. Privietha, L. Mahalakshmi
DOI Link: https://doi.org/10.22214/ijraset.2022.46087
Certificate: View Certificate
Cardio vascular diseases are the number one cause of death globally, taking an estimated 17.9 million lives each year, which accounts for 31% of all deaths worldwide. Heart failure is a common event caused by CVDs and this dataset contains 11 features that can be used to predict mortality by heart failure. In this project, a model is built using the Random Forest Classifier Algorithm using AutoAI and a web application is created using Node Red Application and it showcases the prediction of heart failure in a web based format. The usage of IBM cloud environment for implementing the Machine learning Model using IBM Auto AI and and Node Red Flows are created for the display of Web Application Structure. The complete project explains the coordination among the Auto AI and Node red in the Cloud Platform.
I. INTRODUCTION
Machine Learning [1] commonly deals with big data where the size of the data is massive and the data can be both in structured and unstructured format It is difficult to identify heart disease because of several contributory risk factors such as diabetes, high blood pressure, high cholesterol, abnormal pulse rate and many other factors. Various techniques in data mining and neural networks have been employed to find out the severity of heart disease among humans. The nature of heart disease is complex and hence, the disease must be handled carefully. The proposed method which we use has 10 attributes for heart disease prediction and the problem is carried out using IBM AUTO AI service.
II. LITERATURE SURVEY / RELATED WORK
Lippi et al. [2] focused on the possibility of cardiovascular disease during the COVID-19 pandemic. ,e nationwide quarantine has compelled the government to implement various forms of lockdown to reduce the transmission of COVID-19. As a result of these restrictions, all citizens remain at home, resulting in physical inactivity. Although the WHO has established clear guidelines on the amount of physical activity required to maintain adequate health, strict quarantine, on the other hand, has increased the risk of cardiovascular mortality. After quarantine, negative health effects are observed. As a result, the authors proposed the fact that it is necessary to maintain physical exercise even during quarantine to avoid unfavourable cardiovascular consequences, has influenced the current research study’s Computational Intelligence and Neuroscience design
The random forest algorithm was used in the study [3] to predict the occurrence of heart disease in patients. A total of 303 samples from the Kaggle dataset were considered and the metrics used to evaluate performance were accuracy, sensitivity, and specificity. In the classification of heart disease, the algorithm achieved a prediction rate of 93.3%.
S. Krishnan J. Geetha S [4] has made a system that predicts the developing potential results of Heart Disease. Their aftereffects of this system give the chances of happening heart disease to the extent rate. They have considered datasets used are organized similar to therapeutic parameters. Their structure evaluates those parameters using the information mining plan strategy. Their datasets were set up in python programming using two standard Machine Learning Algorithm to be explicit Decision Tree Algorithm and Naive Bayes Algorithm and have exhibited the best estimation among these two to the extent the precision level of heart illness.
K.G Dinesh, K.A.raj, K.D.Santhosh, V. M.eswari [5] has talked about heart illness expectation and performed information pre-preparing utilizes strategies like the removal of noisy data, removal of missing data, filling default values if applicable and classification of attributes for prediction and decision making at different levels. Their exhibition of the finding model is acquired by utilizing techniques like order, exactness, affectability and particularity examination.
This has proposed a forecast model to anticipate whether people have heart illness or not and to give mindfulness or finding on that. They have done examination by comparing the accuracies of applying rules with the individual consequences of Support Vector Machine, Gradient Boosting, Random backwoods, Naive Bayes classifier and calculated relapse on the dataset taken in a district to display an exact model of foreseeing cardiovascular ailment.
Senthil Kumar Mohan et al,[6] proposed Effective Heart Disease Prediction Using Hybrid Machine Learning Techniques in which strategy that objective is to finding critical includes by applying Machine Learning bringing about improving the exactness in the expectation of cardiovascular malady. The expectation model is created with various blends of highlights and a few known arrangement strategies. We produce an improved exhibition level with a precision level of 88.7% through the prediction model for heart disease with hybrid random forest with a linear model (HRFLM) they likewise educated about Diverse data mining approaches and expectation techniques, Such as, KNN, LR, SVM, NN, and Vote have been fairly famous of late to distinguish and predict heart disease.
Avinash Golande et al,[7] proposed Heart Disease Prediction Using Effective Machine Learning Techniques in which Specialists utilize a few data mining strategies that are available to support the authorities or doctors distinguish the heart disease. Usually utilized methodology utilized are decision tree, k- closest and Naïve Bayes. Other unique characterization-based strategies utilized are packing calculation, Part thickness, consecutive negligible streamlining and neural systems, straight Kernel selfarranging guide and SVM (Bolster Vector Machine). The following area obviously gives subtleties of systems that were utilized in the examination.
Lakshmana Rao et al,[8] Machine Learning Techniques for Heart Disease Prediction in which the contributing elements for heart disease are more (circulatory strain, diabetes, current smoker, high cholesterol, etc..). So, it is difficult to distinguish heart disease. Different systems in data mining and neural systems have been utilized to discover the seriousness of heart disease among people. The idea of CHD ailment is bewildering, in addition, in this manner, the disease must be dealt with warily. Not doing early identification, may impact the heart or cause sudden passing. The perspective of therapeutic science furthermore, data burrowing is used for finding various sorts of metabolic machine learning a procedure that causes the framework to gain from past information tests, models without being expressly customized. Machine learning makes rationale dependent on chronicled information
Marimuthu M et al [9], proposed Heart disease is one of the prevalent disease that can lead to reduce the lifespan of human beings nowadays. Each year 17.5 million people are dying due to heart disease. Life is dependent on component functioning of heart, because heart is necessary part of our body.
Heart disease is a disease that effects on the function of heart. An estimate of a person’s risk for coronary heart disease is important for many aspects of health promotion and clinical medicine. A risk prediction model may be obtained through multivariate regression analysis of a longitudinal study
Balakrishnan et al [10] proposed Machine learning is a technique converts the raw clinical data into an informational data that helps for decision making and prediction. Cardiovascular disease is one of the major causes of mortality around the world. It is considered in a large scale, so prediction of cardiovascular disease is more important in the clinical survey analysis as day by day it gets increased. The amount of data in the health club is huge. As cardiovascular is one of the major causes for death there are some data analytical techniques that predicts the occurrence of cardiovascular disease. It can be achieved through selecting a correct combination of prediction models and features.
Prediction models were developed using different classification techniques based on feature selection and there are certain algorithms which provide varied and improved accuracy.
Here prediction model is developed using Random Forest classification technique - Method for classification, regression by constructing a multitude of decision trees at training time. Developed by aggregating tree Avoids over fitting can deal with large number of features. Helps with feature selection based on importance where necessary features only classified. Pre-processing will be done first considering the clinical data. It will be spited into train and test data with which accuracy can be achieved.
III. RESEARCH METHODOLOGY
The various methods adopted during the research process have been portrayed. This is a Descriptive Research problem where the study of Medical data set is explored. It performs the prediction of Heart Disease of the patients from the medical data set by applying various methodologies with respect to Machine Learning using IBM Auto AI.
A. Research Data
The data collected from secondary data sources are tabulated in the Table 1.
???????B. Proposed System Method Of Analysis
The project helps us to predict the heart failure of human being given various parameters like the details listed in
C. Block diagram
4. Node Red Service: It is a programming tool for wiring together hardware devices, API’s and Online services. It is a browser based editor that makes it easy to wire together flows using wide range of nodes in the palette that can be deployed in its run time in a single click. It helps in deploying the ML model as a web server
D. Hardware and Software Specification
IV. EXPERIMENTAL INVESTIGATIONS
V. EXPERIMENTAL RESULTS
The transformed data set is partitioned into training data set and the test data set where the training data is 80% of the whole data set and the remaining unused 20% is used as Test data set. The random state is set as 0. The parameters applied for various algorithms are depicted in
VI. FUTURE SCOPE
The present study predicted whether patient is at risk of heart failure. The problem has to support with more parameters to be suitable for a generalized application. The web page creation can be further enhanced to make it a user friendly format.
The Machine learning methodology is rapidly increasing and the impact of the machine able to predict the result of a system by itself and also it is able to train a data over a period of time and also test the trained model with a different set of data to prove that the model is working efficiently and effectively. The problem is able to predict whether a person is at risk of heart failure. The data is clean and does not require any exploratory data analysis. The problem can be applied in a hospital environment to identify with the given parameters whether the person will have a heart disease or not. In this research study it has been apparently proved that Random Forest got the accuracy of 87% and Logistic Regression has got a accuracy of 79% and the model is working efficiently when connected with the node red application to display the output in the browser window.
[1] Smola, Alex, and S.V.N. Vishwanathan. Introduction to Machine Learning. Cambridge University Press, 2008. N.p., 2008. Web. [2] G. Lippi, B. M. Henry, and F. Sanchis-Gomar, “Physical inactivity and cardiovascular disease at the time of coronavirus disease 2019 (COVID-19),” European journal of preventive cardiology, vol. 27, no. 9, pp. 906–908, 2020. [3] M. Pal and S. Parija, “Prediction of heart diseases using random forest,” in Journal of Physics: Conference Seriesvol. 1817, no. 1, IOP Publishing, Article ID 012009, 2021 [4] S.Krishnan J. Geetha S, \\\\\\\"Forecast of Heart Disease Using Machine Learning Algorithms\\\\\\\",First International Conference on Innovations in Information and Communication Technology ,2019. [5] K.G.Dinesh, K A.garaj, K.D.Santhosh, V M.eswari. \\\\\\\"Forecast of Cardiovascular Disease Utilizing Machine Learning Algorithms\\\\\\\", 2018 International Conference on Current Trends towards Converging Technologies ,2018 [6] Senthilkumar Mohan, Chandrasegar Thirumalai, Gautam Srivastava ?Effective Heart Disease Prediction Using Hybrid Machine Learning Techniques, Digital Object Identifier 10.1109/ACCESS.2019.2923707, IEEE Access, VOLUME 7, 2019 [7] Avinash Golande, Pavan Kumar T,” Heart Disease Prediction Using Effective Machine Learning Techniques”, International Journal of Recent Technology and Engineering (IJRTE) ISSN: 2277-3878, Volume-8, Issue-1S4, June 2019 [8] A.Lakshmanarao, Y.Swathi, P.Sri Sai Sundareswar,” Machine Learning Techniques For Heart Disease Prediction”, International Journal Of Scientific & Technology Research Volume 8, Issue 11, November 2019 [9] Marimuthu M, Abinaya A, 2018, A Review on Heart Disease Prediction using Machine Learning and Data Analytics Approach, International Journal of Computer Applications, Volume 181 – No. 18, pp20-25 . [10] M. Balakrishnan, A.B. Arockia Christopher, P. Ramprakash and A. Logeswari, Prediction of Cardiovascular Disease using Machine Learning, Journal of Physics: Conference Series, Volume 1767, International E- Conference on Data Analytics, Intelligent Systems and Information Security & ICDIIS 2020 11-12 December 2020, Pollachi, India
Copyright © 2022 M. Nirmala, V. Saravanan. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET46087
Publish Date : 2022-07-30
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here