Different Disease Prediction

Authors: Aayush Mankar, Atharv Pawar, Akash Pore, Amar Waghmare

DOI Link: https://doi.org/10.22214/ijraset.2023.56897

Abstract

One of the most significant subjects of society is human healthcare. It is looking for the best one and robust disease diagnosis to get the care they need as soon as possible.The task of following new approaches is challenging these disciplines, moving beyond the conventional ones.The actual number of new techniques makes it possible to provide a broad overview that avoids particular aspects. To this end, we suggest a systematic analysis of human diseases related to machine learning. This research concentrates on existing techniques related to machine learning growth applied to the diagnosis of human illnesses in the medical field to discover exciting trends, make unimportant predictions, and help decision-making.This paper analyzes unique machine learning algorithms used for healthcare applications to create adequate decision support.This paper intends to reduce the research gap in creating a realistic decision support system for medical applications. .Datasets are used in this project for predictions. Random Forest algorithm is to create multiple decision trees during training and then combine their predictions to obtain a more accurate and robust model. Ensemble models based on deep learning have made significant contributions to the medical field, particularly in the area of disease prediction. Breast cancer is a highly aggressive disease with a high mortality rate. Timely and effective prediction of breast cancer can reduce the risk of it progressing to later stages and the need for unnecessary medications. This study proposes a novel machine learning algorithm to predict the AD progression utilising a multi-task ensemble learning approach. Specifically, we present a novel tensor multi-task learning (MTL) algorithm based on similarity measurement of spatio-temporal variability of brain biomarkers to model AD progression. In this model, the prediction of each patient sample in the tensor is set as one task, where all tasks share a set of latent factors obtained through tensor decomposition. Furthermore, as subjects have continuous records of brain biomarker testing, the model is extended to ensemble the subjects’ temporally continuous prediction results utilising a gradient boosting kernel to find more accurate predictions

Introduction

I. INTRODUCTION

In an era characterized by unprecedented advancements in technology and an ever-increasing volume of healthcare data, the field of disease prediction has witnessed a remarkable transformation. Predicting diseases before they manifest clinically is a pivotal aspect of modern healthcare that holds the promise of early intervention, improved patient outcomes, and more efficient resource allocation within the healthcare system. The conventional methods of disease diagnosis and prediction, heavily reliant on the expertise of healthcare professionals and manual processes, often fall short in terms of accuracy, speed, and scalability. This is where the convergence of healthcare and machine learning emerges as a game-changer.

Machine learning, a subset of artificial intelligence, has revolutionized various industries, and healthcare is no exception. Its capacity to harness the power of big data, extract meaningful patterns, and make data-driven predictions has opened up new horizons for disease prevention and management. This report delves into the exciting intersection of healthcare and machine learning, where predictive algorithms are employed to anticipate disease onset, progression, and outcomes with increasing accuracy.

Nowadays, Healthcare services are so expensive so everybody can’t afford it So there is need to develop cost effective system to detect the diseases.The main advantage of this project is that we can get the test result immediately at our home with a just few clicks.Many of existing machine learning models are concentrating on one disease per analysis.For example,one for diabetes,one for heart disease ,one for lung disease and many such.If user wants to predict more than one disease ,he/she has to go through different sites.There is no common site where one analysis can perform more disease prediction.Some models have inconsistent accuracy which can seriously affect the patients

II. PROCESSED SYSTEM

User Registration: Users start by registering on the platform, providing essential details such as name, email,aadhar number password and mobile number.
Security measures: After registration the user will get the conformation mail on their mail id which consist of login information.
Choose Action: Upon registration user needs to enter the login information to access the website where they will check their disease.
Select Disease: User needs to select the disease which he/she wants to check among the available diseases.
Enter details: The user will require to enter his details which are being asked by the system.
System Details: User will be required to enter his name,gender, age ,values or upload the required files.
User Interface: After completion of all the steps the user will get a pop up screen which will consist of the result with all the details of User which he/she has entered.

III. LITERATURE REVIEW

IV. METHODOLOGY

A. Algorithms:

Machine learning: Machine learning is the most sought-after approach, nowadays, which is based on several statistical algorithms that make the machine learn about the data and the relationships between features of the data, in which, primary task is to gain the knowledge, predict or classify the data, all the process is based on the data, known as ‘training data’. Machine learning has variety of applications in almost all the fields of the businesses and research.
Supervised Machine Learning: The labeled output indicates usage of supervised method of learning. These algorithms learn from previously inputted data, known as training data, execute analyses, and utilize the results to predict future occurrences of any new data within the existing categories. To accurately forecast test data, you'll need a lot of data with better data pattern analysis. Having the comparative analysis of predicted vs real or ground truth values and thereby refining the methods to maximize the accuracy based on the previous mistakes the ML algorithms further can be developed and trained.
COMPUTER VISION: In the field of machine learning known as computer vision, pictures and videos are analysed and understood. It is used to educate computers how to "see" and how to use visual data to do tasks that humans can do with visuals.In order to translate visual data, computer vision models need features and contextual data that are discovered during training. In order to perform prediction or decision-making tasks, models can now understand photos and videos.
RANDOM FOREST: Popular machine learning algorithm Random Forest is a part of the supervised learning methodology. It can be applied to ML issues involving both classification and regression. It is built on the idea of ensemble learning, which is a method of integrating various classifiers to address difficult issues and enhance model performance. According to what its name implies, "Random Forest is a classifier that contains a number of decision trees on various subsets of the given dataset and takes the average to improve the predictive accuracy of that dataset." Instead than depending on a single decision tree, the random forest uses forecasts from each tree and predicts the result based on the votes of the majority of predictions.
XGBOOST: XGBoost is a distributed gradient boosting library that has been optimised for quick and scalable machine learning model training. A number of weak models' predictions are combined using this ensemble learning technique to get a stronger prediction. Extreme Gradient Boosting, or XGBoost, is one of the most popular machine learning algorithms because it can handle big datasets and perform at the cutting edge in many machine learning tasks like regression and classification.Its effective handling of missing values, which enables it to handle real-world data with missing values without requiring a lot of pre-processing, is one of the key characteristics of XGBoost. Additionally, XGBoost includes integrated parallel processing capability, allowing you to train models on huge datasets quickly.

Conclusion

Multiple diseases prediction projects have the potential to revolutionize healthcare by shifting the focus from treatment to prevention.While there are challenges and complexities to address, the benefits of early disease detection, personalized healthcare, and improved health outcomes make these projects a valuable pursuit in the field of medicine and data science. By carefully navigating the associated issues, we can work toward a future where healthcare is more proactive,individualized and effective. Intelligent data processing is a social necessity for identifying, as soon as possible, of useful and robust disease detections to provide patients with appropriate care within the shortest possible time. This detection has been carried out in recent decades by detecting exciting patterns in databases. Smart data processing is emerging as a requirement for effective and robust diseases to be found by society. Detection of patients providing the necessary treatment as soon as possible within the shortest possible period. This identification has been achieved in recent decades through the method of identifying exciting patterns in databases. A comprehensive overview of intelligent data analysis tools in the medical sector is given in this paper. Some examples of some algorithms used in these medical field areas are also presented, examining potential patterns based on the target searched, the methodology used, and the application field. Given the pace at which new works emerge in this emerging field, a systematic analysis such as the one we have just presented may become obsolete in a short period.

References

[1] G. Chugh, S. Kumar, and N. Singh, ‘‘Survey on machine learning and deep learning applications in breast cancer diagnosis,’’ Cognit. Comput., vol. 13, no. 6, pp. 1451–1470, Nov. 2021. [2] S. Kumar and S. Sharma, ‘‘Sub-classification of invasive and non-invasive cancer from magnification independent histopathological images using hybrid neural networks,’’ Evol. Intell., vol. 15, no. 3, pp. 1531–1543, Sep. 2022. [3] N. Arya and S. Saha, ‘‘Multi-modal classification for human breast cancer prognosis prediction: Proposal of deep-learning based stacked ensemble model,’’ IEEE/ACM Trans. Comput. Biol. Bioinf., vol. 19, no. 2, pp. 1032–1041, Mar. 2022. [4] Z. S. Khachaturian, ‘‘Diagnosis of Alzheimer’s disease,’’ Arch. Neurol., vol. 42, no. 11, pp. 1097–1105, 1985. [5] S. Tabarestani et al., ‘‘A distributed multitask multimodal approach for the prediction of Alzheimer’s disease in a longitudinal study,’’ NeuroImage, vol. 206, pp. 1–15, Feb. 2020. [6] X. Wang, X. Zhen, Q. Li, D. Shen, and H. Huang, ‘‘Cognitive assessment prediction in Alzheimer’s disease by multi-layer multi-target regression,’’ Neuroinformatics, vol. 16, nos. 3–4, pp. 285–294, Oct. 2018. [7] R. S. Doody, V. Pavlik, P. Massman, S. Rountree, E. Darby, and W. Chan, ‘‘Predicting progression of Alzheimer’s disease,’’ Nature BMC, vol. 2, no. 1, pp. 1–9, 2010. [8] F. Islam, R. Ferdousi, S. Rahman, and H. Y. Bushra, Computer Vision and Machine Intelligence in Medical Image Analysis. London, U.K.: Springer, 2019. [9] WorldHealth Organization(WHO).(2020).WHORevealsLeading Causes of Death and Disability Worldwide: 2000–2019. Accessed: Oct. 22, 2021. [Online]. Available: https://www.who.int/news/item/09-12-2020-whoreveals-leading-causes-of-death-and-disability-worldwide-2000-2019 [10] A. Frank and A. Asuncion. (2010). UCI Machine Learning Repository. Accessed: Oct. 22, 2021. [Online]. Available: http://archive.ics.uci.edu/ml

Copyright

Copyright © 2023 Aayush Mankar, Atharv Pawar, Akash Pore, Amar Waghmare. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download Paper

Paper Id : IJRASET56897

Publish Date : 2023-11-21

ISSN : 2321-9653

Publisher Name : IJRASET

DOI Link : Click Here