Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: Mr. Aniket G., Ms. Ankita D., Ms. Deepika N. K., Ms. Priyanka R., Dr. Aruna M G
DOI Link: https://doi.org/10.22214/ijraset.2023.53175
Certificate: View Certificate
Parkinson\'s disease (PD) is a complex neurodegenerative disorder that affects millions of people worldwide. Accurate diagnosis and monitoring of PD are essential for effective treatment and management of the disease. In recent years, machine learning algorithms have shown great promise in assisting with the analysis of PD data and aiding in diagnosis and prognosis. This study presents a comparative analysis of various machine learning algorithms for PD analysis, with the objective of identifying the most effective approach for detecting and predicting PD progression. Multiple machine learning algorithms, including decision trees, support vector machines, random forests, neural networks, and ensemble methods, are evaluated using a comprehensive dataset of PD patients and healthy individuals. The study in corporates feature selection and dimensionality reduction techniques to enhance the algorithms\' performance and reduce computational complexity. The results of the comparative analysis reveal the strengths and weaknesses of each algorithm in PD analysis. In conclusion, this comparative study showcases the effectiveness of machine learning algorithms in the field of PD research. It emphasizes the importance of selecting appropriate algorithms and features for accurate diagnosis and prediction of PD, ultimately leading to improved patient outcomes and better management of the disease.
I. INTRODUCTION
Parkinson's disease (PD) is a chronic and progressive neurodegenerative disorder that is characterized by both motor and non-motor symptoms. The prevalence of PD is high in older adults, with a global population affected more than doubling from 1990 to 2016. At the onset of the disease, patients exhibit motor symptoms such as tremors, stiffness, and other motor deficits. These symptoms are primarily caused by the degeneration of dopaminergic neurons in the basal ganglia, a region of the brain that plays a crucial role in the control of movement. As the disease progresses, non-motor symptoms such as cognitive changes, sleep disturbances, and sensory abnormalities may also be observed. These non-motor symptoms may not be specific to PD and can vary from patient to patient, making it difficult to diagnose the disease based on these symptoms alone. However, early identification of non-motor symptoms is important for effective treatment and management of PD. Currently, the diagnosis of PD is primarily based on the observation of motor symptoms. However, rating scales used to assess disease severity have not been fully validated, and there is a need for more accurate diagnostic tools. Machine learning algorithms have been developed to identify patterns in clinical and genetic data that may predict the development of non-motor symptoms such as impulse control disorders (ICDs) in PD. These algorithms have shown promise in predicting the occurrence of ICDs in PD patients using longitudinal data from two independent cohorts, but further studies are needed to determine their clinical relevance
II. EXISTING SYSTEM
Parkinson's disease (PD) is a prevalent neurodegenerative disorder, affecting approximately 1-2 individuals per 1,000 in the population above 60 years old, with a global prevalence rate of 1% (Tysnes and Storstein, 2017). The incidence of PD has significantly increased over the years, more than doubling from 2.5 million to 6.1 million between 1990 and 2016 due to the aging population (Dorsey et al., 2018).PD is characterized by both motor and non-motor symptoms, impacting various aspects of movement, such as planning, initiation, and execution (Contreras-Vidal and Stelmach, 1995).The diagnosis of PD traditionally relies on motor symptoms, although many rating scales used for assessing disease severity lack comprehensive evaluation and validation (Jankovic, 2008).
A. Disadvantages of Existing System
Misdiagnosis rate for Parkinson's disease by non-specialists is high, up to 25%.
The disease can go undiagnosed for many years, highlighting the need for early prediction.
Existing diagnostic methods are not effective in early prediction and accurate medicinal diagnosis.
The below table I gives a literature summary about the papers being reviewed for this project work.
III. PROPOSED SYSTEM
Proposed system combines voice and spiral drawing data to provide accurate results for Parkinson's disease detection using machine learning algorithms including Logistic regression, Random Forest, SVM, Decision Tree, and K-NN. Doctors can use the combined results to diagnose and prescribe medication.
IV. TRAINED DATA AND PRE-PROCESSING
Data has been gathered from a variety of online platforms, including Kaggle, the UCI library, Coda lab, Driven Data, and the Google Dataset Search Engine.
The size of the datasets is –
Spiral Dataset – 77 Observations, 29 Parameters
Voice Dataset – 757 Observations,729 Parameters.
A. Pre- Processing
In this step the info is visualized well to identify the connection between the parameters present within the data soon take the advantage of also as to get the data imbalances. With this, we need to separate the info into two parts. The first part for training the model like in our model we have used 70 percent of knowledge for training and 30 percentage for testing. The following are key aspects of data preprocessing in the comparative study:
B. Feature Extraction
This component will involve identifying the most important features that can be used to detect Parkinson's disease. Some of the features that have been found to be useful in detecting Parkinson's disease include tremors, gait, and voice patterns. The process of feature extraction begins with acquiring a comprehensive dataset that includes a wide range of variables and measurements related to Parkinson's disease. These variables can include clinical assessments, demographic information, genetic markers, imaging data, and various motor and non-motor symptoms.
C. Trained data:
The trained data typically includes a combination of features and corresponding target variables. The features represent various measurements, assessments, or characteristics associated with Parkinson's disease, such as clinical evaluations, demographic information, genetic markers, imaging data, and motor or non-motor symptoms. The target variable indicates the class or label, distinguishing between PD patients and healthy individuals.
The trained data is utilized to train the machine learning algorithms to learn the underlying patterns and relationships between the features and the target variable. Various supervised learning algorithms, including decision trees, support vector machines, random forests, neural networks, and ensemble methods, can be trained using this data.
D. Prediction
The prediction phase involves taking the trained models and applying them to the test or validation dataset, which consists of unseen instances that were not used during the training process. The models use the learned patterns and relationships from the training data to make predictions on these new instances.
The prediction task can vary depending on the specific objective of the study. It may involve predicting whether an individual has Parkinson's disease or not, based on their features and symptoms. Additionally, the models can be used to predict the progression or severity of Parkinson's disease for existing patients, assisting in prognosis and treatment planning
IV. SYSTEM ARCHITECTURE
The below figure represent the System Architecture of Detection of Parkinson’s Disease structure is used in System.
VI. SYSTEM IMPLEMENTATION
For this project, the system requirements entail using a Windows 64-bit operating system. The chosen technology is Python, and the preferred integrated development environment (IDE) is Python IDLE. The recommended tool for managing packages and environments is Anaconda. It is essential to ensure that Python version 3.6 is installed. In terms of the front-end development, HTML and CSS will be utilized. For the back-end, the project will rely on OpenCV, Keras, and TensorFlow. These software components and frameworks will collectively contribute to the successful execution of the project.
To meet the hardware requirements for this project, it is recommended to have an Intel Core i5 processor. An 8GB RAM capacity will ensure smooth performance and efficient multitasking. The project also requires a minimum of 80GB of hard disk space for storing files and data.
A processor speed of 2.4GHz or higher is preferable to handle the computational demands effectively. These hardware specifications will provide a solid foundation for running the project smoothly and efficiently.
A. Algorithms
For the prediction, multiple supervised learning algorithms are trained using the training set, after which using the testing set performance evaluation occurs.
B. K-NN Algorithm
C. SVM Algorithm
D. Logistic Regression Algorithm
E. Decision Tree Algorithm
F. Random Forest Classifier Algorithm
VIII. FUTURE ENHANCEMENT
These enhancements can contribute to improving the accuracy, interpretability, and practicality of the analysis. Some potential areas for future development include:
After conducting experiments and analysing the results, it can be concluded that machine learning algorithms, specifically k-Nearest Neighbour (KNN), Support Vector Machines (SVM), Random Forest, Decision Tree, Logistic Regression, and Random Forest, have been effective in diagnosing Parkinson\'s disease while considering both time and space efficiency. While Decision Trees have shown efficiency in this context, other algorithms such as KNN, SVM, Random Forest, and Logistic Regression should still be considered, as they might offer better accuracy or performance under different circumstances. In summary, while all the tested machine learning algorithms have shown effectiveness in diagnosing Parkinson\'s disease, KNN, SVM (with a linear kernel), and Random Forest have exhibited superior efficiency in terms of both time and space requirements. Decision Tree have also demonstrated satisfactory efficiency with increment of 9%, our system is achieving from 87% to 96%, although they may be slightly less optimal compared to the former algorithms. Ultimately, the choice of algorithm depends on the specific requirements and constraints of the application.
[1] M. McHenry, ‘‘Symptoms and possible causes cures for parkinsons disease,’’ Brain Matters, vol. 3, no. 1, pp. 8–10, 2021. [2] J. L. Ernfors, ‘‘Heredity of parkinsons disease,’’ Students final thesis, Riga Stradins Univ., Latvia, 2021. [3] B. Mulhall and A. Tietjen, ‘‘The effectiveness of physical activity to increase strength and motor control during daily occupations in adults diagnosed with parkinsons disease,’’ Creighton Univ., Omaha, NE, USA, Tech. Rep., 2021. [Online]. Available: http://hdl.handle.net/10504/130306 [4] M. H. Monje, G. Foffani, J. Obeso, and A. Sánchez-Ferro, ‘‘New sensor and wearable technologies to aid in the diagnosis and treatment monitoring of Parkinson’s disease,’’ Annu. Rev. Biomed. Eng., vol. 21, pp. 111–143, Sep. 2019. [5] R. Lu, Y. Xu, X. Li, Y. Fan, W. Zeng, Y. Tan, K. Ren, W. Chen, and X. Cao, ‘‘Evaluation of wearable sensor devices in Parkinson’s disease: A review of current status and future prospects,’’ Parkinson’s Disease, vol. 2020, pp. 1–8, Dec. 2020. [6] R. Deb, G. Bhat, S. An, U. Ogras, and H. Shill, ‘‘Trends in technology usage for Parkinson’s disease assessment: A systematic review,’’ MedRxiv, Jan. 2021. [7] G. AlMahadin, A. Lotfi, E. Zysk, F. L. Siena, M. M. Carthy, and P. Breedon, ‘‘Parkinson’s disease: Current assessment methods and wearable devices for evaluation of movement disorder motor symptoms—A patient and healthcare professional perspective,’’ BMC Neurol., vol. 20, no. 1, pp. 1–13, Dec. 2020. [8] S. Abbas, J. Condell, P. Gardiner, M. McCann, S. Todd, and J. Connolly, ‘‘Can multiple wearable sensors be used to detect the early onset of Parkinson’s disease?’’ in Proc. 31st Irish Signals Syst. Conf. (ISSC), Jun. 2020, pp. 1–6. [9] M. U. Sarwar and A. R. Javed, ‘‘Collaborative health care plan through crowdsource data using ambient application,’’ in Proc. 22nd Int. Multitopic Conf. (INMIC), Nov. 2019, pp. 1–6. [10] J. G. V. Habets, M. Heijmans, A. F. G. Leentjens, C. J. P. Simons, Y. Temel, M. L. Kuijf, P. L. Kubben, and C. Herff, ‘‘A long-term, reallife parkinson monitoring database combining unscripted objective and subjective recordings,’’ Data, vol. 6, no. 2, p. 22, Feb. 2021
Copyright © 2023 Mr. Aniket G., Ms. Ankita D., Ms. Deepika N. K., Ms. Priyanka R., Dr. Aruna M G. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET53175
Publish Date : 2023-05-27
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here