A Review on Melanoma Cancer Detection Using Artificial Intelligence

Authors: Harshada Mhaske, Mandar Patil, Jeevan Thote, Ajaykumar Shendage, Rutuja Tallapalli

DOI Link: https://doi.org/10.22214/ijraset.2023.49231

Abstract

Abstract: The melanoma skin cancer is the most dangerous cancer detected till the date. The reason is as it is difficult for dermatologists or physicians to detect it at early stages, an AI based system is required to detect the melanoma skin cancer at early stage. Skin cancer is one of the fatal diseases of which patients are increasing day by day. It can be easily cured if identified in early stages. Skin cancer is primarily brought on by the abnormal proliferation of melanocytic cells. Skin cancer can happen due to genetic disorder or UV exposure on skin which result in black and brown spot on the skin. The three cancers are : squamous cell cancer, melanoma cancer, and basal cell cancer. With early detection, this skin cancer can be completely cured. Before this the traditional method is the biopsy method for diagnosing melanoma which is very painful one and a time-consuming process. This study gives a computer-aided detection system for the early identification of melanoma. In this study, the image processing techniques and algorithms like Support vector machine (SVM), K-Nearest Neighbor (KNN), Convolution Neural Network and Random Forest are used to design an diagnosing system which is efficient.

Introduction

I. INTRODUCTION

Our concept is based on providing the patients and doctors a simple way of assistance from a Machine Learning model to detect melanoma cancer. Detecting cancer early can help prepare patient and the doctors during the treatment procedure. In rural areas, people don’t have access to high-tech facilities and labs to detect cancer in its early stage. We want to provide a comfortable way for people to get a better diagnosis to receive a better treatment plan from early stages of cancer.

Biopsy is the long-established method used to detect a skin cancer which is more painful. This technique consumes extra time and cost. That’s why, the diagnosis of person’s skin cancer is carried out based on computer technologies to solve above issues. In Our proposed system there are four processes to find out the skin cancer. First of all, it takes images from the dataset HAM10000. Second is image pre-processing using Median Filter. Later, Segmentation using K-means clustering. And it is followed by Extraction of features in which using i) LBP, ii) ABCDE rule, iii) GLCM feature. Fourth step is to classify the given image of the dataset either it is a normal or a cancer affected image using different classifiers.

Many studies have shown the effectiveness of using Machine Learning algorithms for detecting Melanoma Cancer from the HAM10000 Dataset images.

Research works show the importance of segmentation and pre-processing the available images for better results from the model. An ensemble approach has also been proposed earlier for predicting the risk of Melanoma Cancer. All these relevant works have formed a base for our work and practical application of a machine learning model to help patients and doctors in detecting melanoma cancer.

II. LITERATURE REVIEW

Muhammad Ali Farooq, Muhammad Aatif Mobeen Azhar, The system was developed based on computer vision algorithms such as ABCDE rule of melanoma, seven-point checklist, CASH algorithm and Menzies method and it is based on modern image processing. The use of these algorithms significantly improved the accuracy of detection and analysis of suspicious skin lesions compared to visual inspection. Recently, with the development of imaging technology and computer vision algorithms, In medical field the interest has increased to minimize the errors and ambiguity in investigation procedure and to provide reliable secondary findings to doctors.

Maen Takruri, Maram W. Rashad have developed a non-invasive automated system based on Support vector machine classifier for detecting melanoma cancer. The system uses features which were extracted from grayscale concurrency matrix (GLCM) of grayscale images of skin lesions and the color features were obtained from the original color image.

Evaluation performed on dataset which consist of digital images both benign and malignant. The accuracy results of the test achieved by the support vector machine classifier used in this experiment is 82.7% for GLCM features, 81.48% for GLCM and color features using ROI segmentation. Moreover, the proposed system using ROI segmentation resulted in a sensitivity of 83.6% for GLCMs and 83.33% for GLCMs and color features. We also obtained a specificity of 80% using GLCM and a specificity of 76.19% for GLCM and color using ROI segmentation.

F.Ercal, M.Moganti, proposed the technique based on the border extraction of the images. The noise presented in the image is removed by using median filter. In addition to that they implemented histogramming as well as rough color segmentation method.one of the conclusion was that chromaticity and spherical transformation had given best result in precision. Thresholding technique is used to distinguish the various and different image components present in image. Boundary detecting algorithm for image is stated which is helpful for color images of the skin cancer that are provided.

Mustafa Qays Hatem, developed the an automated system based on machine learning technique which is going to serve a perfect helper for the physicians that are working in classifying skin lesions. There are four major steps that are implemented given as : image preprocessing, segmentation, feature extraction, and classification. Graphical User Interface (GUI) is used for better user friendly environment which helps in better visualization of statistical data. For this, Matlab is used as the one of the tool to do coding part. A morphological closing is the technique used for preprocessing.

Sushant Kumar, S. Nandhini, Adnan Afridi, Mohammed Abdul Sofiyan, the proposed system classifies human skin cancer according to the dermatoscopic images into seven different types. It solves this problem using the HAM10000 (Human-Against-Machine) dataset. This dataset contains 10000 training images. This technique uses a Random Forest Algorithm to classify skin cancer into its different types.

Chi Mai Luong, Tri Cong Pham, Thi Phuong Nghiem, Van-Dung Hoang, Antoine Doucet, Giang Son Tran, proposed in this paper involves 4 main steps and that are Data processing, feature extraction, melanoma classification, result analysis. Random forest is one of the techniques used in this paper for melanoma classification. Datasets used in this paper are ISIC 2016 and HAM10000.

Kassem MA, Hosny KM, Fouad MM, in this paper author proposed the classification of the human skin wounds(legions) into eight different classes. Those eight classes are squamous cell carcinoma, actinic keratosis, melanocytic nevus, melanoma, vascular lesion, benign keratosis, dermatofibroma and basal cell carcinoma. The proposed system in the study achieves accuracy near about (94.9%), sensitivity (79.8%), specificity (97.0%) and precision approx. (80.3%).

Yap J, Yolland W, Tschandl P, Dermatoscopy is one of the maximum important methods for detecting and classifying pores and skin cancer via imaging. computerized evaluation of these ensuing images can be executed as a technique to assist dermatologists make better choices. this is primarily based on information to make sure that the most efficient course of movement is taken closer to the affected person. This analysis can be facilitated by using new technology such as convolutional neural networks (CNNs).

Rosadi R, Hadi, S.,B. Y., Irawan, B., and Tumbelaka, suggested a simple, effective, and integrated computer vision method used to recognize and analyze the early stages of melanoma. The segmentation, filtering, and localization phases are the three stages that form the foundation of the structure development. The user can divide the entity initially using a variety of color spaces and appealing learning and non-learning strategies. Morphological filters have been correlated for the purpose of removing image noise during the stage of filtering. K-means algorithm and the Associated constituent categorizing is used to categorize items during the localization phase. Melanoma tumor types are determined by an ABCD feature-based score. Skin cancer research has been successfully controlled using online skin cancer tumor photographs.

III. METHODOLOGY

A. Proposed System

In this proposed system, For the purpose of classification of skin lesions of melanoma skin cancer various techniques are used. These techniques helped to build intelligent as well as precise decision support system. This System contains four main stages: Image preprocessing, skin lesion segmentation, feature extraction. The whole architecture of model is given in fig .This architecture contains flowchart of the system in which all the processes are going to be carried out. The very first step which is image preprocessing on test images consist of noise removal, augmentation, hair removal and many more.

For the image pre-processing purpose, Median Filter method is used. Whereas K-means clustering technique is used for the image segmentation. It uses different types of distance formulae for calculating distance. Augmentation and morphological techniques are used to reduce noise, enhancement of targeted area, and making it easier to spot by selecting important details. Feature extraction is done by GLCM which stands for Gray Level Co-occurrence Matrix. The GLCM is an arithmetic approach that works mainly on dimensional relationship of small unit of image which is pixel.

The removal of skin lesion is done by ABCD algorithm. The ABCD stands for A - Asymmetry, B - Border, C - Color, and D – Diameter. Feature extraction in done by using ABCD rule. Next part is classification, Classification is done by using 4 various algorithms which are KNN , SVM and RF. KNN stands for K-Nearest Neighbor, CNN stands for Convolutional neural network, RF stands for random forest and SVM stands for Support Machine Vector. Hyperparameter tunning can be done to get more precise result.

B. Techniques used

Pre-Processing: The median filter is used for pre-processing in the initial stage. The images are processed through a median filter to remove extra hair, bubbles, and noise. Usually, the image of skin cancer includes fine hair, noise, and bubbles. These are being removed using a median filter because they do not contribute to cancer. The location and amplitude of edges are preserved by median filters. The median filter reduces the variation of the image's intensity by utilizing the neighborhood median to smooth the image.
Median Filter: The median channel reduces disorder in an image in a manner similar to that of the mean channel. The median channel can be distinguished for two descriptions as in equation median

The operations of two images are designated by S(x) and A(x). The specific channels have statistics that are really precise and minute. Only the central significance of all surrounding pixel standards is represented by the median. It is highly possible to omit distinct types of disturbance using median filtering.

3. Segmentation: The image can be segmented after processing so that it can be used. The segmentation stage accepts the pre-processed image as an input. The segmentation process examines the ridges via k-mean clustering and the outcome is presented.

4. Feature Extraction: The next phase in the recognition and classification process is feature extraction. The segmented image isolates the features and thus the GLCM and ABCD rule methodology are used to extract the features.

5. ABCDE Rule: The extraction of the cutaneous lesion is done using ABCDE rule-based recognition. The five features stand for A- Asymmetry, B- Border, C- Color, D-Diameter, E-Evolution are retrieved in the subsequent part from the pre-processed image during feature extraction. In the computational analysis of skin cancer, features are extracted using the ABCDE characteristics.

a. Asymmetry: Melanoma lesions have an asymmetrical appearance. The asymmetry index has an impact on the entity's degree of symmetry. By dividing the parallel or upright image, this is produced.

b. Border: The border of melanoma is crooked, ragged and indistinct. To determine the boundary abnormality, one uses the compactness index.

c. Color: Melanomas rarely have a color that is similar to a normal mole. The normalized Euclidean distance between each pixel is

d. Diameter: The melanoma lesion is larger than 6mm in diameter. The diameter in the image is determined and measured at 6mm.

e. Evolution: The spot appears distinct from the others or is evolving in terms of size, shape, or color.

6. Shape Feature: The abnormality index, irregularity index, and distance from the lesion in the binary image are the characteristics of shape.

7. Classification: The final stage of classification divides the images into benign and melanoma categories. Melanoma refers to a tumor image, while benign refers to a normal image. A plan for utilizing support vector machines is offered.

a. Support Vector Machine: Support Vector Machine(SVM) is a supervised learning technique. It is a machine learning technique which is used for not only Classification but also Regression problems. But it is mostly used in Classification problem. We design all data input as a position in n-structural hole with significance of each characteristic being value of an explicit correlation in this SVM algorithm. Support Vector Machine is also known as front line algorithm. SVM is used to finely separate two classes :- 1) hyper-plane and 2) contour.

b. Random Forest: Random Forest is the supervised learning technique. It is a machine learning technique which is used for not only Classification but also Regression problems. For solving a complex problem and improvement the performance of a system . A classifier called random forest uses a variety of decision trees on different subsets of a present dataset to alter the projected accuracy of that particular dataset. The random forest takes the forecast from every tree and bases it on the majority votes of predictions, rather than relying on one or more constrained decision trees. Additionally, it interprets the result. If there are more trees in the forest, the accuracy will increase. It avoids overfitting issues.

c. K-Nearest Neighbor: It is the simplest classification model to use is K Nearest Neighbor. This method recognizes images in the test set by labelling the nearest point in the learning set, where distances are quantified in image space. The Euclidean distance metric is frequently used to gauge how near the data points are to one another in KNN. Every pixel in a dataset is has assigned distance. The distance is the Euclidean distance between two pixels. A KNN classifier by default uses this Euclidean distance. Following the feature extraction procedure, the retrieved features are immediately fed into the classifiers, the machine learning tools, to be divided into two distinct groups. There are two stages to the process: the training phase and the testing phase.

d. Convolutional Neural Networks (CNN): In computer vision, we have convolutional neural networks that are very common in tasks of computer vision such as object detection, image segmentation, and image classification. Image classification is one of the most in-demand technologies today and is used in various fields such as healthcare, business, and more. AI Convolutional Neural Network (CNN ) is a type of neural network for processing images. This type of neural network takes input from images, extracts features from images, and provides learnable parameters to efficiently perform classification, recognition, and many other tasks.

C. Plan Of Activation

Initially when an image is transmitted to the system image will be pre-processed using the median filtering method. To segment the pre-processed image K-Mean Clustering will be used. After that, features are extracted from the segmented images using the GLCM feature method, ABCD rule, and shape feature. To find the best results techniques of different classification are used. The classification are divided into three types. The suggested architecture for our research project is shown in Figure 1.

Conclusion

This paper discusses the classification and segmentation of skin cancer. Segmentation is a technique used to group input images into regions that are similar to each other. The increased precision of K-means Clustering technology can help section skin lesions cleanly. In this study a new classification technique with the increased stage evaluation is presented. The algorithm which is proposed here uses a supervised learning algorithm, such as a SVM or Random Forest, to compare the performance of three different classifiers, in this case, the SVM, the KNN and the Random Forest. The SVM and Random Forest performed significantly better than the KNN in terms of performance. SVM is effective in detecting bias in training data, even when the sample exhibits some initial bias. Given that the optimality problem is convex, it provides a unique solution. An effective out-of-sample generalisation is offered by artificial intelligence (AI) tools.

References

[1] Soniya Mane, Dr. Swati Shinde , “A Method for Melanoma Skin Cancer Detection Using Dermoscopy Images”, IEEE, Fourth International Conference on Computing Communication Control and Automation (ICCUBEA)., vol.3, (2018), pp.22-28. [2] R. Garnavi, M. Aldeen, M. E. Celebi, A. Bhuiyan, C. Dolianitis, G. Varigos, “Automatic segmentation of dermoscopy images using histogram thresholding on optimal color channels”, International Journal of Medicine and Medical Sciences., vol.1, no.2, (2010) ,pp. 126– 134. [3] G. Argenziano, H. Soyer, S. Chimenti, R. Talamini, R. Corona, F. Sera, and M. Binder, “Dermoscopy of pigmented skin lesions: Results of consensus meeting via the Internet”, Journal of the American Academy of Dermatology., vol.48, pp. (2003),679-693.. [4] .M.E. Celebi, H. Iyatomi, G. Schaefer, and W. V. Stoecker,” Lesion border detection in dermoscopy images”,Computerised Medical Imaging and Graphics., vol.33, no.2, (2009),pp. 148-153. [5] C.Grana,G.Pellacani, R.Cucchiara,and S.Seidenari, “A new algorithm for borderdescription of polarized light surface microscopic images of pigmented skin lesions,” IEEE Trans Med Imaging., vol. 22, no. 8, (2003), pp. 959–964. [6] A. Bono, S. Tomatis, and C. Bartoli, “The ABCD system of melanoma detection: A spectrophotometric analysis of the asymmetry,border, color, and dimension”, Cancer.,vol.85,no.1, (1999), pp. 72–77. [7] Dr. Prakash B., R Lilly Kamari, M Navya Niharika, Pulamarasetti Poornima “DETECTION OF MELANOMA IN SKIN CANCER USING DEEP LEARNING” ISSN: 2278-4632, Vol-11 Issue-07 No.01 July 2021. [8] A. Murugan , Dr. S. Anu H Nair , Dr. A. Angelin Peace Preethi , Dr. K. P. Sanal Kumar “Diagnosis of skin cancer using machine learning techniques” 18 December 2020, 0141-9331/© 2020 Elsevier B.V [9] Tri Cong Pham, Giang Son Tran, Thi Phuong Nghiem, Antoine Doucet, Chi Mai Luong, Van-Dung Hoang, “A Comparative Study for Classification of Skin Cancer”,pp.267-272, 10.1109/ICSSE.2019.8823124 [10] Enakshi Jana, Dr. Ravi, S. Saraswathi “Research on skin cancer cell detection using image processing”, IEEE, 2017 [11] Linlin Wu1, Saurabh Kumar Garg2, and Rajkumar Buyya1” Service Level Agreement(SLA) based SaaS Cloud Management System” , IEEE 21st International Conference on Parallel and Distributed Systems, 2015. [12] Scott E.Umbaugh, Randy H.Moss, and William V.Stoecker,"Applying Artificial [13] Intelligence to the Identification of Variegated Coloring in Skin Tumors", IEEE Engineering in Medicine And Biology, Magazine, Vol.10, No.4, pp 57-62, 1991. [14] O. Abuzaghleh, B. D. Barkana, and M. Faezipour, “SKINcure: A real time image analysis system to aid in the malignant melanoma prevention and early detection”in Proc. IEEE Southwest Symp. Image Anal. Interpretation (SSIAI),Apr. 2014, pp.85_88. [15] Hadi, S., Tumbelaka, B. Y., Irawan, B., and Rosadi, R., “Implementing DEWA Framework for Early Diagnosis of Melanoma” International Conference on Computer Science and Computational Intelligence (ICCSCI 2015). Procedia Computer Science 59:410–418, 2015 [16] T.D. Srividya, V.Arulmozhi, “A Review of Threshold based Segmentation for Skin Cancer with Image Processing”, IJRTE, ISSN: 2277-3878, Volume-7, Issue-5C, February 2019. [17] Mohd Afizi, Mohd Shukran, Nor Suraya Mariam Ahmad, Suzaimah Ramli, Farhana Rahmat, “Melanoma Cancer Diagnosis Device Using Image Processing Techniques”, IJRTE, ISSN: 2277-3878, Volume-7, Issue-5S4, February 2019. [18] Yang, J., Xie, F., Fan, H., Jiang, Z., & Liu, J. (2018). “Classification for dermoscopy images using convolutional neural networks based on region average pooling”. IEEE Access, 6, 65130–65138. [19] Vedanti Chintawar, Jignyasa Sanghavi, “A Review on Computer Aided Melanoma Skin Cancer Detection using Image Processing”, EasyChair Print, No. 584, October 24,2018. [20] Nidhal K. EL Abbadi, Zahraa Faisal, “Detection and Analysis of Skin Cancer from Skin Lesions”, International Journal of Applied Engineering Research ISSN 0973-4562 Volume 12, Number 19 (2017) pp. 9046-9052.

Copyright

Copyright © 2023 Harshada Mhaske, Mandar Patil, Jeevan Thote, Ajaykumar Shendage, Rutuja Tallapalli. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download Paper

Paper Id : IJRASET49231

Publish Date : 2023-02-23

ISSN : 2321-9653

Publisher Name : IJRASET

DOI Link : Click Here