Generative Data Augmentation and ARMD Classification

Authors: N. Sree Bhavani, G. Narendra Babu Reddy, Y Sravani Devi , M. Bhavani, P. Chandana Reddy, V. Abhignya Reddy

DOI Link: https://doi.org/10.22214/ijraset.2023.54178

Abstract

Age Related Macular Degeneration (ARMD) is a type of eye disease which normally have an effect on the central vision of a person. This Disease might sometimes lead to permanent vision loss for some people. It affects the people over the age of 50. So, basically there are 2 different types of ARMD i.e., Dry and Wet. Dry ARMD will generate a tiny amount of protein deposits called drusen, whereas Wet ARMD occurs whenever any abnormal blood vessel is developed under the retina, so sometimes this blood vessels might leak blood fluid, this type of ARMD is very severe and can even lead to permanent central vision loss. Therefore, it is necessary for early detection of the disease. Generative Data Augmentation for ARMD Classification is deep learning based which uses Convolutional Neural Network (CNN) model for generating images to accurately identify the disease. Deep Learning Diagnostic models require expertly graded images from extensive data sets obtained in large scale clinical trials which may not exist. Therefore, (Generative Adversarial Networks) GAN-based generative data augmentation method called Style GAN is used for generating the images. Generative deep learning techniques is used to synthesize new large datasets of artificial retinal images from different stages of ARMD using the images from the already existing datasets. The performance of ARMD diagnostic DCNNs will be trained on the combination of both real and synthetic datasets. Images obtained by using GAN appear to be realistic, and increase the accuracy of the model. It then continues with classifying the retinal images into one of the three classes i.e., dry, wet or normal using CNN model. It also compares the accuracy against the model with traditional augmentation techniques, towards improving the performance of real-world ARMD classification tasks.

Introduction

I. INTRODUCTION

Age-related macular degeneration (ARMD) is the most common cause of severe vision loss in elderly persons in developed countries and accounts for one-third of cases of untreatable vision loss. ARMD is a painless, irreversible, degenerative eye condition associated with the damage and ultimate death of photoreceptors. There are two types of ARMD, dry and wet; dry ARMD is far more common, but wet ARMD is usually a more advanced disease state and is associated with rapid distortion and sudden loss of central vision. Various agents are used for treatment, and lifestyle changes and dietary constituents are important for preventing ARMD and halting its progression. As new therapies become available, early identification of patients with risk factors for ARMD will be increasingly important. On the same note, retinal fundus images are exceptionally valuable source of information for ophthalmologists to be able to recognize retina problems.

As prevention is better than cure, early detection could improve the chances of cure and be able to prevent blindness. Retinal problems like diabetic retinopathy and retinitis pigmentosa can be diagnosed using retinal fundus images by medical experts. In the most recent times, machine learning research is going on for diagnosing diseases like diabetic retinopathy [6], glaucoma by extracting features and then classifying the image. Researchers are trying to come up with the optimal ways of being able to automatically classify retinal problems from those of the healthy ones with the help of feature extraction. But most importantly, to be able to classify retinal fundus images accurately, a decent size of the dataset is necessary. When the dataset size does not match with the user requirements, generation of more images is needed. Hence the usage of Generative Adversarial networks became necessary.

Generative Adversarial Networks (GAN) are powerful classes which consist of generative model and discriminator model. Generative models generate realistic images that are hard to differentiate from real images. Using Generative adversarial networks, the dataset that is currently being used could be increased as it synthesizes fake images that are very hard to separate apart from the actual images.

II. LITERATURE SURVEY

Neural Networks and Deep Learning: A neural network comprises several key elements that work together to process information and make predictions. These include neurons, which act as functions that receive input from the preceding layer and generate an output in the form of a binary value. The input layer and its neurons receive input data, while hidden layers contain numerous neurons that process the input and transmit it to the output layer. Synapses connect the neurons and layers to allow the flow of information within the network. The complexity and depth of the neural network depend on the number of hidden layers, and the amount of input data required to solve a problem increases with network size.
Generative Adversarial Networks: A generative adversarial network [1], also known as a GAN. It is an unsupervised deep learning framework that can learn from a set of training data and produce new images [2] that shares same properties as training data.

Two neural networks comprise a generative adversarial network:

a. Generator: which learns to generate realistic fictitious data from a random seed The generator's fake examples are used as negative examples to train the discriminator.

b. Discriminator: which learns to distinguish between fake and real data The discriminator penalizes the generator if it produces implausible results.

Both the Generator and Discriminator work opposite to each other to make themselves learn about the image accurately and try to predict it real or fake [3].

3. Style GAN: Style GAN is a type of adversarial generative network. It borrows from the style transfer literature and employs an alternative generator architecture for generative adversarial networks, specifically adaptive instance normalization. Otherwise, it is like Progressive GAN in that it employs a progressively increasing training regimen.

4. Convolutional Neural Networks: They have three layers, they are: Convolutional layer, Pooling layer Fully connected layer. Convolutional layer, which serves as the primary building block and performs the bulk of the computation. The feature detector, also known as a kernel or filter, plays a crucial role in this layer, as it scans the receptive fields of an image to identify specific features. By applying the filter to different areas of the image, a dot product is computed between the input pixels and the filter. This process generates a feature map, activation map, or convolved feature matrix multiplication, which is the output obtained by convolving the input with the filter in a convolutional neural network."

III. METHODOLOGY

The set of principles, practices, and procedures that guide the way research, investigations, or analyses are carried out. It includes concepts relevant to a branch of science and the conceptual study of the body of methods.

A. Architectural Design

Architecture is defined as the art of constructing the basic design of a software system. Architectural design is the process of creating a plan or blueprint for the construction or renovation of a structure.

B. Modules

Dataset

The dataset in the project was collected from a private repository and consist of two classes of images each class consists of 50 images each, Dry and Wet. Along with these two classes, we have also used No ARMD/Normal people eye dataset combined with other two classes. Below are some sample images of 3 classes.

2. Algorithm-1 (AUGMETATION)

Generator and Discriminator Models:

Generative Adversarial Networks (GAN) [4] constitutes two models- generator and discriminator. Both generator and discriminator models should be trained using training data. Discriminators are trained using 100% real and 100% fake images. Generator is trained using discriminator loss. Error of the discriminator is back propagated, and the generator model is updated. Initially, the dataset is divided into various batches. The generative adversarial network model is trained using these batches one after the other.

First, the discriminator model is trained with 100% real examples. Then 100% fake samples are generated using the generator model and discriminator is trained using these 100% fake samples. After the completion of this step, discriminator model is completely trained with both real and fake samples. Next, calculate discriminator loss and update the weights of generator model by back propagating the error. The backpropagation steps will see a large error and will update the model weights accordingly. This makes the generator model better at generating good samples. Weights of the discriminator model are marked as not trainable while updating the generator model. After every 7 epochs, a summary of the model is displayed.

Summary of the model contains the Fréchet Inception Distance (FID) metric and this value is analyzed. A point should be reached where FID metric value is as minimum as possible. This is the point where error rates of both generator and discriminator are equal. An equilibrium is established when the discriminator model cannot distinguish a real sample and the sample generated by the generator.

3. Data Pre-Processing

The process of cleaning data that are taken from different sources and converting it to a meaningful and understandable format. Hence, there are several steps that dataset into meaningful way, where our model can understand like resizing, gaussian blur[8], thresholding, dilating, and eroding.

4. Algorithm-2(CNN)

Convolutional neural network

CNN model [5] is created using 23 layers they are ---3-convolution layers, 4-batch_normalisation layers, 5-activation layer,3-max_pooling layers, 4-dropout layer ,3-dense layers, 1-flatten layer.

B. Comparison of Traditional Augmentation and GAN Based Augmentation

Deep learning models [8] perform better when they are trained with a considerably large dataset. Hence it is advisable to generate new images when the size of the dataset is small. Generative Adversarial Networks (GAN) is an algorithm that generates new images altogether. Data augmentation techniques or Traditional augmentation techniques [9-11] just modify the existing dataset, they do not contribute more to the betterment of the model. Even though Generative Adversarial Networks (GAN) takes more time, it is a powerful model. It is gaining popularity due to its ability to create new images.

IV. RESULTS AND DISCUSSIONS

A. GAN

GAN model is evaluated based on FID value. This value between real and fake images should be close to zero.

We have trained the generator model for about 7-8 times with a learning-rate of 0.0025 and a batch-size of 16 for each class of dataset i.e., dry, wet, and normal and achieved the following FID metric values for each class

Wet -> 70.2618

Dry -> 70.5584

Normal -> 77.9778

Below are the sample of generated images at different FID values

Conclusion

This research focused designing Age related Macular Degeneration prediction which require descent amount of data is acquired using Style GAN on given dataset. Synthesized images dataset is then prepared by selecting the images at some random snap seed value and save into a folder. This step is followed for every class (Dry, Wet, Normal). Later these images were combined and used for classification using CNN model and acquired 96 percentage of validation accuracy.

References

[1] C. Shorten and T. M. Khoshgoftaar, “A survey on image data augmentation for deep learning,” Journal of Big Data, vol. 6, no. 1, p. 60, 2019. [2] https://www.researchgate.net/publication/335521197_Generative_Adversarial_Network_in_Medical_Imaging_A_Review. [3] J. E. W. Koh, E. Y. K. Ng, S. V. Bhandary, A. Laude, and U. R. Acharya, “Automated detection of retinal health using PHOG and SURF features extracted from fundus images,” Applied Intelligence, pp. 1–15, 2017. [4] L. S. Lim, P. Mitchell, J. M. Seddon, F. G. Holz, and T. Y. Wong, “Age-related macular degeneration,” Lancet, vol. 379, no. 9827, pp. 1728–1738, 2012. [5] Tan, Jen Hong, et al. \"Age-related macular degeneration detection using deep convolutional neural network.\" Future Generation Computer Systems 87 (2018): 127- 135. [6] Yi Zhou, Member, IEEE, Boyang Wang, Xiaodong He, Shanshan Cui and Ling Shao, “DR-GAN: Conditional Generative Adversarial Network for Fine-Grained Lesion Synthesis on Diabetic Retinopathy Images”, IEEE, Vol 26, January, 2022. [7] Y. Sravani Devi and S. Phani Kumar, A Scoping review of diabetic retinopathy detection techniques using deep learning: Taxonomy, methods, and recent developments\" High Technol. Lett. 26(11), 392–406 (2020). [8] M. Seetha, N. Kalyani and Y. Sravani Devi, An ensemble cnn model for identication of diabetic retinopathy eye disease, in S. C. Satapathy, V. Bhateja, M. N. Favorskaya and T. Adilakshmi (eds.), Smart Intelligent Computing and Applications, Vol. 2, Smart Innovation, Systems and Technologies, Vol. 283. Springer, Singapore, https://doi.org/ 10.1007/978-981-16-9705-0 19 [9] Y. Devi and S. Kumar, A deep transfer learning approach for identication of diabetic retinopathy using data augmentation, IAES Int. J. Artif. Intell. 11, 1287–1296 (2022), 10.11591/ijai.v11.i4.pp1287-1296. [10] S. Maddala, K. Nara, S. D. Yerrarapu and S. Vanam, Classication of Fundus Images Captured using D-Eye Smartphone Retinal Imaging System,\" 2022 Int. Conf. Emerging Trends in Computing and Engineering Applications (ETCEA), Karak, Jordan, 2022, pp. 1–7, doi: 10.1109/ETCEA57049.2022.10009691. [11] Diabetic Retinopathy (DR) Image Synthesis Using DCGAN and Classification of DR Using Transfer Learning Approaches,Devi, Y.S., Phani Kumar, S., International Journal of Image and Graphics, 2023, 2340009.DOI: 10.1142/S0219467823400090

Copyright

Copyright © 2023 N. Sree Bhavani, G. Narendra Babu Reddy, Y Sravani Devi , M. Bhavani, P. Chandana Reddy, V. Abhignya Reddy. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download Paper

Paper Id : IJRASET54178

Publish Date : 2023-06-18

ISSN : 2321-9653

Publisher Name : IJRASET

DOI Link : Click Here