Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: Sakshi Wakchaure, Gaurav Rathi, Sneha Patil, Mr. H. S. Thakar
DOI Link: https://doi.org/10.22214/ijraset.2023.52479
Certificate: View Certificate
Automated chromosome classification is a critical task in cytogenetics and has been traditionally performed manually. Recent advancements in image processing and machine learning have opened new avenues for automated chromosome classification. This paper presents a novel method for straightening and classifying chromosomes, utilizing a blend of image-processing techniques and a convolutional neural network (CNN) algorithm. The proposed approach involves pre-processing chromosome images, which includes straightening them to eliminate any curvature caused by their natural shape and extracting features with the aid of a CNN. The design of the method allows for the training and testing of images, facilitating the prediction of chromosomal abnormalities with high accuracy. The performance of the method was evaluated using a dataset of chromosome images, demonstrating exceptional accuracy and robustness. The proposed approach was evaluated using publicly available datasets of human chromosomes and achieved an accuracy of 93% without the straightening of chromosomes and 96% after the straightening of chromosomes. This method has the potential to be applied in cytogenetics and medical diagnosis, where precise and efficient chromosome classification is vital, and the ability to straighten chromosomes can enhance the accuracy of the analysis.
I. INTRODUCTION
A. Introduction
DNA is the building block of the human body which holds the genes of humans. Chromosomes are found in the center of the cell which carries DNA. Humans have 23 pairs of Chromosomes which include 22 pairs of Autosomes and 1 pair of Sex chromosomes. Each pair contains 2 chromosomes, one coming from each parent that children inherit from each of their parents Biologically males have 1 X and 1 Y Chromosome and females have 2 X chromosomes. Chromosomes vary in shape and number among humans. Humans as well as animals have linear chromosomes. The human X chromosome is about 3 times larger than the human Y chromosome, containing about 900 genes while Y has about 55 genes. The unique structure of chromosomes keeps DNA tightly bound around histones(proteins), without such packing DNA molecules would be too long, to fit inside the cell. Chromosome straightening and classification are important tasks in the field of biology, particularly in genetics and genomics research. Chromosomes are structures within cells that carry genetic information, and their appearance and characteristics can provide valuable insights into various biological processes and diseases [10]. Deep learning is a powerful tool in image analysis and recognition and has been successfully applied to various biological image analysis tasks, including chromosome analysis [1]. In this project, we aim to develop a deep learning-based system that can automatically straighten and classify chromosomes from images. The system will consist of two main components: the straightening and classification modules. The correction module uses deep learning algorithms to automatically detect and correct kinks or kinks in chromosome images to create straightened images. The classification module then uses another set of deep learning algorithms to classify the straightened chromosomes into different categories based on their shape and characteristics [7]. The proposed system greatly increases the efficiency and accuracy of chromosome analysis, enabling a better understanding of various genetic and disease-related processes.
Top of Form
B. Background
Chromosomes are essential structures in the cell nucleus that carry genetic information in the form of DNA. The analysis of chromosome structure and composition plays a pivotal role in fields such as cytogenetics, genetics, and cancer research. The traditional method for chromosome straightening and classification involves manual processing and analysis of chromosome images by a trained cytogeneticist or biologist. This approach comprises a series of steps, such as the collection of chromosome samples from cells and their microscopic analysis after staining and fixation. The manually selected chromosomes are then corrected by a specialist using specialized software tools or manually. The sorted chromosomes are subsequently analyzed and classified based on size, shape, and banding pattern using a standardized classification system.
Although traditional methods have been widely used, they are subject to various limitations, including subjectivity and variability between analysts, time and labor intensiveness, error-proneness, and dependence on the skill and experience of the analyst. Moreover, traditional methods are not scalable and cannot efficiently handle large amounts of data. To overcome these limitations, automated chromosome analysis techniques have emerged as an essential area of research. Automated chromosome analysis using computer vision techniques can aid in the efficient and accurate analysis of large amounts of chromosome data. However, one of the challenges in automated chromosome analysis is the natural curvature of chromosomes, which can make accurate classification based on their features challenging. The use of CNN algorithms for straightening and classifying chromosomes has emerged as a promising research avenue, as CNNs have demonstrated impressive results in various fields, including medical image analysis. The combination of image processing techniques and CNNs can enable the accurate classification of chromosomes, even in the presence of noise and other artifacts. This has significant implications for medical diagnosis and genetic research, where the accurate and efficient analysis of chromosomes is essential.[3]
II. LITERATURE SURVEY
III. RESEARCH METHODOLOGY
A. Database used
The dataset used in this research was sourced from openCV. The dataset comprises 6000 chromosome images, which include 4800 curved chromosome images. The chromosome images in the dataset were pre-processed and used for training and testing the proposed method for straightening and classifying chromosomes. The performance of the method was evaluated using standard metrics such as accuracy and loss. The results demonstrated high accuracy and robustness in predicting chromosomal abnormalities. The proposed method has potential applications in cytogenetics and medical diagnosis, where accurate and efficient classification of chromosomes is necessary.
B. Straightening of Chromosomes
Chromosome correction is the process of reorienting chromosome images to align them horizontally or vertically. This is usually done to improve the accuracy of chromosome analysis, as it allows for better visualization and measurement of chromosomes. Traditionally, chromosome corrections have been performed manually or by hand by an experienced cytogeneticist or biologist using specialized software tools. However, this process is time-consuming and subjective, leading to errors and inconsistencies. Deep learning algorithms have recently been developed to automate the process of chromosome correction. These algorithms use Convolutional Neural Networks (CNNs) to learn and extract features from chromosome images, then apply transformation algorithms to correct chromosome orientation. This approach can improve the efficiency and accuracy of chromosome analysis and can be particularly useful for analyzing large data sets of chromosome images. This algorithm is used for straightening the presented highly curved chromosome [2]. The proposed algorithm is based on the calculation and analysis of the vertical and horizontal projection vectors of the binary image of the chromosome at different rotation angles. The binary image is obtained by thresholding the input image after histogram correction. A suitable threshold is automatically identified by analyzing the histogram of the image. A rotation score is defined based on the amplitude of the main peak of the horizontal projection vector of the rotated image. We use this to identify the most appropriate rotation image and define the bending axis. This image is used to identify the bending axis, or bending center, of the chromosome and is used to artificially straighten the bent chromosome. The resulting straightened chromosomes were now ready for processing by any of the automated chromosome classification methods previously described [7].
C. Classification of Chromosomes
Chromosomal sorting is the process of classifying chromosomes based on their size, shape, and banding pattern. Chromosomes are thread-like structures in the nucleus that carry genetic information. There are two types of chromosomes: autosomes and sex chromosomes. Autosomes are chromosomes that are not involved in determining the sex of an organism whereas sex chromosomes determine sex. In humans, there are 23 pairs of chromosomes, each pair consisting of two chromosomes - one from each parent. The first 22 pairs are autosomes and the 23rd pair is the sex chromosomes. Females have two X chromosomes (XX), while males have one X chromosome and one Y chromosome (XY). Chromosome classification is a fundamental task in medical image analysis that involves identifying and categorizing chromosomes based on their type. In recent years, Convolutional Neural Networks (CNNs) have shown remarkable success in image classification tasks. In this study, we propose a CNN-based approach for chromosome classification [4].
D. Convolutional Neural Network
A convolutional neural network (CNN) is an artificial neural network primarily used in image recognition and processing. The key components of a CNN are the convolutional, pooling, and fully connected layers. In a convolutional layer, the network applies a set of filters to the input image to extract features such as edges, corners, and texture. The clustering layer uses the feature map generated by the convolutional layer to reduce the dimensionality of the data and make the network more computationally efficient. Finally, a fully connected layer performs a classification or regression task based on the features extracted from the previous layer. The CNN architecture would consist of multiple convolutional layers that would extract relevant features from the chromosome images. The output of the last convolutional layer would be flattened and passed through several fully connected layers to perform the final classification. The trained CNN could be used to classify chromosomes in real-time [8].
E. System Requirements
a. VS Code
b. Library
c. Module
2. Hardware Requirements
a. Processor (i5 or higher)
b. GPU: Integrated or Dedicated GPU for processing of dataset and training of the model
c. RAM: 8GB minimum for processing of dataset and training of the model.
IV. RESULTS/OBSERVATIONS
Convolutional neural networks were employed to train the models, with careful consideration of the number of epochs required to regulate the network training. The number of epochs was varied to obtain the highest validation accuracy. The model was compiled using an optimization algorithm, and the loss type was specified based on the number of classes. The learning rate, which determines the speed of the update of weights, played a crucial role in model training. Higher learning rates resulted in fewer epochs required for training, whereas smaller learning rates necessitated more iterations for convergence to the best values. Typically, learning rates were assigned at random based on the user. All models were trained using convolutional neural networks, with a sufficiently large number of epochs provided to regulate the network training for each model. The models were trained for different values of epochs, and the validation results, loss, and accuracy were observed at each epoch to obtain the highest validation accuracy. The fit() function was used to train the model, and the results were stored for further analysis. This methodology, involving the use of deep learning, provides a more objective and accurate diagnosis of chromosome images, thus advancing the field of automated chromosome analysis. The result is evaluated in two parts, first without the straightening of chromosomes and then after the straightening of chromosomes. The straightening of chromosomes results in an increase in the accuracy of the model.
A. Results For Chromosome Classification Without Straightening Of Chromosomes
The CNN algorithm was trained on a dataset of 6,000 chromosome images and achieved a classification accuracy of 93% without straightening the chromosomes. The model was trained for 15 epochs with a batch size of 128, using the Adam optimization algorithm and a learning rate of 0.001. The loss function used was categorical cross-entropy, and the number of output classes was set to 23, representing the different types of chromosomes. In this study, we developed a CNN-based method for the classification of chromosomes without the need for straightening. The proposed method achieved an overall accuracy of 93% in the classification of chromosomes, which is comparable to the state-of-the-art methods that require straightening.
The results show that CNNs can effectively capture the features of chromosome images and classify them accurately. These results highlight the need for straightening of chromosomes in some cases, particularly when dealing with complex chromosome abnormalities or when there are overlapping chromosomes. In these situations, it may be necessary to use a combination of straightened and unstraightened images to achieve accurate classification
B. Results Of Chromosome Straightening
The process of chromosome straightening has been shown to improve the accuracy of automated chromosome classification, reduce noise and distortion, and enable standardized comparisons across different studies and datasets. The process of chromosome straightening involves several steps, including image acquisition, chromosome segmentation, chromosome alignment, chromosome straightening, and chromosome normalization.
The success of the chromosome straightening process depends on the effectiveness of each step, and several techniques can be used to accomplish each step. These techniques allow the chromosomes to be straightened while preserving their shape and avoiding distortion. Overall, chromosome straightening is a crucial pre-processing step for automated chromosome classification using image processing and machine-learning techniques such as CNNs.
C. Results For Chromosome Classification With Straightening Of Chromosomes
The CNN algorithm was trained on a dataset of 6,000 chromosome images and achieved a classification accuracy of 96% without straightening the chromosomes. The model was trained for 15 epochs with a batch size of 128, using the Adam optimization algorithm and a learning rate of 0.001. The results obtained from the classification of chromosomes with straightening using CNN algorithm showed a significant improvement compared to the classification without straightening. The model achieved an accuracy of 96% on the test set, indicating the effectiveness of using straightened images for chromosome classification. In addition, the confusion matrix showed that the model was able to accurately classify each chromosome class, with only a few misclassifications between similar chromosome types. This demonstrates the potential for using deep learning techniques for accurate and efficient chromosome classification. Overall, the results suggest that straightening of chromosomes prior to classification can greatly enhance the performance of the model. This approach could potentially be applied in various fields, such as genetic research and medical diagnostics, to aid in the identification and classification of chromosomes.
Convolutional neural networks have proven to be effective in training models for the straightening and classification of chromosomes. By varying the number of epochs and learning rate, the models can achieve high validation accuracy. This approach provides a more objective and accurate diagnosis of chromosome images using deep learning techniques. The proposed method has potential applications in cytogenetics and medical diagnosis, where accurate and efficient classification of chromosomes is necessary. The use of deep learning techniques and automation can improve the efficiency and accuracy of chromosome analysis, which can lead to more effective diagnosis and treatment of genetic disorders. To elaborate further, the use of convolutional neural networks in training models for the straightening and classification of chromosomes has demonstrated promising results. The approach involves varying the number of epochs and learning rate during the training process to achieve high validation accuracy. By employing deep learning techniques, the method provides a more objective and accurate diagnosis of chromosome images, which is essential in fields such as cytogenetics and medical diagnosis. The benefits of the proposed method extend beyond just improving the efficiency and accuracy of chromosome analysis. Automation and the use of deep learning techniques can also increase the amount of data that can be analyzed, which is especially important in cases where large datasets need to be processed. Additionally, by reducing the need for manual selection and classification of chromosomes, the proposed method can potentially save time and resources while minimizing errors. The potential applications of the proposed method in medical diagnosis and genetic research are vast. Accurate and efficient classification of chromosomes is crucial in the identification of genetic disorders, which can lead to improved diagnosis and treatment. Furthermore, the use of deep learning techniques and automation can allow for faster and more efficient analysis of chromosome images, leading to more effective decision-making in clinical settings. Overall, the use of convolutional neural networks and deep learning techniques in the analysis of chromosome images offers significant potential benefits and has promising applications in the field of cytogenetics and medical diagnosis.
[1] Bashmail, H., Al-Muhtaseb, H., Al-Hiary, H., & Faris, H. (2019). Chromosome Classification Using Convolutional Neural Networks and Hidden Markov Models. IEEE Access, 7, 113663-113675 [2] Hu, H., Guo, Y., Zhang, J., Guan, J., & Li, Y. (2017). Deep Learning Based Chromosome Classification. IEEE Access, 5, 8831-8837. [3] Saleh, R., Mahbub, U., & Murshed, M. (2019). Chromosome Classification Using Convolutional Neural Networks with Transfer Learning Approach. IEEE Access, 7, 146005-146014 [4] Altinsoy, M. E., Özkan, G., & K?zrak, M. A. (2019). Chromosome Classification Using Deep Learning with Morphological Operations and Convolutional Neural Networks. Journal of Medical Systems, 43(5), 120. [5] Ning, J., Wang, L., Zhang, Y., Li, C., & Sun, Y. (2019). Deep Learning Approaches for Automated Chromosome Classification and Karyotyping. IEEE Access, 7, 105644-105653. doi: 10.1109/access.2019.2934542 [6] Wang, T., Liu, L., Zhao, H., Chen, W., & Wang, L. (2018). An efficient approach for chromosome classification using hybrid features and neural network. Journal of Medical Systems, 42(12), 248. [7] Tashima, D., Saito, Y., & Aoki, K. (2018). Automatic straightening of chromosome images using convolutional neural networks. Journal of Imaging, 4(11), 136. [8] Karakaslar, S. B., Senliol, B., & Gunduz-Demir, C. (2019). Automatic chromosome classification using deep learning and handcrafted features. Pattern Recognition Letters, 125, 194-200. [9] S. Gagulapalalic and M. Can, “Human chromosome classification using competitive neural network teams (cnnt) and nearest neighbour,” in Ieee-Embs International Conference on Biomedical and Health Informatics, 2014, pp. 626–629. [10] M. Sharma, O. Saha, A. Sriraman, R. Hebbalaguppe, L. Vig, and S. Karande, “Crowdsourcing for chromosome segmentation and deep classification,” in Computer Vision and Pattern Recognition Workshops, 2017, pp. 786–793. [11] C. Bishop, Pattern Recognition and Machine Learning. Springer, 2006. [12] N. Srivastava, G. E. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, “Dropout:a simple way to prevent neural networks from overfitting,” Journal of Machine Learning Research, vol. 15, pp. 1929–1958, 2014. [13] Truong K et al 2004 Quantitative fish determination of chromosome 3 arm imbalances in lung tumors by automated image cytometry Med. Sci. Monit. 10 426–32 [14] Boehm D et al 2004 Rapid detection of subtelomeric deletion/duplication by novel realtime quantitative PCR using SYBR-Green dye Human Mutation 23 368–78 [15] Kyan M J, Guan L, Amison M R and Cogswell C J 1999 Feature extraction of chromosomes from 3D confocal microscope images 1999 Int. Conf. on Image Processing vol 2, pp 24–8 [16] Harnden D G, Klinger H P, Jensen J T and Kaelbling M (ed) 1985 An international system for human cytogenetic nomenclature (ISCN1985) Report of the Standing Committee on Human Cytogenetic Nomenclature (Basel, Switzerland: KAEGER) [17] Graham J and Piper J 1994 Automated karyotype analysis Methods in Molecular Biology: Chromosome Analysis Protocols ed J R Gosden (Totowa, NJ: Humana) [18] Wang Y, Wu Q, Castleman K R and Xiong Z 2001 Image enhancement using multiscale differential operators 2001 IEEE Int. Conf. on Acoustics, Speech and Signal Processing (7–11 May 2001) vol 3, pp 1853–6 [19] Gaybay C 1986 Image structure representation and processing: a discussion of some segmentation methods in cytology IEEE Trans. Pattern Anal. Mach. Intell. 8 140–6 [20] S. Delshadpour, “Reduced size multi layer perceptron neural network for human chromosome classification,” in International Conference of the IEEE Engineering in Medicine & Biology Society, vol. 3, 2003, pp. 2249–2252 [21] B. C. Oskouei and J. Shanbehzadeh, “Chromosome classification based on wavelet neural network,” in International Conference on Digital Image Computing: Techniques and Applications, 2010, pp. 605– 610 [22] Wu, Q. and K.R. Castleman. Automated chromosome classification using wavelet-based band pattern descriptors. in Computer-Based Medical Systems, 2000. CBMS 2000. Proceedings. 13th IEEE Symposium on. 2000. IEEE. [23] Madian, N. and K.B. Jayanthi. Overlapped chromosome segmentation and separation of touching chromosome for automated chromosome classification. in Engineering in Medicine and Biology Society. 2012. [24] Swati, G. Gupta, M. Yadav, M. Sharma, and L. Vig, “Siamese networks for chromosome classification,” in IEEE International Conference on Computer Vision Workshop, 2018, pp. 72–81. [25] Altan, Novruz Allahverdi, Yakup Kutlu Diagnosis of Coronary Artery Disease Using Deep Belief Networks, European journal of engineering and natural sciences, Volume 2, Issue 1, pp. 29-36, 2017. [26] Maximo E. Drets, Margery W. Shaw, “Specific banding patterns of human chromosomes (heterochromatin/ Giemsa stain/ chromosome bands)”, 1971, Proc. Nat. Acad. Sci. USA, Vol. 68, No. 9, pp. 2073 2077. [27] Nair, R.M., R. Remya, and K. Sabeena, Karyotyping techniques of chromosomes: a survey. Int J Comput Trends Technol, 2015. 22(1). [28] Sandeep Kumar, Zeeshan Khan, Anurag Jain, A Review of Content Based Image Classification using Machine Learning Approach, International Journal of Advanced Computer Research, Volume-2 Number-3 Issue-5, 2012
Copyright © 2023 Sakshi Wakchaure, Gaurav Rathi, Sneha Patil, Mr. H. S. Thakar, Dr. M. V. Munot. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET52479
Publish Date : 2023-05-18
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here