Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: Kunal P Raghuwanshi, Shikha Singh J.P.Thakur, Sumit A Wath, Sakshi P Dafe, Abhishek S Deshmukh
DOI Link: https://doi.org/10.22214/ijraset.2024.65629
Certificate: View Certificate
An extensive analysis of the combination of Long Short-Term Memory (LSTM) networks with Convolutional Neural Networks (CNN) is presented in this research. within data mining applications. CNNs, optimized for spatial data processing, and LSTMs, made to manage data in a sequential fashion, are combined to leverage their respective strengths in feature extraction and temporal pattern recognition. By analyzing over 60 published studies, we evaluate the effectiveness of CNN-LSTM models across diverse domains such as medical image analysis, financial time series forecasting, stock price prediction, and sentiment analysis. The review reveals that CNN-LSTM hybrids significantly outperform traditional methods in activities requiring both temporally and spatially understanding. However, challenges such as model interpretability, hyperparameter optimization, and the need for diverse datasets remain. Future research should focus on optimizing these models for specific applications, improving scalability, and addressing limitations like data scarcity. This review underscores the significant potential of CNN-LSTM integration in advancing data mining techniques and solving complex problems in various fields. We present the related work of some of the most important publication in the field and classify over 60 published papers from different perspective. This review intends to classify the major techniques, methods, algorithms in the domain of medical images, financial time series analysis, stock price and social media.
I. INTRODUCTION AND BACKGROUND
Deep Learning (DL) has profoundly transformed artificial intelligence, driving advancements across a spectrum of tasks from image classification to natural language processing. In dealing with sequential data, such as text or time series, specialized models are crucial for effectively capturing temporal dependencies. Traditional feed-forward neural networks often struggle with these tasks, Conversely, Recurrent Neural Networks (RNNs) are made expressly to process sequential data by preserving a recollection of previous inputs.
Convolutional Neural Networks (CNNs) are essential in deep learning, especially for handling grid-like data such as images. They consist of several layers—including Convolutional, Activation (like ReLU), Pooling, and Fully Connected layers—that work together to automatically identify and analyze key features in the data. This reduces the need for manual feature extraction.
Fig. 1. Layers of CNN
The Long Short-Term Memory (LSTM) model, introduced by Hochreiter and Schmidhuber in 1997, addresses the challenge of learning long-term dependencies. LSTMs feature specialized units that manage and retain information over extended periods. Building upon this, Cho et al. (2014) developed the Gated Recurrent Unit (GRU), which incorporates gating mechanisms similar to LSTMs but simplifies the model by eliminating separate memory cells.
Fig. 2. Working Of LSTM
This paper aims to explore the integration of CNNs and LSTMs, highlighting the benefits and challenges of combining these models for data mining applications. By examining how CNNs and LSTMs operate and their effectiveness across different domains. The paper offers valuable insights into the synergistic use of these techniques for addressing complex data mining tasks.
II. LITREATURE REVIEW
The collection of papers showcases significant advancements in applying deep learning techniques, particularly Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) networks, across a range of domains.
Table 1 RELATED WORK
Sr. No |
Author Name |
Conclusion |
Source |
Year |
1. |
Ishaani Priyadarshini, Chase Cotton |
Develop integrated grid search, convolutional neural networks (CNNs), and long short-term memory (LSTM) networks for sentiment analysis, and the results are excellent. With an accuracy that exceeds 96%, your model outperforms other baseline models. |
Springer [1] |
2021 |
2. |
Tzuu-Hseng S. Li et al |
Proposed the employing of a CNN-LSTM model for humanoid robots' facial expression analysis, which substantially enhanced human-robot interaction. |
IEEE [2] |
2017 |
3. |
Jimmy Ming?Tai Wu, et al. |
created a graph-based CNN-LSTM stock price prediction system which incorporates multiple data sources and optimizes prediction accuracy. |
Springer [3]
|
2021 |
4. |
Burak Gülmez0 |
You showcased the improved accuracy of the LSTM-ARO model—an advanced LSTM network enhanced with the ARO algorithm—for stock price prediction on DJIA data. |
Elsevier [4] |
2023 |
5. |
Greg Van Houdt, et al. |
examined the uses of LSTM models, pointing out the advantages of mixing LSTMs with CNNs while stressing that no one model is ideal for every situation. |
Research Gate[5] |
2020 |
6. |
Md. Zabirul Islam, et al. |
presented a CNN-LSTM model that achieved excellent accuracy in COVID-19 detection from X-ray pictures, but it had drawbacks such as a small sample size and a lack of radiologists to contrast the results with. |
Elsevier [6] |
2020 |
7. |
Sidra Mehtab, Jayadip Sen |
LSTM- and CNN-based share price forecasting algorithms were analyzed; the models demonstrated good accuracy with different execution speeds. |
Research Gate[7] |
2020 |
8. |
Hailun Xie, Li Zhang, (Senior et al. |
created an enhanced CNN-LSTM model for time series forecasting with the Grey Wolf Optimizer, showing increased effectiveness and performance. |
IEEE [8] |
2020 |
9. |
Tariq Mahmood, et al. |
reviewed developments in deep learning for the categorization and segmentation of medical images, emphasizing difficulties and making recommendations for enhancements for clinical use. |
IEEE [9] |
2023 |
10. |
Felipe Viel et al. |
Evaluated CNNs and Transformers for hyperspectral image classification, recommending 2D and 3D CNNs for better accuracy and future exploration of Transformer architectures. |
IEEE [10] |
2023 |
11. |
Aji Prasetya Wibawa et al. |
Improved CNN performance in time-series forecasting by optimizing the smoothing factor and incorporating Lucas numbers, recommending further exploration with advanced models |
Springer [11] |
2022 |
12. |
Md Maruf Hossain et al. |
Using an integrated CNN-LSTM model with explainable AI, it was possible to identify cardiovascular illness with 74.15% accuracy and pinpoint important predictor variables. |
Elsevier [12] |
2023 |
13. |
Rikiya Yamashita et al. |
Reviewed CNN applications in radiology, emphasizing their effectiveness in image classification and object detection while acknowledging their limitations. |
Springer [13] |
2018 |
14. |
Jing Chen et al. |
Developed a VGG16-CNN and LSTM model for action recognition from video data, showing high classification accuracy for traditional Chinese exercises. |
IEEE [14] |
2023 |
15. |
Sreelekshmy Selvin et al. |
CNNs showed better trend understanding when compared to RNN and CNN-Sliding Window systems for stock price prediction. |
IEEE [15] |
2017 |
16. |
Subhasis Dasgupta et al. |
CNN-based models outperformed LSTM-based models in terms of speed and accuracy when used for everyday price of shares predictions. |
IEEE [16] |
2020 |
17. |
Abhinav Narula, Naveen Kumar Vaegae |
Introduced a CNN-LSTM model for COVID-19 detection using chest X-rays, achieving high performance and recommending further validation. |
Springer [17] |
2023 |
18. |
Nima Hatami et al. |
Proposed a CNN-LSTM model for multimodal MRI and clinical data fusion to predict stroke outcomes, reducing computational cost and suggesting future extensions |
Research Gate [18] |
2022 |
19. |
Shengbin Liang et al. |
Developed a Double Channel CNN-LSTM model for text classification, effectively handling complex long-sequence data and emotional nuances |
IEEE [19] |
2020 |
20. |
Priya Singh et al. |
Proposed a CNN-LSTM+MV model for financial portfolio performance, improving prediction accuracy but dependent on data quality. |
IEEE [20] |
2023 |
21. |
Tawsifur Rahaman et al. |
Used a stacking CNN-LSTM model for detecting congenital heart disease from fetal echocardiogram videos, showing potential for early detection. |
IEEE [21] |
2023 |
22. |
Jheng-Long Wu et al. |
Developed a Bi-directional LSTM-CNN model for emotion labeling in psychiatric texts, achieving superior results with pretrained word embeddings. |
IEEE [22] |
2020 |
23. |
Sidratul Montaha et al. |
Proposed a TD-CNN-LSTM model for brain tumor classification on 3D MRI scans, outperforming 3D CNN models and validating through cross-validation. |
IEEE [23] |
2022 |
24. |
Yuanyuan Chen et al. |
Applied deep learning to detect traffic-related information from social media, with CNN and LSTM methods showing superior performance with word embeddings. |
IEEE [24] |
2018 |
III. PROPOSED WORK
Data mining has become a critical tool for uncovering valuable insights from large and complex datasets. In this paper, we propose a novel data mining approach that leverages advanced artificial intelligence techniques to enhance both pattern recognition and feature extraction. Our method integrates traditional data mining techniques with deep learning architectures, resulting in a hybrid model that combines Convolutional Neural Networks (CNNs) for spatial feature learning and Long Short-Term Memory (LSTM) networks for capturing temporal dependencies. We will evaluate the effectiveness of this approach across various real-world datasets, including financial time series, medical imaging, and social media text. The results will be the notable improvements in both accuracy and efficiency when compared to conventional data mining methods.
A. Synopsis of the Suggested Algorithm
We present a novel hybrid architecture that combines the strengths of Long Short-Term Memory (LSTM) networks with Convolutional Neural Networks (CNNs). This method works well with complicated datasets that have both spatial and temporal components, which makes it adaptable to a wide range of data kinds and mining applications.
B. Architecture of Algorithms
There are three major parts to the algorithm:
In this survey paper, we introduced different techniques, methods, and algorithms for efficient integration of CNN and LSTM for data mining observation algorithms in the field of medicine, financial time series analysis, stock price, and social media. This review explores the integration of Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) networks in various data mining applications. The combination of CNN’s strength in spatial data processing and LSTM’s ability to manage sequential information has led to enhanced task performance like image classification, time series forecasting, and sentiment analysis. Despite challenges like model interpretability and the need for larger datasets, the CNN-LSTM approach shows significant potential for future advancements, particularly in fields such as healthcare, finance, and natural language processing. Future research should focus on optimizing and scaling these models for specific applications. CNN and LSTM together enhanced the feature extraction and sequence learning, improved performance on video and image sequence data, sentiment analysis, improved accuracy and generalization. This also helps in better handling of complex data structures for applications like NLP, CNN can extract semantic features from text while LSTM can process the sequential nature of language, improving tasks such as sentiment analysis or language translation.
[1] Ishaani Priyadarshini, Chase Cotton, “A novel LSTM–CNN–grid search based deep neural network for sentiment analysis”, The Journal of Supercomputing (2021), May 2021 [2] Tzuu-Hseng S. Li, Member, IEEE, Ping-Huan Kuo, Ting-Nan Tsai, Po-Chien Luan, “CNN and LSTM Based Facial Expression Analysis Model for a Humanoid Robot”, IEEE Access, 2017 [3] Jimmy Ming Tai Wu, Zhongcui Li, Norbert Herencsar, Bay Vo, Jerry Chun Wei Lin, “A graph based CNN LSTM stock price prediction algorithm with leading indicators”, Multimedia Systems (2023) 29:1751–1770, Feb 2021 [4] Burak Gülmez, “Stock price prediction with optimized deep LSTM network with artificial rabbit optimization algorithm”, DOI: https://doi.org/10.1016/j.eswa.2023.120346, May 2023 [5] Greg Van Houdt, Gonzalo Napoles, Carlos Mosquera, “A Review on the Long Short-Term Memory Model” [6] Md. Zabirul Islam, Md. Milon Islam, Amanullah Asraf, “A combined deep CNN-LSTM network for the detection of novel coronavirus (COVID-19) using X-ray images\", Aug 2020, DOI: https://doi.org/10.1016/j.imu.2020.100412, [7] Sidra Mehtab, Jayadip Sen, “Analysis and Forecasting of Financial Time Series Using CNN and LSTM-Based Deep Learning Models”, Nov 2020 [8] Hailun Xie, Li Zhang, (Senior Member, IEEE), and Chee Peng Lim, “Evolving CNN-LSTM Models for Time Series Prediction Using Enhanced Grey Wolf Optimizer.”, vol. 8, Sept 2020, DOI: 10.1109/ACCESS.2020.3021527 [9] Tariq Mahmood, Amjad Rehmani, (Senior Member, IEEE), Tanzila Saba, (Senior Member, IEEE), Lubna Nadeem, and Saeed Ali Omer Bahaj, “Recent Advancements and Future Prospects in Active Deep Learning for Medical Image Segmentation and Classification”, Sep 2023, DOI: 10.1109/ACCESS.2023.3313977 [10] Felipe Viel(Student Member, IEEE), Renato Cotrim Maciel, Laio Oriel Seman, Cesar Albenes Zeferino, (Member, IEEE), Eduardo Augusto Bezerra, (Member, IEEE), and Valderi Reis Quietinho Leithardt (Senior Member, IEEE), “Hyperspectral Image Classification: An Analysis Employing CNN, LSTM, Transformer, and Attention Mechanism”, Mar 2023, vol. 11, DOI: 10.1109/ACCESS.2023.3255164 [11] Aji Prasetya Wibawa, Agung Bella Putra Utama, Hakkun Elmunsyah, Utomo Pujianto , Felix Andika Dwiyanto and Leonel Hernandez , “Time series analysis with smoothed Convolutional Neural Network”, Wibawa et al. Journal of Big Data (2022) 9:44, DOI: https://doi.org/10.1186/s40537-022-00599-y, [12] Md Maruf Hossain, Md Shahin Ali, Md Mahfuz Ahmed, Md Rakibul Hasan Rakib, Moutushi Akter Kona a, Sadia Afrin, Md Khairul Islam a , Md Manjurul Ahsan, Sheikh Md Razibul Hasan Raj , Md Habibur Rahman, “Cardiovascular disease identification using a hybrid CNN-LSTM model with explainable AI”, DOI: https://doi.org/10.1016/j.imu.2023.101370, Oct 2023 [13] Rikiya Yamashita & Mizuho Nishio & Richard Kinh Gian Do& Kaori Togashi , “Convolutional neural networks: an overview and application in radiology”, Insights into Imaging (2018) 9:611–629, June 2018, DOI: https://doi.org/10.1007/s13244-018-0639-9, [14] Jing Chen, Jiping Wang, Qun Yuan, and Zhao Yang, “CNN-LSTM Model for Recognizing Video-Recorded Actions Performed in a Traditional Chinese Exercise”, vol. 11, Jun 2023, DOI: 10.1109/JTEHM.2023.3282245 [15] Sreelekshmy Selvin, Vinayakumar R, Gopalakrishnan E.A, Vijay Krishna Menon, Soman K.P , “Stock price prediction using LSTM, RNN and CNN-Sliding Window Model”, 978-1-5090-6367, 2017, IEEE [16] Subhasis Dasgupta,Jaydip Sen,Sidra Mehtab , “Robust Analysis of Stock Price Time Series Using CNN and LSTM-Based Deep Learning Models” [17] Abhinav Narula, Naveen Kumar Vaegae, “Development of CNN LSTM combinational architecture for COVID 19 detection”, Journal of Ambient Intelligence and Humanized Computing (2023) 14:2645–2656, Dec 2022, DOI: https://doi.org/10.1007/s12652-022-04508-2, [18] Nima Hatami, Tae-Hee Cho, Laura Mechtouff, Omer Faruk Eker, David Rousseau and Carole Frindel, “CNN-LSTM Based Multimodal MRI and Clinical Data Fusion for Predicting Functional Outcome in Stroke Patients”, DOI: 10.1109/EMBC48229.2022.9871735, https://www.researchgate.net/publication/363394340 [19] Shengbin Liang, Bin Zhu, Suying Cheng, Jiangyong Jin, “A Double Channel CNN-LSTM Model for Text Classification”, DOI: 10.1109/HPCC-SmartCity-DSS50907.2020.00169, Dec 2020, https://www.researchgate.net/publication/351839538, [20] Priya Singh, Manoj Jha, Mohamed Sharaf, Mohammed A. El-Meligy, and Thippa Reddy Gadekallu, “Harnessing a Hybrid CNN-LSTM Model for Portfolio Performance: A Case Study on Stock Selection and Optimization”, vol. 11, Sept 2023, DOI: 10.1109/ACCESS.2023.3317953 [21] Tawsifur Rahman, Mahmoud Khatib A. A. Al-Ruweidi, MD. Shaheenur Islam Sumon, Reema Yousef Kamal, Muhammad E. H. Chowdhury, (Senior Member, IEEE), and Huseyin C. Yalcin, “Deep Learning Technique for Congenital Heart Disease Detection Using Stacking-Based CNN-LSTM Models from Fetal Echocardiogram: A Pilot Study”, vol. 11, Sept 2023 DOI: 10.1109/ACCESS.2023.3316719 [22] Jheng-Long Wu, Yuanye He, Liang-Chih Yu, and K.Robert La, “Identifying Emotion Labels from Psychiatric Social Texts Using a Bi-Directional LSTM-CNN Model”, vol. 8, 2020, DOI: 10.1109/ACCESS.2020.2985228 [23] Sidratul Montaha, Sami Azam, A.K.M.Rakibul Haque Rafid, MD. Zahid Hasan, (Member, IEEE), Asif Karim, (Member, IEEE), and Ashraful Islam, “Time Distributed-CNN-LSTM: A Hybrid Approach Combining CNN and LSTM to Classify Brain Tumor on 3D MRI Scans Performing Ablation Study”, vol. 10, June 2022, DOI: 10.1109/ACCESS.2022.3179577 [24] Yuanyuan Chen, Yisheng Lv, Senior Member, IEEE, Xiao Wang, Member, IEEE, Lingxi Li, Senior Member, IEEE, and Fei-Yue Wang, Fellow, IEEE, “Detecting Traffic Information from Social Media Texts with Deep Learning Approaches” [25] Raghuwanshi Kunal Prakashsingh, Dr. Muthulakshmi P, “AI based data mining observation algorithm”, Computer Integrated Manufacturing Systems, vol.30, No.7, DOI: 10.24297/j.cims.2024.7.1
Copyright © 2024 Kunal P Raghuwanshi, Shikha Singh J.P.Thakur, Sumit A Wath, Sakshi P Dafe, Abhishek S Deshmukh. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET65629
Publish Date : 2024-11-28
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here