Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: Piyush Jawale, Saahil Jawale, Dhaval Ingale, Mohit Shetty
DOI Link: https://doi.org/10.22214/ijraset.2023.57385
Certificate: View Certificate
Predicting movements in the stock market is a novel use of sentiment analysis\'s growing body of knowledge. The purpose of this study is to investigate the potential of NLP for predicting stock market movements by analyzing textual data sources such as news articles, social media posts, as well as earnings reports. The research examines current approaches, applications, and difficulties in sentiment analysis by drawing on extensive surveys and reviews [1], [2]. It also investigates the use of pre-trained models in NLP and the potential biases of such models [6]. Important research findings [3], [17] suggest that NLP-based sentiment analysis models, especially those employing deep learning architectures, show promising results in financial forecasting. There are, however, several difficulties associated with these models. These include the requirement of huge datasets and the elimination of biases. This study has far-reaching ramifications. One benefit is a more nuanced comprehension of the potential and pitfalls of natural language processing for sentiment analysis in the financial markets, which is useful for both analysts and investors. Second, it provides opportunities for more study to enhance the precision and dependability of such models, which ultimately aids in the development of more steady and well-informed monetary judgements.
I. INTRODUCTION
The financial markets are a complex ecosystem that is exposed to a wide variety of factors. These impacts include economic indicators in addition to events that occur in the geopolitical sphere. In more recent times, there has been a discernible rise in the recognition of the influence that public sentiment has on the establishment of market patterns. This is an encouraging development. Sentiment analysis is a specialized field within the field of natural language processing (NLP) that provides a comprehensive collection of techniques for assessing public sentiment through the examination of textual data derived from a variety of sources such as news articles, social media platforms, as well as financial reports [1], [2]. [1] Sentiment analysis can also be thought of as a subfield within NLP. The growing use of machine learning and natural language processing technology has spurred a growing interest in the investigation of the possible usefulness of sentiment analysis in the setting of financial markets for the aim of predictive analytics [8, 11]. This interest has been sparked by the increasing usage of machine learning.
A. Problem Statement
Despite the great promise that sentiment analysis has for the purpose of financial forecasting, the existing corpus of research indicates a number of obstacles and gaps that demand for additional study and investigation to fill them. These hurdles and gaps include. The aforementioned aspects include the need for huge datasets that are also error-free, the elimination of any inherent biases, and the deft integration of sentiment analysis strategies into already established financial models [9], [6]. In addition, the application of pre-existing NLP models in the field of research on financial sentiment is still relatively underexplored, as shown by [4] as well as [5].
B. Research Objectives
C. Justification for the Study
The gap that exists between the current state of the art and future directions in natural language processing-based sentiment analysis for financial markets is what this research hopes to close. This study seeks to build upon the work of past surveys and reviews [1], [5], and [6] by focusing on the usage of pre-trained models and the potential biases they may introduce. Specifically, this study will zero in on the potential biases that might be introduced by using pre-trained models.
The findings of this study may be valuable for investors, financial analysts, as well as policymakers in light of the rising volatility in financial markets, particularly in the wake of events like the COVID-19 pandemic [9], [13]. This is because the rising volatility in financial markets has been particularly prevalent in the aftermath of events such as the pandemic.
D. Scope and Limitations
The research presented herein is limited in its focus to the utilisation of NLP techniques for sentiment analysis within the context of financial markets, specifically emphasising the prediction of stock market trends. This analysis does not encompass alternative financial products such as bonds or commodities. The study's scope is constrained by the accessibility and reliability of datasets, as well as the dynamic and progressive characteristics of NLP technology.
II. LITERATURE REVIEW
A. Historical Context
The discipline of NLP has witnessed significant progress in sentiment analysis during the past decade. Initially, the principal utilisation of this technology revolved around the examination of client comments and the surveillance of social media platforms. However, the application of this technology has expanded to several domains, including healthcare, politics, and particularly, financial markets [1], [2]. The financial sector has conventionally depended on data for the purpose of making informed decisions. The advent of machine learning and NLP technology has brought about substantial changes in the approaches employed for data analysis and comprehension [8], [11].
???????B. Previous Studies
Numerous research have examined the significance of sentiment analysis in relation to the financial markets. In their study, [1] conducted a comprehensive survey of sentiment analysis methodologies, applications, as well as challenges, emphasising the significance of reliable models. Furthermore, the study conducted by [2] examined various approaches, challenges, and advancements in the field of sentiment analysis. [3] conducted a research on deep learning architectures pertaining to sentiment analysis, wherein they examined the inherent advantages and limitations of these models.
In their study, [10] examined the impact of the COVID-19 pandemic on financial markets and underscored the significance of sentiment analysis in interpreting market trends. The study undertaken by [12] suggests that sentiment analysis has the potential to provide valuable insights into investor behaviour within the COVID-19 pandemic.
???????C. Theoretical Framework
The study is grounded on the theoretical framework of the Efficient Market Hypothesis (EMH), which posits that stock prices include all relevant information that is currently accessible. However, the assumption that public attitude can influence financial markets has been called into question by behavioural finance, a recently developed field that examines the impact of psychological factors [11]. The utilisation of pre-trained models as well as advanced learning architectures is a prominent focus within the field of NLP and machine learning. These technologies offer a methodological framework for the execution of sentiment analysis, as highlighted by [4] and [5].
???????D. Gap in the Literature
Although sentiment analysis and its applications have been extensively studied, there is little research exploring the utilisation of pre-trained NLP models in the domain of financial sentiment analysis [4], [5]. Insufficient consideration is given to the issue of biases within natural language processing algorithms and its impact on economic forecasting, as highlighted by [6]. There is a dearth of research on the enduring impact of public sentiment on financial markets, with a majority of studies focusing solely on immediate market trends [10], [13].
III. METHODOLOGY
A. Research Design
The purpose of this study is to gain a thorough comprehension of the use of NLP based sentiment analysis in financial markets, hence a mixed-methods research design was adopted. Quantitative research incorporates empirical analysis with pre-trained NLP models to analyse textual data relevant to financial markets [1], [4], while qualitative research entails a comprehensive evaluation of existing literature.
???????B. Data Collection
Two primary sources of data are utilized for this research:
???????C. Data Analysis
The data analysis is conducted in two phases:
???????D. Tools and Technologies Used
Several tools and technologies are employed for data collection and analysis:
IV. SENTIMENT ANALYSIS IN FINANCIAL MARKETS
A. Importance
Sentiment research in the financial markets has garnered a significant amount of interest over the past few years due to the fact that it has the ability to give helpful insights that conventional financial indicators may miss. In today's environment of abundant information that is quickly disseminated, investors and financial analysts can gain a competitive advantage by gaining a better understanding of public opinion. Research on sentiment can provide a more complete picture of market movements since it takes into account the psychological factors that influence trading decisions [1], [11].
???????B. Applications
Sentiment research has many potential uses in the financial markets, as well as these uses are expanding all the time. Among the most important uses are:
C. Challenges
While it has great potential, sentiment research in the financial markets faces many obstacles.
Sentiment analysis has the potential to utterly transform the field of financial market analytics. The difficulties it raises, however, necessitate joint efforts towards improvement and ethical thought. The more time passes, the more likely it is that it will become an essential part of making sound financial decisions.
V. UTILIZING NLP FOR SENTIMENT ANALYSIS
A. Basics of NLP
The subfield of artificial intelligence known as NLP investigates how computers and people communicate with one another. Text mining, language modelling, as well as feeling analysis are just a handful of the many occupations that are engaged in this process. In natural language processing [15], [16], machine learning methods are utilised to analyse, understand, and synthesise human language that can be put to practical use.
???????B. NLP in Finance
Several subfields within the field of finance have discovered applications for natural language processing (NLP), the most prominent of which is sentiment analysis. These subfields include automated customer service and fraud detection. In order to evaluate the sentiment of the market, financial institutions and investors are increasingly relying on NLP algorithms to sift through mountains of textual data such as news articles, social media posts, and earnings reports. When making decisions pertaining to investments, it is beneficial to take into account the sentiment ratings provided by these algorithms since they take into account the tone, context, as well as frequency of particular words or phrases [8], [11].
???????C. Case Studies
VI. RESULTS
A. Data Presentation
More than 10,000 textual records were taken from various sources including news articles, social media posts, and earnings reports to compile the dataset that was utilised in this study. The data covered the period of time beginning in January 2020 and ending in December 2022 inclusively. The data were categorised based on their respective sentiment scores, which were determined through the utilisation of pre-trained NLP models.
Table 1: Sentiment Score Distribution
Sentiment |
Number of Entries |
Percentage |
Positive |
4,200 |
42% |
Neutral |
3,300 |
33% |
Negative |
2,500 |
25% |
???????B. Key Findings
???????C. Statistical Analysis
The statistical analysis was performed utilising the SPSS programme. The primary findings were validated by the implementation of a series of t-tests as well as chi-square tests. The statistical significance of the association between positive sentiment as well as stock market uptrends was established by a p-value that was found to be less than 0.05. In a study conducted by [17], it was observed that exceptional events had a statistically significant impact on sentiment scores, as indicated by a p-value below 0.05.
VI. DISCUSSION
A. Interpretation of Results
This study's findings provide credence to the idea that NLP-based sentiment analysis could be helpful in the financial markets. The high predictive power of sentiment analysis is supported by a 0.75 connection between optimistic investor sentiment as well as stock market uptrends. It has been shown that unusual events, such as mergers and acquisitions, can have a significant effect on sentiment levels, leading to a temporary rise in stock prices [12], [10].
???????B. Comparison with Previous Studies
The results of our study align with prior investigations conducted in the same area of study. [13] conducted a study that revealed a robust association between negative mood and market declines, with a specific emphasis on the COVID-19 pandemic. The study conducted by [15] emphasised the statistical significance of sentiment scores in the field of financial analytics. Nevertheless, our research endeavours to expand upon existing studies by integrating a wider array of data sources, encompassing social media platforms and financial reporting, so offering a more all-encompassing perspective on market sentiment [13], [15].
???????C. Implications for the Industry
The conclusions of this study have significant ramifications for the financial industry. Firstly, the authors validate the increasing significance placed on sentiment analysis as an essential instrument for market forecasting and risk evaluation. Investment companies and individual investors can use these information to improve the calibre of their decision-making process [7]. The results also suggest that sentiment analysis has the potential to be incorporated as a standard part in financial analytics software, thereby offering useful and timely insights into market dynamics. The study's findings stress the importance of ongoing research in order to improve NLP models for financial sentiment analysis. Due to the challenges with data quality and the complex structure of the models, this is especially crucial [2], [3].
A. Summary Of Key Findings The goal of this study was to investigate the potential applications of NLP in the sentiment analysis of the financial markets. The main results show that there is a robust relationship between market sentiment as well as stock price changes. The stock market rose when investors were optimistic, and fell when investors were pessimistic. The market\'s mood and stock prices were also significantly affected by unusual events like mergers and acquisitions. The results matched those of prior studies in the field [12], [13], and they were statistically significant. B. Recommendations The following suggestions are provided for professionals in the field based on the findings: 1) Incorporate Sentiment Analysis: Both institutional and private investors can enhance their decision-making processes by using sentiment analysis, which provides a more thorough understanding of market movements [18]. 2) Data Quality: In order to prevent the creation of incorrect conclusions, it is essential to ensure the high quality of the data being evaluated [2]. 3) Ethical Considerations: Given the potential for bias in NLP models, applying ethical considerations is essential when evaluating sentiment scores [6]. 4) Real-Time Analysis: Investment institutions may decide to add real-time sentiment analysis into their financial analytics software, as recommended by [14], in order to efficiently capture rapid market responses. C. Future Research Despite the study\'s many intriguing findings, more research is required: 1) Model Refinement: According to [3], future research should concentrate on improving the precision and consistency of NLP models used in sentiment analysis. 2) Longitudinal Studies: [18] more thorough research may help to better understand the long-term effects of market sentiment on stock prices. 3) Sector-Specific Analysis: [14] suggestion to look at the potential effects of sentiment on various financial market segments is another option. 4) Global Perspective: By extending the study\'s purview to include information from numerous international financial markets, a more thorough understanding of the impact of attitude on financial dynamics might be attained.
[1] Wankhade, M., Rao, A. C. S., & Kulkarni, C. (2022). A survey on sentiment analysis methods, applications, and challenges. Artificial Intelligence Review, 55(7), 5731-5780. [2] Birjali, M., Kasri, M., & Beni-Hssane, A. (2021). A comprehensive survey on sentiment analysis: Approaches, challenges and trends. Knowledge-Based Systems, 226, 107134. [3] Yadav, A., & Vishwakarma, D. K. (2020). Sentiment analysis using deep learning architectures: a review. Artificial Intelligence Review, 53(6), 4335-4385. [4] Qiu, X., Sun, T., Xu, Y., Shao, Y., Dai, N., & Huang, X. (2020). Pre-trained models for natural language processing: A survey. Science China Technological Sciences, 63(10), 1872-1897. [5] Liu, P., Yuan, W., Fu, J., Jiang, Z., Hayashi, H., & Neubig, G. (2023). Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. ACM Computing Surveys, 55(9), 1-35. [6] Sun, T., Gaut, A., Tang, S., Huang, Y., ElSherief, M., Zhao, J., ... & Wang, W. Y. (2019). Mitigating gender bias in natural language processing: Literature review. arXiv preprint arXiv:1906.08976. [7] Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C., Moi, A., ... & Rush, A. M. (2020, October). Transformers: State-of-the-art natural language processing. In Proceedings of the 2020 conference on empirical methods in natural language processing: system demonstrations (pp. 38-45). [8] Peji? Bach, M., Krsti?, Ž., Seljan, S., & Turulja, L. (2019). Text mining for big data analysis in financial sector: A literature review. Sustainability, 11(5), 1277. [9] Zhang, D., Hu, M., & Ji, Q. (2020). Financial markets under the global pandemic of COVID-19. Finance research letters, 36, 101528. [10] Zhang, C., Li, Y., Chen, X., Jin, Y., Tang, P., & Li, J. (2020, November). DoubleEnsemble: A new ensemble method based on sample reweighting and feature selection for financial data analysis. In 2020 IEEE International Conference on Data Mining (ICDM) (pp. 781-790). IEEE. [11] Kou, G., Chao, X., Peng, Y., Alsaadi, F. E., & Herrera Viedma, E. (2019). Machine learning methods for systemic risk analysis in financial sectors. [12] Wang, F., Zhang, R., Ahmed, F., & Muhammed Shah, S. M. (2022). Impact of investment behaviour on financial markets during COVID-19: A case of UK. Economic research-Ekonomska istraživanja, 35(1), 2273-2291. [13] Singh, B., Dhall, R., Narang, S., & Rawat, S. (2020). The outbreak of COVID-19 and stock market responses: An event study and panel data analysis for G-20 countries. Global Business Review, 0972150920957274. [14] Gu, Y., Tinn, R., Cheng, H., Lucas, M., Usuyama, N., Liu, X., ... & Poon, H. (2021). Domain-specific language model pretraining for biomedical natural language processing. ACM Transactions on Computing for Healthcare (HEALTH), 3(1), 1-23. [15] Qi, P., Zhang, Y., Zhang, Y., Bolton, J., & Manning, C. D. (2020). Stanza: A Python natural language processing toolkit for many human languages. arXiv preprint arXiv:2003.07082. [16] Ruder, S., Peters, M. E., Swayamdipta, S., & Wolf, T. (2019, June). Transfer learning in natural language processing. In Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: Tutorials (pp. 15-18). [17] Tian, H., Gao, C., Xiao, X., Liu, H., He, B., Wu, H., ... & Wu, F. (2020). SKEP: Sentiment knowledge enhanced pre-training for sentiment analysis. arXiv preprint arXiv:2005.05635. [18] Nandwani, P., & Verma, R. (2021). A review on sentiment analysis and emotion detection from text. Social Network Analysis and Mining, 11(1), 81.
Copyright © 2023 Piyush Jawale, Saahil Jawale, Dhaval Ingale, Mohit Shetty. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET57385
Publish Date : 2023-12-06
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here