The World has experienced the new pandemic caused by COVID-19 virus and several countries are affected by this disease specially India. Due to socio-economic problems of this disease, it is required to predict the trend of the outbreak and propose a beneficial method to find out the correct trend. In this paper, we have analysed the COVID-19 progression in India and developed a system to forecast the behaviour of COVID-19 spread in the future months using machine learning.
The objective of this paper is to use prophet model in Machine Learning for analysing and predicting the information of COVID-19. The user or consumer can glimpse a machine learning formula that analyses the data and how the machine learning algorithm predicts the data to facilitate in future health care mechanism.
Introduction
I. INTRODUCTION
On 30 January 2020, Director-General WHO declared that the outbreak of novel coronavirus (2019-nCoV) constitutes a Public Health Emergency of International Concern (PHEIC) as per the advice of International Heath Regulations (IHR) Emergency Committee et al [1] .As on 31th January 2020, a total of 9720 confirmed cases and 213 deaths have been reported in China et al [3].The epicenter of the outbreak was initially in Wuhan City, Hubei province but has rapidly extended to all other provinces of China et al [2]. Outside of China, 19 countries have reported a total of 106 confirmed cases, most with travel history from China. These countries are Australia (9), Cambodia (1), Canada (3), Finland (1), France (6), Germany (5), India (1), Italy (2), Japan (14), Malaysia (8), Nepal (1), Philippines (1), Singapore (13), South Korea (11), Sri Lanka (1), Thailand (14), UAE (4), USA (6), and Vietnam (5).On 30 January 2020, a laboratory confirmed case of 2019-nCoV was reported in Kerala.
II. LITERATURE REVIEW
Narayana Darapaneni et al [12] has used FbProphet Model which is for time-series forecasting which was made open source by Facebook in 2017. The work in this paper is focusing on COVID-19 Data analysis prediction using Machine learning with FbProphet algorithm approach. The main is prediction work in this project i.e., the future rarely repeat itself in the same way as the past specifically cases of covid - 19 it will be difficult to predict accurately. Arman Behnam et al [13] used predictive analytics which is based on dataset, active cases are derived reveals the number of confirmed, recovered, and death cases per day. A tremendous rise in the case number is obvious though freshly the active cases number is decreasing. Death cases trend has a gentle slope.
Hamzah, F. Binti, et al [7] analyzed the sentiments
from news extracted by CoronaTracker to further understand people’s reaction towards this outbreak. COVID-19 is still an infectious disease with some unclear or unknown properties, which means accurate SEIR prediction can only be obtained once the outbreak has been successfully contained. The outbreak spreads are largely influenced by each country’s policy and social responsibility.
III. MATERIALS AND METHODS
A. Data Collection and Preparation
Variables in our research include number of newly found cases, new death cases, and newly found recovered cases in India. All information has been collected and classified from reputable sources such as WHO (World Health Organization). These variables are chosen for use in our prediction methods because of their numerical nature. The high prevalence rate of COVID19 and the need for estimates necessitate the collection of the necessary datasets from reliable sources including WHO and Worldometer. Examination data, including observational data, are from a three-part collection of sample reports (i.e., death, confirmation, and recovery). This daily information at national level is confirmed by WHO.
Machine learning involves structural data that we see in a table. Algorithms for this comprise both linear and nonlinear varieties. Linear algorithms train more quickly, while nonlinear are better optimized for the problems they are likely to face (which are often nonlinear).
B. Prophet Model
Prophet is a procedure for forecasting time series data based on an additive model where non-linear trends are fit with yearly, weekly, and daily seasonality, plus holiday effects. It works best with time series that have strong seasonal effects and several seasons of historical data. Prophet isrobust to missing data and shifts in the trend, and typically handles outliers well. (Narayana Darapaneni et al 2020)
The Prophet algorithm is used in the time series and Forecast models. It is an open-source algorithm developed by Facebook, used internally by the company for forecasting. The Prophet algorithm is of great use in capacity planning, such as allocating resources and setting sales goals. Owing to the inconsistent level of performance of fully automated forecasting algorithms, and their inflexibility, successfully automating this process has been difficult. On the other hand, manual forecasting requires hours of labor by highly experienced analysts.
Prophet isn’t just automatic; it’s also flexible enough to incorporate heuristics and useful assumptions. The algorithm’s speed, reliability and robustness when dealing with messy data have made it a popular alternative algorithm choice for the time series and forecasting analytics models. Both expert analysts and those less experienced with forecasting find it valuable.
C. Our Analysis
We have done our analysis on the data gathered from the official website of the Ministry of Health and Family Welfare, Government of India. Also, for the sake of our analysis, considering the huge population of our country, we have assumed that not the whole population is likely to be infected with this disease.
The above graph shows the death rate in India in a date-wise manner. On the x-axis, it shows the time duration between March 2020 to July 2021, and on the y-axis, it shows the death rate due to covid 19.This graph shows that the COVID mortality graph has grown exponentially since September 2020.
The above graph shows the total cases in India in a state wise manner, on x- axis there is name of states in India and on y- axis there is total cases of covid-19 found in India. Among states, Andhra Pradesh, Kerala, Maharashtra, Tamil Nadu, and Delhi are the hotspots for Covid-19 cases. The above graph shows the total cases in India in a state wise manner.The Asian country was divided into three zones particularly 1) Red zones (Hotspots) 2) Orange zones (non-hotspots) and 3) inexperienced zone (Area while not ensure cases for 3 uninterrupted weeks). Machine Learning approaches are used and there is 2 resolutions, one for Analysing the information and the alternative to predict the chances of being infected and the other to predict the number of positive cases. According to these zones, Andhra Pradesh, Kerala, Maharashtra, Tamil Nadu, and Delhi are the hotspots for Covid-19 cases.
The above graph shows the total cases in India in state wise manner, on x- axis there is name of states in India and on y- axis there is total cases of covid-19 in India This graph shows the total cases in India in state wise manner, Whichever state has the maximum number of total cases, according to that, this graph has been shown in ascending order, and its shows Kerala, Maharashtra, Andhra Pradesh are the main hotspots for Covid-19 cases.
IV. PREDICTION
Prediction refers to the output of an algorithm after it has been trained on a particular dataset and applied to new data when forecasting the likelihood of a particular outcome, and based on our dataset, future active cases are derived. It reveals the number of confirmed, recovered, and death cases per day. (Arman Behnam et al 2 January 2021)
This Graph shows the total cases on y-axis and dates on x-axis. The dots represent the total cases on the date mentioned in the graph. The blue line represents the prediction done by the fbProphet Model.
Conclusion
The pandemic caused by COVID-19 is responsible for the advanced mortality rate and lower recovery rate. In this study, timewise patterns of the rise and fall of confirmed, deaths and recovery cases have been presented visually. The scope is to study the trend of Covid-19 Outbreak and give a predictive system to stop the outbreak. Thus, in this project the data of Covid is analysed a mere prediction has been done if the cases will grow or not. Accordingly, the action can be taken to stop this outbreak.
References
[1] Vardhini, PA Harsha, S. Shiva Prasad, and Seena Naik Korra. \"Medicine Allotment for COVID-19 Patients by Statistical Data Analysis.2021 International Conference on Emerging Smart Computing and Informatics (ESCI). AISSMS Institute of Information Technology, Pune, India. Mar 5-7, 2021, pp. 665-669.
[2] Saini, Sumindar Kaur, et al. \"Visual exploratory data analysis of covid-19 pandemic.\" 2020 5th IEEE International Conference on Recent Advances and Innovations in Engineering (ICRAIE). UIET, Panjab University, Chandigarh, India. 2020 Dec 1, pp. 1-6.
[3] Maghraby, Ashwag, et al. \"Software to Assist a Health Practitioner in Caring of Covid-19 Home Isolated Patients.\" 2021 National Computing Colleges Conference (NCCC). Makkah, Kingdome of Saudi Arabia.2021 Mar 27, pp. 1-4.
[4] Joseph, Nancy, et al. \"A Case Study on using Unstructured Data Analysis Methods to identify local Covid-19 Hotspots.\"SoutheastCon 2021.InSoutheastCon 2021 Mar 10, pp. 1-4.
[5] Krithika, D. R., and K. Rohini. \"Comparative Intrepretation of Machine Learning Algorithms In Predicting The Cardiovascular Death Rate For Covid-19 Data.\" 2021 International Conference on Computational Intelligence and Knowledge Economy (ICCIKE).2021 Mar 17, pp.394-400.
[6] Junlan, Zhu, and Yang Chengke. \"Data-analysis-based discussion on COVID-19 Pandemic Shocks to the Economy and Policy Responses: Cases in the United Kingdom.\" 2020 Management Science Informatization and Economic Innovation Development Conference (MSIEID). Chicago, 2020 Dec 18 pp. 538-541.
[7] Hamzah, F. Binti, et al. \"Corona Tracker: worldwide COVID-19 outbreak data analysis and prediction.\" Bull World Health Organ 1.32, 2020 Mar 19.
[8] Soomro, Zainab Tariq, Sardar Haider Waseem Ilyas, and Ussama Yaqub. \"Sentiment, Count and Cases: Analysis of Twitter discussions during COVID-19 Pandemic.\" 2020 7th International Conference on Behavioural and Social Computing (BESC), 2020 Nov 5 pp. 1-4.
[9] Kumari, Pratima, and Durga Toshniwal. \"Real-time estimation of COVID-19 cases using machine learning and mathematical models-The case of India.\" 2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS),2020 Nov 26 pp. 369-374.
[10] Kumar, S. Lokesh. \"Predictive Analytics of COVID-19 Pandemic: Statistical Modelling Perspective.\" Walailak Journal of Science and Technology (WJST).2021 Aug 15;18(16) pp. 15583-14.
[11] Singh, Sudhansu Sekhar, and Dinakrushna Mohapatra. \"Predictive Analysis for COVID-19 Spread in India by Adaptive Compartmental Model.\", medRxiv. 2020 Jan 1.
[12] Narayana Darapaneni, Praphul Jain,”Analysis and Prediction of COVID-19 Pandemic in India” 2020 2nd International Conference on Advances in Computing, Communication Control a nd Networking (ICACCCN)
[13] Arman Behnam, Roohollah Jahanmahin “A data analytics approach for COVID?19 spread and end prediction (with a case study in Iran)” Industrial Engineering Department, Iran University of Science and Technology (IUST), Narmak, 16846?13114 Tehran, Iran.