Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: Shreya Sharma, Vikash Singh Rajput
DOI Link: https://doi.org/10.22214/ijraset.2023.55984
Certificate: View Certificate
Predicting the academic achievement of pupils is a significant component that needs to be taken into mind anytime concerns involving higher education or more in-depth schooling, particularly the connections between the two, are brought up for discussion. Students are able to choose the classes with future study plans that will be most useful to them with the assistance of the capability to predict their achievement. This is made possible by the availability of this skill. It gives teachers and administrators the opportunity to monitor pupils, which in turn enables them to provide more support for students, combine training programs for the highest potential results, and anticipate how successfully students will complete their education. One of the many benefits of student forecasting is that it leads to a decline in the number of official warning signals for school expulsions that are caused by inefficiency. This is only one of the many benefits of student forecasting. Students are able to see their own futures if they take the time to select their courses thoughtfully and come up with study methods that make the most of their unique sets of abilities and areas of interest. As it had values of 0.888 for accuracy, precision, recall, & f1 accordingly for each of those categories, the Support Vector Classifier was the most beneficial tool for this inquiry. This was due to the fact that it was able to correctly classify the data. These values are proof that the data were categorized with a high degree of accuracy. Throughout the course of this inquiry, several different methods of machine learning, such as ensemble, logistic regression, random forest, AdaBoost, & XG Boost, were applied.
I. INTRODUCTION
"Students" are people who have been accepted to universities and colleges. Education helps these people develop and become more employable. This review uses "students" and "those attending higher education institutions" interchangeably. Supporting Nigerian College Students' Mental Health with Technology Pandemics: A Global History has more. The Cambridge Intermediate Learner's Dictionary defines a "student" as someone who attends a higher education institution. This investigation included all Kenyan undergraduates in degree programs at state or private universities.[1]. Active Learning using Tools in Blended/Hybrid Courses provides details. College and university students are meant. "The Use of a Whatsapp Register for Students Contact: A Case of the Zimbabwe Local College, Mashonaland West Campus" provides details. Case method courses can assess students' progress through class participation, individual writing for essays and examinations, group projects, and presentations. Because coursework is crucial to the topic and often counts toward a student's grade, we'll focus on it here. A skilled particular instance instructor considers each student's altruistic contribution to class discussions while assessing involvement [2]. Developing impartial assessments of their contributions is not always easy. Both the substance and delivery of each student's contribution to the class discussion matter for its overall quality. While it's great to have more people chip in, doing so too frequently may lead to subpar contributions and signal a lack of interest in listening. It is possible for specific students' judgments of engagement to be significantly influenced by the calling patterns, inquiries, and follow-ups used by their instructors with those particular students [3]. The efficiency of the instructor's system for tracking student engagement may also have a major bearing on the reliability of the overall rating. The participant-centered research approach, in contrast to lecture-based pedagogies, encourages greater expectations and feedback. When students participate in class discussions, they receive rapid feedback in the form of opinions on their ideas from both the instructor and their peers. However, this type of feedback is sometimes imprecise and hard to interpret, leaving students wondering if and how their contributions will be valued. It's not always a good idea to encourage kids to cultivate their capacities for self-reflection and evaluation. Aside from formal assessments, students may actively seek out negative feedback from others[4]. Teachers must be able to give feedback that is both evaluative & helpful so that students can gain insight into their own strengths and areas for growth. The term "evaluation" is used to describe the method of rating the quality of student learning in relation to predetermined criteria.
Evaluation effectively summarizes and conveys to parents, other educators, employers, institutes of higher education and the students themselves will be responsible for assessing students' knowledge and abilities in accordance to the overall curriculum requirements. Evidence of student achievement is gathered through assessments administered at key points in the grading/teaching cycle, most frequently at the conclusion of a unit of study[5].
The determination of which specific expectations should be employed to evaluate the fulfillment of the overall expectations will be determined by the teachers [6]. In order to gather information on their students' levels of comprehension, teachers employ a wide variety of assessment methods. Observations, conversations between instructors and pupils, and the creation of work by kids all make up the three legs of this methodology's triangulated structure. The Student Performance Evaluation was created with an eye toward the future in order to provide greater clarity as you concentrate on improving the aforementioned competencies throughout the length of your work term. The three primary areas of strength & the top three areas for improvement will be the focus of these competencies[7].
The following strategy is one that we propose implementing so that you may get the most out of the Student Performance Evaluation:
Your "Work Term Record" may be found on the bottom right side of the WaterlooWorks dashboard. During the fourth month of the work term, we are going to get in touch with the person who is listed as your contact on my "Work Term Record.", and we are going to give that contact repeated reminders to complete a Student Performance Evaluation[10]. As a student participating in a co-op, it is your obligation to make certain that this is finished and turned in within the allotted amount of time by your employer. It would be to benefit you if you could remember to hand in your pupil performance Evaluation BEFORE you conclude your work term with the organization. This would be something that would be beneficial to your employer. Please do not hesitate to contact your co-op adviser if you have any inquiries regarding the process of completing your Student Performance Evaluation in collaboration with your employer.
II. LITERATURE REVIEW
Sáiz 2023 et. al conducted research with a representative sample of 57 college students, including 42 undergraduates and 15 graduate students majoring in Health Sciences. An interdisciplinary approach to research was taken. The quantitative study investigated the influence of the variable's schooling (bachelor's degree vs. master's degree) including the combined level of previous expertise on the rate of chatbot usage (low vs. average), learning outcomes, or satisfaction with the chatbot's usefulness. [Chatbot use rate] [Low vs. average] [Learning results] [Satisfaction with the chatbot's usefulness] [Low vs. average] . Specifically, the study looked at how educational level and prior knowledge affect how frequently people use chatbots. In addition, we investigated whether or not the frequency of students' use of chatbots was dependent on the metacognitive tactics they employed. The ideas that the students had for improving the chatbot and the kind of questions that it asked were analyzed as part of the qualitative research. There were no findings that could be considered definitive in regard to the frequency of use of chatbots and the levels of metacognitive methods exhibited by the pupils. On the basis of the students' recommendations for areas of development, additional research is required to direct this research [11].
Beckham 2023 et. al might generally be the greatest approach for fixing some problems, as a result of the fact as students are responsible for shaping the future of their country, which will have repercussions for a great number of aspects of life in the future, each student may be forced to deal with extremely difficult challenges. To determine whether or not a particular element genuinely has an impact on student grades, our MLP 12-Neuron model makes use of ML models to make predictions. First place goes to our MLP 12-Neuron model, which achieves the best results with an RMSE score of 4.32; second place goes to Random Forest, which achieves the best results with an RMSE value of 4.52; and third place goes to Decision Tree, which achieves the best results with a R [12].
Agarwal 2023 et. al has received a lot of attention as of late in association with its effects, which have been observed in students attending universities in India. The information for the dataset was gathered over the course of several years through the use of a questionnaire. Millions of people suffer from mental diseases that are frequently overlooked, and the vast majority of those people fulfill the criteria for the Likert scale measuring instrument. The individuals who participated in the study were university engineering students. The majority of respondents to this inquiry were young people, specifically students from a variety of educational institutions. The goal of this research was to ascertain the level of anxiety that was experienced by each of the 127 engineering students that participated in the study. As a direct result of this research, the degree of anxiety was quantified, and its causes and effects were evaluated in connection with its impact, as noticed by Indian university students. [Causes] and [effects] were analyzed in conjunction with [its] effects, as observed in [Indian] university students. The precision for the year 2023 It was discovered that the Authors are based on the fact that Cronbach's alpha value for the entire dataset was found to be 0.723 and the Pearson's correlation coefficient was found to be 0.823. This led to the conclusion that the Authors are. Elsevier (B.V.) was responsible for the printing and distribution. The accuracy of the naive Bayes technique was 71.05%, while the accuracy of the decision tree method was 71.05%, the accuracy of the random forest method was 78.9%, and the accuracy of the support vector machine method was 75.5% [13].
Nasser 2023 et. al According to the findings of the study, a Machine Learning (ML) technique was presented to improve the quality assurance of online education courses offered through the Maharat platform at Taif University. These programs are modeled after those offered in the Kingdom of Saudi Arabia (KSA), which has established criteria for online training. The outcomes of the research served as the foundation for these justifications. By taking into account the participants' participation in a virtual school environment, the primary objective of this study is to make an educated guess as to the level of academic success that each participant will achieve. After the selection of the relevant features by the application of hybrid optimization, the classification procedure was then carried out. The technique known as Support Vector Machine was used in order to make the predictions. For the purpose of determining and identifying the degree of achievement of the quality monitoring of online training program criteria, we made use of a technique that included descriptive and analytical components. This was accomplished by conducting an analysis of the sample opinions regarding the quality guarantee of online courses that were acquired using the Maharat platform that is situated at Taif University [14].
Ortin 2023 et. alhas demonstrated a noteworthy increase throughout the course of the past few years. These repositories house valuable information that may be extracted and used for a wide number of applications, including, but not limited to, the following: using them to teach programming; using them to identify poor programming practices; and constructing programming tools such as decompilers, developer environments, and intelligent tutoring systems. documenting recurrent syntactic constructs; evaluating the particular builds used by experts & beginners; using them to teach programming; using used to detect poor programming practices.
The fact that the syntactic information of source code is stored with tree structures, yet the datasets used by typical machine learning methods are n-dimensional, presents a challenge that is inherent to the format of the source code. These approaches involve learning in supervised as well as unsupervised settings. These findings reveal some interesting knowledge, such as the Java builds that are rarely used (in addition to widely used) (for example, bitwise operators, union types, as well as static blocks), various language features and patterns that are most commonly used by beginners (indeed barely used by experts), the discovery of certain kinds of source code (for example, helper or utility lessons, data transfer objects, and too complex abstractions), as well as the way that complexity is an inherent characteristic of the programming language [15].
Table 1 literature summary
Author/year |
Method/models |
Metrics |
References |
Martín/2023 |
Harmon’s one-factor test (Harmon’s single-factor test) by using IBM SPSS |
Variables= 28.7%, |
[16] |
Kunhoth/2023 |
K-Nearest Neighbor, Support Vector Machine, Random Forest, & AdaBoost are some of the algorithms that can be used. |
Accuracy= 80.8% |
[17] |
Gencoglu/2023 |
latent Dirichlet allocation (LDA) |
Qualitative data= 173,858 |
[18] |
Iqbal/2023 |
Artificial Intelligence (AI) & Machine Learning (ML) |
Average= 2.5, median= 3 |
[19] |
Asish/2022 |
RF, kNN, and extreme gradient boost (XGBoost) model |
Accuracy= 98.88% |
[20] |
III. PROPOSED METHODOLOGY
This page utilizes the open-source and totally free Kaggle tool to aggregate information on Portuguese with mathematics into one convenient location. focuses on the final rating that is used to classify pupils and offer them labels for their predicted and actual performance. There is a wide variety of design options available for columns or features. After that, the data ought to be prepared. If you choose to view the final score row, you will see that you have the option to utilize the score to divide the qualities into one of three categories: Good, Fair, or Poor.
A. Data Collection
Data collection involves testing hypotheses, answering research questions, and assessing results. Data collection is the systematic gathering and analysis of vital information. Arts & humanities, business, social sciences, & physical and applied sciences use the same research process. Data collection is the most important part of any research project. Different study fields require different data collection methodologies.
B. Pre-processing
Data preparation in data mining makes raw data more valuable and manageable. Since raw data may contain a lot of irrelevant information, it is important to clean it. Data preparation techniques have changed due to inferences drawn against machine learning and AI models and training them. Data preparation changes data structure, making data mining, machine learning, and other data science tasks faster and more efficient. To verify results, machine neural networks and artificial intelligence developers use the methods early on.
C. Data Splitting
The train-test split method tests machine learning algorithms' predictions using untrained data. The test set is the model's untrained data. Using outcomes, you can analyze how machine learning algorithms fared on your predictive modeling work. Despite its simplicity, the strategy should not be used in several situations. This difficulty occurs when the dataset is small and needs additional configuration or is utilized for classification but is unbalanced.
D. EDA
Exploratory data analysis (EDA) is a statistical method that compares data sets to find their commonalities. Statistics and other data visualization methods are used frequently. EDA seeks to discover what data may reveal that formal models cannot. Statistical models are yours to use. Since 1970, John Tukey has promoted exploratory data analysis to persuade statisticians to examine the data and generate ideas that could lead to greater data collection & testing.
E. Machine Learning and Modeling
Data science's fast-growing field includes machine learning. Using statistical methods, data mining projects train algorithms to classify or forecast and reveal key insights. These insights drive company application activities, which should affect important growth indicators. Data scientists will be in demand as big data usage grows. They'll help discover the company's most important questions and the data needed to address them.
IV. RESULT AND DISCUSSION
The findings of the research are presented in this part of the report. The viability of the proposed model was evaluated throughout the entirety of the experiment making use of a Python simulator in conjunction with a variety of performance measures. The metrics that were used for the assessment are presented down below.
A. TN/TP/FN/FP
B. Confusion Matrix
The first thing that we do with the data after organizing, cleansing, & otherwise modifying it is to feed it into a wonderful model, which, by its very nature, produces outcomes in the form of probabilities. This is where the Confusion Matrix is exposed for all to see. The confusion matrix is a tool that can be used to evaluate how well machine learning can classify things.
???????
Predicting student success is crucial when addressing higher education, in-depth education, and relationships. Students can choose courses with the best study strategies by forecasting their performance. It allows teachers and administrators to monitor students and give better support, combine training packages for optimal results, and forecast student performance. Student forecasting reduces inefficient warning signals for student expulsions, among other benefits. If they choose classes and study strategies that fit their strengths and interests, students can predict their futures. The Support Vector Classifier became the most useful tool for this inquiry due to its 0.888 accuracy, precision, recall, & f1 scores. These values demonstrate data categorization precision. Ensemble, logistic regression, random forest, AdaBoost, & XG Boost were used in this study.
[1] M. H. Kaspersen, K.-E. K. Bilstrup, M. Van Mechelen, A. Hjort, N. O. Bouvin, and M. G. Petersen, “High school students exploring machine learning and its societal implications: Opportunities and challenges,” Int. J. Child-Computer Interact., vol. 34, p. 100539, 2022, doi: 10.1016/j.ijcci.2022.100539. [2] S. Rajendran, S. Chamundeswari, and A. A. Sinha, “Predicting the academic performance of middle- and high-school students using machine learning algorithms,” Soc. Sci. Humanit. Open, vol. 6, no. 1, p. 100357, 2022, doi: 10.1016/j.ssaho.2022.100357. [3] V. Sheth, U. Tripathi, and A. Sharma, “A Comparative Analysis of Machine Learning Algorithms for Classification Purpose,” Procedia Comput. Sci., vol. 215, pp. 422–431, 2022, doi: 10.1016/j.procs.2022.12.044. [4] P. Meena and G. Kumar, “Online food delivery companies’ performance and consumers expectations during Covid-19: An investigation using machine learning approach,” J. Retail. Consum. Serv., vol. 68, no. May, p. 103052, 2022, doi: 10.1016/j.jretconser.2022.103052. [5] W. Rujuan and W. Lei, “Research on E-learning Behavior Evaluation of Students Based on Three-way Decisions Classification Algorithm,” Procedia Comput. Sci., vol. 208, pp. 367–373, 2022, doi: 10.1016/j.procs.2022.10.052. [6] L. Hasenbein et al., “Learning with simulated virtual classmates: Effects of social-related configurations on students’ visual attention and learning experiences in an immersive virtual reality classroom,” Comput. Human Behav., vol. 133, no. October 2021, 2022, doi: 10.1016/j.chb.2022.107282. [7] S. Roegiers, E. Corneillie, F. Lievens, F. Anseel, P. Veelaert, and W. Philips, “Distinctive features of nonverbal behavior and mimicry in application interviews through data analysis and machine learning,” Mach. Learn. with Appl., vol. 9, no. May, p. 100318, 2022, doi: 10.1016/j.mlwa.2022.100318. [8] M. 202, M. Habes, E. Youssef, S. A. Salloum, R. Alfaisal, and A. Ziani, “SEM-ANN-based approach to understanding students’ academic-performance adoption of YouTube for learning during Covid,” Heliyon, vol. 8, no. 4, p. e09236, 2022, doi: 10.1016/j.heliyon.2022.e09236. [9] M. Shanbehzadeh, H. Kazemi-Arpanahi, M. Bolbolian Ghalibaf, and A. Orooji, “Performance evaluation of machine learning for breast cancer diagnosis: A case study,” Informatics Med. Unlocked, vol. 31, no. June, p. 101009, 2022, doi: 10.1016/j.imu.2022.101009. [10] S. Gupta, K. Saluja, A. Goyal, A. Vajpayee, and V. Tiwari, “Comparing the performance of machine learning algorithms using estimated accuracy,” Meas. Sensors, vol. 24, no. September, p. 100432, 2022, doi: 10.1016/j.measen.2022.100432. [11] M. C. Sáiz-Manzanares, R. Marticorena-Sánchez, L. J. Martín-Antón, I. González Díez, and L. Almeida, “Perceived satisfaction of university students with the use of chatbots as a tool for self-regulated learning,” Heliyon, vol. 9, no. 1, p. e12843, 2023, doi: 10.1016/j.heliyon.2023.e12843. [12] N. R. Beckham, L. J. Akeh, G. N. P. Mitaart, and J. V Moniaga, “Determining factors that affect student performance using various machine learning methods,” Procedia Comput. Sci., vol. 216, no. 2022, pp. 597–603, 2023, doi: 10.1016/j.procs.2022.12.174. [13] J. Agarwal and J. Agarwal, “ScienceDirect ScienceDirect Detection and classification of anxiety students through the application of machine learning Detection and classification of anxiety in university students through the application of , machine learning,” Procedia Comput. Sci., vol. 218, no. 2022, pp. 1542–1550, 2023, doi: 10.1016/j.procs.2023.01.132. [14] M. Nasser Alsubaie, “Predicting student performance using machine learning to enhance the quality assurance of online training via Maharat platform,” Alexandria Eng. J., vol. 69, pp. 323–339, 2023, doi: 10.1016/j.aej.2023.02.004. [15] F. Ortin, G. Facundo, and M. Garcia, “Analyzing syntactic constructs of Java programs with machine learning,” Expert Syst. Appl., vol. 215, no. October 2022, p. 119398, 2023, doi: 10.1016/j.eswa.2022.119398. [16] J. L. Martín-Núñez, A. Y. Ar, R. P. Fernández, A. Abbas, and D. Radovanovi?, “Does intrinsic motivation mediate perceived artificial intelligence (AI) learning and computational thinking of students during the COVID-19 pandemic?,” Comput. Educ. Artif. Intell., vol. 4, no. January, p. 100128, 2023, doi: 10.1016/j.caeai.2023.100128. [17] J. Kunhoth, S. Al Maadeed, M. Saleh, and Y. Akbari, “Exploration and analysis of On-Surface and In-Air handwriting attributes to improve dysgraphia disorder diagnosis in children based on machine learning methods,” Biomed. Signal Process. Control, vol. 83, no. February, p. 104715, 2023, doi: 10.1016/j.bspc.2023.104715. [18] B. Gencoglu, M. Helms-Lorenz, R. Maulana, E. P. W. A. Jansen, and O. Gencoglu, “Machine and expert judgments of student perceptions of teaching behavior in secondary education: Added value of topic modeling with big data,” Comput. Educ., vol. 193, no. September 2022, p. 104682, 2023, doi: 10.1016/j.compedu.2022.104682. [19] M. Z. Iqbal and A. G. Campbell, “Telematics and Informatics Reports AGILEST approach?: Using machine learning agents to facilitate kinesthetic learning in STEM education through real-time touchless hand interaction,” Telemat. Informatics Reports, vol. 9, no. December 2022, p. 100034, 2023, doi: 10.1016/j.teler.2022.100034. [20] S. M. Asish, A. K. Kulshreshth, and C. W. Borst, “Detecting distracted students in educational VR environments using machine learning on eye gaze data,” Comput. Graph., vol. 109, pp. 75–87, 2022, doi: 10.1016/j.cag.2022.10.007.
Copyright © 2023 Shreya Sharma, Vikash Singh Rajput. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET55984
Publish Date : 2023-10-03
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here