Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: Omkar Khade, Yash Kadam, Ashish Ruke, Suyash Yeolekar
DOI Link: https://doi.org/10.22214/ijraset.2023.50023
Certificate: View Certificate
The National Institutional Ranking Framework (NIRF) is an annual ranking system initiated by the Indian government to rank higher education institutions based on several parameters such as teaching, research, and outreach activities. In this project, we propose to develop a machine learning model that can predict the NIRF rank of an institution. Here we have used 2020 NIRF ranking dataset from Kaggle. Then based on the score of previous years, we predict the rank by giving the performance indicators to the model. The paper focuses on the use of Random Forest Regressor based Machine learning technique to predict NIRF rank. Factors considered are Teaching, Learning and Resources (TLR) score, Research and Professional Practice (RPC) score, Graduation Outcome (GO) score, Outreach and Inclusivity (OI) score and Perception Score for particular college. The model is evaluated using standard strategic indicator: Root Mean Square Error. The low value of this indicator show that the model is efficient in predicting NIRF rank. We got score of 93% and RMSE of 15.47. We have completed ML model save and load operations using Joblib. We have created a flask server for model deployment and deployed on Render as web service. We conducted comprehensive evaluations on frequently used machine learning models and conclude that our proposed solution outperforms due to the comprehensive feature engineering that we built. The system achieves overall high accuracy for College NIRF rank prediction.
I. INTRODUCTION
A. National Institutional Ranking Framework (NIRF)
National Institutional Ranking Framework (NIRF) is a methodology adopted by the Ministry of Education, Government of India, to rank institutions of higher education in India. The Framework was approved by the MHRD and launched by Minister of Human Resource Development on 29 September 2015. Depending on their areas of operation, institutions have been ranked under 11 different categories – overall, university, colleges, engineering, management, pharmacy, law, medical, architecture, dental and research. The Framework uses several parameters for ranking purposes like resources, research, and stakeholder perception. These parameters have been grouped into five clusters and these clusters were assigned certain weightages. The weightages depend on the type of institution. About 3500 institutions voluntarily participated in the first round of rankings.
The methodology draws from the overall recommendations and broad understanding by a Core Committee set up by MHRD to identify the broad parameters for ranking institutions of Higher Education. The parameters covered are:
B. NIRF Rank Prediction
The NIRF ranking is determined by a complex process that involves the analysis of various performance metrics of educational institutions. These metrics include teaching, research, graduation outcomes, outreach, and perception. The institutions are then ranked based on their overall score, which is calculated using a weighted average of these metrics. Predicting the NIRF rank of an educational institution can be a challenging task as it involves analysing various performance metrics and their relative importance in determining the final rank. Machine learning algorithms can be used to build predictive models that can accurately predict the NIRF rank of educational institutions.
By predicting the NIRF rank of educational institutions, stakeholders such as students, parents, and educational institutions can make informed decisions about which institutions to choose or collaborate with. It can also help educational institutions identify areas where they need to improve to increase their ranking in the future.
II. PROBLEM STATEMENT
The NIRF (National Institutional Ranking Framework) rank prediction problem statement involves predicting the rank of various educational institutions in India based on certain parameters such as teaching, research, graduation outcomes, outreach, and perception. Given a dataset containing the performance metrics of various educational institutions, the goal is to build a predictive model that can accurately predict the NIRF rank of these institutions. This can be framed as a regression problem where the target variable is the NIRF rank, and the input features are parameters like Teaching Learning and Resources (TLR) score, Research and Professional Practice (RPC) score, Graduation Outcome (GO) score, Outreach and Inclusivity (OI) score, and Perception Score. The model can be trained using various machine learning algorithms such as linear regression, decision trees, random forests, or neural networks. The performance of the model can be evaluated using metrics such as mean squared error, root mean squared error, or R-squared value. The final model can be used to predict the NIRF rank of new institutions based on their performance metrics.
A. Goals and Objectives
B. Statement of scope
The scope of NIRF rank prediction involves developing predictive models to accurately predict the ranking of educational institutions in India based on various performance metrics. The performance metrics may include factors such as teaching, research, graduation outcomes, outreach, and perception.
The scope of NIRF rank prediction includes the following:
The scope of NIRF rank prediction is limited to the Indian higher education system and the performance metrics used in the NIRF ranking framework. The predictive models developed through this process can help stakeholders make informed decisions about the choice of educational institutions and incentivize educational institutions to improve their performance in various areas.
C. Software context
To develop the front end of the project we require HTML, CSS, and bootstrap. To develop the back end, we need a flask framework, Flask can be used to serve machine learning models through APIs. To deploy the model, we require account on render.
D. Major constraints
NIRF rank prediction faces several major constraints that can affect the accuracy and reliability of the predictions. Some of these constraints include:
To address these constraints, it is essential to use appropriate data pre-processing techniques, carefully select and evaluate machine learning algorithms, and ensure transparency and objectivity in the methodology used to calculate the NIRF rank. Additionally, it is important to recognize the limitations of predictive models and use them as a tool to support decision-making rather than relying on them as the sole basis for decision-making.
III. RELATED WORK
There have been several studies and research papers that have focused on NIRF rank prediction using various machine learning algorithms and performance metrics. Here are a few examples of related work:
These studies and others like them demonstrate the potential of machine learning algorithms and performance metrics to predict the NIRF ranking of Indian educational institutions. However, there is still a need for further research to develop more accurate and reliable predictive models and address the limitations and constraints associated with this type of prediction.
IV. METHODOLOGY
The methodology for predicting the NIRF rank of Indian educational institutions using machine learning algorithms typically involves the following steps:
The evaluation helps to determine the accuracy and reliability of the model and identify areas for improvement.
6. Model Evaluation: The trained model is evaluated using various performance metrics such as root mean square error (RMSE), mean absolute error (MAE), and R-squared. The evaluation helps to determine the accuracy and reliability of the model and identify areas for improvement.
7. Model Deployment: Once the predictive model has been trained and evaluated, it can be deployed for NIRF rank prediction. The model can be integrated into an existing educational analytics platform or developed as a standalone application.
8. Continuous Improvement: Predictive models require continuous improvement to keep up with changes in performance metrics and to address any limitations and constraints associated with NIRF rank prediction. This involves regularly updating the model with new data and evaluating its performance to ensure accuracy and reliability.
A. Random Forest Regression
Every decision tree has high variance, but when we combine all of them together in parallel then the resultant variance is low as each decision tree gets perfectly trained on that sample data, and hence the output does not depend on one decision tree but on multiple decision trees. In the case of a classification problem, the final output is taken by using the majority voting classifier. In the case of a regression problem, the final output is the mean of all the outputs. This part is called Aggregation.
Random Forest is an ensemble technique capable of performing both regression and classification tasks with the use of multiple decision trees and a technique called Bootstrap and Aggregation, commonly known as bagging. The basic idea behind this is to combine multiple decision trees in determining the final output rather than relying on individual decision trees.
Random Forest has multiple decision trees as base learning models. We randomly perform row sampling and feature sampling from the dataset forming sample datasets for every model. This part is called Bootstrap.
Ensemble uses two types of methods:
B. Flask
Flask is a lightweight web application framework in Python that can be used for deploying machine learning models for NIRF rank prediction. Here are the steps involved in deploying a Random Forest Regression model using Flask:
Overall, deploying a Random Forest Regression model using Flask allows the model to be easily integrated into web applications or APIs, providing a scalable and accessible solution for NIRF rank prediction. It also provides an opportunity to further optimize and improve the model's performance by gathering real-time data and monitoring its predictions.
C. Scikit-Learn
Scikit-learn is a popular machine learning library in Python that can be used for NIRF rank prediction using Random Forest Regression. Here are the steps involved in building a Random Forest Regression model using scikit-learn:
V. PROJECT PURPOSE
The purpose of the NIRF rank prediction project is to develop a machine learning model that can predict the National Institutional Ranking Framework (NIRF) rank of Indian educational institutions based on various parameters such as research output, student and faculty quality, infrastructure, outreach, and perception. The project aims to provide insights into the factors that contribute to an institution's NIRF rank and to help identify areas for improvement. It also aims to provide a tool for policymakers, educators, and other stakeholders to make informed decisions about higher education institutions in India. By building a predictive model for NIRF ranking, the project can potentially help institutions better understand how they can improve their standing in the rankings, and guide policymakers in allocating resources to enhance the overall quality of higher education in India. Overall, the project's purpose is to contribute to the improvement of the Indian higher education system by leveraging machine learning techniques to predict NIRF rankings, and provide actionable insights for institutions and policymakers.
VI. EXPERIMENTAL RESULT
VII. FUTURE SCOPE
The future scope of the NIRF rank prediction project is vast and encompasses several potential avenues for further development and improvement. Here are some possible directions for future work:
In conclusion, the NIRF rank prediction project aims to leverage machine learning techniques to predict the National Institutional Ranking Framework (NIRF) rank of Indian higher education institutions. The project\'s purpose is to provide insights into the factors that contribute to an institution\'s NIRF rank, identify areas for improvement, and help policymakers allocate resources to enhance the overall quality of higher education in India. By building a Random Forest Regression model using scikit-learn, the project demonstrates the potential of machine learning to predict NIRF rankings with a high degree of accuracy. The model has been trained and evaluated using a large dataset of Indian educational institutions, and its performance has been measured using evaluation metrics such as root mean squared error (RMSE). The future scope of the project is vast and encompasses several potential avenues for further development, such as incorporating more data sources, enriching data with text analysis, incorporating temporal trends, exploring alternative machine learning models, and building a user-friendly interface. Overall, the NIRF rank prediction project is a valuable contribution to the improvement of the Indian higher education system, and its predictive model provides actionable insights for institutions and policymakers.
[1] National Institutional Ranking Framework (NIRF) official website: https://www.nirfindia.org/ [2] Bhatia, A., & Singh, S. P. (2021). Predicting NIRF Ranking using Machine Learning. In Proceedings of the 3rd International Conference on Computing Methodologies and Communication (pp. 547-553). Springer. [3] Jha, P. C., & Aggarwal, M. (2019). Predicting NIRF Ranking of Indian Universities and Institutes using Machine Learning Techniques. Journal of Data Science, 17(4), 611-626. [4] Scikit-learn documentation: https://scikit-learn.org/stable/documentation.html [5] Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning: data mining, inference, and prediction. Springer. [6] Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5-32. [7] Friedman, J. H. (2001). Greedy function approximation: a gradient boosting machine. Annals of Statistics, 1189-1232.2011. [8] Chollet, F. (2018). Deep learning with Python. Manning Publications. [9] Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT press. [10] Kingma, D. P., & Ba, J. (2014). Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980. [11] Nigam, A., & Singh, S. (2020). Predicting NIRF Ranking of Indian Engineering Institutions using Machine Learning Techniques. International Journal of Engineering Research and Technology, 13(2), 96-102 [12] Kumar, A., & Kumar, M. (2021). NIRF Ranking Prediction using Ensemble Machine Learning Techniques. In 2021 4th International Conference on Computing, Communication and Networking Technologies (ICCCNT) (pp. 1-6). IEEE. [13] Jain, A., & Sood, S. K. (2020). NIRF Ranking Prediction of Indian Universities using Machine Learning Algorithms. International Journal of Computer Applications, 180(7), 1-5. [14] Agrawal, A., & Singh, S. P. (2020). Predicting NIRF Ranking of Indian Universities and Institutes using Supervised Learning Techniques. In 2020 3rd International Conference on Computing, Communication and Security (ICCCS) (pp. 1-6). IEEE. [15] Géron, A. (2019). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. O\'Reilly Media, Inc.
Copyright © 2023 Omkar Khade, Yash Kadam, Ashish Ruke, Suyash Yeolekar. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET50023
Publish Date : 2023-04-01
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here