Early prediction Students' Graduation Rank Using LAGT: Enhancing Accuracy with GCN and Transformer on Small Datasets
Author affiliations
DOI:
https://doi.org/10.15625/1813-9663/21095Keywords:
Deep learning, Transformer, classìfication model, Graph convolutional network, Early prediction of graduation classificationAbstract
Recent efforts to predict students' graduation ranks using machine learning and deep learning methods have faced challenges, particularly with small sample sizes which limit accuracy. This paper introduces the LAGT (Learning Analysis by Graph Convolutional Network and Transformer) method, a novel approach for early predicting of students' graduation ranks. LAGT integrates a Graph Convolutional Network (GCN) to enhance the training set with labeled samples and utilizes a Transformer to forecast graduation ranks. This method harnesses the semi-supervised learning capabilities of GCN to automatically label data, addressing the constraints of small sample sizes in training sets. Additionally, the Transformer leverages its proficiency in handling long sequences and capturing contextual information, thereby demonstrating superior effectiveness in models trained on larger datasets. We evaluated this method on three datasets from some universities (HNMU1, HNMU2, VNU) and achieved a maximum accuracy of 92.73%. Results indicate that the integrated LAGT method outperforms comparable approaches across multiple metrics including accuracy, prediction precision, and model sensitivity, achieving up to a 35.73% improvement. Notably, on the same HNMU1 dataset, the accuracy increased from 85% (reported by Son et al. [1] to 90.91% with this model. Experimental comparisons underscore the superior performance of LAGT over alternative methodologies in similar scenarios.
Metrics
References
. ACT. (2012). National collegiate retention and persistence to degree rates from http://www.act.org/research/policymakers/pdf/retain_2012.pdf
. Alfy, S. El., Gómez, J.M.& Dani, A. (2019). Exploring the benefits and challenges of learning analytics in higher education institutions: a systematic literature review. Information Discovery and Delivery, 47(1), 25-34.
. Arnold, K. E., & Pistilli, M. D. (2012). Course signals at Purdue: Using learning analytics to increase student success. In Proceedings of the 2nd international conference on learning analytics and knowledge (pp. 267-270). ACM. https://doi.org/10.1145/2330601.2330666.
. Bendangnuksung, E. & Prabu, D. (2018), Student’s Performance Prediction Using Deep Neural Network, International Journal of Applied Engineering Research ISSN 0973-4562 Volume 13, Number 2 (2018) pp. 1171-1176.
. Bienkowski, M., Feng, M. and Means, B. (2012), Enhancing Teaching and Learning through Educational Data Mining and Learning Analytics, Washington D.C.
. Corrigan, O. & Smeaton, A. F. (2017). A Course Agnostic Approach to Predicting Student Success from VLE Log Data Using Recurrent Neural Networks. European Conference on Technology Enhanced Learning, 545–548. Springer.
. Du, X., Yang, J., Hung, J-L. and Shelton, B. (2020). Educational data mining: A systematic review of research and emerging trends. Information Discovery and Delivery, 48(4), 225-236.
. Dutt, A., Ismail, M.A. & Herawan, T. (2017). A systematic review on educational data mining. IEEE Access, 5, 15991-16005.
. Fei, M. and Yeung, D-Y. (2015). Temporal Models for Predicting Student Dropout in Massive Open Online Courses. 2015 IEEE International Conference on Data Mining Workshop (ICDMW), 256-263.
. Hussain, S., Muhsion, Z. F., Salal, Y. K., Theodorou, P., Kurtoglu, F., and Hazarika, G. (2019), Prediction Model on Student Performance based on Internal Assessment using Deep Learning, International Journal of Emerging Technologies in Learning (iJET).
. Iatrellis, O., Savvas, I.K., Fitsilis, P. & Gerogiannis, V.C. (2021), A two-phase machine learning approach for predicting student outcomes. Educ Inf Technol 26, 69-88 (2021).
. Khalil, M., & Ebner, M. (2015). Learning analytics: Principles and constraints. EdMedia: World Conference on Educational Media and Technology, 1789-1799. Association for the Advancement of Computing in Education (AACE).
. T. N. Kipf and M. Welling (2017), Semi-supervised classification with graph convolutional networks. In: 5th international conference on learning representations, ICLR 2017. Conference Track Proceedings, Toulon.
. Fok, W. W. T., He, Y. S., Yeung, H. H. A., Law, K. Y., Cheung, K., Ai, Y., & Ho, P. (2018). Prediction model for students’ future development by deep learning and tensorflow artificial intelligence engine. 2018 4th International Conference on Information Management (ICIM).
. Krizhevsky, A., Sutskever, I., and Hinton, G. E. (2012a). ImageNet Classification with Deep Convolutional Neural Networks. Curran Associates, Inc.
. Mubarak, A.A., Cao, H. and Hezam, I.M. (2022), Modeling student’s performance using graph convolutional networks. Complex Intell. Syst. 8, 2183–2201 (2022).
. N.T.K. Son, C.C. Tho, B.T.T. Huong, P.T. Anh, N. Q. Tri, (2021), The application of machine learning in education science research, VNU JOURNAL of SCIENCE, Vol. 37, No. 4 (2021) 19-26.
. N.T.K. Son, N.V. Bien, N. H. Quynh, C.C. Tho (2022). Machine learning based admission data processing for early forecasting students’ learning outcomes, International Journal of Data Warehousing and Mining, 18(1), 1-14.
. N.T.K. Son, N.T. Thong, N.H. Quynh, H.V. Long (2024). Implementation neutrosophy in deep learning models for prediction student outcomes: An application in Hanoi Metropolitan University, SOCO, DOI: 10.1007/s00500-023-09625-4.
. Siemens, G., & Long, P. (2011). Penetrating the fog: Analytics in learning and education. EDUCAUSE Review, 46(5), 31-40.
. Thai-Nghe, N., Horváth, T., & Schmidt-Thieme, L. (2011), Factorization models for forecasting student performance, EDM 2011 - Proceedings of the 4th International Conference on Educational Data Mining, (January).
. Tinto, V. (2006). Research and practice of student retention: What next? Journal of College Student Retention: Research, Theory & Practice, 8(1), 1-19.
. Waheed, H., Hassan, S., Aljohani, N. R., Hardman, J., Nawaz, R. (2019). Predicting Academic Performance of Students from VLE Big Data using Deep Learning Models, Computers in Human Behavior (2019), https://doi.org/10.1016/j.chb.2019.106189
. Xu, X., Wang, J., Peng, H., & Wu, R. (2019). Prediction of academic performance associated with internet usage behaviors using machine learning algorithms. Computers in Human Behavior, 98, 166–173.
. B. K. Yousafzai, S.A. Khan, T.Rahman, I. Khan, I.Ullah, A.U. Rehman, M. Baz, H.H. Hamam, O. Cheikhrouhou (2021), Student-Performulator: Student Academic Performance Using Hybrid Deep Neural Network, Sustainability, https://doi.org/10.3390/su13179775
Downloads
Published
How to Cite
Issue
Section
License
1. We hereby assign copyright of our article (the Work) in all forms of media, whether now known or hereafter developed, to the Journal of Computer Science and Cybernetics. We understand that the Journal of Computer Science and Cybernetics will act on my/our behalf to publish, reproduce, distribute and transmit the Work.2. This assignment of copyright to the Journal of Computer Science and Cybernetics is done so on the understanding that permission from the Journal of Computer Science and Cybernetics is not required for me/us to reproduce, republish or distribute copies of the Work in whole or in part. We will ensure that all such copies carry a notice of copyright ownership and reference to the original journal publication.
3. We warrant that the Work is our results and has not been published before in its current or a substantially similar form and is not under consideration for another publication, does not contain any unlawful statements and does not infringe any existing copyright.
4. We also warrant that We have obtained the necessary permission from the copyright holder/s to reproduce in the article any materials including tables, diagrams or photographs not owned by me/us.