Early prediction Students' Graduation Rank Using LAGT: Enhancing Accuracy with GCN and Transformer on Small Datasets

Son Nguyen Thi Kim; Huy Quynh Nguyen; Tuan Minh Bui

doi:10.15625/1813-9663/21095

Author affiliations

Authors

Nguyen Thi Kim Son Hanoi University of Industry, Hanoi, 298 Cau Dien Street, Nord Tu Liem District, Ha Noi, Viet Nam
Nguyen Huu Quynh CMC University, 84 Nguyen Thanh Binh Street, Ha Dong District, Ha Noi, Viet Nam
Bui Tuan Minh Thuyloi University, 175 Tay Son Street, Dong Da District, Ha Noi, Viet Nam

DOI:

https://doi.org/10.15625/1813-9663/21095

Keywords:

Deep learning, Transformer, classìfication model, Graph convolutional network, Early prediction of graduation classification

Abstract

Recent efforts to predict students' graduation ranks using machine learning and deep learning methods have faced challenges, particularly with small sample sizes which limit accuracy. This paper introduces the LAGT (Learning Analysis by Graph Convolutional Network and Transformer) method, a novel approach for early predicting of students' graduation ranks. LAGT integrates a Graph Convolutional Network (GCN) to enhance the training set with labeled samples and utilizes a Transformer to forecast graduation ranks. This method harnesses the semi-supervised learning capabilities of GCN to automatically label data, addressing the constraints of small sample sizes in training sets. Additionally, the Transformer leverages its proficiency in handling long sequences and capturing contextual information, thereby demonstrating superior effectiveness in models trained on larger datasets. We evaluated this method on three datasets from some universities (HNMU1, HNMU2, VNU) and achieved a maximum accuracy of 92.73%. Results indicate that the integrated LAGT method outperforms comparable approaches across multiple metrics including accuracy, prediction precision, and model sensitivity, achieving up to a 35.73% improvement. Notably, on the same HNMU1 dataset, the accuracy increased from 85% (reported by Son et al. [1] to 90.91% with this model. Experimental comparisons underscore the superior performance of LAGT over alternative methodologies in similar scenarios.

Metrics

PDF views

122

References

. ACT. (2012). National collegiate retention and persistence to degree rates from http://www.act.org/research/policymakers/pdf/retain_2012.pdf

. Alfy, S. El., Gómez, J.M.& Dani, A. (2019). Exploring the benefits and challenges of learning analytics in higher education institutions: a systematic literature review. Information Discovery and Delivery, 47(1), 25-34.

. Arnold, K. E., & Pistilli, M. D. (2012). Course signals at Purdue: Using learning analytics to increase student success. In Proceedings of the 2nd international conference on learning analytics and knowledge (pp. 267-270). ACM. https://doi.org/10.1145/2330601.2330666.

. Bendangnuksung, E. & Prabu, D. (2018), Student’s Performance Prediction Using Deep Neural Network, International Journal of Applied Engineering Research ISSN 0973-4562 Volume 13, Number 2 (2018) pp. 1171-1176.

. Bienkowski, M., Feng, M. and Means, B. (2012), Enhancing Teaching and Learning through Educational Data Mining and Learning Analytics, Washington D.C.

. Corrigan, O. & Smeaton, A. F. (2017). A Course Agnostic Approach to Predicting Student Success from VLE Log Data Using Recurrent Neural Networks. European Conference on Technology Enhanced Learning, 545–548. Springer.

. Du, X., Yang, J., Hung, J-L. and Shelton, B. (2020). Educational data mining: A systematic review of research and emerging trends. Information Discovery and Delivery, 48(4), 225-236.

. Dutt, A., Ismail, M.A. & Herawan, T. (2017). A systematic review on educational data mining. IEEE Access, 5, 15991-16005.

. Fei, M. and Yeung, D-Y. (2015). Temporal Models for Predicting Student Dropout in Massive Open Online Courses. 2015 IEEE International Conference on Data Mining Workshop (ICDMW), 256-263.

. Hussain, S., Muhsion, Z. F., Salal, Y. K., Theodorou, P., Kurtoglu, F., and Hazarika, G. (2019), Prediction Model on Student Performance based on Internal Assessment using Deep Learning, International Journal of Emerging Technologies in Learning (iJET).

. Iatrellis, O., Savvas, I.K., Fitsilis, P. & Gerogiannis, V.C. (2021), A two-phase machine learning approach for predicting student outcomes. Educ Inf Technol 26, 69-88 (2021).

. Khalil, M., & Ebner, M. (2015). Learning analytics: Principles and constraints. EdMedia: World Conference on Educational Media and Technology, 1789-1799. Association for the Advancement of Computing in Education (AACE).

. T. N. Kipf and M. Welling (2017), Semi-supervised classification with graph convolutional networks. In: 5th international conference on learning representations, ICLR 2017. Conference Track Proceedings, Toulon.

. Fok, W. W. T., He, Y. S., Yeung, H. H. A., Law, K. Y., Cheung, K., Ai, Y., & Ho, P. (2018). Prediction model for students’ future development by deep learning and tensorflow artificial intelligence engine. 2018 4th International Conference on Information Management (ICIM).

. Krizhevsky, A., Sutskever, I., and Hinton, G. E. (2012a). ImageNet Classification with Deep Convolutional Neural Networks. Curran Associates, Inc.

. Mubarak, A.A., Cao, H. and Hezam, I.M. (2022), Modeling student’s performance using graph convolutional networks. Complex Intell. Syst. 8, 2183–2201 (2022).

. N.T.K. Son, C.C. Tho, B.T.T. Huong, P.T. Anh, N. Q. Tri, (2021), The application of machine learning in education science research, VNU JOURNAL of SCIENCE, Vol. 37, No. 4 (2021) 19-26.

. N.T.K. Son, N.V. Bien, N. H. Quynh, C.C. Tho (2022). Machine learning based admission data processing for early forecasting students’ learning outcomes, International Journal of Data Warehousing and Mining, 18(1), 1-14.

. N.T.K. Son, N.T. Thong, N.H. Quynh, H.V. Long (2024). Implementation neutrosophy in deep learning models for prediction student outcomes: An application in Hanoi Metropolitan University, SOCO, DOI: 10.1007/s00500-023-09625-4.

. Siemens, G., & Long, P. (2011). Penetrating the fog: Analytics in learning and education. EDUCAUSE Review, 46(5), 31-40.

. Thai-Nghe, N., Horváth, T., & Schmidt-Thieme, L. (2011), Factorization models for forecasting student performance, EDM 2011 - Proceedings of the 4th International Conference on Educational Data Mining, (January).

. Tinto, V. (2006). Research and practice of student retention: What next? Journal of College Student Retention: Research, Theory & Practice, 8(1), 1-19.

. Waheed, H., Hassan, S., Aljohani, N. R., Hardman, J., Nawaz, R. (2019). Predicting Academic Performance of Students from VLE Big Data using Deep Learning Models, Computers in Human Behavior (2019), https://doi.org/10.1016/j.chb.2019.106189

. Xu, X., Wang, J., Peng, H., & Wu, R. (2019). Prediction of academic performance associated with internet usage behaviors using machine learning algorithms. Computers in Human Behavior, 98, 166–173.

. B. K. Yousafzai, S.A. Khan, T.Rahman, I. Khan, I.Ullah, A.U. Rehman, M. Baz, H.H. Hamam, O. Cheikhrouhou (2021), Student-Performulator: Student Academic Performance Using Hybrid Deep Neural Network, Sustainability, https://doi.org/10.3390/su13179775

Early prediction Students' Graduation Rank Using LAGT: Enhancing Accuracy with GCN and Transformer on Small Datasets

Authors

DOI:

Keywords:

Abstract

Metrics

References

Downloads

Published

How to Cite

Issue

Section

License