CLST: Cold-Start Mitigation in Knowledge Tracing by Aligning a Generative Language Model as a Students’ Knowledge Tracer

Main

Sidebar

Published December 5, 2025
Heeseok Jung Jaesang Yoo Yohaan Yoon Yeonju Jang

Abstract

Generative large language models (LLMs) are widely utilized in education to assist students to learn and instructors to teach. In addition, generative LLMs are used to support personalized learning by recommending learning content in intelligent tutoring systems (ITSs). Nonetheless, there are few studies utilizing generative LLMs in the field of knowledge tracing (KT), which is a key component of ITSs. KT, which uses students' problem-solving histories to estimate their current levels of knowledge, is regarded as a key technology for personalized learning. Nevertheless, most existing KT models are characterized by their development with an ID-based paradigm, which results in a low performance in cold-start scenarios. These limitations can be mitigated by leveraging the vast quantities of external knowledge possessed by generative LLMs. In this study, we propose cold-start mitigation in knowledge tracing by aligning a generative language model as a students’ knowledge tracer (CLST) as a framework that utilizes a generative LLM as a knowledge tracer. Upon collecting data from math, social studies, and science subjects, we framed the KT task as a natural language processing task, wherein problem-solving data are expressed in natural language, and fine-tuned the generative LLM using the formatted KT dataset. Subsequently, we evaluated the performance of the CLST in situations of data scarcity using various baseline models for comparison. The results indicate that the CLST significantly enhanced performance with a dataset of fewer than 100 students in terms of prediction, reliability, and cross-domain generalization.

How to Cite

CLST: Cold-Start Mitigation in Knowledge Tracing by Aligning a Generative Language Model as a Students’ Knowledge Tracer. (2025). Journal of Educational Data Mining, 17(2), 86-117. https://doi.org/10.5281/
Abstract 111 | PDF Downloads 67 HTML Downloads 33

Details

Keywords

intelligent tutoring system, knowledge tracing, personalized learning, large language models applications

References
Abdelghani, R., Wang, Y.H., Yuan, X., Wang, T., Lucas, P., Sauzéon, H., and Oudeyer, P.Y. 2024. GPT-3-Driven Pedagogical Agents to Train Children's Curious Question-Asking Skills. International Journal of Artificial Intelligence in Education 34, 2, 483-518.

Abdelrahman, G., and Wang, Q. 2019. Knowledge Tracing with Sequential Key-Value Memory Networks. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 175-184.

Abdelrahman, G., Wang, Q., and Nunes, B. 2023. Knowledge Tracing: A Survey. ACM Computing Surveys 55, 11, 1-37.

Abu-Rasheed, H., Weber, C., and Fathi, M. 2024. Knowledge Graphs as Context Sources for LLM-Based Explanations of Learning Recommendations. In 2024 IEEE Global Engineering Education Conference (EDUCON), 1-5.

Ai, F., Chen, Y., Guo, Y., Zhao, Y., Wang, Z., Fu, G., and Wang, G. 2019. Concept-Aware Deep Knowledge Tracing and Exercise Recommendation in an Online Learning System. Proceedings of the 12th International Conference on Educational Data Mining, 240-245.

Bulut, O., and Yildirim-Erbasli, S.N. 2022. Automatic Story and Item Generation for Reading Comprehension Assessments with Transformers. International Journal of Assessment Tools in Education 9, Special Issue, 72-87.

Chen, P., Lu, Y., Zheng, V.W., and Pian, Y. 2018. Prerequisite-Driven Deep Knowledge Tracing. In 2018 IEEE International Conference on Data Mining (ICDM), 39-48.

Cheng, S., Liu, Q., Chen, E., Zhang, K., Huang, Z., Yin, Y., Huang, X., and Su, Y. 2022. AdaptKT: A Domain Adaptable Method for Knowledge Tracing. In Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining, 123-131.

Colpo, M.P., Primo, T.T., and de Aguiar, M.S. 2024. Lessons Learned from the Student Dropout Patterns on COVID-19 Pandemic: An Analysis Supported by Machine Learning. British Journal of Educational Technology 55, 2, 560-585.

Corbett, A.T., and Anderson, J.R. 1994. Knowledge Tracing: Modeling the Acquisition of Procedural Knowledge. User Modeling and User-Adapted Interaction 4, 253-278.

Dai, W., Tsai, Y.S., Lin, J., Aldino, A., Jin, H., Li, T., Gašević, D., and Chen, G. 2024. Assessing the Proficiency of Large Language Models in Automatic Feedback Generation: An Evaluation Study. Computers and Education: Artificial Intelligence 7, 100299.

Dai, W., Lin, J., Jin, H., Li, T., Tsai, Y.S., Gašević, D., and Chen, G. 2023. Can Large Language Models Provide Feedback to Students? A Case Study on ChatGPT. In 2023 IEEE International Conference on Advanced Learning Technologies (ICALT), 323-325.

Delianidi, M., and Diamantaras, K. 2023. KT-Bi-GRU: Student Performance Prediction with a Bi-Directional Recurrent Knowledge Tracing Neural Network. Journal of Educational Data Mining 15, 2, 1-21.

Do Viet, T., and Markov, K. 2023. Using Large Language Models for Bug Localization and Fixing. In 2023 12th International Conference on Awareness Science and Technology (iCAST), 192-197.

Doughty, J., Wan, Z., Bompelli, A., Qayum, J., Wang, T., Zhang, J., Zheng, Y., Doyle, A., Sridhar, P., Agarwal, A., Bogart, C., Keylor, E., Kultur, C., Savelka, J., and Sakr, M. 2024. A Comparative Study of AI-Generated (GPT-4) and Human-Crafted MCQs in Programming Education. In Proceedings of the 26th Australasian Computing Education Conference, 114-123.

Emerson, A., Min, W., Azevedo, R., and Lester, J. 2023. Early Prediction of Student Knowledge in Game-Based Learning with Distributed Representations of Assessment Questions. British Journal of Educational Technology 54, 1, 40-57.

Fan, Y., Jiang, F., Li, P., and Li, H. 2023. GrammarGPT: Exploring Open-Source LLMs for Native Chinese Grammatical Error Correction with Supervised Fine-Tuning. In CCF International Conference on Natural Language Processing and Chinese Computing, 69-80.

Feng, M., Heffernan, N., and Koedinger, K. 2009. Addressing the Assessment Challenge with an Online System that Tutors as It Assesses. User Modeling and User-Adapted Interaction 19, 243-266.

Gervet, T., Koedinger, K., Schneider, J., and Mitchell, T. 2020. When Is Deep Learning the Best Approach to Knowledge Tracing? Journal of Educational Data Mining 12, 3, 31-54.

Ghosh, A., Heffernan, N., and Lan, A.S. 2020. Context-Aware Attentive Knowledge Tracing. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2330-2339.

Hu, E.J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., and Chen, W. 2022. LoRA: Low-Rank Adaptation of Large Language Models. ICLR 1, 2, 3.

Idrissi, N., Zellou, A., Hourrane, O., Bakkoury, Z., and Benlahmar, E.H. 2019. Addressing Cold Start Challenges in Recommender Systems: Towards a New Hybrid Approach. In 2019 International Conference on Smart Applications, Communications and Networking (SmartNets), 1-6.

Jiang, A.Q., Sablayrolles, A., Mensch, A., Bamford, C., Chaplot, D.S., Casas, D.D.L., Bressand, F., Lengyel, G., Lample, G., Saulnier, L., Lavaud, L.R., Lachaux, M., Stock, P., Scao, T.L., Lavril, T., Wang, T., Lacroix, T., and Sayed, W.E. 2023. Mistral 7b. arXiv preprint arXiv:2310.06825.

Jin, M., Wen, Q., Liang, Y., Zhang, C., Xue, S., Wang, X., Zhang, J., Wang, Y., Chen, H., Li, X., Pan, S., Tseng, V.S., Zheng, Y., Chen, L., and Xiong, H. 2023. Large Models for Time Series and Spatio-Temporal Data: A Survey and Outlook. arXiv preprint arXiv:2310.10196.

Jung, H., Yoo, J., Yoon, Y., and Jang, Y. 2023. Language Proficiency Enhanced Knowledge Tracing. In International Conference on Intelligent Tutoring Systems, 3-15.

Kim, S., Kim, W., Jung, H., and Kim, H. 2021. DiKT: Dichotomous Knowledge Tracing. In Intelligent Tutoring Systems: 17th International Conference, ITS 2021, Virtual Event, June 7-11, 2021, Proceedings 17, 41-51.

Kuo, B.C., Chang, F.T., and Bai, Z.E. 2023. Leveraging LLMs for Adaptive Testing and Learning in Taiwan Adaptive Learning Platform (TALP). In LLM@AIED, 101-110.

Lee, W., Chun, J., Lee, Y., Park, K., and Park, S. 2022. Contrastive Learning for Knowledge Tracing. In Proceedings of the ACM Web Conference 2022, 2330-2338.

Li, L., Zhang, Y., Liu, D., and Chen, L. 2023. Large Language Models for Generative Recommendation: A Survey and Visionary Discussions. arXiv preprint arXiv:2309.01157.

Liu, Y., Yang, Y., Chen, X., Shen, J., Zhang, H., and Yu, Y. 2020. Improving Knowledge Tracing via Pre-training Question Embeddings. arXiv preprint arXiv:2012.05031.

Liu, Z., Liu, Q., Chen, J., Huang, S., Tang, J., and Luo, W. 2022. pyKT: A Python Library to Benchmark Deep Learning Based Knowledge Tracing Models. Advances in Neural Information Processing Systems 35, 18542-18555.

Lu, Y., Chen, P., Pian, Y., and Zheng, V.W. 2022. CMKT: Concept Map Driven Knowledge Tracing. IEEE Transactions on Learning Technologies 15, 4, 467-480.

Malik, A., Wu, M., Vasavada, V., Song, J., Coots, M., Mitchell, J., Goodman, N., and Piech, C. 2019. Generative Grading: Near Human-Level Accuracy for Automated Feedback on Richly Structured Problems. arXiv preprint arXiv:1905.09916.

Nakagawa, H., Iwasawa, Y., and Matsuo, Y. 2019. Graph-Based Knowledge Tracing: Modeling Student Proficiency Using Graph Neural Network. In IEEE/WIC/ACM International Conference on Web Intelligence, 156-163.

Neshaei, S.P., Davis, R.L., Hazimeh, A., Lazarevski, B., Dillenbourg, P., and Käser, T. 2024. Towards Modeling Learner Performance with Large Language Models. arXiv preprint arXiv:2403.14661.

Ni, L., Wang, S., Zhang, Z., Li, X., Zheng, X., Denny, P., and Liu, J. 2024. Enhancing Student Performance Prediction on Learnersourced Questions with SGNN-LLM Synergy. In Proceedings of the AAAI Conference on Artificial Intelligence 38, 21, 23232-23240.

Pandey, S., and Karypis, G. 2019. A Self-Attentive Model for Knowledge Tracing. arXiv preprint arXiv:1907.06837.

Pardos, Z.A., and Bhandari, S. 2023. Learning Gain Differences Between ChatGPT and Human Tutor Generated Algebra Hints. arXiv preprint arXiv:2302.06871.

Pardos, Z.A., and Bhandari, S. 2024. ChatGPT-Generated Help Produces Learning Gains Equivalent to Human Tutor-Authored Help on Mathematics Skills. PLoS ONE 19, 5, e0304013.

Pavlik, P.I., Jr., Cen, H., and Koedinger, K.R. 2009. Performance Factors Analysis–A New Alternative to Knowledge Tracing. In Artificial Intelligence in Education, 531-538. IOS Press.

Peng, C., Yang, X., Chen, A., Smith, K.E., PourNejatian, N., Costa, A.B., Martin, C., Flores, M.G., Zhang, Y., Magoc, T., Lipori, G., Mitchell, D.A., Ospina, N.S., Ahmed, M.M., Hogan, W.R., Shenkman, E.A., Guo, Y., Bian, J., and Wu, Y. 2023. A Study of Generative Large Language Model for Medical Research and Healthcare. NPJ Digital Medicine 6, 1, 210.

Piech, C., Bassen, J., Huang, J., Ganguli, S., Sahami, M., Guibas, L.J., and Sohl-Dickstein, J. 2015. Deep Knowledge Tracing. Advances in Neural Information Processing Systems 28.

Sahebi, S., and Cohen, W.W. 2011. Community-Based Recommendations: A Solution to the Cold Start Problem. In Workshop on Recommender Systems and the Social Web, RSWEB.

Sethi, R., and Mehrotra, M. 2021. Cold Start in Recommender Systems—A Survey from Domain Perspective. In Intelligent Data Communication Technologies and Internet of Things: Proceedings of ICICI 2020, 223-232. Springer Singapore.

Shen, S., Liu, Q., Chen, E., Huang, Z., Huang, W., Yin, Y., Su, Y., and Wang, S. 2021. Learning Process-Consistent Knowledge Tracing. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, 1452-1460.

Shen, S., Liu, Q., Huang, Z., Zheng, Y., Yin, M., Wang, M., and Chen, E. 2024. A Survey of Knowledge Tracing: Models, Variants, and Applications. IEEE Transactions on Learning Technologies 17, 1898-1919.

Sridhar, P., Doyle, A., Agarwal, A., Bogart, C., Savelka, J., and Sakr, M. 2023. Harnessing LLMs in Curricular Design: Using GPT-4 to Support Authoring of Learning Objectives. arXiv preprint arXiv:2306.17459.

Stamper, J., Niculescu-Mizil, A., Ritter, S., Gordon, G.J., and Koedinger, K.R. 2010. Challenge Data Set from KDD Cup 2010 Educational Data Mining Challenge.

Sun, J., Wei, M., Feng, J., Yu, F., Li, Q., and Zou, R. 2024. Progressive Knowledge Tracing: Modeling Learning Process from Abstract to Concrete. Expert Systems with Applications 238, 122280.

Tey, F.J., Wu, T.Y., Lin, C.L., and Chen, J.L. 2021. Accuracy Improvements for Cold-Start Recommendation Problem Using Indirect Relations in Social Networks. Journal of Big Data 8, 1-18.

Van Der Linden, W.J., and Hambleton, R.K. 1997. Item Response Theory: Brief History, Common Models, and Extensions. In Handbook of Modern Item Response Theory, 1-28. Springer New York.

Vie, J.J., and Kashima, H. 2019. Knowledge Tracing Machines: Factorization Machines for Knowledge Tracing. In Proceedings of the AAAI Conference on Artificial Intelligence 33, 1, 750-757.

Wang, C., Ma, W., Zhang, M., Lv, C., Wan, F., Lin, H., Tang, T., Liu, Y., and Ma, S. 2021. Temporal Cross-Effects in Knowledge Tracing. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining, 517-525.

Wang, Q., and Mousavi, A. 2023. Which Log Variables Significantly Predict Academic Achievement? A Systematic Review and Meta-Analysis. British Journal of Educational Technology 54, 1, 142-191.

Wang, Z., Lamb, A., Saveliev, E., Cameron, P., Zaykov, Y., Hernández-Lobato, J.M., Turner, R.E., Baraniuk, R.G., Barton, C., Jones, S.P., Woodhead, S., and Zhang, C. 2020. Instructions and Guide for Diagnostic Questions: The NeurIPS 2020 Education Challenge. arXiv preprint arXiv:2007.12061.

Wei, J., Bosma, M., Zhao, V.Y., Guu, K., Yu, A.W., Lester, B., Du, N., Dai, A.M., and Le, Q.V. 2021. Finetuned Language Models Are Zero-Shot Learners. arXiv preprint arXiv:2109.01652.

Weng, L.T., Xu, Y., Li, Y., and Nayak, R. 2008. Exploiting Item Taxonomy for Solving Cold-Start Problem in Recommendation Making. In 2008 20th IEEE International Conference on Tools with Artificial Intelligence 2, 113-120.

Whitehill, J., and LoCasale-Crouch, J. 2023. Automated Evaluation of Classroom Instructional Support with LLMs and BoWs: Connecting Global Predictions to Specific Feedback. arXiv preprint arXiv:2310.01132.

Wu, L., Zheng, Z., Qiu, Z., Wang, H., Gu, H., Shen, T., Qin, C., Zhu, C., Zhu, H., Liu, Q., Xiong, H., and Chen, E. 2024. A Survey on Large Language Models for Recommendation. World Wide Web 27, 5, 60.

Wu, T., and Ling, Q. 2023. Fusing Hybrid Attentive Network with Self-Supervised Dual-Channel Heterogeneous Graph for Knowledge Tracing. Expert Systems with Applications 225, 120212.

Wu, Z., Huang, L., Huang, Q., Huang, C., and Tang, Y. 2022. SGKT: Session Graph-Based Knowledge Tracing for Student Performance Prediction. Expert Systems with Applications 206, 117681.

Xia, Z., Dong, N., Wu, J., and Ma, C. 2023. Multivariate Knowledge Tracking Based on Graph Neural Network in ASSISTments. IEEE Transactions on Learning Technologies 17, 32-43.

Xu, J., Huang, X., Xiao, T., and Lv, P. 2023. Improving Knowledge Tracing via a Heterogeneous Information Network Enhanced by Student Interactions. Expert Systems with Applications 232, 120853.

Yang, H., Hu, S., Geng, J., Huang, T., Hu, J., Zhang, H., and Zhu, Q. 2024. Heterogeneous Graph-Based Knowledge Tracing with Spatiotemporal Evolution. Expert Systems with Applications 238, 122249.

Yeung, C.K., and Yeung, D.Y. 2018. Addressing Two Problems in Deep Knowledge Tracing via Prediction-Consistent Regularization. In Proceedings of the Fifth Annual ACM Conference on Learning at Scale, 1-10.

Yin, Y., Dai, L., Huang, Z., Shen, S., Wang, F., Liu, Q., Chen, E., and Li, X. 2023. Tracing Knowledge Instead of Patterns: Stable Knowledge Tracing with Diagnostic Transformer. In Proceedings of the ACM Web Conference 2023, 855-864.

Zhang, J., Cambronero, J., Gulwani, S., Le, V., Piskac, R., Soares, G., and Verbruggen, G. 2022. Repairing Bugs in Python Assignments Using Large Language Models. arXiv preprint arXiv:2209.14876.

Zhang, J., Shi, X., King, I., and Yeung, D.Y. 2017. Dynamic Key-Value Memory Networks for Knowledge Tracing. In Proceedings of the 26th International Conference on World Wide Web, 765-774.

Zhang, M., Zhu, X., Zhang, C., Ji, Y., Pan, F., and Yin, C. 2021. Multi-Factors Aware Dual-Attentional Knowledge Tracing. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management, 2588-2597.

Zhang, X., Zhang, X., Yang, C., Yan, H., and Qiu, X. 2023. Does Correction Remain A Problem For Large Language Models? arXiv preprint arXiv:2308.01776.

Zhao, J., Bhatt, S., Thille, C., Gattani, N., and Zimmaro, D. 2020. Cold Start Knowledge Tracing with Attentive Neural Turing Machine. In Proceedings of the Seventh ACM Conference on Learning@Scale, 333-336.

Zhao, S., Fu, H., Gong, M., and Tao, D. 2019. Geometry-Aware Symmetric Domain Adaptation for Monocular Depth Estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 9788-9798.

Zhao, W.X., Zhou, K., Li, J., Tang, T., Wang, X., Hou, Y., Min, Y., Zhang, B., Zhang, J., Dong, Z., Du, Y., Yang, C., Chen, Y., Chen, Z., Jiang, J., Ren, R., Li, Y., Tang, X., Liu, Z., and Wen, J.R. 2023. A Survey of Large Language Models. arXiv preprint arXiv:2303.18223.

Zhou, H., Liu, F., Gu, B., Zou, X., Huang, J., Wu, J., Li, Y., Chen, S.S., Zhou, P., Liu, J., Hua, Y., Mao, C., You, C., Wu, X., Zheng, Y., Clifton, L., Li, Z., Luo, J., and Clifton, D.A. 2023. A Survey of Large Language Models in Medicine: Progress, Application, and Challenge. arXiv preprint arXiv:2311.05112.

Zhou, Z., Ning, M., Wang, Q., Yao, J., Wang, W., Huang, X., and Huang, K. 2023. Learning by Analogy: Diverse Questions Generation in Math Word Problem. In Findings of the Association for Computational Linguistics: ACL 2023, 10892-10908.
Section
Articles