Evaluating Generative AI as a Supportive Analytic Partner in Qualitative Coding of Metacognitive Student Reflections
Main
Sidebar
Abstract
Qualitative data provides rich insight into student and educator thinking but remains difficult to analyse systematically at scale. The rise of generative artificial intelligence (GenAI) introduces new opportunities for pattern recognition and interpretive support, while also raising questions about how such systems can be responsibly embedded in educational research workflows. This study investigates the application of a fine-tuned GenAI model to classify metacognitive elements in student self-reflections and examines the methodological and epistemological implications of this process.In partnership with a secondary school, a university, and a state education department, the study analysed more than 14,000 student reflection artefacts collected between 2018 and 2023. A total of 4,631 samples were manually coded for four sub-elements of metacognition—Goal Setting, Strategy Choice, Reflection on Learning, and Effort Regulation—which were then used to train and evaluate a fine-tuned GPT‑4o-mini model that achieved 80.98% classification accuracy. However, our analysis also raises critical questions. While the model could detect the existence of metacognitive constructs, it lacked the pedagogical and contextual grounding to assess their quality or relevance and required significant human effort. These findings highlight the need to reconceptualise GenAI not as a replacement for human judgement, but as a supportive analytic partner. We argue that co-design processes between educators, researchers, and developers are essential to ensure AI systems are trustworthy, theoretically grounded, and practically useful. The approach outlined in this study provides a roadmap for extending the use of GenAI to other complex educational constructs, ensuring that AI is framed not merely as a tool but as a supportive analytic partner whose design reflects the needs and values of all system actors.
How to Cite
Details
Metacognition, Human-in-the-loop, Artificial Intelligence (AI), Qualitative Research, K-12 education
Alt, D., and Raichel, N. 2020. Reflective journaling and metacognitive awareness: Insights from a longitudinal study in higher education. Reflective Practice, 21, 2, 145–158. https://doi.org/10.1080/14623943.2020.1716708
Azevedo, R. 2020. Reflections on the field of metacognition: Issues, challenges, and opportunities. Metacognition and Learning, 15, 91–98.
Barany, A., Nasiar, N., Porter, C., Zambrano, A. F., Andres, A. L., Bright, D., Shah, M., Liu, X., Gao, S., Zhang, J., Mehta, S., Choi, J., Giordano, C., and Baker, R. S. 2024. ChatGPT for Education Research: Exploring the Potential of Large Language Models for Qualitative Codebook Development. In Artificial Intelligence in Education, A. M. Olney, I.-A. Chounta, Z. Liu, O. C. Santos, and I. I. Bittencourt, Eds. Springer Nature Switzerland 134–149. https://doi.org/10.1007/978-3-031-64299-9_10
Barthakur, A., Joksimovic, S., Kovanovic, V., Mello, R. F., Taylor, M., Richey, M., and Pardo, A. 2022. Understanding Depth of Reflective Writing in Workplace Learning Assessments Using Machine Learning Classification. IEEE Transactions on Learning Technologies, 15, 5, 567–578. https://doi.org/10.1109/tlt.2022.3162546
Barthakur, A., Marrone, R., Esnaashari, S., Kovanovic, V., and Dawson, S. 2025. Advancing Holistic Decision-Making Systems in Schools: Insights From Academic Research and Practical Applications. Journal of Computer Assisted Learning, 41, 3, e70021. https://doi.org/10.1111/jcal.70021
Belotto, M. J. 2018. Data analysis methods for qualitative research: Managing the challenges of coding, interrater reliability, and thematic analysis. The Qualitative Report, 23, 11, 2622–2633. https://doi.org/10.46743/2160-3715/2018.3492
Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J. D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., and Askell, A. 2020. Language models are few-shot learners. Advances in Neural Information Processing Systems, 33, 1877–1901.
Bryda, G., and Sadowski, D. 2024. From words to themes: AI-powered qualitative data coding and analysis. 309–345.
Calisto, F. M., Fernandes, J., Morais, M., Santiago, C., Abrantes, J. M., Nunes, N., and Nascimento, J. C. 2023. Assertiveness-based Agent Communication for a Personalized Medicine on Medical Imaging Diagnosis. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 1–20. https://doi.org/10.1145/3544548.3580682
Chen, S., and McDunn, B. A. 2022. Metacognition: History, measurements, and the role in early childhood development and education. Learning and Motivation, 78, 101786. https://doi.org/10.1016/j.lmot.2022.101786
Creagh, S., Thompson, G., Mockler, N., Stacey, M., and Hogan, A. 2025. Workload, work intensification and time poverty for teachers and school leaders: A systematic research synthesis. Educational Review, 77, 2, 661–680. https://doi.org/10.1080/00131911.2023.2196607
Cronbach, L. J. 1951. Coefficient alpha and the internal structure of tests. Psychometrika, 16(3), 297–334.
Flavell, J. H. 1979. Metacognition and cognitive monitoring: A new area of cognitive–developmental inquiry. American Psychologist, 34, 10, 906–911.
Garg, R., Han, J., Cheng, Y., Fang, Z., and Swiecki, Z. 2024. Automated Discourse Analysis via Generative Artificial Intelligence. In Proceedings of the 14th Learning Analytics and Knowledge Conference, 814–820. https://doi.org/10.1145/3636555.3636879
Gascoine, L., Higgins, S., and Wall, K. 2017. The assessment of metacognition in children aged 4–16 years: A systematic review. Review of Education, 5, 1, 3–57. https://doi.org/10.1002/rev3.3077
Gibson, A., Kitto, K., and Bruza, P. 2016. Towards the Discovery of Learner Metacognition From Reflective Writing. Journal of Learning Analytics, 3, 2, 22–36. https://doi.org/10.18608/jla.2016.32.3
Gisev, N., Bell, J. S., & Chen, T. F. (2013). Interrater agreement and interrater reliability: key concepts, approaches, and applications. Research in Social and Administrative Pharmacy, 9(3), 330-338.
Harwood, T. G., and Garry, T. 2003. An overview of content analysis. The Marketing Review, 3, 4, 479–498.
Holstein, K., and Aleven, V. 2022. Designing for human–AI complementarity in K-12 education. AI Magazine, 43, 2, 239–248.
İpek, Z. H., Gözüm, A. İ. C., Papadakis, S., and Kallogiannakis, M. 2023. Educational Applications of the ChatGPT AI System: A Systematic Review Research. Educational Process International Journal, 12, 3. https://doi.org/10.22521/edupij.2023.123.2
Ismail, N. M., and Tawalbeh, T. I. 2014. Effectiveness of a Metacognitive Reading Strategies Program for Improving Low Achieving EFL Readers. International Education Studies, 8, 1, p71. https://doi.org/10.5539/ies.v8n1p71
Jiang, J. A., Wade, K., Fiesler, C., and Brubaker, J. R. 2021. Supporting Serendipity: Opportunities and Challenges for Human-AI Collaboration in Qualitative Analysis. Proceedings of the ACM on Human-Computer Interaction, 5, CSCW1, 1–23. https://doi.org/10.1145/3449168
Katz, A., Wei, S., Nanda, G., Brinton, C., and Ohland, M. 2023. Exploring the Efficacy of ChatGPT in Analyzing Student Teamwork Feedback with an Existing Taxonomy (Version 1). arXiv. https://doi.org/10.48550/ARXIV.2305.11882
Khosravi, H., Shum, S. B., Chen, G., Conati, C., Tsai, Y.-S., Kay, J., Knight, S., Martinez-Maldonado, R., Sadiq, S., and Gašević, D. 2022. Explainable Artificial Intelligence in education. Computers and Education: Artificial Intelligence, 3, 100074. https://doi.org/10.1016/j.caeai.2022.100074
Kovanović, V., Joksimović, S., Mirriahi, N., Blaine, E., Gašević, D., Siemens, G., and Dawson, S. 2018. Understand students’ self-reflections through learning analytics. In Proceedings of the 8th International Conference on Learning Analytics and Knowledge, 389–398. https://doi.org/10.1145/3170358.3170374
Latif, E., and Zhai, X. 2024. Fine-tuning ChatGPT for automatic scoring. Computers and Education: Artificial Intelligence, 6, 100210. https://doi.org/10.1016/j.caeai.2024.100210
Li, B., Bhattarai, A., and Ding, Z. 2025. In search of humanness: Professional identities of qualitative research educators in the age of generative AI. Learning, Media and Technology, 1–13. https://doi.org/10.1080/17439884.2025.2521547
Lichtman, M. 2023. Qualitative research in education: A user’s guide. Routledge.
Miles, M., Huberman, A.M., and Saldana, j. 2014. Qualitative Data Analysis: A Methods Sourcebook (4th. ed.). Sage.
Nguyen-Trung, K. (2025). ChatGPT in thematic analysis: Can AI become a research assistant in qualitative research?. Quality & Quantity, 59(6), 4945-4978.
Ohtani, K., and Hisasaka, T. 2018. Beyond intelligence: A meta-analytic review of the relationship among metacognition, intelligence, and academic performance. Metacognition and Learning, 13, 2, 179–212. https://doi.org/10.1007/s11409-018-9183-8
Ozturk, N. 2017. Assessing Metacognition: Theory and Practices. International Journal of Assessment Tools in Education, 134–134. https://doi.org/10.21449/ijate.298299
Parker, J., Richard, V., and Becker, K. 2023. Guidelines for the Integration of Large Language Models in Developing and Refining Interview Protocols. The Qualitative Report. https://doi.org/10.46743/2160-3715/2023.6801
Pintrich, P. R., Wolters, C. A., and Baxter, G. P. 2000. Assessing metacognition and self-regulated learning. https://digitalcommons.unl.edu/burosmetacognition/3/
Prescott, M. R., Yeager, S., Ham, L., Rivera Saldana, C. D., Serrano, V., Narez, J., Paltin, D., Delgado, J., Moore, D. J., and Montoya, J. 2024. Comparing the Efficacy and Efficiency of Human and Generative AI: Qualitative Thematic Analyses. JMIR AI, 3, e54482. https://doi.org/10.2196/54482
Ramadhanti, D., Ghazali, A. S., Hasanah, M., Harsiati, T., and Yanda, D. P. 2020. The Use of Reflective Journal as a Tool for Monitoring of Metacognition Growth in Writing. International Journal of Emerging Technologies in Learning (iJET), 15, 11, 162. https://doi.org/10.3991/ijet.v15i11.11939
Ramanathan, S., Lim, L. A., Mottaghi, N. R., & Buckingham Shum, S. (2025, March). When the prompt becomes the codebook: Grounded prompt engineering (groproe) and its application to belonging analytics. In Proceedings of the 15th International Learning Analytics and Knowledge Conference (pp. 713-725).
Saldaña, J. 2015. The Coding Manual for Qualitative Research (3rd. ed.). Sage, Newcastle upon Tyne.
Schraw, G. 2000. Assessing Metacognition: Implications Of The Buros Symposium. In Issues in the measurement of metacognition, G. Schraw, and J. C. Impara, Eds. Lincoln, Nebraska: Buros Institute of Mental Measurements, 297–321.
Schroeder, H., Aubin Le Quéré, M., Randazzo, C., Mimno, D., and Schoenebeck, S. 2025. Large Language Models in Qualitative Research: Uses, Tensions, and Intentions. In Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems, 1–17. https://doi.org/10.1145/3706598.3713120
Schunk, D. H., and Greene, J. A. 2017. Historical, contemporary, and future perspectives on self-regulated learning and performance. In Handbook of self-regulation of learning and performance. Routledge, 1–15.
Siemens, G., Marmolejo-Ramos, F., Gabriel, F., Medeiros, K., Marrone, R., Joksimovic, S., and de Laat, M. 2022. Human and artificial cognition. Computers and Education: Artificial Intelligence, 3, 100107.
Siiman, L. A., Rannastu-Avalos, M., Pöysä-Tarhonen, J., Häkkinen, P., and Pedaste, M. 2023. Opportunities and Challenges for AI-Assisted Qualitative Data Analysis: An Example from Collaborative Problem-Solving Discourse Data. In Innovative Technologies and Learning. Y.-M. Huang and T. Rocha. Eds., 14099. Springer Nature Switzerland, 87–96. https://doi.org/10.1007/978-3-031-40113-8_9
Silver, N. 2023. Reflective Pedagogies and the Metacognitive Turn in College Teaching. In Using Reflection and Metacognition to Improve Student Learning (1st ed.), D. LaVaque-Manty, D. Meizlish, N. Silver, M. Kaplan, and J. Rhem, Eds. Routledge, 1–17. https://doi.org/10.4324/9781003448570-1
Strauss, A., and Corbin, J. 1998. Basics of qualitative research techniques. Sage Publications, Inc.
Ullmann, T. D. 2019. Automated Analysis of Reflection in Writing: Validating Machine Learning Approaches. International Journal of Artificial Intelligence in Education, 29, 2, 217–257. https://doi.org/10.1007/s40593-019-00174-2
Veenman, M. V. J., and Spaans, M. A. 2005. Relation between intellectual and metacognitive skills: Age and task differences. Learning and Individual Differences, 15, 2, 159–176. https://doi.org/10.1016/j.lindif.2004.12.001
Wei, X., Cui, X., Cheng, N., Wang, X., Zhang, X., Huang, S., Xie, P., Xu, J., Chen, Y., and Zhang, M. 2023. Chatie: Zero-shot information extraction via chatting with chatgpt. arXiv Preprint arXiv:2302.10205.
White, J., Fu, Q., Hays, S., Sandborn, M., Olea, C., Gilbert, H., Elnashar, A., Spencer-Smith, J., and Schmidt, D. C. 2023. A Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPT (No. arXiv:2302.11382). arXiv. https://doi.org/10.48550/arXiv.2302.11382
Winne, P. H. 2017. Cognition and metacognition within self-regulated learning. In Handbook of self-regulation of learning and performance. Routledge, 36–48.
Winne, P. H., and Hadwin, A. F. 1998. Studying as Self-Regulated Learning. In Metacognition in Educational Theory and Practice. Routledge.
Zambrano, A. F., Liu, X., Barany, A., Baker, R. S., Kim, J., and Nasiar, N. 2023. From nCoder to ChatGPT: From Automated Coding to Refining Human Coding. In Advances in Quantitative Ethnography, 1895, G. Arastoopour Irgens, and S. Knight, Eds. Springer Nature Switzerland, 470–485. https://doi.org/10.1007/978-3-031-47014-1_32
Zamfirescu-Pereira, J. D., Wong, R. Y., Hartmann, B., and Yang, Q. 2023. Why Johnny Can’t Prompt: How Non-AI Experts Try (and Fail) to Design LLM Prompts. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 1–21. https://doi.org/10.1145/3544548.3581388
Zhang, H., Wu, C., Xie, J., Lyu, Y., Cai, J., and Carroll, J. M. 2025. Harnessing the power of AI in qualitative research: Exploring, using and redesigning ChatGPT. Computers in Human Behavior: Artificial Humans, 4, 100144. https://doi.org/10.1016/j.chbah.2025.100144
Zhao, Z., Wallace, E., Feng, S., Klein, D., and Singh, S. 2021. Calibrate before use: Improving few-shot performance of language models. 12697–12706.
Zimmerman, B. J. 2002. Becoming a Self-Regulated Learner: An Overview. Theory Into Practice, 41, 2, 64–70. https://doi.org/10.1207/s15430421tip4102_2
Zsigmond, I., Metallidou, P., Misailidi, P., Iordanou, K., and Papaleontiou-Louca, E. 2025. Metacognitive monitoring in written communication: Improving reflective practice. Education Sciences, 15, 3, 299.

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Authors who publish with this journal agree to the following terms:
- The Author retains copyright in the Work, where the term “Work” shall include all digital objects that may result in subsequent electronic publication or distribution.
- Upon acceptance of the Work, the author shall grant to the Publisher the right of first publication of the Work.
- The Author shall grant to the Publisher and its agents the nonexclusive perpetual right and license to publish, archive, and make accessible the Work in whole or in part in all forms of media now or hereafter known under a Creative Commons 4.0 License (Attribution-Noncommercial-No Derivatives 4.0 International), or its equivalent, which, for the avoidance of doubt, allows others to copy, distribute, and transmit the Work under the following conditions:
- Attribution—other users must attribute the Work in the manner specified by the author as indicated on the journal Web site;
- Noncommercial—other users (including Publisher) may not use this Work for commercial purposes;
- No Derivative Works—other users (including Publisher) may not alter, transform, or build upon this Work,with the understanding that any of the above conditions can be waived with permission from the Author and that where the Work or any of its elements is in the public domain under applicable law, that status is in no way affected by the license.
- The Author is able to enter into separate, additional contractual arrangements for the nonexclusive distribution of the journal's published version of the Work (e.g., post it to an institutional repository or publish it in a book), as long as there is provided in the document an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post online a pre-publication manuscript (but not the Publisher’s final formatted PDF version of the Work) in institutional repositories or on their Websites prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (see The Effect of Open Access). Any such posting made before acceptance and publication of the Work shall be updated upon publication to include a reference to the Publisher-assigned DOI (Digital Object Identifier) and a link to the online abstract for the final published Work in the Journal.
- Upon Publisher’s request, the Author agrees to furnish promptly to Publisher, at the Author’s own expense, written evidence of the permissions, licenses, and consents for use of third-party material included within the Work, except as determined by Publisher to be covered by the principles of Fair Use.
- The Author represents and warrants that:
- the Work is the Author’s original work;
- the Author has not transferred, and will not transfer, exclusive rights in the Work to any third party;
- the Work is not pending review or under consideration by another publisher;
- the Work has not previously been published;
- the Work contains no misrepresentation or infringement of the Work or property of other authors or third parties; and
- the Work contains no libel, invasion of privacy, or other unlawful matter.
- The Author agrees to indemnify and hold Publisher harmless from Author’s breach of the representations and warranties contained in Paragraph 6 above, as well as any claim or proceeding relating to Publisher’s use and publication of any content contained in the Work, including third-party content.
https://orcid.org/0000-0002-2437-6892