Collaboration-Type Identification in Educational Datasets



Published Jun 29, 2014
Andrew Waters Christoph Studer Richard Baraniuk


Identifying collaboration between learners in a course is an important challenge in education for two reasons: First, depending on the courses rules, collaboration can be considered a form of cheating. Second, it helps one to more accurately evaluate each learners competence. While such collaboration identification is already challenging in traditional classroom settings consisting of a small number of learners, the problem is greatly exacerbated in the context of both online courses or massively open online courses (MOOCs) where potentially thousands of learners have little or no contact with the course instructor. In this work, we propose a novel methodology for collaboration-type identification, which both identifies learners who are likely collaborating and also classifies the type of collaboration employed. Under a fully Bayesian setting, we infer the probability of learners succeeding on a series of test items solely based on graded response data. We then use this information to jointly compute the likelihood that two learners were collaborating and what collaboration model (or type) was used. We demonstrate the efficacy of the proposed methods on both synthetic and real-world educational data; for the latter, the proposed methods find strong evidence of collaboration among learners in two non-collaborative take-home exams.

How to Cite

Waters, A., Studer, C., & Baraniuk, R. (2014). Collaboration-Type Identification in Educational Datasets. JEDM | Journal of Educational Data Mining, 6(1), 28-52. Retrieved from
Abstract 466 | PDF Downloads 153


W. E. Becker and C. Johnston. 1999. The Relationship between Multiple Choice and Essay Response Questions in Assessing Economics Understanding. Economic Record 75, 4 (Dec. 1999), 348–357.

Y. Bergner, S. Droschler, G. Kortemeyer, S. Rayyan, D. Seaton, and D. Pritchard. 2012. Model-Based Collaborative Filtering Analysis of Student Response Data: Machine-Learning Item Response Theory. In Proc. 5th Intl. Conf. Educational Data Mining. Chania, Greece, 95–102.

A. C. Butler and H. L. Roediger. 2008. Feedback enhances the positive effects and reduces the negative effects of multiple-choice testing. Memory & Cognition 36, 3 (Apr. 2008), 604–616.

W. W. Chaffin. 1979. Dangers in using the Z Index for detection of cheating on tests. Psychological Reports 45 (Dec. 1979), 776–778.

S. Chib and E. Greenberg. 1998. Analysis of multivariate probit models. Biometrika 85, 2 (June 1998), 347–361.

R. B. Frary. 1993. Statistical Detection of Multiple-Choice Answer Copying: Review and Commentary. Applied Measurement in Education 6, 2 (1993), 153–165.

R. B. Frary, T. N. Tideman, and T. M. Watts. 1977. Indices of Cheating on Multiple-Choice Tests. Journal of Educational Statistics 2, 4 (1977), 235–256.

A. Gelman, C. Robert, N. Chopin, and J. Rousseau. 1995. Bayesian Data Analysis. CRCPress.

T. M. Haladyna, S. M. Downing, and M. C. Rodriguez. 2002. A Review of Multiple-Choice Item-Writing
guidelines for Classroom Assessment. Applied measurement in education 15, 3 (2002), 309–333.

P. D. Hoff. 2009. A First Course in Bayesian Statistical Methods. Springer Verlag.

A. S. Lan, A. E. Waters, C. Studer, and R. G. Baraniuk. 2013. Sparse Factor Analysis for Learning and
Content Analytics. submitted to Journal of Machine Learning Research, (2013).

M. V. Levine and B. R. Donald. 1979. Measuring the Appropriatemess of Multiple-Choice Test Scores.
Journal of Educational Statistics 4, 5 (Winter 1979), 269–290.

H. S. Nwana. 1990. Intelligent tutoring systems: an overview. Artificial Intelligence Review 4, 4 (1990),

L. Pappano. 2012. The Year of the MOOC. The New York Times (Nov. 4 2012).

G. Rasch. 1960. Studies in Mathematical Psychology: I. Probabilistic Models for Some Intelligence and
Attainment Tests. Nielsen & Lydiche.

G. Rasch. 1993. Probabilistic Models for Some Intelligence and Attainment Tests. MESA Press.

M. C. Rodriguez. 1997. The Art & Science of Item Writing: A Meta-Analysis of Multiple-Choice Item
Format Effects. In annual meeting of the American Education Research Association, Chicago, IL.

M. N. Schmidt, O. Winther, and L. K. Hansen. 2009. Bayesian Non-Negative Matrix Factorization. In
Independent Component Analysis and Signal Separation, Vol. 5441. 540–547.

A. E. Waters, C. Studer, and R. G. Baraniuk. 2013. Bayesian Pairwise Collaboration Detection in Educa-
tional Datasets. In Proc. IEEE Global Conf. on Sig. and Info. Proc. (GlobalSIP). Austin, TX.

G. O. Wesolowsky. 2000. Detection Excessive Similarity in Answers on Multiple Choice Exams. Journal of
Applied Statistics 27, 7 (Aug. 2000), 909–921.

P. H. Westfall, W. O. Johnson, and J. M. Utts. 1997. A Bayesian perspective on the Bonferroni adjustment.
Biometrika 84, 2 (1997), 419–427.