Is this model for real? Simulating data to reveal the proximity of a model to reality

Rinat Rosenberg-Kima, Zachary A. Pardos

Research output: Contribution to journalConference articlepeer-review

Abstract

Simulated data plays a central role in Educational Data Mining and in particular in Bayesian Knowledge Tracing (BKT) research. The initial motivation for this paper was to try to answer the question: given two datasets could you tell which of them is real and which of them is simulated? The ability to answer this question may provide an additional indication of the goodness of the model, thus, if it is easy to discern simulated data from real data that could be an indication that the model does not provide an authentic representation of reality, whereas if it is hard to set the real and simulated data apart that might be an indication that the model is indeed authentic. In this paper we will describe analyses of 42 GLOP datasets that were performed in an attempt to address this question. Possible simulated data based metrics as well as additional findings that emerged during this exploration will be discussed.

Original languageEnglish
Pages (from-to)78-87
Number of pages10
JournalCEUR Workshop Proceedings
Volume1432
StatePublished - 2015
EventWorkshops at the 17th International Conference on Artificial Intelligence in Education, AIED-WS 2015 - Madrid, Spain
Duration: 22 Jun 201526 Jun 2015

Keywords

  • Bayesian Knowledge Tracing (BKT)
  • Parameters space
  • Simulated data

ASJC Scopus subject areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'Is this model for real? Simulating data to reveal the proximity of a model to reality'. Together they form a unique fingerprint.

Cite this