<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The Mobile Fact and Concept Textbook System (MoFaCTS) Computational Model and Scheduling System</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Philip I. Pavlik Jr.</string-name>
          <email>ppavlik@memphis.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Luke G. Eglington</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute for Intelligent Systems, University of Memphis</institution>
          ,
          <addr-line>Memphis, TN 38152</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>An intelligent textbook may be defined as an interaction layer between the text and the student, helping the student master the content in the text. The Mobile Fact and Concept Training System (MoFaCTS) is an adaptive instructional system for simple content that has been developed into an interaction layer to mediate textbook instruction and so is being transformed into the Mobile Fact and Concept Textbook System (MoFaCTS). In this paper, we document the several terms of the logistic regression model we use to track performance adaptively. We then examine the contribution of each component of our model when it is fit to 4 semesters of Anatomy and Physiology course practice data. Following this documentation of the model, we explain how it is applied in the MoFaCTS system to schedule performance by targeting practice for each item at an optimal efficiency threshold.</p>
      </abstract>
      <kwd-group>
        <kwd>System</kwd>
        <kwd>intelligent tutoring systems</kwd>
        <kwd>e-learning</kwd>
        <kwd>instructional design</kwd>
        <kwd>cloze</kwd>
        <kwd>models of learning</kwd>
        <kwd>adaptive scheduling</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Many adaptive learning systems (ALS) are inspired by the idea of personalizing the
selection of practice items for a student. However, existing ALS rarely quantify
studentlevel individual differences (e.g., learning rate) or consider efficiency when sequencing
practice, so the potential of adaptive practice selection is rarely achieved in practice.
Since the prominence of behaviorism, there has been an interest in using automated
methods to sequence practice items to help students learn (e.g., [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]). Skinner advocated
the construction of sequences to promote error-free learning through a content domain.
Advocates expanded upon these ideas to produce systems with adaptive branching
depending on student responses (e.g., [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]). The transition to the information processing
approach offered rich opportunities for adaptive practice since by proposing
hypothetical cognitive constructs like memories or skills; it became easier to make
computational models that tracked such constructs (e.g., [
        <xref ref-type="bibr" rid="ref1 ref3 ref4">1, 3, 4</xref>
        ]. This early work was so
influential that one of the largest systems with adaptive sequencing, Carnegie Learning’s
Cognitive Tutor series of products, currently uses a system with many similarities to
earlier models (e.g., [
        <xref ref-type="bibr" rid="ref3 ref5 ref6">3, 5, 6</xref>
        ].
      </p>
      <p>
        We are currently developing and testing an adaptive textbook system to teach
community college students Anatomy and Physiology content (see Fig. 1 below) using cloze
practice. Students have used our system during four semesters as a supplement to course
content. The practice content is currently cloze sentences created with NLP algorithms
and the course textbook [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ]. Over the past two years, we have iteratively refined all
aspects of the system based on practice data, teacher feedback, student survey data, and
system log data (e.g., student performance).
      </p>
      <p>
        Below we describe the motivating literature of our approach, followed by our
iterative process to improve our ALS to integrate these additional student variables. While
we describe this work embedded in our development context, we are explicitly
designing the system as a generic textbook supplement that might be applied to any textbook
to help students practice that content [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>"The __________ __________ is composed predominantly of
neural tissue, but also includes blood vessels and connective tissue."
(fill-in is nervous system)</p>
      <p>
        “Once __________ __________ are secreted, phosphates
filtered into the fluid of the renal tubule buffer them, aided by
ammonia.” (fill-in is hydrogen ions)
Our adaptive practice system is specifically designed for learning materials containing
a set of knowledge components (KCs) that are interdependent and for which multiple
KCs may be required to solve an individual problem. Despite this focus on complex
content, the adaptive sequencing applies to both independent items (e.g., unrelated
vocabulary words) and dependent items within a KC model (e.g., practice questions nested
within a concept). There are two primary novel components of our system that
distinguish our approach. First, our system uses an adaptive practice algorithm, with practice
chosen based on an empirically derived Optimal Efficiency Threshold (OET) [
        <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
        ].
Second, the learner model in our system includes features that account for elapsed time
between practice events and the difficulty of those events. Below we begin by
describing relevant literature concerning adaptive practice algorithms and the existing adaptive
learning systems within which they are used. Most of the relevant literature and existing
adaptive systems involve what is essentially flashcard learning. The key distinctions
among them are whether practice or not practice is scheduled adaptively (the practice
algorithm) and whether decisions are made with the help of a learner model.
Subsequently, we describe relevant literature on individual student and item differences and
why learner models need to account for these differences.
      </p>
      <p>
        The Pimsleur method [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] was an early attempt to leverage spacing effects into
language learning practice. With this method, new vocabulary was introduced and tested
with increasingly wide spacing intervals. Expanding practice intervals can be effective
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], but they may not be effective unless the interval is adaptive to the user's
performance [
        <xref ref-type="bibr" rid="ref12 ref9">9, 12</xref>
        ]. With the Pimsleur method, spacing intervals are increased
regardless of item difficulty or student performance. This heuristic leads to overly difficult
(and inefficient) practice for some items. The Leitner system [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] offered an improved
algorithm that was adaptive to student performance. Put simply, practice schedules for
items practiced according to the Leitner method were increased or decreased according
to whether the student was answering correctly (increase spacing) or incorrectly
(decrease spacing). Decreasing spacing for harder items can improve learning [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] and has
been successfully implemented in adaptive practice algorithms [
        <xref ref-type="bibr" rid="ref15 ref16 ref9">9, 15, 16</xref>
        ].
      </p>
      <p>
        In their simplest forms, ALS do not have a model estimating knowledge; they have
a decision rule dictating when a concept is sufficiently well understood and adaptive
feedback triggered by incorrect answers. For example, the Assistments system
considers content mastered when the student answers questions about that content correctly
three times in a row [
        <xref ref-type="bibr" rid="ref17 ref18">17, 18</xref>
        ]. Relevant feedback and hints are provided if the student
answers incorrectly. While this approach is superior to undifferentiated instruction and
problem sequencing [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], including a model can further improve practice efficiency. A
particularly prominent model-informed adaptive system is the Cognitive Tutor System.
It uses a Bayesian Knowledge Tracing (BKT) model to trace the learning of
mathematical skills or KCs (knowledge components), adapting practice to drop items from the
practice set when they have been “mastered” according to the BKT model. BKT avoids
inefficiencies that can arise from solely decision-rule-based scheduling (e.g., a student
getting two in a row repeatedly but not progressing due to a 3 in a row requirement).
While BKT accounts for learning and adapts to performance, it and other popular
models (e.g., PFA) typically do not account for other important factors that can predict
knowledge states, such as elapsed time between practice attempts, the increased
predictive utility of recent vs. older attempts, and forgetting.
      </p>
      <p>
        Recently, more advanced adaptive practice algorithms informed by psychological
theories of memory have been tested. The Half-life Regression algorithm [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] and the
difficulty-threshold system introduced by Eglington &amp; Pavlik [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] produce practice
scheduling similar to these methods but further improved upon compared to these
earlier heuristics by scheduling practice according to predictions by a learner model
inspired by psychological theories of spacing and forgetting. Lindsey et al. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] also
demonstrated how scheduling practice according to a model could provide robust
learning benefits over simpler scheduling algorithms. An important detail of these
approaches is that the decision rule for which item to practice next is informed by prior
theory that items more likely to have been forgotten are more productive items to
practice [
        <xref ref-type="bibr" rid="ref21 ref3">3, 21</xref>
        ]. Thus, more learning gains may be achieved on such trials, but those trials
may also be more time-consuming than efficiency-based approaches [
        <xref ref-type="bibr" rid="ref16 ref9">9, 16</xref>
        ]. We
describe these issues in detail below. Next, we describe our approach, in which we also
use a theoretically driven learner model, and also account for the efficiency of practice
when making pedagogical decisions with those predictions. For our adaptive learning
system (ALS), we chose a logistic regression framework instead due to its greater
simplicity and flexibility caused by the easy combination of multiple factors in the
underlying regression, unlike methods like BKT, where the addition of additional factors
greatly complexifies the optimization and implementation of the model. Using logistic
regression allowed us to add features more easily with forgetting effects and spacing
effects, crucial for modeling declarative memory. This flexibility also allowed us to
include features to account for linguistic features of practice items and student
differences in prior knowledge, learning rate, and motivation.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Logistic Regression Model</title>
      <sec id="sec-2-1">
        <title>Student differences</title>
        <p>
          Prior knowledge varies across students that use adaptive educational technology (e.g.,
[
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]). This variability is part of the motivation for creating adaptive educational
technology in the first place. Individual differences in prior knowledge are important
because they determine the initial challenge of practice, which has ramifications for the
efficacy of the practice and the student's subjective experience. Prior knowledge can
also impact the rate at which the student can encode new information - if they already
have a somewhat coherent mental model of the to-be-learned topic, they may better
learn new related concepts [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]. Variability in student learning rates is an important
topic and has been investigated to some extent already. Lee &amp; Brunskill [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]
demonstrated that practice scheduled by a BKT model and mastery criterion might be more
efficient if the BKT model included individual student parameters. Yudelson,
Koedinger, &amp; Gordon [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ] provided further evidence that accounting for student-level
learning rates could improve model fits, perhaps more than student prior knowledge.
Mis-estimating learning rates can also lead to systematic errors when adaptively
sequencing practice [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ], reducing practice efficiency. If a system consistently
overestimates a students’ learning, that student may be subjected to overly challenging practice
content. This overestimation may have other unintended consequences on the student.
For instance, overly difficult or easy tasks can lead to inattention [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ]. In other words,
a mismatch between the model predictions and the student can reduce a students’
engagement with the task [
          <xref ref-type="bibr" rid="ref27 ref28">27, 28</xref>
          ] and perhaps ultimately increase the probability of
student drop-out [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ]. In sum, accounting for student-level differences is more important
than simply improving model fit. Accounting for this variance can have tangible
beneficial effects on the efficiency of the ALS.
3.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Multi-level item differences</title>
        <p>
          Existing ALS assume KCs vary in difficulty [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ]. This difficulty can be accounted for
by estimating different learning rates across KCs. One advantage of differences
intrinsic to KCs is that these differences frequently generalize across students and improve
prediction accuracy for new students. Additionally, most learning materials (and
representative KCs) are taught repeatedly to many students. In other words, new students are
more common than new content, and thus generalizing information about KCs can
greatly enhance ALS predictive accuracy and usefulness. For example, in our research,
we use cloze items as our primary trial type, providing learners with sentences with
keywords omitted that they must fill in. We plan to eventually have difficulty measures
for these items, treating them as KCs. However, currently, we do not estimate intercepts
for these KCs. Instead, item/KC learning is tracked at three nested levels: the
studentlevel, the fill-in-word (i.e., performance on prior attempts to recall that particular word
across different cloze sentences), and the sentence-level (independent of which
fill-inword for the sentence is chosen).
3.3
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Transitioning to a more comprehensive model</title>
        <p>
          The goal of our model was not to prove cognitive theories but to apply them. Though
more detailed than models like AFM, PFA, or BKT, the learning effects of our model
are simpler than cognitive architectures such as ACT-R. We included the additional
complexity to account for robust learning effects that these simple popular models do
have mechanisms to capture. However, in many cases, we use basic mechanisms from
these models and variants of these models, which we find to combine well using logistic
regression as the inference method. For example, as can be seen below, we use
mechanisms related to PFA variants such as R-PFA [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ], PFA-Decay [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ], and PFA difficulty
[
          <xref ref-type="bibr" rid="ref33">33</xref>
          ], along with memory-based recency similar to ACT-R [
          <xref ref-type="bibr" rid="ref34">34</xref>
          ]. Below we describe this
initial model used to track student learning and performance in Anatomy and
Physiology across four semesters. Following that, we introduce new features that attempt to
account for additional student and item differences. Some of these features are derived
from student self-report data, while others are new tracking features designed to account
for student individual differences in learning rate and prior knowledge.
3.4
        </p>
      </sec>
      <sec id="sec-2-4">
        <title>Features in the model</title>
        <p>
          These features are strongly informed by cognitive theories of learning, although to our
knowledge have not previously been utilized together in a live adaptive learning system.
The model was parameterized for the first semester using a highly detailed cloze
learning experiment with basic statistics sentences (for a description of this dataset, see [
          <xref ref-type="bibr" rid="ref34">34</xref>
          ]).
We refit this model using the first semester Fall data; those parameters were
subsequently used for successive semesters. The choice of model terms was based on the
need to produce a model that would behave consistently (parameters showing high
correlation even though different) in all cases when presented with reasonable data,
Temporal recency. There is a strong effect of elapsed time since a previous practice
on performance [
          <xref ref-type="bibr" rid="ref35">35</xref>
          ]. Skills and memories decay rapidly as a function of time, and the
most important part of this decay is well represented by a power-law function of the
times since last practice. We included recency features for both cloze-level KCs and
the cloze fill-in answers themselves for the present model.
        </p>
        <p>
          Long-term learning. While we might expect long-term forgetting, once recency is
factored in and with the presence of the adaptive factors below, long-term learning is
captured as permanent. Our model's long-term learning features were counts of correct and
incorrect attempts, computed at the level of the cloze fill-in (KC) within-student.
Counts of correctness were weighted by the difficulty of the attempt (operationalized
as the predicted probability of correctness). An additional learning feature for correct
counts was included in which the difficulty weighted counts were squared to represent
the diminishing returns of additional correct attempts (PFA-Difficulty, [
          <xref ref-type="bibr" rid="ref33">33</xref>
          ]). This
approach has been shown to perform similarly to the PFA approach. However, it has
the important implication that there is an optimal level of difficulty, which is more well
supported theoretically than a constant effect of practice difficulty. In other words, the
implied null hypothesis in most models is that difficulty is unimportant to learning,
which contravenes many research findings (e.g., [
          <xref ref-type="bibr" rid="ref16 ref21 ref9">9, 16, 21</xref>
          ]), and we think that is
implausible. Weighting by predicted difficulty means that the contribution of a given
attempt is specific to the particular student because the predictions are based on a model
estimate of the student’s difficulty using that student's unique practice history.
Tracking. As a student learns a concept, their performance should improve. Changes
in performance over time can be used by computing a running average. However, recent
trials are more informative of students’ ability, and thus running averages should weight
recent trials more heavily. Such predictors have been shown to improve model fit (e.g.,
R-PFA, [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ]). We used a similar approach, but instead of using a ratio of correctness
vs. incorrectness, we used the logarithm of the correct/incorrect ratio. This alternative
approach has additional flexibility by not being restricted to a range of [
          <xref ref-type="bibr" rid="ref1">0,1</xref>
          ] and having
the ability to indicate a decrease of performance expectation by using ghost success
AND ghost failures [
          <xref ref-type="bibr" rid="ref34">34</xref>
          ].
        </p>
        <p>
          Syllable hints. Providing hints to help students answer fill-in questions can increase
the probability of success without reducing learning [
          <xref ref-type="bibr" rid="ref36">36</xref>
          ]. Thus providing hints may be
productive in adaptive instructional systems. In order to quantify the correctness
probability increase as a function of syllable hints, the model included separate intercepts
for each level of hint syllable (0, 1, or 2 syllable hints).
        </p>
        <p>Intercepts for each prior semester. Student aptitude, instructional approaches, and
other external factors may change across semesters. To reduce the impact of these
potential differences on the model, separate intercepts were included for each semester to
improve model stability.</p>
        <p>Performance =</p>
        <p>Recent performance of student +
Recent performance of student on the fill-in+
Temporal recency of the sentence/fill-in pair +</p>
        <p>Temporal recency of the fill-in +
Long-term effect of practice difficulty of success +</p>
        <p>Long-term effect of count of failures +
Intercept for each of the 3 hint condition levels +</p>
        <p>Intercept for each prior semester
(1)</p>
        <p>
          In section 4, we look at the quantitative importance of each term to prediction. In
section 5, we explore the qualitative results of the model applied to practice
optimization. Equation 1 fit with R2 = .1093, which is considerably lower than experimental
results, but in line with other implemented systems [
          <xref ref-type="bibr" rid="ref34">34</xref>
          ]. Logistic regression using GLM
showed that all predictors (except for semester intercepts) had coefficient Z-scores
greater than 8. Since the model was fit for N = 93155 trials with only 16 parameters,
there was no concern with overfitting within the context of our collected data, so we do
not show cross-validation results for space reasons (but more complex models would
certainly face this problem). To better understand the parameterization, note that the
first four terms required non-linear decay parameters that were solved for using an R
optimizer that recomputed the feature and the logistic regression model iteratively to
arrive at optimal non-linear parameters [
          <xref ref-type="bibr" rid="ref32 ref34">32, 34</xref>
          ]. The exact values of these parameters
are specific to the data and can be obtained from the authors as needed. It was
interesting to note that recent memory was forgotten much more quickly for the sentence and
fill-in word pair than the recency effect of the fill-in word across sentences, which
persisted longer.
4
5
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Participants and Data Screening</title>
      <p>359 participants were included in the present analysis. Demographics surveyed from a
subset of the participants (N=133) indicated that this population is approximately 87%
female and has a median age of 33 years old (SD = 9.1 years). 63% of respondents
reported being African American. Practice attempts were only included up to the fifth
repetition for each sentence/fill-in pair. Otherwise, all data were included.</p>
    </sec>
    <sec id="sec-4">
      <title>Model Fit</title>
      <p>To illustrate the importance (or lack thereof) for each component of the model, we
computed the subject mean R2 differences for models that excluded 1 variable at a time
from Equation 1 and for a dominance analysis used to compute the avg contribution of
the parameter if used in all possible compositions of the 7 terms (excluding semester
intercepts).</p>
      <p>
        The results in Table 1 show that the importance of each component varies greatly.
For example, the effect of failures on long-term learning is extremely small (slightly
negative coefficient in this case indicates it is likely overfit or not significant). This
term may be removed in future versions, despite some indication it was positive in our
early results. This situation illustrates the complexity of the model since this term is
almost certainly negative because it is multicollinear with all 4 recency terms (2 for
performance and 2 for temporal recency), which means we cannot conclude that
longterm learning is actually near 0 for failures. It is more likely that the recency terms
capture the short-term effects of failure well enough and that the feedback effect of
review may be forgotten quickly. This result agrees well with theories of forgetting that
suggest that successful testing results in durable learning. In contrast, passive study
from a review is forgotten more quickly (e.g. [
        <xref ref-type="bibr" rid="ref37">37</xref>
        ]).
6
      </p>
    </sec>
    <sec id="sec-5">
      <title>Application Model to Pedagogical Optimality</title>
      <p>
        Our project is guided by the assumption that there is an optimal sequence in which
practice items should be practiced for a given learning task. This optimal sequence is
also assumed to be unique to the student and varies dynamically according to their prior
practice history and performance. Historically, this has been achieved by using prior
practice history as an input to algorithms that make pedagogical decisions. For instance,
Smallwood [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] used prior performance (e.g., correctness percentage, learning rate) on
mathematics problems to decide whether students were ready for new (more
challenging) problems, should continue on the current problem type, or perhaps needs to revisit
prior content. Additionally, our system's design assumes that there is an optimal
difficulty at which to practice for a given learning task. In this case, we are operationalizing
difficulty as the probability of correctly answering a test question. The relevance of
difficulty for practice optimality has been researched extensively in psychology (e.g.
[
        <xref ref-type="bibr" rid="ref38">38</xref>
        ]). A general conclusion has been that imposing some difficulty benefits learning
[
        <xref ref-type="bibr" rid="ref39">39</xref>
        ], but researchers disagree about how much difficulty to impose. Some have argued
that imposing greater difficulty provides optimal learning benefits [
        <xref ref-type="bibr" rid="ref3 ref40">3, 40</xref>
        ]. One
justification for this approach is known as Discrepancy Reduction Theory, in which whatever
content is least known should be practiced because that item can provide the largest
potential learning gains. There is truth to this idea; learning gains are higher with more
difficulty [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], although the benefit is not universal (e.g.[
        <xref ref-type="bibr" rid="ref41">41</xref>
        ]). However, a frequently
overlooked variable that consistently correlates with difficulty is practice time. As
difficulty increases, the time to complete the task or recall the information also increases
[
        <xref ref-type="bibr" rid="ref42 ref43">42, 43</xref>
        ]. There are at least two reasons for this consistent finding. First, better-learned
information (in the form of skills or memories) is recalled more quickly [
        <xref ref-type="bibr" rid="ref44 ref45">44, 45</xref>
        ].
Second, as difficulty increases, so does the risk of failure. When failures occur, feedback
is necessary for most practical contexts. This feedback is time-consuming and may not
be necessary if the information is successfully recalled. Thus, feedback time is
primarily associated with failures and can dramatically increase the time cost of practice. Note
that despite costing more time, failures do not necessarily confer more learning [
        <xref ref-type="bibr" rid="ref46">46</xref>
        ]. In
short, imposing more difficulty may become inefficient even if per-trial it appears to
provide more learning (e.g., [
        <xref ref-type="bibr" rid="ref16 ref9">9, 16</xref>
        ]).
      </p>
      <p>
        Considering the efficiency of pedagogical decisions can improve learning outcomes
[
        <xref ref-type="bibr" rid="ref16 ref9">9, 16</xref>
        ]. Rather than schedule practice according to whichever item would provide the
most gain, Pavlik &amp; Anderson [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] instead had students practice whichever item
provided the most gain per second. As a result, students’ practice time was more efficient,
and they completed many more practice trials. Furthermore, students that practiced
according to this algorithm had significantly higher memory retention than alternative
algorithms based on discrepancy reduction theories [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and other scheduling heuristics.
In our A &amp; P project, efficiency (utility) was determined by computing an optimal gain
curve using Equation 2.
(2)
Confirming the efficacy of this method, Eglington and Pavlik [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] simulated student
performance at various difficulties to determine an optimal difficulty empirically.
Simulated student practice was estimated using a logistic regression model parameterized
by fitting an existing dataset. They tested the simulation predictions by having students
learn Japanese-English word pairs in which the same model scheduled practice at
various difficulties. Practicing at relatively low difficulty was found to be most beneficial
for memory retention in contrast with prevailing theories [
        <xref ref-type="bibr" rid="ref20 ref21">20, 21</xref>
        ]. Importantly, they
found that a discrepancy reduction approach (focusing practice on more challenging
items) was both significantly worse than practicing at a lower difficulty (higher
efficiency) and also not significantly better than a non-adaptive control condition. It is
important to note that the benefit of practicing at low difficulty is partially due to the
learning materials - cloze learning trials are typically fast when answered correctly but
relatively slow when incorrect due to the necessity of providing feedback. In short, the
optimal difficulty may vary depending on the learning context.
      </p>
      <p>
        In order to schedule practice efficiently, the time cost for correct vs. incorrect
attempts must be known. Response time for successful recall is well fit by several models
(e.g., [
        <xref ref-type="bibr" rid="ref47 ref48">47, 48</xref>
        ]). Incorrect response times are harder to model, but using median
durations is effective [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. We used prior data from students completing a similar cloze task
to model correct and incorrect response times. Using this response time data, we
computed the optimally efficient threshold (OET) to practice, with our goal to use that
difficulty (operationalized as the correctness probability) to schedule practice within
MoFaCTS. For example, if the OET were determined to be .6, then on each trial, the
system would estimate the correctness probability for each potential cloze item (based
on that students’ practice history) and choose the item closest to .6. To compute the
OET, we needed to compute learning gains as a function of correctness probability
(difficulty) and divide those gains by the estimated time cost of practicing at that
difficulty. We computed learning gain by fitting our logistic regression model, which
estimates the long-term learning from practicing an item and being correct and practicing
and being incorrect. There is a different time cost for each of these potential outcomes,
the estimated trial duration for being correct or incorrect. Together, these computations
give us an efficiency score (or utility) for each value of correctness probability (see
Figure 2). In other words, the learning benefits of being correct or incorrect are linked
to the time it takes for either outcome.
Fig. 3 Shows the fit of the model with 16 parameters characterizing the results across
the four semesters for a split of the data into participants with mean performance greater
than and mean performance less than 50%. We do not know the system's efficacy at
this time since we have not begun efficacy testing. We see an encouraging correlation
between system usage and performance of all other work in the courses, shown in Fig.
4. However, as Fig. 3 suggests, we are falling short of our goal of providing practice
near the OET for participants performing at less than 50%, while participants with
&gt;50% performance are averaging near the 75% OET (which was estimated at 72% in
prior semesters). These results clarify that we have improvements to make if we are to
serve better the approximately half of our population (51%) that produced means less
than 50%. We suspect that the problem may have been caused by our prior model,
suggesting a much higher gain for failures than we found after introducing changes in
the model structure after Fall 2020 (the model in this paper). Perhaps not coincidentally,
Fall 2020 coincided with making the system mandatory, which likely also meant the
bottom 50% mean students may have been less motivated, unlike the extra credit
samples from the prior semesters.
Our system’s scheduling algorithm extends beyond simpler scheduling flashcard
paradigms (e.g., the Leitner method) and models (mastery learning with standard BKT)
using a model that allows the spaced sequencing of content according to rich features of
the student history. Such a practice system may be a useful component of many
iTextbooks due to the importance of declarative and conceptual facts in many academic
domains. In our development research, we use cloze items as our primary trial type, which
entails providing learners with sentences with keywords omitted that they must fill in.
Future work plans to begin including outside of practice factors from our student
surveys to influence our model and reduce the cold start problem. In addition, we are
currently developing new contextual and semantic features to add to the system. The
contextual features include data like the class of the fill-in-response (e.g., content vs.
connector word) and its importance in the content domain. Semantic features are
conceptual connections between sentence items independent of the already tracked fill-in-word
and represent new KCs that should influence the model.
9
      </p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This work was supported by the Institute of Education Sciences (IES; R305A190448).
Any opinions, findings, and conclusions, or recommendations expressed in this
material are those of the authors and do not necessarily reflect the views of IES. This work
was also supported by the Learner Data Institute project (NSF; 1934745) and the
University of Memphis Institute for Intelligent Systems.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Smallwood</surname>
          </string-name>
          , R.D.:
          <article-title>A Decision Structure for Teaching Machines</article-title>
          . MIT Press, Cambridge (
          <year>1962</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Borko</surname>
            ,
            <given-names>H.: A. A.</given-names>
          </string-name>
          <string-name>
            <surname>Lumsdaine</surname>
          </string-name>
          and Robert Glaser (Editors).
          <article-title>Teaching Machines and Programmed Learning: A Source Book</article-title>
          . Washington, D. C.: Department of Audio-Visual
          <string-name>
            <surname>Instruction</surname>
          </string-name>
          ,
          <source>National Education Association</source>
          ,
          <year>1960</year>
          . 724 Pp. Behavioral Science:
          <volume>7</volume>
          ,
          <fpage>479</fpage>
          -
          <lpage>480</lpage>
          . (
          <year>1962</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Atkinson</surname>
          </string-name>
          , R.C.
          <article-title>: Optimizing the Learning of a Second-Language Vocabulary</article-title>
          .
          <source>Journal of experimental psychology: 96</source>
          ,
          <fpage>124</fpage>
          -
          <lpage>129</lpage>
          . (
          <year>1972</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Tatsuoka</surname>
            ,
            <given-names>K.K.</given-names>
          </string-name>
          :
          <article-title>Rule Space: An Approach for Dealing with Misconceptions Based on Item Response Theory</article-title>
          . Journal of Educational Measurement:
          <volume>20</volume>
          ,
          <fpage>345</fpage>
          -
          <lpage>354</lpage>
          . (
          <year>1983</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Skinner</surname>
            ,
            <given-names>B.F.: Teaching</given-names>
          </string-name>
          <string-name>
            <surname>Machines</surname>
          </string-name>
          .
          <source>The Review of Economics</source>
          and Statistics:
          <volume>42</volume>
          ,
          <fpage>189</fpage>
          -
          <lpage>191</lpage>
          . (
          <year>1960</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Corbett</surname>
            ,
            <given-names>A.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Anderson</surname>
            ,
            <given-names>J.R.</given-names>
          </string-name>
          :
          <article-title>Knowledge Tracing: Modeling the Acquisition of Procedural Knowledge. User Modeling</article-title>
          and
          <string-name>
            <surname>User-Adapted Interaction</surname>
          </string-name>
          :
          <volume>4</volume>
          ,
          <fpage>253</fpage>
          -
          <lpage>278</lpage>
          . (
          <year>1994</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Olney</surname>
            ,
            <given-names>A.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pavlik</surname>
            ,
            <given-names>P.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maass</surname>
            ,
            <given-names>J.K.</given-names>
          </string-name>
          :
          <article-title>Improving Reading Comprehension with Automatically Generated Cloze Item Practice</article-title>
          . In: André,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Baker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Hu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            ,
            <surname>Rodrigo</surname>
          </string-name>
          , M.M.T.,
          <string-name>
            <surname>du Boulay</surname>
          </string-name>
          , B. (eds.):
          <source>Proceedings of Artificial Intelligence in Education: 18th International Conference</source>
          ,
          <volume>262</volume>
          -
          <fpage>273</fpage>
          . Springer International Publishing, Wuhan, China (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>Pavlik</given-names>
            <surname>Jr</surname>
          </string-name>
          .,
          <string-name>
            <given-names>P.I.</given-names>
            ,
            <surname>Olney</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.M.</given-names>
            ,
            <surname>Banker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Eglington</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Yarbro</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.:</surname>
          </string-name>
          <article-title>The Mobile Fact and Concept Textbook System (Mofacts)</article-title>
          .
          <source>21st International Conference on Artificial Intelligence in Education (Aied</source>
          <year>2020</year>
          ) Second Workshop on Intelligent Textbooks,
          <fpage>35</fpage>
          -
          <lpage>49</lpage>
          . In CEUR workshop proceedings (Vol.
          <volume>2674</volume>
          ). (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Eglington</surname>
            ,
            <given-names>L.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pavlik</surname>
            <given-names>Jr</given-names>
          </string-name>
          ,
          <string-name>
            <surname>P.I.</surname>
          </string-name>
          :
          <source>Optimizing Practice Scheduling Requires Quantitative Tracking of Individual Item Performance. npj Science of Learning: 5</source>
          ,
          <fpage>15</fpage>
          . (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Pimsleur</surname>
            ,
            <given-names>P.: A Memory</given-names>
          </string-name>
          <string-name>
            <surname>Schedule. The Modern Language Journal</surname>
          </string-name>
          :
          <volume>51</volume>
          ,
          <fpage>73</fpage>
          -
          <lpage>75</lpage>
          . (
          <year>1967</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Yan</surname>
            ,
            <given-names>V.X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schuetze</surname>
            ,
            <given-names>B.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eglington</surname>
            ,
            <given-names>L.G.</given-names>
          </string-name>
          :
          <article-title>A Review of the Interleaving Effect: Theories and Lessons for Future Research</article-title>
          . PsyArXiv,
          <fpage>1</fpage>
          -
          <lpage>39</lpage>
          . (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Lindsey</surname>
            ,
            <given-names>R.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shroyer</surname>
            ,
            <given-names>J.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pashler</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , Mozer,
          <string-name>
            <surname>M.C.</surname>
          </string-name>
          :
          <article-title>Improving Students' Long-Term Knowledge Retention through Personalized Review</article-title>
          .
          <source>Psychological Science</source>
          ,
          <volume>639</volume>
          -
          <fpage>647</fpage>
          . (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Leitner</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>So Lernt Man Lernen</article-title>
          . Herder, Freiburg im Breisgau, Germany (
          <year>1972</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Metzler-Baddeley</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baddeley</surname>
          </string-name>
          , R.J.: Does Adaptive Training Work? Applied Cognitive Psychology:
          <volume>23</volume>
          ,
          <fpage>254</fpage>
          -
          <lpage>266</lpage>
          . (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Wozniak</surname>
            ,
            <given-names>P.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gorzelanczyk</surname>
            ,
            <given-names>E.J.:</given-names>
          </string-name>
          <article-title>Optimization of Repetition Spacing in the Practice of Learning</article-title>
          .
          <source>Acta Neurobiologiae Experimentalis</source>
          :
          <volume>54</volume>
          ,
          <fpage>59</fpage>
          -
          <lpage>62</lpage>
          . (
          <year>1994</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Pavlik</surname>
            Jr.,
            <given-names>P.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Anderson</surname>
            ,
            <given-names>J.R.</given-names>
          </string-name>
          :
          <article-title>Using a Model to Compute the Optimal Schedule of Practice</article-title>
          . Journal of Experimental Psychology: Applied:
          <volume>14</volume>
          ,
          <fpage>101</fpage>
          -
          <lpage>117</lpage>
          . (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Heffernan</surname>
            ,
            <given-names>N.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heffernan</surname>
            ,
            <given-names>C.L.</given-names>
          </string-name>
          :
          <article-title>The Assistments Ecosystem: Building a Platform That Brings Scientists and Teachers Together for Minimally Invasive Research on Human Learning and Teaching</article-title>
          .
          <source>International Journal of Artificial Intelligence in Education: 24</source>
          ,
          <fpage>470</fpage>
          -
          <lpage>497</lpage>
          . (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Kelly</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thompson</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heffernan</surname>
            ,
            <given-names>N.: Defining</given-names>
          </string-name>
          <string-name>
            <surname>Mastery: Knowledge Tracing Versus N-Consecutive</surname>
            <given-names>Correct</given-names>
          </string-name>
          <string-name>
            <surname>Responses</surname>
          </string-name>
          .
          <source>8th International Conference on Educational Data Mining</source>
          ,
          <fpage>39</fpage>
          -
          <lpage>46</lpage>
          . (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Murphy</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roschelle</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Feng</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mason</surname>
            ,
            <given-names>C.A.</given-names>
          </string-name>
          :
          <article-title>Investigating Efficacy, Moderators and Mediators for an Online Mathematics Homework Intervention</article-title>
          .
          <source>Journal of Research on Educational Effectiveness: 13</source>
          ,
          <fpage>235</fpage>
          -
          <lpage>270</lpage>
          . (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Settles</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meeder</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>A Trainable Spaced Repetition Model for Language Learning</article-title>
          .
          <source>Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume</source>
          <volume>1</volume>
          :
          <string-name>
            <surname>Long</surname>
            <given-names>Papers)</given-names>
          </string-name>
          ,
          <fpage>1848</fpage>
          -
          <lpage>1858</lpage>
          . Berlin, Germany (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Bjork</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          : In: Metcalfe,
          <string-name>
            <surname>J</surname>
          </string-name>
          . (ed.):
          <source>Metacognition: Knowing About Knowing</source>
          ,
          <fpage>185</fpage>
          -
          <lpage>205</lpage>
          . MIT Press (
          <year>1994</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koedinger</surname>
            ,
            <given-names>K.R.</given-names>
          </string-name>
          :
          <article-title>Towards Reliable and Valid Measurement of Individualized Student Parameters</article-title>
          .
          <source>International Educational Data Mining Society</source>
          . (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>J.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brunskill</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          :
          <source>The Impact on Individualizing Student Models on Necessary Practice Opportunities. International Educational Data Mining Society</source>
          . (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Yudelson</surname>
            ,
            <given-names>M.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koedinger</surname>
            ,
            <given-names>K.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gordon</surname>
            ,
            <given-names>G.J.</given-names>
          </string-name>
          :
          <article-title>Individualized Bayesian Knowledge Tracing Models</article-title>
          . In: Lane,
          <string-name>
            <given-names>H.C.</given-names>
            ,
            <surname>Yacef</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Mostow</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Pavlik</surname>
          </string-name>
          , P. (eds.):
          <source>Artificial Intelligence in Education</source>
          ,
          <volume>171</volume>
          -
          <fpage>180</fpage>
          . Springer Berlin Heidelberg, Berlin, Heidelberg (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Eglington</surname>
            ,
            <given-names>L.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pavlik</surname>
            <given-names>Jr</given-names>
          </string-name>
          ,
          <string-name>
            <surname>P.I.</surname>
          </string-name>
          :
          <article-title>Systematic Prediction Errors from Individual Differences Are Inevitable and Treatable. (under review)</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Seli</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Risko</surname>
            ,
            <given-names>E.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smilek</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>On the Necessity of Distinguishing between Unintentional and Intentional Mind Wandering</article-title>
          . Psychological Science:
          <volume>27</volume>
          ,
          <fpage>685</fpage>
          -
          <lpage>691</lpage>
          . (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Kurzban</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Duckworth</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kable</surname>
            ,
            <given-names>J.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Myers</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>An Opportunity Cost Model of Subjective Effort and Task Performance</article-title>
          . Behav Brain Sci:
          <volume>36</volume>
          ,
          <fpage>661</fpage>
          -
          <lpage>679</lpage>
          . (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Seli</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Konishi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Risko</surname>
            ,
            <given-names>E.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smilek</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>The Role of Task Difficulty in Theoretical Accounts of Mind Wandering</article-title>
          . Conscious Cogn:
          <volume>65</volume>
          ,
          <fpage>255</fpage>
          -
          <lpage>262</lpage>
          . (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Steyvers</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Benjamin</surname>
            ,
            <given-names>A.S.:</given-names>
          </string-name>
          <article-title>The Joint Contribution of Participation and Performance to Learning Functions: Exploring the Effects of Age in Large-Scale Data Sets</article-title>
          . Behavior Research Methods:
          <volume>51</volume>
          ,
          <fpage>1531</fpage>
          -
          <lpage>1543</lpage>
          . (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Ritter</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harris</surname>
            ,
            <given-names>T.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nixon</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dickison</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Murray</surname>
            ,
            <given-names>R.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Towle</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Reducing the Knowledge Tracing Space</article-title>
          .
          <source>International Working Group on Educational Data Mining</source>
          . (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Galyardt</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goldin</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Move Your Lamp Post: Recent Data Reflects Learner Knowledge Better Than Older Data</article-title>
          .
          <source>Journal of Educational Data Mining: 7</source>
          ,
          <fpage>83</fpage>
          -
          <lpage>108</lpage>
          . (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Gong</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beck</surname>
            ,
            <given-names>J.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heffernan</surname>
          </string-name>
          , N.T.:
          <article-title>How to Construct More Accurate Student Models: Comparing and Optimizing Knowledge Tracing and Performance Factor Analysis</article-title>
          .
          <source>International Journal of Artificial Intelligence in Education: 21</source>
          ,
          <fpage>27</fpage>
          -
          <lpage>46</lpage>
          . (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33.
          <string-name>
            <surname>Cao</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pavlik</surname>
            <given-names>Jr</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>P.I.</given-names>
            ,
            <surname>Bidelman</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.M.</surname>
          </string-name>
          :
          <article-title>Incorporating Prior Practice Difficulty into Performance Factor Analysis to Model Mandarin Tone Learning</article-title>
          . In: Lynch,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Merceron</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Desmarais</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Nkambou</surname>
          </string-name>
          ,
          <string-name>
            <surname>R</surname>
          </string-name>
          . (eds.):
          <source>Proceedings of the 11th International Conference on Educational Data Mining</source>
          ,
          <fpage>516</fpage>
          -
          <lpage>519</lpage>
          . (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          34.
          <string-name>
            <surname>Pavlik</surname>
            <given-names>Jr</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>P.I.</given-names>
            ,
            <surname>Eglington</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.G.</given-names>
            ,
            <surname>Harrell-Williams</surname>
          </string-name>
          ,
          <string-name>
            <surname>L.M.</surname>
          </string-name>
          :
          <article-title>Logistic Knowledge Tracing: A Constrained Framework for Learner Modeling</article-title>
          . arXiv.org. (
          <year>2021</year>
          , preprint)
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          35.
          <string-name>
            <surname>Wixted</surname>
            ,
            <given-names>J.T.</given-names>
          </string-name>
          :
          <article-title>The Psychology and Neuroscience of Forgetting</article-title>
          . Annual Review of Psychology:
          <volume>55</volume>
          ,
          <fpage>235</fpage>
          -
          <lpage>269</lpage>
          . (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          36.
          <string-name>
            <surname>Fiechter</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Benjamin</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          :
          <article-title>Techniques for Scaffolding Retrieval Practice: The Costs and Benefits of Adaptive Versus Diminishing Cues</article-title>
          .
          <source>Psychonomic Bulletin &amp; Review: 26</source>
          ,
          <fpage>1666</fpage>
          -
          <lpage>1674</lpage>
          . (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          37.
          <string-name>
            <surname>Thompson</surname>
            ,
            <given-names>C.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wenger</surname>
            ,
            <given-names>S.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bartling</surname>
            ,
            <given-names>C.A.</given-names>
          </string-name>
          :
          <article-title>How Recall Facilitates Subsequent Recall: A Reappraisal</article-title>
          .
          <source>Journal of Experimental Psychology: Human Learning &amp; Memory: 4</source>
          ,
          <fpage>210</fpage>
          -
          <lpage>221</lpage>
          . (
          <year>1978</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          38.
          <string-name>
            <surname>Schmidt</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bjork</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          :
          <article-title>New Conceptualizations of Practice: Common Principles in Three Paradigms Suggest New Concepts for Training</article-title>
          . Psychological Science:
          <volume>3</volume>
          ,
          <fpage>207</fpage>
          -
          <lpage>217</lpage>
          . (
          <year>1992</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          39.
          <string-name>
            <surname>Koedinger</surname>
            ,
            <given-names>K.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aleven</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Exploring the Assistance Dilemma in Experiments with Cognitive Tutors</article-title>
          . Educational Psychology Review:
          <volume>19</volume>
          ,
          <fpage>239</fpage>
          -
          <lpage>264</lpage>
          . (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          40.
          <string-name>
            <surname>Pashler</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zarow</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Triplett</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Is Temporal Spacing of Tests Helpful Even When It Inflates Error Rates? Journal of Experimental Psychology: Learning, Memory</article-title>
          , and Cognition:
          <volume>29</volume>
          ,
          <fpage>1051</fpage>
          -
          <lpage>1057</lpage>
          . (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          41.
          <string-name>
            <surname>Vaughn</surname>
            ,
            <given-names>K.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kornell</surname>
          </string-name>
          , N.:
          <article-title>How to Activate Students' Natural Desire to Test Themselves</article-title>
          .
          <source>Cognitive Research: Principles and Implications: 4</source>
          ,
          <fpage>35</fpage>
          . (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          42.
          <string-name>
            <surname>Rickard</surname>
          </string-name>
          , T.C.
          <article-title>: Bending the Power Law: A Cmpl Theory of Strategy Shifts and the Automatization of Cognitive Skills</article-title>
          . Journal of Experimental Psychology: General:
          <volume>126</volume>
          ,
          <fpage>288</fpage>
          -
          <lpage>311</lpage>
          . (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          43.
          <string-name>
            <surname>Pyc</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rawson</surname>
            ,
            <given-names>K.A.</given-names>
          </string-name>
          :
          <article-title>Testing the Retrieval Effort Hypothesis: Does Greater Difficulty Correctly Recalling Information Lead to Higher Levels of Memory? Journal of Memory</article-title>
          and Language:
          <volume>60</volume>
          ,
          <fpage>437</fpage>
          -
          <lpage>447</lpage>
          . (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          44.
          <string-name>
            <surname>Peterson</surname>
            ,
            <given-names>L.R.</given-names>
          </string-name>
          :
          <article-title>Paired-Associate Latencies after the Last Error</article-title>
          . Psychonomic Science:
          <volume>2</volume>
          ,
          <fpage>167</fpage>
          -
          <lpage>168</lpage>
          . (
          <year>1965</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          45.
          <string-name>
            <surname>Schlag-Rey</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Groen</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suppes</surname>
            ,
            <given-names>P.:</given-names>
          </string-name>
          <article-title>Latencies on Last Error in Paired-Associate Learning</article-title>
          . Psychonomic Science:
          <volume>2</volume>
          ,
          <fpage>15</fpage>
          -
          <lpage>16</lpage>
          . (
          <year>1965</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          46.
          <string-name>
            <surname>Vaughn</surname>
            ,
            <given-names>K.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hausman</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kornell</surname>
          </string-name>
          , N.:
          <article-title>Retrieval Attempts Enhance Learning Regardless of Time Spent Trying to Retrieve</article-title>
          . Memory:
          <volume>25</volume>
          ,
          <fpage>298</fpage>
          -
          <lpage>316</lpage>
          . (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          47.
          <string-name>
            <surname>Anderson</surname>
            ,
            <given-names>J.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lebiere</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>The Atomic Components of Thought</article-title>
          . Lawrence Erlbaum Associates, Mahwah, NJ (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          48.
          <string-name>
            <surname>Anderson</surname>
            ,
            <given-names>J.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fincham</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Douglass</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>The Role of Examples and Rules in the Acquisition of a Cognitive Skill</article-title>
          .
          <source>Journal of Experimental Psychology: Learning</source>
          , Memory, &amp; Cognition:
          <volume>23</volume>
          ,
          <fpage>932</fpage>
          -
          <lpage>945</lpage>
          . (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>