<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Design of a User-Interpretable Math Quiz Recommender System for Japanese High School Students</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Yiling Dai</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Brendan Flanagan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kyosuke Takami</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hiroaki Ogata</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Academic Center for Computing and Media Studies, Kyoto University</institution>
          ,
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2022</year>
      </pub-date>
      <fpage>21</fpage>
      <lpage>22</lpage>
      <abstract>
        <p>In the context of K-12 math education, identifying the quizzes of the appropriate difficulty is essential to improve the students' understanding of math concepts. In this work, we propose a quiz recommender system which not only considers the difficulty of the quiz but also the expected learning outcome of solving that quiz. To increase the students' motivation of accepting the recommendations, our system provides interpretable information, i.e., the difficulty and expected learning gain of a recommendation, to the students. We conducted a pilot to implement this recommender system in a Japanese high school classroom. Overall, the log data showed a low rate of usage. We summarized challenges in implementing the recommender system in our specific setting, which help direct the future development of the system and its evaluation at a larger scale.</p>
      </abstract>
      <kwd-group>
        <kwd>1 Adaptive learning</kwd>
        <kwd>K-12 math education</kwd>
        <kwd>Quiz recommendation</kwd>
        <kwd>User-interpretable</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        In the context of K-12 math education, identifying the quizzes of the appropriate difficulty is
essential to improve the students’ understanding of math concepts. Previous knowledge tracing works
have focused on estimating the knowledge states of the students and predicting students’ performance
of the learning materials [
        <xref ref-type="bibr" rid="ref2 ref5">2, 5, 7, 8, 11</xref>
        ]. This is based on an assumption that learning happens when
students attempt tasks in the zone of proximal development [13], i.e., the tasks they cannot achieve by
themselves but can achieve with assistance. However, these works did not consider how knowledge
states are improved and what improvement brought by the “proximal” learning materials should be
prioritized. In this work, we propose a quiz recommender system which not only considers the difficulty
of the quiz but also the expected learning outcome of solving that quiz. Among various learning
outcomes, we focus on the average improvement of the understanding of related math concepts. By
doing so, a quiz that helps the student to practice weaker concepts will be prioritized in the
recommendation.
      </p>
      <p>
        Recent adaptive learning systems have attempted to recommend learning materials based on
complex methods such as deep learning methods [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and reinforcement learning methods [12].
However, the mechanism and output of these methods are difficult to interpret, which may lead to the
decrease of students’ beliefs that they are able to do the task and their perceived values of completing
the task, which further decreases their motivation to participate [14]. Being intuitive and simple, our
recommender system is able to provide interpretable information, i.e., the difficulty and expected
learning gain of a recommendation, to the students.
      </p>
      <p>
        Some works proposed educational recommender systems with explanations for different subjects,
different types of learning tasks, and different levels of education [
        <xref ref-type="bibr" rid="ref1">1, 9</xref>
        ]. Consequently, such
recommender systems are context-specific and difficult to be applied in our case. We conducted a pilot
to implement our proposed recommender system in a Japanese high school classroom. By analyzing
the log and questionnaire data, we identified some challenges which help direct the future development
of the system and its evaluation at a larger scale.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. System overview</title>
      <p>(a) Recommender system and other modules.</p>
      <p>Figure 1: An overview of the learning system.
(b) Interface of BookRoll.</p>
      <p>
        As Figure 1a shows, our recommender system is built on an integrated learning system where an
ebook reading module called BookRoll [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] works to register and present the quizzes, and a learning
record store (LRS) module stores the students’ answers of the quizzes. The interface of viewing and
answering the quizzes is shown in Figure 1b. In this work, we only record whether the student correctly
answered the quiz for each attempt. In the recommendation module, we first extract the necessary
concepts for the registered quizzes. Then, we estimate the students’ mastery levels on the concepts
based on their answers on the quizzes, which is further utilized to recommend quizzes that complement
and extend their knowledge. In the following section, we describe the mechanism of the recommender
system in detail.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Proposed recommender system</title>
    </sec>
    <sec id="sec-4">
      <title>3.1. Problem definition</title>
      <p>Suppose we have a student  and | | quizzes. We model the student’s attempts on the quizzes
as   = {( 1,  1), . . . , (  ,   )}, where   is the student’s response to   at step  .  equals 1 if the
answer is correct and 0 otherwise. Our goal is to select and rank a subset of the quizzes that improve
the student’s knowledge acquisition. We denote it as  ( |  ) = ( 1,  2, . . . ,   ), where n
is a predefined number of quizzes to recommend. In the following sections, we describe the procedures
in detail with a running example showed in Table 1.</p>
    </sec>
    <sec id="sec-5">
      <title>3.2. Procedure</title>
      <p>3.2.1. Extracting underlying concepts.</p>
      <p>∠ = 60°  +  = 1.  is the midpoint of  . What is the length of 
such that the length of line segment  is minimum.</p>
      <p>Solving a math quiz requires knowledge of related concepts. For instance, to solve  1 in Table 1, the
students should know and be able to apply the knowledge of proof, law of sines, and etc. Understanding
how students master the underlying concepts of the quizzes help us estimate their knowledge states
more precisely, therefore, provide better remedial strategies.</p>
      <p>Identifying the concepts required to solve the quizzes is not an easy task. Some of the concepts can
be identified in the textual information of the quiz and its standard answer while some cannot. In this
work, we take an initiative attempt to utilize the noun phrases as the concepts. For example, we deem
proof, equality, and triangle as necessary concepts to solve  1. We denote the underlying concepts of
a quiz set as  . For each pair of  and  , 

( ,  ) denotes the relatedness between a quiz and
a concept, which indicates the degree to which the knowledge of a concept is necessary in solving the
quiz. For preprocessing, we use pdftotext2 to extract the plain texts of quizzes from PDF format. Then,
we adopt Janome3 to parse the Japanese terms in the quiz text. Last, we use a classic term weighting
method TFIDF [10] to compute</p>
      <p>( ,  ), which is implemented in scikit-learn4.</p>
    </sec>
    <sec id="sec-6">
      <title>3.2.2. Estimating mastery level on concepts.</title>
      <p>( |  ) = the correctness rate of  in a student’s attempts</p>
      <p>As illustrated in Figure 2, we compute the mastery level on each concept by looking at how the
student answered the quizzes which require the knowledge of this concept. Suppose the student had
attempted three quizzes  1,  2, and  3, all of which require the knowledge of  1. However, the student
failed on  2 at the first attempt and it is not counted in the computation of mastery level on  1. Note
that for the unseen quizzes, their requirement on the concepts is not considered in the computation. In
other words, the mastery level only reflects the student’s understanding of the concepts in the context
of quizzes s/he had attempted.
where:
   = the set of quizzes in a student’s attempts
 ( ,1) = the correct attempts on 
 ( ) = all the correct attempts on</p>
    </sec>
    <sec id="sec-7">
      <title>3.2.3. Estimating quiz difficulty and expected learning gain.</title>
      <p>To improve the acquisition of knowledge, the student needs to practice on quizzes that are of
appropriate difficulty and provide learning gain at the same time. Therefore, we propose two criteria—
quiz difficulty and expected learning gain— to decide which quizzes to recommend. Quiz difficulty
reflects the probability that the student will give a wrong answer. As shown in Equation (2) and Figure
3, it is inferred from the student’s mastery level on the concepts required in this quiz. Taking  1 for
example, the students’ mastery level on  1,  2, and  4 indicates how possible s/he is able to solve  1.
By subtracting it from 1, we obtain the probability that the student will fail to solve  1, i.e., the difficulty
of  1. Note that for the unseen concepts, we set the mastery level as 0 to simplify the computation.
(2)
(3)</p>
      <p>( , )</p>
      <p>The expected learning gain of a quiz at step  is the average mastery level update on the concepts if
the student successfully solves the quiz at the next step  + 1, as computed in Equation (3).
_
_
( |  ) =
∑ ∈  (
_
( |  ∪( ,1))−
_</p>
      <p>( |  ))
|  |
where   is the set of concepts required in  . As illustrated in Figure 3, when computing the expected
learning gain of  2, we first update the student’s mastery level on concepts assuming s/he successfully
solved  2. Then the total improvement on the mastery level is normalized by the number of concepts
required in  2.</p>
    </sec>
    <sec id="sec-8">
      <title>3.2.4. Recommending quizzes.</title>
      <p>We recommend quizzes based on the estimated difficulty and expected learning gain. Since both of
quiz difficulty and expected learning gain are computed from students’ mastery level of the concepts,
they are intertwined with each other. Intuitively, a more difficult quiz may provide more learning gain
if the student successfully solved it. Meanwhile, it requires the students to have more tolerance of
uncertainty and to seek extra assistance in solving the quiz. Therefore, we adopt a recommending policy
that the quiz is neither too difficult nor too easy to the students, yet provides learning gain as much as
possible. As a result, the quiz to be recommended is not necessarily the one with the highest expected
learning gain.</p>
      <p>To do so, we select the quizzes in a proper range of difficulty and then rank them based on the
expected learning gain in descendant order. Given a predefined number  of quizzes to recommend, the
recommended set of quizzes and their order is  ( |  ) = ( 1, . . . ,   ) such that for any
1 ≤  ≤  ,  ≤  _ (  |  ) ≤  , and for any 1 ≤  ≤  ≤  ,
 _ _ (  |  ) ≥  _ _ (  |  ) , where  and  are the
thresholds of difficulty. In the example of Figure 3, given the estimated difficulty and expected learning
gain, we recommend  2 if the acceptable difficulty is no greater than 0.3.
3.3.</p>
    </sec>
    <sec id="sec-9">
      <title>Strengths and limitations</title>
      <p>Compared with previous works that recommend quizzes (or exercises in other contexts), our
proposed recommender system has the following strengths:
 Our recommendations are user-interpretable.</p>
      <p>
        As described in Section 3.2, every step in the recommender system is intuitive and simple, which
makes it easy to provide explanations for the recommended quizzes. For example, we recommend
 2 to the student since s/he does not fully understand  3 and by attempting  2 s/he may deepen the
understanding of  1 and  3. This type of information is difficult to extract in deep knowledge tracing
methods [
        <xref ref-type="bibr" rid="ref5 ref6">5–8</xref>
        ] since the quizzes and knowledge states are usually embedded as vectors and
processed in complex computation.
 Our model takes the future learning gain into consideration.
      </p>
      <p>In a knowledge tracing model, the main goal is to model and trace the changes of knowledge states
when students attempt quizzes. As a result, these models output the probability that a student can
solve a quiz successfully while they do not consider whether and how the knowledge state improves
by solving the quiz. In contrast, our model recommends quizzes taking the expected learning gain
into account, which is an effort to optimize the learning outcomes.
 Our model does not rely on the data of other students.</p>
      <p>Unlike data-driven methods which rely on a large dataset of student attempts on the quizzes, our
model is content-based and is able to recommend quizzes merely using the data of one student. This
is important when implemented in a real classroom setting that usually does not have sufficient
student data on the quizzes.
 Our model is flexible and easy to modify.</p>
      <p>
        Except for the two criteria we use in this work, our model is flexible to accept other criteria and
modifications. For example, we can recommend a quiz which improves the understanding of the
weakest concept instead of the overall understanding of the concepts. Or as in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], we can set a target
of concepts as a learning goal, then the quizzes supporting the understanding of these concepts will
be prioritized.
      </p>
      <p>Being intuitive and simple brings limitations too. For example, we do not consider the students’
ability to apply their previous knowledge when solving a new quiz. Besides, we simply assume the
students know nothing about unseen quizzes or concepts, which is not the case in real situations since
the unseen concepts may be related to the known concepts. As described in Section 3.2.1, we focused
on explicit noun phrases as the necessary concepts for solving a quiz. To obtain more underlying
concepts, methods such as topic models can be integrated in our recommendation framework. However,
interpreting the latent vectors could be another challenge. To address these limitations, considering the
relationships between concepts and introducing parameters that model more complex learning
behaviors are future works.</p>
    </sec>
    <sec id="sec-10">
      <title>4. Pilot in high school classroom</title>
      <p>We conducted a pilot of the proposed recommender system in a real classroom setting as preparation
for a well-designed comparative experiment. In the pilot, one class of students of a Japanese high school
was invited to use the recommender system to solve quizzes during summer vacation from July 20th to
August 23rd, 2021. The teacher had assigned 54 quizzes and asked the students to finish the quizzes
and check the answers by themselves. Note that the students were highly recommended to solve the
quizzes and report their answers in the learning system. However, they were not required to do so since
they also can choose to solve quizzes in paper-based textbooks. In the BookRoll system, the students
can access the quizzes from a book directory, a list of quiz assignment, or from the recommender system
tab. While the students started to attempt the quizzes in the system, we recommended the assigned
quizzes accordingly. We set the acceptable quiz difficulty in the range of [0.1, 0.6]. Figure 4 shows the
interface of the recommender system where the estimated mastery level and the reasons why a quiz is
recommended were shown to the students. During the summer vacation, the log data of quiz answers
and recommendation clicks were collected5. After the summer vacation, we conducted a questionnaire
survey on the students’ perceptions of the recommender system.
4.1.</p>
    </sec>
    <sec id="sec-11">
      <title>Students’ reactions</title>
      <p>Table 2 shows the statistics of the system usage during the pilot period. 57.9% of the students
accessed the learning system and only two students ever clicked the recommended quizzes. To further
investigate the reason of the usage of the recommender system, we analyzed the questionnaire survey.
The questionnaire contains 42 questions regarding the students’ perceptions of the recommender system
and attitudes towards math learning. Among the questions, 8 are descriptive and 34 are 5-likert scale
questions, which were found to have good reliability (  = 0.836 ). 30 students answered the
questionnaire and 3 incomplete answers were excluded in the following analysis. We separated the
students into two groups based on whether they reported that they ever used the recommender system.
The students whose answers were equal to or greater than 3 are treated as self-reported users ( = 6),
while the rest of the students are treated as self-reported non-users ( = 21). Note that the following
analysis is based on an assumption that we “trust” the self-reported results. We then conducted 2-tailed
t-test on the answers of the other questions between these two groups. Table 3 lists the questions that
had a significant difference ( &lt; 0.05) between the two groups. We found that the self-reported users
5 The log data of quiz answers during August 12th and August 18th was not recorded due to a system bug.
demonstrated more positive attitudes towards the usefulness of the recommender system, more trust
towards the explanations and the recommender system, and more motivation to do math quizzes.
However, we are aware of the limitations such as the small sample size and the low reliability of
selfreported results.
because of the recommender</p>
      <p>The system pushed me to do math 3.67
4.2.</p>
    </sec>
    <sec id="sec-12">
      <title>Challenges</title>
      <p>system in our specific classroom setting:</p>
      <p>Based on the results of the pilot, we found the following challenges to implement a recommender

</p>
      <p>The inertia of the students to do quizzes in paper-based methods.</p>
      <p>Doing math quizzes and self-checking the answers require a large visual space to write down the
answer and then compare it with the standard answer. The current e-book reading system and the
device could not fully support this usage, which leads to extra work for the students to report their
answers in the system after solving them in the paper version. As some of the students reported in
the questionnaire, it was annoying to find the same quiz in digital-version and the paper-version. To
encourage more usage in the digital version, we need to improve the convenience of the e-book
reading system so that the students can easily switch between both versions.</p>
      <p>The compatibility with the standard teaching schedule.</p>
      <p>As a request from the teacher, we limited the recommendations to an assigned set of quizzes in the
pilot. This reduces the meaning to use the recommender system as the students had to finish all the
quizzes at the end. However, there does exist a concern that if the recommended quizzes for each
student are distributed over diverse topics, the teacher may fail to keep a standard teaching schedule
sd
and to give feedback instantly. In the future, we need to balance the standard teaching schedule and
the personalized learning.
 The definition of the usage of the recommender system.</p>
      <p>
        In our recorded log data, only two students clicked the recommended quizzes. However, 6 students
reported they ever used the recommender system. One of the two students who clicked the
recommended quizzes reported s/he never used the recommender system. Obviously, there is a gap
between our and the students’ perceptions of “usage”. Besides, the students may provide unreal
answers both intentionally and unintentionally. As Fredricks et.al [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] suggest, engagement is a
multifaceted construct that relates to behavior, emotion, and cognition. In future work, we plan to
record more students’ engagement with the system in a more objective manner.
      </p>
    </sec>
    <sec id="sec-13">
      <title>5. Conclusions and future works</title>
      <p>In this work, we proposed a quiz recommender system which not only considers the difficulty of the
quiz but also the expected learning outcome of solving that quiz. To increase the students’ motivation
of accepting the recommendations, our system provided interpretable information, i.e., the difficulty
and expected learning gain of a recommendation, to the students. Our system has advantages over
existing methods as a) the recommendations are user-interpretable, b) the future learning gain is
considered, c) it does not rely on large sets of student data, and d) it is flexible to modifications.</p>
      <p>We also conducted a pilot to implement this recommender system in a Japanese high school
classroom. The log data demonstrated a low usage of the system during the pilot. However, we did find
some positive signals between the attitudes toward the system and the self-reported usage of it from the
limited questionnaire data. More importantly, we identified some challenges such as the inconvenience
of the current e-book reading system, the incompatibility with the standard teaching schedule, and the
gap between the intended usage of the system and the students’ perceptions of it. In future work, we
plan to redesign the evaluation experiment with more clear instructions and more thorough and
objective ways to record students’ engagement and learning performance improvement.</p>
    </sec>
    <sec id="sec-14">
      <title>6. Acknowledgements</title>
    </sec>
    <sec id="sec-15">
      <title>7. References</title>
      <p>This work was partly supported by JSPS Grant-in-Aid for Scientific Research (B) 20H01722, JSPS
Grant-in-Aid for Scientific Research (Exploratory) 21K19824, JSPS Grant-in-Aid for Scientific
Research (S) 16H06304 and NEDO JPNP20006 and JPNP18013.
[7] Qi Liu, Zhenya Huang, Yu Yin, Enhong Chen, Hui Xiong, Yu Su, and Guoping Hu. 2021. EKT:
Exercise-Aware Knowledge Tracing for Student Performance Prediction. IEEE Transactions on
Knowledge &amp; Data Engineering 33, 01 (2021), 100–115.
[8] Hiromi Nakagawa, Yusuke Iwasawa, and Yutaka Matsuo. 2019. Graph-Based Knowledge
Tracing: Modeling Student Proficiency Using Graph Neural Network. In Proceedings of 2019
IEEE/WIC/ACM International Conference on Web Intelligence (WI). 156–163.
[9] Behnam Rahdari, Peter Brusilovsky, Khushboo Thaker, and Jordan Barria-Pineda. 2020. Using
Knowledge Graph for Explainable Recommendation of External Content in Electronic Textbooks.
In Proceedings of the Second International Workshop on Intelligent Textbooks 2020 co-located
with 21st International Conference on Artificial Intelligence in Education (AIED 2020), Vol. 2674.
50–61.
[10] Gerard Salton and Christopher Buckley. 1988. Term-Weighting Approaches in Automatic Text</p>
      <p>Retrieval. Information Processing &amp; Management 24, 5 (1988), 513–523.
[11] Xia Sun, Xu Zhao, Bo Li, Yuan Ma, Richard Sutcliffe, and Jun Feng. 2021. Dynamic Key-Value
Memory Networks With Rich Features for Knowledge Tracing. IEEE Transactions on Cybernetics
(2021), 1–7.
[12] Xueying Tang, Yunxiao Chen, Xiaoou Li, Jingchen Liu, and Zhiliang Ying. 2019. A
Reinforcement Learning Approach to Personalized Learning Recommendation Systems. Brit. J.</p>
      <p>Math. Statist. Psych. 72, 1 (2019), 108–135.
[13] Lev S. Vygotsky. 1978. Interaction between Learning and Development. 79–91.
[14] Allan Wigfield and Jacquelynne S. Eccles. 2000. Expectancy-Value Theory of Achievement
Motivation. Contemporary Educational Psychology 25, 1 (2000), 68–81.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Jordan</given-names>
            <surname>Barria-Pineda</surname>
          </string-name>
          , Kamil Akhuseyinoglu, Stefan Želem-Ćelap,
          <string-name>
            <given-names>Peter</given-names>
            <surname>Brusilovsky</surname>
          </string-name>
          , Aleksandra Klasnja Milicevic, and
          <string-name>
            <given-names>Mirjana</given-names>
            <surname>Ivanovic</surname>
          </string-name>
          .
          <year>2021</year>
          .
          <article-title>Explainable Recommendations in a Personalized Programming Practice System</article-title>
          .
          <source>In Artificial Intelligence in Education</source>
          .
          <volume>64</volume>
          -
          <fpage>76</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Albert</surname>
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Corbett and John R. Anderson</surname>
          </string-name>
          .
          <year>1994</year>
          .
          <article-title>Knowledge Tracing: Modeling the Acquisition of Procedural Knowledge. User Modeling and User-</article-title>
          <source>Adapted Interaction 4</source>
          ,
          <issue>4</issue>
          (
          <year>1994</year>
          ),
          <fpage>253</fpage>
          -
          <lpage>278</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Brendan</given-names>
            <surname>Flanagan</surname>
          </string-name>
          and
          <string-name>
            <given-names>Hiroaki</given-names>
            <surname>Ogata</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Learning Analytics Platform in Higher Education in Japan</article-title>
          .
          <source>Knowledge Management &amp; E-Learning</source>
          <volume>10</volume>
          ,
          <issue>4</issue>
          (
          <year>2018</year>
          ),
          <fpage>469</fpage>
          -
          <lpage>484</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Jennifer</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Fredricks</surname>
          </string-name>
          ,
          <string-name>
            <surname>Phyllis C. Blumenfeld</surname>
          </string-name>
          , and Alison H. Paris.
          <year>2004</year>
          . School Engagement:
          <article-title>Potential of the Concept, State of the Evidence</article-title>
          .
          <source>Review of Educational Research</source>
          <volume>74</volume>
          ,
          <issue>1</issue>
          (
          <year>2004</year>
          ),
          <fpage>59</fpage>
          -
          <lpage>109</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Tao</given-names>
            <surname>Huang</surname>
          </string-name>
          , Mengyi Liang, Huali Yang,
          <string-name>
            <given-names>Zhi</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Tao</given-names>
            <surname>Yu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Shengze</given-names>
            <surname>Hu</surname>
          </string-name>
          .
          <year>2021</year>
          .
          <article-title>Context-Aware Knowledge Tracing Integrated With the Exercise Representation and Association in Mathematics</article-title>
          .
          <source>In Proceedings of the 14th International Conference on Educational Data Mining.</source>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Zhenya</given-names>
            <surname>Huang</surname>
          </string-name>
          , Qi Liu, Chengxiang Zhai, Yu Yin, Enhong Chen,
          <string-name>
            <given-names>Weibo</given-names>
            <surname>Gao</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Guoping</given-names>
            <surname>Hu</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Exploring Multi-Objective Exercise Recommendations in Online Education Systems</article-title>
          .
          <source>In Proceedings of the 28th ACM International Conference on Information and Knowledge Management</source>
          .
          <fpage>1261</fpage>
          -
          <lpage>1270</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>