<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>SARLR: Self-adaptive Recommendation of Learning Resources</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Liping Liu</string-name>
          <email>liuliping@nlsde.buaa.edu.cn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Wenjun Wu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jiankun Huang</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>State Key Lab of Software Development Environment Department of Computer Science and Engineering, Beihang University</institution>
          ,
          <addr-line>Beijing</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
      </contrib-group>
      <fpage>151</fpage>
      <lpage>158</lpage>
      <abstract>
        <p>Personalized recommendation is important for online students to select rich learning resources and make their own learning schedules. We propose SARLR, a new self-adaptive recommendation algorithm of online learning resources. The SARLR algorithm integrates an IRT-based learning cognitive model named T-BMIRT into the recommendation framework and is able to adaptively adjust learning path recommendations based on dynamic of individual learning process. The experimental results show that the SARLR algorithm outperforms the existing recommendation algorithms.</p>
      </abstract>
      <kwd-group>
        <kwd>Online Education</kwd>
        <kwd>Learning Recommendation</kwd>
        <kwd>ITS</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        With the growing prevalence of online education, students have access to all kinds of
electronic learning resources, including electronic books, exercises and learning videos.
Given the diversity of students’ background, learning styles and knowledge levels, it is
essential to have personalized recommendation tools to facilitate students in choosing
their own learning paths to satisfy their individual needs [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Previous studies have
introduced personalized learning recommendation algorithms following the two major
approaches including rule-based recommendation and data-driven recommendation.
      </p>
      <p>
        Most Intelligent Tutor Systems (ITS) such as [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], primarily adopt the rule-based
approach to design their recommendation algorithms, which requires domain experts to
evaluate learning scenarios for different kinds of students and define extensive
recommendation rules accordingly. Apparently, such a labor-intensive approach can only be
applied in specific learning domains. For modern online educational systems, designers
often take the data-driven approach by utilizing collaborative filtering methods to
implement learning recommendation algorithms. These data-driven recommendation
algorithms [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] attempt to identify suitable learning resources for students by comparing
similarity among students and learning objects.
      </p>
      <p>Although the data-driven recommendation approach is more scalable and general
than the rule-based approach, current proposed solutions have common problems in
achieving highly adaptive recommendation towards students’ latent learning state.
They often focus on either searching for similar learning resources based on content or
identifying similar student groups based on their learning behaviors. The recommended
learning objects or paths fail to consider the impact of difficulty of learning objects and
dynamic change in students’ learning states.</p>
      <p>In this paper, we propose a novel learning recommendation algorithm named
SARLR, which attempts to integrate an IRT-based learning cognitive model into the
recommendation framework and to adaptively adjust learning path recommendations
based on dynamics of individual learning process. Specifically, we introduce a
temporal, multidimensional IRT-based model named as T-BMIRT, which can accurately
infer student proficiency of multiple latent skills and difficulties of exercise
assessments. In addition, the T-BMIRT model incorporates the parameter of video learning,
which can describe the improvement in student skills after their interactions with video
lectures. Based on the T-BMIRT model, the SARLR algorithm can comprehensively
analyze every student’s skill progress at each learning step and recommend to them a
personalized learning path with the matching online video lectures and homework
problems.</p>
      <p>The contributions of this paper are the two-fold. First, we introduce the T-BMIRT
model, to estimate students’ latent skill levels and difficulties of learning resources for
recommendation. Second, we propose the SARLR algorithm by integrating the
TBMIRT model in the adaptive recommendation process of learning resources. The
experimental results confirm that the SARLR outperforms regular recommendation
algorithms. Lastly, we present an evaluation strategy for recommendation algorithms in
terms of rationality and effectiveness.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Data-driven learning recommendation algorithms often utilize common
recommendation methods widely adopted in the e-Commence area, including Collaborative
Filtering (CF) and Latent Factor Model (LFM). CF can be further divided into UCF
(Userbased Collaborative Filtering) and ICF (Item-based Collaborative Filtering). The core
idea of LFM is to connect users and items through latent features [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        EduRank [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] is a collaborative filtering based method for personalization in
e-learning. It can generate a difficulty ranking of questions for a target student by aggregating
the ranking of similar students. Although this method is able to rank the available
exercise questions based on their difficulties for similar students, it doesn’t integrate
cognitive learning models in its framework for estimating the ability of individual students.
Thus, it can’t generate the matching learning paths for students based on their state of
latent skills.
      </p>
      <p>
        The most related work to our research in previous studies is the Latent Skill
Embedding (LSE) model [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], which also presents a probabilistic model of students and lessons.
Although the LSE model provides a good foundation for designing a recommendation
framework for personalized learning, the paper [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] doesn’t propose a detailed
recommendation algorithm. Our T-BMIRT model is more fine-grained than the LSE model
because it defines a video learning parameter to capture student progress through their
interaction with video lectures. Moreover, we present the SARLR algorithm that
utilizes the T-BMIRT model to identify similar students for a target student and
recommend their learning paths according to the dynamic state of the target student’s latent
skills. We also extend the recommendation evaluation criteria expected gain by
incorporating two more metrics including relevance accuracy and difficulty accuracy. These
new metrics can support more comprehensive performance evaluation for learning
recommendation algorithms.
      </p>
      <p>
        Recently, reinforcement learning has been explored in personalized study planning
in ITS [
        <xref ref-type="bibr" rid="ref7 ref8 ref9">7-9</xref>
        ]. Most of them have not evaluated their approaches in real online learning
scenarios and compared their performance to existing problem selection strategies used
in current systems. Moreover, calculating an optimal personalized learning path in a
POMPD is often time-consuming and even becomes intractable as the dimensions of
the knowledge state and strategy spaces increase. Therefore, our SARLR algorithm
adopts the collaborative filter based approach and we plan to investigate the possibility
of utilizing reinforcement learning in our framework in future work.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>SELF-ADAPTIVE RECOMMENDATION</title>
      <p>
        define the model based on IRT, T-IRT and MIRT model [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. In a two-parameter IRT
model, the probability of the student  correctly answering the question  is given by:
      </p>
      <p>
        ,  (  +τ|  )=    , 2 (  +τ)
next moment is only relevant to his current ability value.

Where   is the question discrimination, 
is the question difficulty,   is the
student’s ability value. The Temporal IRT (T-IRT) model [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] extends the original IRT
and MIRT model by modeling a student’s latent skills over time as a Wiener process,
where   + −   ~ (  ,  2 ). The model indicates the ability value of the student at the
      </p>
      <p>
        The T-IRT model only considers interactions between students and assessments,
ignoring their interactions with learning videos. However, we believe that the students'
ability can be significantly improved after completing a learning video. Therefore, in
[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], we introduce a new model T-BMIRT by incorporating learning video parameters
to describe the impact of students’ interaction with learning videos. The major
equations are defined in Eq (2):
 ( ⃗ , +τ| ⃗ , , ⃗
 , )=  ⃗⃗ , +⃗ , , 2 ( ⃗ , +τ), ⃗
 , =
 
   ∙  ⃗ ∙
1+
      </p>
      <p>1
(−(⃗⃗ , ∙ℎ⃗⃗
‖ℎ⃗⃗ ‖
−‖ℎ⃗⃗ ‖))
(1)
(2)
Where ⃗ , represents knowledge that student  gains from the video  , ⃗⃗ ⁡represents

knowledge of the video  , ℎ⃗ is the prerequisites of video⁡ ,    is the duration in which
student  watches video  and   is the total length of the video  . In Eq (2), both
student ability and learning video requirements have been expanded from one-dimensional
to multidimensional. We utilize the vector projection method to determine whether the
relevant abilities of the student exceed the relevant skill requirements of the video
lectures.</p>
      <p>
        The T-BMIRT model enables us to infer every student’s current ability  , video
knowledge  and video skill requirements ℎ through the student’s responses of
assessment questions. The detailed model fitting process of the T-BMIRT can be found in
[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. An approximation technique makes it possible to train the T-BMIRT in an online
way. As a result, the T-BMIRT can be effectively used in the framework of the SARLR
algorithm to estimate the parameters of learning resources and students’ ability levels.
3.2
      </p>
      <sec id="sec-3-1">
        <title>Similar Students Search and Learning Path Extraction</title>
        <p>SARLR Phase 1 describes the process of searching similar students and extracting a
suitable learning path for a target student. At Step 1, the algorithm identifies the
students MS with the similar skill levels to the target student   through k-nearest neighbor
search method over the k-dimension tree (kd-tree) structure and k-nearest neighbor
search method. At Step 2-4, the algorithm selects the best student  
∈ 
highest ability level at the moment when they complete learning specific knowledge
with the
units. At Step 5, the algorithm extracts the learning path  of   to the target student⁡  .
SARLR Phase 1: Search and Extraction
INPUT:</p>
        <p>Set of students  = { 1,  2, … ,   }, target student   ∈ 
Matrix of abilities  = [  , ], where   , is the ability value of student s at time t
Set of learning resources  = { 1,  2, … ,   }</p>
        <p>The time in this paper is the index of learning resources with the student just completed learning.
OUTPUT: learning path 
1: search for similar students MS, where   ∈  and    , 0 is similar to    , 0
2: for each   ∈  do
3: find   =  (
4: end for
5: extract the learning path  = (  1,   2, …    )of  
6: return</p>
        <p>(   ,   −    , 0)), where    is the time of   completing learning
3.3</p>
      </sec>
      <sec id="sec-3-2">
        <title>Adaptive Adjustment</title>
        <p>(3)
Because each individual student has his/her inherent learning style, even when he
follows the recommended learning path generated in SARLR phase 1, the learning
outcome may not be as good as expected by the recommendation algorithm. In order to
deal with this problem, we set up the two conditions in Eq (3) to initiate the Adaptive
Re-planning phase, which is defined in SARLR Phase 2.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>EXPERIMENTS</title>
      <p>We selected two datasets to perform our experiments, the public “Assistments”,
including 224,076 interactions, 860 students, 1,427 assessments and 106 skills, and a blended
learning data from our learning analysis platform including 14,037,146 learning
behavior data from 140 schools and 9 online educational companies.
4.1</p>
      <sec id="sec-4-1">
        <title>Experiments for T-BMIRT</title>
        <p>We divided each data set into two parts, one part only contains single skill assessments,
and the other part contains multiple skills assessments. The IRT, T-IRT are single skill
models, and the MIRT and T-BMIRT are multiple skills models. The dimensions for
models are related to the numbers of knowledge components. The values in Table 1 are
average results of the cross-validation. It shows that T-BMIRT outperforms the other
models on each dataset, especially on the multidimensional dataset.
Where   ∈  is the learning resources in a recommended path,  is the length of the
is the knowledge components which   is learning in the current chapter,
function similarity() calculates the adjusted cosine similarity of the two vectors in the
parentheses. The relevance accuracy</p>
        <p>ability. The difficulty accuracy</p>
        <p>of the recommended learning resources for the target student   are matched with his
is set to evaluate whether the difficulties of the</p>
        <p>is used to evaluate whether the difficulties
recommended learning resources for the target student can match his current ability
levels.</p>
        <p>
          We selected the blending data to do this experiments. Table 2 shows the average of
the 10-fold cross-validation results. It can be seen that the UCF and ICF have a similar
effect, but the UCF works better on the relevance accuracy, while the ICF is better at
the difficulty accuracy. The LFM performs better than the first two algorithms in terms
of both indicators. The SARLR algorithm performs best among all these algorithms.
their ability levels. We calculated “expected gain”  =  (   ′(⁡) − )(  ) by using PCA and
K-means method to further split the students of the same group into two parts based on
their learning paths [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. One part is the students whose learning paths are strictly
recommended, denoted as⁡ ′ , and the other part is the students whose learning path are
randomly selected, denoted as⁡ .  (  ′⁡)and  (  )and indicate that the students’
average score in the last online assessment. We sorted the six groups of the students
ascendingly based on their ability levels: group 1 has the lowest skill level, group 2 has a
higher skill level than group 1, and group 6 has the highest.
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>CONCLUSIONS</title>
      <p>We developed a self-adaptive recommendation algorithm of learning resources
(SARLR) to personalize students’ learning path. It contains the T-BMIRT, a temporal
blended multidimensional IRT model, which performs well on the prediction task of
multi-dimensional skills assessments, especially when the study process contains
learning video interactions. Based on the T-BMIRT model, the SARLR algorithm adopts a
reasonable recommendation strategy and establishes conditions to adaptively adjust
recommendations towards the dynamic needs of the students. In addition, we extend
the evaluation criteria for personalized learning recommendation in term of rationality
and effectiveness. Experimental results prove that the SARLR algorithm outperforms
the other recommendation algorithms based on CF and LFM.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Akbulut</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cardak</surname>
            ,
            <given-names>C. S.</given-names>
          </string-name>
          :
          <article-title>Adaptive educational hypermedia accommodating learning styles: A content analysis of publications from 2000 to 2011</article-title>
          . Computers &amp; Education.
          <volume>58</volume>
          (
          <issue>2</issue>
          ),
          <fpage>835</fpage>
          -
          <lpage>842</lpage>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Vesin</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ivanović</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>KlašNja-MilićEvić</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Budimac</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>Protus 2.0: Ontology-based semantic recommendation in programming tutoring system</article-title>
          .
          <source>Expert Systems with Applications</source>
          .
          <volume>39</volume>
          (
          <issue>15</issue>
          ),
          <fpage>12229</fpage>
          -
          <lpage>12246</lpage>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Zhang, G.:
          <article-title>A fuzzy tree matching-based personalized e-learning recommender system</article-title>
          .
          <source>IEEE Transactions on Fuzzy Systems</source>
          .
          <volume>23</volume>
          (
          <issue>6</issue>
          ),
          <fpage>2412</fpage>
          -
          <lpage>2426</lpage>
          (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Jenatton</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roux</surname>
            ,
            <given-names>N. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bordes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Obozinski</surname>
            ,
            <given-names>G. R.:</given-names>
          </string-name>
          <article-title>A latent factor model for highly multi-relational data</article-title>
          .
          <source>In: Proceedings of the 25th International Conference on Neural Information Processing Systems</source>
          , pp.
          <fpage>3167</fpage>
          -
          <lpage>3175</lpage>
          . ACM,
          <string-name>
            <surname>California</surname>
          </string-name>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Segal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Katzir</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gal</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shani</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shapira</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Edurank: A collaborative filtering approach to personalization in e-learning</article-title>
          .
          <source>In: Proceedings of the 7th International Conference on Educational Data Mining</source>
          , pp.
          <fpage>68</fpage>
          -
          <lpage>75</lpage>
          . EDM, London (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Reddy</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Labutov</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joachims</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Latent skill embedding for personalized lesson sequence recommendation</article-title>
          .
          <source>arXiv preprint arXiv:1602.07029</source>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Theocharous</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beckwith</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Butko</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Philipose</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Tractable POMDP Planning Algorithms for Optimal Teaching in SPAIS</article-title>
          . In: Workshop on Plan Activity, and
          <article-title>Intent Recognition (PAIR), IJCAI (</article-title>
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Folsom-Kovarik</surname>
            ,
            <given-names>J. T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sukthankar</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schatz</surname>
            ,
            <given-names>S. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nicholson</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          :
          <article-title>Scalable POMDPs for Diagnosis and Planning in Intelligent Tutoring Systems</article-title>
          . In: AAAI Fall Symposium:
          <article-title>Proactive Assistant Agents</article-title>
          . AAAI Press, Virginia (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Brunskill</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Russell</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Partially observable sequential decision making for problem selection in an intelligent tutoring system</article-title>
          .
          <source>In Educational Data Mining (EDM)</source>
          , pp.
          <fpage>327</fpage>
          -
          <lpage>328</lpage>
          (
          <year>2011</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Reckase</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Multidimensional item response theory</article-title>
          . Springer, New York (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Ekanadham</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karklin</surname>
          </string-name>
          , Y.:
          <article-title>T-skirt: Online estimation of student proficiency in an adaptive learning system</article-title>
          .
          <source>arXiv preprint arXiv:1702.04282</source>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
          </string-name>
          , W.: T-BMIRT:
          <article-title>Estimating representations of student knowledge and educational components in online education</article-title>
          .
          <source>In: 2017 IEEE International Conference on Big Data</source>
          , pp.
          <fpage>1301</fpage>
          -
          <lpage>1306</lpage>
          . IEEE Press, Massachusetts (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>