<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Context-Aware Factorization for Personalized Student's Task Recommendation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Nguyen Thai-Nghe</string-name>
          <email>nguyen@ismll.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tomas Horvath</string-name>
          <email>horvath@ismll.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lars Schmidt-Thieme</string-name>
          <email>schmidt-thieme@ismll.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Information Systems and Machine Learning Lab, University of Hildesheim Marienburger Platz 22</institution>
          ,
          <addr-line>31141 Hildesheim</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Collaborative filtering - one of the recommendation techniques - has been applied for e-learning recently. This technique makes an assumption that each user rates for an item once. However, in educational environment, each student may perform a task (problem) several times. Thus, applying original collaborative filtering for student's task recommendation may produce unsatisfied results. We propose using context-aware models to utilize all interactions (performances) of the given student-task pairs. This approach can be applied not only for personalized learning environment (e.g., recommending tasks to students) but also for predicting student performance. Evaluation results show that the proposed approach works better than the none-context method, which only uses one recent performance.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Recommender systems have been applied for e-learning task recently [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. One of
the techniques, for instance, is collaborative filtering, e.g. k-nearest neighbors (k-NN)
or matrix factorization, which takes into account just the last rating of users, i.e. it
assumes that a user rates an item once. However, in educational environment, for
example, recommending tasks (or problems or exercises) to students, this assumption
might not hold since each student can perform the task several times. Furthermore,
recommender system for educational purposes is a complex and challenging research
direction since the preferred learning activities of students might pedagogically not be
the most adequate and recommendations in e-learning should be guided by
educational objectives, and not only by the user's preferences [
        <xref ref-type="bibr" rid="ref3 ref4 ref5">3-5</xref>
        ].
      </p>
      <p>
        On the other hand, recommendation techniques have also been applied for
predicting student performance recently [
        <xref ref-type="bibr" rid="ref2 ref6">2, 6</xref>
        ]. Concretely, [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] proposed a temporal
collaborative filtering approach to automatically predict the correctness of students'
problem solving in an intelligent math tutoring system. This approach utilized
multiple interactions for a student-problem pair by using k-NN method; [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] proposed
using matrix and tensor factorization to take into account the “slip” and “guess” latent
factors as well as the temporal effect in predicting student performance.
      </p>
      <p>
        Previous work [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] pointed out that an approach which uses student performance
prediction for the recommendation of e-learning tasks could tackle the above
mentioned problems since we can recommend the tasks to the students based on their
performance but not on their preferences. Using this approach, one can recommend
similar tasks (exercises) to students and can determine which tasks are notoriously
difficult for a given student. For example, there is a large bank of exercises where
students lose a lot of time solving problems which are too easy or too hard for them.
When a system is able to predict students' performance, it could recommend more
appropriate exercises for them. Thus, we could filter out the tasks with predicted high
performance / confidence since these tasks are too easy, or filter out the tasks with
predicted low performance (too hard) or both, depending on the goals of the
elearning system [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        This work proposes using context-aware models for student's task
recommendation which utilize multiple interactions (performances) of a given
student-task pair. This approach can be applied not only for predicting student
performance as in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] but also for personalized task recommendation to students.
Here, we have not focused on building a real system, but on how to model the
student's task recommendation using context-aware approach [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
    </sec>
    <sec id="sec-2">
      <title>2 Data sets and Methods</title>
      <p>In this section we first introduce the data sets. We then present the method without
taking into account the context (considered as a baseline) and the proposed
contextaware methods.</p>
      <sec id="sec-2-1">
        <title>2.1 Data sets</title>
        <p>Two data sets are collected from the KDD Challenge 2010
(pslcdatashop.web.cmu.edu/KDDCup), which will be called “Algebra” and “Bridge”
for short. We aggregated these data sets to get four attributes: student ID (s), problem
ID (i), problem view (v) which tracks how many times the student has interacted with
the problem, and performance p (p ∈ [0..1]) which is an average of successful
solutions (averaging from “correct first attempt” attribute).</p>
        <p>
          As described in the literature [
          <xref ref-type="bibr" rid="ref2 ref8">8, 2</xref>
          ], these data sets can be mapped to
user-itemrating in recommender systems. In this case, students become users and problems
become items which are presented in a matrix (s, i) as in Figure 1a. In this work, the
context (“problem view” - v) is taken into account, thus, each data set is presented in a
three-mode tensor (s, i, v) as illustrated in Figure 1c.
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2 Baseline (Without Using Context)</title>
        <p>
          Traditional collaborative filtering has an assumption that each user rates for each item
once, which means that only the last rating is used. Similarly, in this work, the last
performance p of a student-problem pair (s, i) is used (which ignores the multiple
interactions between students and problems) and finally, a matrix factorization model
is applied. The following paragraph briefly summarizes the matrix factorization
method (please see the article [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] for more details).
Matrix factorization is the task of approximating a matrix X by the product of two
smaller matrices W and H, i.e. X ~ WHT [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. In the context of recommender systems
the matrix X is the partially observed ratings matrix, W ∈ ℜS×K is a matrix where each
row s is a vector containing K latent factors describing the student s and H ∈ ℜI×K is a
matrix where each row i is a vector containing K latent factors describing the problem
i. Let wsk and hik be the elements of W and H, respectively, then the performance
given by a student s to a problem i is predicted by:
        </p>
        <p>K
pˆsi = ∑ wsk hik = (WH T )s,i (1)</p>
        <p>
          k=1
where W and H are model parameters which can be obtained by an optimization
process using either stochastic gradient descent or Alternating Least Squares [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]
given a criterion such as Root Mean Squared Error (RMSE) or Mean Absolute Error
(MAE).
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3 Context-Aware Methods</title>
        <p>
          We make use of two context-aware methods: “Pre-filtering” and “Contextual
Modeling” [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] (in this work we use matrix and tensor factorization approach instead
of heuristic-based and model-based approaches as in [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]).
        </p>
        <p>Pre-filtering (PF): As its name, this method requires pre-processing on the data sets.
To do this, the performance p is aggregated (averaged) along the context v. Thus, the
three-mode tensor (s, i, v) now becomes the matrix as illustrated in Figure 1b.</p>
        <p>After the pre-filtering step, we apply the matrix factorization method to factorize
on student-problem pairs (s, i) as described in section 2.2.</p>
        <p>
          Contextual Modeling (CM): In this method, the context v is preserved, thus, we have
to deal with the three-mode tensor. Given a tensor Z of size S × I × V, where the first
and the second mode describe the student and the problem as in previous sections; the
third mode describes the context (problem view - v) with size V. Then Z can be
written as a sum of rank-1 tensors, using CANDECOM-PARAFAC [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]:
        </p>
        <p>K
Z = ∑λ w οhk οqk</p>
        <p>k k
k =1</p>
        <p>
          K
pˆsiv = ∑λk wsk hik qvk
k =1
(2)
(3)
where ° is the outer product, λk is a vector of scalar values, and each vector wk ∈ ℜS,
hk ∈ ℜI , and qk ∈ ℜV describes the latent factors of student, problem, and context,
respectively. With this approach, the performance of student s for problem i at context
v (problem view) is predicted by:
“Student bias/effect” and “problem bias/effect”: As shown in the literature [
          <xref ref-type="bibr" rid="ref11 ref2 ref8">11, 8,
2</xref>
          ], the prediction result can be improved if one incorporates the biased terms to the
model. In educational setting, those biased terms are “student bias/effect” which
models how good/clever a student is (i.e. how likely is the student to perform a
problem correctly), and “problem bias/effect” which models how difficult/easy the
problem is (i.e. how likely is the problem in general to be performed correctly) [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
        </p>
        <p>
          With these biases, the performance p in the pre-filtering method becomes
K
pˆsi = μ + bs + bi + ∑ wsk hik (4)
k =1
and the performance p in the contextual modeling method (equation 3) becomes
K
pˆsiv = μ + bs + bi + ∑λk wsk hik qvk (5)
k =1
where μ is global average, bs is student bias, and bi is problem bias (how to obtain
these values is already described the article [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]).
        </p>
        <p>After the prediction phase, we can filter out the tasks with predicted high
performance since these tasks are too easy, or filter out the tasks with predicted low
performance (too hard) or both, depending on the goals of the e-learning system.
Thus, the appropriate tasks can be delivered to students.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3 Experiments</title>
      <p>We describe the experimental setting and then we present the comparison results.</p>
      <sec id="sec-3-1">
        <title>3.1 Experimental setting</title>
        <p>We use just the first 5,000 problems in both Algebra and Bridge data sets. We use
3fold cross-validation and paired t-test with significance level 0.05 for all experiments.
We do hyper parameter search to determine the best hyper parameters for all methods.
The Matlab Tensor Toolbox is used for experimenting (csmr.ca.sandia.gov/~tgkolda/
TensorToolbox).</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2 Experimental results</title>
        <p>Moreover, the MAE improvements in the prediction models implicitly mean that
the system can recommend the “right” tasks (exercises) to the students, and thus, we
can help them reducing their time and effort in solving the tasks by filtering the ones
that are too easy or too hard for them. Using these context-aware models, we can
generate the performance for a given student-task pair, so the remaining works are
wrapping around with an interface to deliver the recommendations. However, this
work is out of the scope of this paper, and is leaved for future work.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4 Conclusion</title>
      <p>We proposed using context-aware models to utilize all performances (interactions) of
the given student-task pairs. We have shown that these methods can improve the
prediction results compared to the none-context method, which only uses the last
performance. This approach can apply not only for personalized recommending the
tasks to students but also for predicting student performance.</p>
      <p>
        It is well-known that factorization methods outperform the k-NNs collaborative
filtering [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. However, the comparison of the context-aware factorization methods
with the temporal collaborative filtering (using k-NNs as in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]) is leaved for future
work.
      </p>
      <p>Acknowledgments The first author was funded by the TRIG project of Cantho
University, Vietnam. Tomas Horvath is also supported by the grant VEGA 1/0131/09.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Manouselis</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Drachsler</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vuorikari</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hummel</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koper</surname>
          </string-name>
          , R.:
          <article-title>Rec. syst. In technology enhanced learning</article-title>
          . In: Kantor,
          <string-name>
            <given-names>P.B.</given-names>
            ,
            <surname>Ricci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Rokach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Shapira</surname>
          </string-name>
          ,
          <string-name>
            <surname>B.</surname>
          </string-name>
          <source>(eds.) 1st Recommender Systems Handbook</source>
          , Springer-Berlin (
          <year>2010</year>
          )
          <fpage>1</fpage>
          -
          <lpage>29</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Thai-Nghe</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Drumond</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horvath</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krohn-Grimberghe</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nanopoulos</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>SchmidtThieme</surname>
          </string-name>
          , L.:
          <article-title>Factorization techniques for predicting student performance</article-title>
          . In
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>O.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boticario</surname>
          </string-name>
          , J.G., eds.:
          <article-title>Educational Recommender Systems and Technologies: Practices and Challenges (In press)</article-title>
          .
          <source>IGI Global</source>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Drachsler</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hummel</surname>
            ,
            <given-names>H.G.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koper</surname>
          </string-name>
          , R.:
          <article-title>Identifying the goal, user model and conditions of recommender systems for formal and informal learning</article-title>
          .
          <source>Journal of Digital Information</source>
          <volume>10</volume>
          (
          <issue>2</issue>
          ) (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>O.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boticario</surname>
            ,
            <given-names>J.G.</given-names>
          </string-name>
          :
          <article-title>Modeling recommendations for the educational domain</article-title>
          .
          <source>Elsevier's Procedia Computer Science</source>
          <volume>1</volume>
          (
          <issue>2</issue>
          ) (
          <year>2010</year>
          )
          <fpage>2793</fpage>
          -
          <lpage>2800</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Tang</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McCalla</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Beyond learners interest: Personalized paper recommendation based on their pedagogical features for an e-learning system</article-title>
          .
          <source>In: PRICAI 2004: Trends in Artificial Intelligence</source>
          . (
          <year>2004</year>
          )
          <fpage>301</fpage>
          -
          <lpage>310</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Cetintas</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Si</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xin</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hord</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Predicting correctness of problem solving in ITS with a temporal collaborative _ltering approach</article-title>
          .
          <source>In: International Conference on Intelligent Tutoring Systems</source>
          . (
          <year>2010</year>
          )
          <fpage>15</fpage>
          -
          <lpage>24</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Adomavicius</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tuzhilin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Context-aware recommender systems</article-title>
          . In Ricci, F.,
          <string-name>
            <surname>Rokach</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shapira</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kantor</surname>
          </string-name>
          , P.B., eds.
          <source>: Recommender Systems Handbook</source>
          . Springer (
          <year>2011</year>
          )
          <fpage>217</fpage>
          -
          <lpage>253</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Thai-Nghe</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Drumond</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krohn-Grimberghe</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schmidt-Thieme</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Recommender system for predicting student performance</article-title>
          .
          <source>Elsevier's Procedia Computer Science</source>
          <volume>1</volume>
          (
          <issue>2</issue>
          ) (
          <year>2010</year>
          )
          <fpage>2811</fpage>
          -
          <lpage>2819</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Koren</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bell</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Volinsky</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Matrix factorization techniques for recommender systems</article-title>
          .
          <source>IEEE Computer Society Press 42(8)</source>
          (
          <year>2009</year>
          )
          <fpage>30</fpage>
          -
          <lpage>37</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Kolda</surname>
            ,
            <given-names>T.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bader</surname>
            ,
            <given-names>B.W.</given-names>
          </string-name>
          :
          <article-title>Tensor decompositions and applications</article-title>
          .
          <source>SIAM Review</source>
          <volume>51</volume>
          (
          <issue>3</issue>
          ) (
          <year>September 2009</year>
          )
          <fpage>455</fpage>
          -
          <lpage>500</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Toscher</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jahrer</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>Collaborative filtering applied to educational data mining</article-title>
          .
          <source>KDD Cup</source>
          <year>2010</year>
          :
          <article-title>Improving Cognitive Models with Educational Data Mining (</article-title>
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Koren</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Factor in the neighbors: Scalable and accurate collaborative filtering</article-title>
          .
          <source>ACM Transactions on Knowledge Discovery from Data</source>
          <volume>4</volume>
          (
          <issue>1</issue>
          ) (
          <year>2010</year>
          )
          <fpage>1</fpage>
          -
          <lpage>24</lpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>