<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Predicting student performance based on Lecture Materials data using Neural Network Models</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sukrit Leelaluk</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tsubasa Minematsu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yuta Taniguchi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fumiya Okubo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Atsushi Shimada</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Graduate School of Information Science and Electrical Engineering, Kyushu University</institution>
          ,
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2022</year>
      </pub-date>
      <fpage>21</fpage>
      <lpage>22</lpage>
      <abstract>
        <p>Student Performance Prediction is essential for learning analysis of the students' learning behavior to discovering at-risk students for the early invention to support students. This study transforms the students' reading behavior into a two-dimensional matrix input based on each lecture material's reading behavior. The matrix input will be updated by accumulating the value for each week for performance prediction week by week. The multilayer perceptron neural network is employed to receive the matrix input and give feedback as a student's criteria consist of at-risk or no-risk students. This study considers the accuracy of a model considering between on contents information and weekly information. We also investigate the switching of learning materials' order, the feature importance of the reading operation on an event stream, and the difference in reading behavior between at-risk and no-risk students. These can help the instructors for an early invention to support at-risk students.</p>
      </abstract>
      <kwd-group>
        <kwd>1 Learning Analytics</kwd>
        <kwd>Performance prediction</kwd>
        <kwd>Neural Networks</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Performance prediction is education mining data by collecting students' learning activity logs and
analysis. Then, we receive output to analyze students' learning behavior. The purpose of performance
prediction is to increase students' academic success or discover at-risk students early and invent them
against drop-out or failure [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ].
      </p>
      <p>
        Recently, Machine Learning (ML) and Deep Learning (DL) have been used for performance
prediction in previous research. This is because ML and DL can learn and recognize the data pattern
with a vast volume and predict the outcome with better accuracy [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Many pieces of research are
proposing to predict students' performance early by using students' activities logs based on each week
to create a time sequence. However, collecting the data from the education systems each week maybe
find the break weeks or the same courses on each semester have a period difference. These problems
will have the possibility to make accuracy drop. Furthermore, instructors can adjust the teaching styles
of the courses every time. Therefore, using the past activities data as training data to predict the students'
performance with different formats at the current time makes the prediction performance drop possible.
This situation is similar with using a single-course predictive model to predict another course [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>From the reason above, this study decided to transform the data to be based on lecture materials for
students' performance prediction since the lecture materials are a fundamental object used to hold the
lecture. In addition, this paper relies on the neural network (NN) to derive the high-performance
classification model for predicting students' reading behavior. This can support instructors to identify
at-risk students during the period of courses and invent students early.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Works</title>
      <p>The prediction of students' performance becomes the challenge of learning analytics. The previous
literature was recently analyzed to present different techniques to predict students' performance, such
as statistical methods, machine learning, or deep learning.</p>
      <p>
        Moises et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] used ML and multilayer perceptron NN (MLP) to analyze the data on the
massive learning management system (LMS) on 10%, 25%, 33%, and 50% of the course length for
detect at-risk, fail and excellent students. Okubo et al. [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ] studied a method to predict students' final
grades on multiple courses using a recurrent neural network (RNN) to analyze the time-series data.
Murata et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] used the knowledge distillation RNN model to compress the original model to become
a student model for early detection of at-risk students. Finally, Conjin et al. [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] predict students' final
grades and the predictor variables of 17 blended courses using standard and logistic regression on each
week from the Moodle LMS data.
      </p>
      <p>From these previous works of literature, we found they predicted students' performance by
considering the event stream of reading behavior and other factors based on conventional weekly. In
contrast to previous work, we present a prediction by considering the reading behavior on each learning
material that instructors used in the lecture. This study will transform reading behavior data into the
matrix input that can explain each student's reading information.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Datasets and Preparation 3.1.</title>
    </sec>
    <sec id="sec-4">
      <title>Data Collection</title>
      <p>
        The clickstream data was received from the International Conference on Learning Analytics &amp;
Knowledge 2022 Data Challenge [
        <xref ref-type="bibr" rid="ref10 ref9">9, 10</xref>
        ]. Dataset consists of the two types of lectures (A and B) from
onsite classes in 2019 and online classes in 2020 in the same institution. The summary of the detail of
the experimental data shown in Table 1.
3.2.
      </p>
    </sec>
    <sec id="sec-5">
      <title>Features</title>
      <p>
        The features are received from the operation on the event stream that concludes the total operations and
reading time computed by using the openLA library [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. The features are used in experiments shown
in Appendix A.
3.3.
      </p>
    </sec>
    <sec id="sec-6">
      <title>Data Preprocessing</title>
      <p>
        In this study, the raw data are transformed into two-dimensional matrices for inserted into the model
[
        <xref ref-type="bibr" rid="ref12 ref13">12, 13</xref>
        ]. The matrix consists of m rows with the size of the number of lecture materials (C), and n
columns with the size of the number of features (F) is displayed in figure 1. Where, its element Xmn
represents the value of the clickstream on each action and lecture material.
      </p>
      <p>The contents information matrix can input the feature in order of learning materials based on the
previous data. Then, this matrix can accumulate values of features depending on students' reading
behavior independent of time series. This can protect instructors from switching and not using content
for a lecture. So, we can use the previous data to predict students' performance that differs from the
model depending on time. Since instructors do not use a learning material to lecture students on that
week, the value of the data has a significant change that may cause prediction performance to change.</p>
      <p>From the observation data in table 1, we found that the amount of the contents is different. We thus
made the matrix for each course is adjusted to have rows and columns equivalent using the matrix size
of a course with most lecture materials on the same lecture-type as the main. If the missing value is
found, this will be imputed by the average of each feature's value of students on the same course. All
value on the matrix is normalized by robust standardization for rescaling features using statistics that
are robust to outliers.
3.4.</p>
    </sec>
    <sec id="sec-7">
      <title>Model</title>
      <p>This study creates MLP prediction models for each type of lecture is trained by the learning behavior
in all lecture weeks. The MLP will consist of two hidden layers with the activation function are ReLu.
The output layer had two units for classification between no-risk and at-risk students with the activation
function is softmax. The optimizer function is Adam for the loss. The matrix will be adjusted to become
the vector of size m x n by listing m rows next to each other and fetching them into the same model.</p>
      <p>The experiment will predict the students' performance every week during the course period. The matrix
will be updated by accumulating data week by week until the course's last week that is shown in figure
2.
3.5.</p>
    </sec>
    <sec id="sec-8">
      <title>Evaluation Criteria</title>
      <p>
        The evaluation criteria will class the student's performance as a binary classification. The class will
separate students who receive grades A, B, and C into the no-risk group and students who receive grades
D and F into the at-risk group [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The matrix's performance is evaluated using the accuracy. Table 2
shows the distribution of students' final grades in each course.
      </p>
    </sec>
    <sec id="sec-9">
      <title>4. Experimental &amp; Results 4.1.</title>
    </sec>
    <sec id="sec-10">
      <title>The efficiency of the model considering content information</title>
      <p>
        This experiment presents the model's prediction result that considers weekly behavior with content
information (Weekly w/ Content) in contrast with weekly behavior without content information
(Weekly w/o Content). Many researchers predicted students' performance using the data based on
conventional weekly in previous. For instance, Okubo et al. [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ] predicted students' final grades using
RNN for each week.
      </p>
      <p>The weekly prediction data is collected from the reading behavior data from the starting of a
lecture of the current week until the starting of next week's lecture every week. Then, this is input into
the prediction model as a vector size 1 row with n columns that uses features like a content information
model. We evaluate the proposed method where A-2019 and B-2019 courses are used as the training
data, and A-2020 and B-2020 courses are used as the test data.</p>
      <p>From observing the correspondence of lecture materials on both lecture types from the raw data,
an instructor adjusted the teaching styles of the courses by changing the number of contents. We found
that A-2019 and B-2019 courses have ten and one content canceled for usage on the lecture in 2020,
respectively. Then, A-2020 and B-2020 courses have two and one new content used in the lecture,
respectively. Therefore, the matrix input for training and test set is adjusted to have the row and column
equivalent by using the matrix size of a course with most lecture materials on the same lecture-type as
the main. The missing value thus will be imputed by the mean of each feature's value on each student.
However, an A-type lecture on the 2019 course has 21 lecture materials, including 12 main contents
and 9 summarized from main contents. Then, an instructor left the main content only and added new
content in 2020. In addition, many students chose to read the main contents more than summarized
contents significantly on students' reading behavior on A-2019 lectures. We thus transform the training
matrix of A-2019 courses to have the same size as the test matrix of the A-2020 course.</p>
      <p>The result found that the weekly w/o content model is better than the weekly w/ content model
on the lecture A-type. However, these are similar to the lecture B-type. Then, we found that both
prediction models had high accuracy on a lecture B-type than a lecture A-type shown in figure 3.</p>
      <p>When we observe the result prediction on each model, we found that the weekly w/o content
model can classify the final grade A and B students as no-risk students better than the weekly w/ content
model on both types of lectures. However, the identity of the final grade C students is very well on both
models in a B-type lecture more than an A-type lecture. The detection of the at-risk students of both
models on A-type lecture is better than B-type lecture every week. This is because the training data of
at-risk students on A-type lecture (16 students) is more than B-type lecture (4 students). Therefore, that
affects the model better to analyze the pattern of at-risk students on A-type lecture. The pattern of
reading behavior of both no-risk and at-risk students will be shown in section 4.2.</p>
      <p>Figure 4a: The graphs show the switching of learning materials' order of a weekly w/ content model
on (a) A-2020 and (b) B-2020 courses.</p>
      <p>Figure 4b: The graphs show the switching of weekly order of a weekly w/o content model
on (a) A-2020 and (b) B-2020 courses.</p>
      <p>Furthermore, we simulated the example situation if instructors change the learning materials'
order to make information change in an actual class in Figures 4a and 4b. We conducted these
experiments by switching the order of learning materials in the weekly w/ content and the weekly
information on weekly w/o content on test data every week and inputting the same model on previous
experiments.</p>
      <p>We found the switch of the contents on a weekly w/ content model made an accuracy drop on
some weeks on a course period, especially on an A-type lecture. The reason is that a weekly w/ content
) 18
(%16
ce 14
n
tra 12
o
p 10
m
e 8
i
r
tau 6
fe 4
e
itv 2
a
le 0
R
model considers each content’s reading behavior as the main. Thus, switching the order materials makes
the values of features change. In addition, instructors and students maybe give importance to each
lecture equivalent each year. This made the separation of each content on each row on the matrix have
new data that a weekly w/ content model has ever been trained to cause an accuracy drop (Figure 4a).</p>
      <p>On the other hand, the switch of weekly information on a weekly w/o content model made an
accuracy change on some weeks. However, this model can give an accuracy better than the weekly w/
content model. We found that the weekly w/o content model considers the operation count and reading
time on each week as central. Let us consider students' behavior, such as reading time. We can find that
the no-risk group takes time to read regularly. However, the at-risk group did not spend so much time
reading that caused a weekly w/o content model to identify a class of students better than a weekly w/
content model (Figure 4b).
4.2.</p>
    </sec>
    <sec id="sec-11">
      <title>Analysis of the feature important</title>
      <p>
        In the previous experiment, we conducted the matrix and vector input using features explained in section
3.2. We thus try to reduce the matrix size by exploring the feature importance by using the Random
Forest classifier. This method allows us to know the insight into informative features for classification
[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
      </p>
      <p>Features</p>
      <p>This experiment was conducted using the vector data from the weekly w/o content to explore the
feature importance. We found the 'NEXT,' 'PREV,' 'READING_TIME_MIN,' and 'TOTAL_ACTION'
are gave the most feature importance score as a percentage on both types of lecture. These features are
the most helpful operation for reading learning materials. Furthermore, this will analyze students'
reading behavior on each final grade by using a heatmap in figure 6. The heatmap of reading behavior
was computed by the average of operations from the students on each final grade. The rich yellow color
means that a feature has the most action and the purple color means that a feature has the poor action.
The no-risk students had the most operation on an event stream, which means no-risk students spent
much attention and time for reading learning materials on many pages. On the other hand, the at-risk
students did not focus on reading lecture materials as no-risk students, which caused these students got
to fail.</p>
      <p>We thus try to adjust the matrix on weekly w/ content model by reducing the matrix size by using
only the high score features composed of 'NEXT,' 'PREV,' 'READING_TIME_MIN,' and
'TOTAL_ACTION' compared with the current features. We found that the reduction of matrix size
raised prediction accuracy on an A-type course due to the essential features, making the model more
easily recognize the difference between at-risk and no-risk students. However, a B-type course did not
have any difference when using the high score features for prediction since the data of at-risk students
was few. (Figure 7)</p>
    </sec>
    <sec id="sec-12">
      <title>5. Discussion</title>
      <p>From the result of the experiment, we found that predicting students' performance by considering
learning content data can classification students who have a trend to fail or at-risk students if we have
much reading behavior pattern of at-risk data sufficient. In addition, we used a heatmap to investigate
the reading behavior on each week between no-risk students and at-risk students for a survey on how it
affected the model performance. After the 3rd and 4th week of lecture, we found that no-risk students
had a few actions for reading learning materials. On the other hand, at-risk students can continue reading
and reviewing many learning materials until the end of the course (Figure 8). This causes a model to
separate the reading behavior between at-risk and no-risk students.</p>
      <p>
        The reading behavior of at-risk students has a minor operation on the reading of contents materials,
which means these students spent a little time reading. We thus plot graphs of the average reading time
of all students those criteria by final grade on figure 9. Again, we found that the students in the no-risk
group spent much time reading more than at-risk students significantly, according to the result on a
heatmap. Let us consider the reading behavior with the prediction result based on the content
information model. The instructors can invent students who can fail on half of the course period, such
as 3rd or 4th week on the quarter course. Considering the early timing for interventions with past related
literature, such as the author from [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], the model can also invent at-risk students around half of
the course.
      </p>
    </sec>
    <sec id="sec-13">
      <title>6. Conclusion &amp; Future Works</title>
      <p>This study employed the multilayer perceptron neural network to classify students' performance as
atrisk and no-risk students for support instructors for early interventions. The experimental data were
collected from the two different onsite and online lectures in different years. The model for identifying
at-risk and no-risk students is based on the clickstream event on each learning material and transformed
into a two-dimensional matrix. The features are generated by using the operation of students' reading
behavior. We then contrast the model's accuracy with the model considering on weekly information
model. From the result, a weekly w/ content model can give accuracy similar to a weekly w/o model.
A weekly w/ content model model is a model that considers students' reading behavior depending on
the content that is independents of time. However, a weekly w/ content switch made an accuracy drop
since students may not focus on each content equivalent. Whereas the weekly switch information on
the weekly w/o contents model can give a better accuracy since this model focuses on the total reading
behavior each week.</p>
      <p>Next, we try to reduce the size of the matrix by using the random forest to investigate the feature
importance. We found that both types of lectures' high score operation features were the same as a
result. These features are the most necessary action for reading learning materials on the system. Finally,
we found the behavior of at-risk students gave little attention to studying and reviewing the learning
materials on features on the matrix that can inverstigate on the heatmap after half of period so the
teachers should invent to protect these students from failing before half of the course period.</p>
      <p>
        In the future task, we plan to explore how to find an attention weight of each content on the same
type of lecture due to instructors' changing lecture style and students' reading behavior. The purpose is
to improve the weekly w/ contest model because this can still perform well and explain almost all
students' reading behavior well. Then, we want to optimize the model to have higher accuracy by using
other deep learning models, such as the attention model [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], and extract new features from students'
reading behavior.
      </p>
    </sec>
    <sec id="sec-14">
      <title>Acknowledgements</title>
      <p>This work was supported by JST SPRING Grant Number JPMJSP2136, JST AIP Grant Number
JPMJCR19U1, and JSPS KAKENHI Grand Number JP18H04125, Japan.</p>
    </sec>
    <sec id="sec-15">
      <title>Appendix A. Features related to reading behavior</title>
      <p>The following table presents the variables describing the reading operation collected from the students'
reading behavior on the e-book system. The description is referred to the dataset received from LAK22
Data Challenge.</p>
      <sec id="sec-15-1">
        <title>Feature F1 F2 F3</title>
        <p>F4
F5
F6
F7
F8
F9
F10
F11
F12
F13
F14
F15</p>
      </sec>
      <sec id="sec-15-2">
        <title>Operation Name</title>
      </sec>
      <sec id="sec-15-3">
        <title>OPEN</title>
      </sec>
      <sec id="sec-15-4">
        <title>CLOSE</title>
      </sec>
      <sec id="sec-15-5">
        <title>NEXT</title>
      </sec>
      <sec id="sec-15-6">
        <title>PREV</title>
      </sec>
      <sec id="sec-15-7">
        <title>PAGE_JUMP</title>
      </sec>
      <sec id="sec-15-8">
        <title>ADD BOOKMARK</title>
      </sec>
      <sec id="sec-15-9">
        <title>BOOKMARK_JUMP</title>
      </sec>
      <sec id="sec-15-10">
        <title>ADD MARKER</title>
      </sec>
      <sec id="sec-15-11">
        <title>ADD MEMO</title>
      </sec>
      <sec id="sec-15-12">
        <title>CHANGE MEMO</title>
      </sec>
      <sec id="sec-15-13">
        <title>DELETE BOOKMARK</title>
      </sec>
      <sec id="sec-15-14">
        <title>DELETE MARKER</title>
      </sec>
      <sec id="sec-15-15">
        <title>DELETE_MEMO</title>
      </sec>
      <sec id="sec-15-16">
        <title>SEARCH</title>
      </sec>
      <sec id="sec-15-17">
        <title>SEARCH_JUMP</title>
      </sec>
      <sec id="sec-15-18">
        <title>GETIT</title>
      </sec>
      <sec id="sec-15-19">
        <title>NOTGETIT</title>
      </sec>
      <sec id="sec-15-20">
        <title>TOTAL_ACTION</title>
      </sec>
      <sec id="sec-15-21">
        <title>READING_TIME_MIN</title>
      </sec>
      <sec id="sec-15-22">
        <title>Description</title>
      </sec>
      <sec id="sec-15-23">
        <title>The amount of opened the e-book</title>
      </sec>
      <sec id="sec-15-24">
        <title>The amount of closed the e-book</title>
      </sec>
      <sec id="sec-15-25">
        <title>The amount of went to next page</title>
      </sec>
      <sec id="sec-15-26">
        <title>The amount of backed to previous page</title>
      </sec>
      <sec id="sec-15-27">
        <title>The amount of jumped to a particular page</title>
      </sec>
      <sec id="sec-15-28">
        <title>The amount of added a bookmark</title>
      </sec>
      <sec id="sec-15-29">
        <title>The amount of jumped to a particular page from the bookmark</title>
      </sec>
      <sec id="sec-15-30">
        <title>The amount of added a marker</title>
      </sec>
      <sec id="sec-15-31">
        <title>The amount of added a memo</title>
      </sec>
      <sec id="sec-15-32">
        <title>The amount of edited an existing memo</title>
      </sec>
      <sec id="sec-15-33">
        <title>The amount of deleted a bookmark</title>
      </sec>
      <sec id="sec-15-34">
        <title>The amount of deleted a marker</title>
      </sec>
      <sec id="sec-15-35">
        <title>The amount of deleted a memo</title>
      </sec>
      <sec id="sec-15-36">
        <title>The amount of searched for something within the e-book</title>
      </sec>
      <sec id="sec-15-37">
        <title>The amount of jumped to a particular page from the search results</title>
      </sec>
      <sec id="sec-15-38">
        <title>The amount of clicked the "GETIT" button in the e-book</title>
      </sec>
      <sec id="sec-15-39">
        <title>The amount of clicked the "NOTGETIT" button in the e-book</title>
      </sec>
      <sec id="sec-15-40">
        <title>The total of operation on the e-book</title>
      </sec>
      <sec id="sec-15-41">
        <title>The reading time of each lecture materials (minutes)</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Baradwaj</surname>
            ,
            <given-names>B. K.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Pal</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>Mining educational data to analyze students' performance</article-title>
          .
          <source>arXiv preprint arXiv:1201</source>
          .
          <fpage>3417</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Marbouti</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Diefes-Dux</surname>
            ,
            <given-names>H.A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Madhavan</surname>
            ,
            <given-names>K.P.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Models for early prediction of at-risk students in a course using standards-based grading</article-title>
          .
          <source>Comput. Educ.</source>
          ,
          <volume>103</volume>
          ,
          <fpage>1</fpage>
          -
          <lpage>15</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Albreiki</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zaki</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Alashwal</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          (
          <year>2021</year>
          ).
          <article-title>A Systematic Literature Review of Student' Performance Prediction Using Machine Learning Techniques</article-title>
          .
          <source>Education Sciences</source>
          ,
          <volume>11</volume>
          (
          <issue>9</issue>
          ), 552. doi:
          <volume>10</volume>
          .3390/educsci11090552
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>González</surname>
            ,
            <given-names>M.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruíz</surname>
            ,
            <given-names>M.D.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Ortin</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          (
          <year>2021</year>
          ).
          <article-title>Massive LMS log data analysis for the early prediction of course-agnostic student performance</article-title>
          .
          <source>Comput. Educ.</source>
          ,
          <volume>163</volume>
          ,
          <fpage>104108</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Okubo</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yamashita</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shimada</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Konomi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Students' performance prediction using data of multiple courses by recurrent neural network</article-title>
          . In A. F. Mohd
          <string-name>
            <surname>Ayub</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Mitrovic</surname>
          </string-name>
          ,
          <string-name>
            <surname>J-C. Yang</surname>
            ,
            <given-names>S. L.</given-names>
          </string-name>
          <string-name>
            <surname>Wong</surname>
          </string-name>
          , &amp; W. Chen (Eds.),
          <source>Proceedings of the 25th International Conference on Computers in Education, ICCE 2017 - Main Conference Proceedings</source>
          (pp.
          <fpage>439</fpage>
          -
          <lpage>444</lpage>
          ).
          <source>(Proceedings of the 25th International Conference on Computers in Education, ICCE 2017 - Main Conference Proceedings). Asia-Pacific Society for Computers in Education.</source>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Okubo</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yamashita</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shimada</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taniguchi</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Shin</surname>
            'ichi,
            <given-names>K.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>On the prediction of students' quiz score by recurrent neural network</article-title>
          .
          <source>CEUR Workshop Proceedings</source>
          ,
          <volume>2163</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Murata</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Minematsu</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shimada</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2021</year>
          ).
          <article-title>Early Detection of At-risk Students based on Knowledge Distillation RNN Models</article-title>
          .
          <source>Educational Data Mining</source>
          <year>2021</year>
          (EDM
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>López-Zambrano</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lara</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Romero</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>2020</year>
          ).
          <article-title>Towards Portability of Models for Predicting Students' Final Performance in University Courses Starting from Moodle Logs</article-title>
          .
          <source>Applied Sciences</source>
          ,
          <volume>10</volume>
          (
          <issue>1</issue>
          ), 354. doi:
          <volume>10</volume>
          .3390/app10010354
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Flanagan</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ogata</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Integration of Learning Analytics Research and Production Systems While Protecting Privacy</article-title>
          .
          <source>In International Conference on Computers in Education (ICCE2017)</source>
          (pp.
          <fpage>333</fpage>
          -
          <lpage>338</lpage>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Ogata</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mohri</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Okubo</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shimada</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yamada</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Hirokawa</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Learning analytics for e-book-based educational big data in higher education</article-title>
          .
          <source>In Smart Sensors at the IoT Frontier</source>
          (pp.
          <fpage>327</fpage>
          -
          <lpage>350</lpage>
          ). Springer, Cham.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Murata</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Minematsu</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shimada</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2020</year>
          ).
          <article-title>OpenLA: Library for Efficient E-book Log Analysis and Accelerating Learning Analytics</article-title>
          . In International Conference on Computer in Education (ICCE
          <year>2020</year>
          ),
          <fpage>301</fpage>
          -
          <lpage>306</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Daniusis</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vaitkus</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Neural Network with Matrix Inputs</article-title>
          . Informatica, Lith. Acad. Sci..
          <volume>19</volume>
          .
          <fpage>477</fpage>
          -
          <lpage>486</lpage>
          .
          <fpage>10</fpage>
          .15388/Informatica.
          <year>2008</year>
          .
          <volume>225</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Dalal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Imtiaz</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ebrahim</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2021</year>
          ).
          <article-title>Deep learning based network traffic matrix prediction</article-title>
          .
          <source>International Journal of Intelligent Networks</source>
          ,
          <volume>2</volume>
          ,
          <fpage>46</fpage>
          -
          <lpage>56</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Altaf</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soomro</surname>
            ,
            <given-names>W.J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Rawi</surname>
            ,
            <given-names>M.I.</given-names>
          </string-name>
          (
          <year>2019</year>
          ).
          <article-title>Student Performance Prediction using Multi-Layers Artificial Neural Networks: A Case Study on Educational Data Mining</article-title>
          .
          <source>Proceedings of the 2019 3rd International Conference on Information System and Data Mining.</source>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Vaswani</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shazeer</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parmar</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Uszkoreit</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jones</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gomez</surname>
            ,
            <given-names>A. N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaiser</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Polosukhin</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Attention is all you need</article-title>
          .
          <source>In Advances in neural information processing systems</source>
          (pp.
          <fpage>5998</fpage>
          -
          <lpage>6008</lpage>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Conijn</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Snijders</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kleingeld</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Matzat</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Predicting Student Performance from LMS Data: A Comparison of 17 Blended Courses Using Moodle LMS</article-title>
          .
          <source>IEEE Transactions on Learning Technologies</source>
          ,
          <volume>10</volume>
          ,
          <fpage>17</fpage>
          -
          <lpage>29</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>