<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Automated Matching of ITS Problems with Textbook Content</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Je rey Matayoshi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Christopher Lechuga</string-name>
          <email>christopher.lechugag@aleks.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>McGraw Hill ALEKS</institution>
          ,
          <addr-line>Irvine, CA</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of California</institution>
          ,
          <addr-line>Irvine, Irvine, CA</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>As intelligent tutoring systems (ITSs) continue to grow in popularity, many classrooms employ a blended learning model that combines these systems with more traditional resources, such as static (i.e., non-intelligent) textbooks. Typically in such cases, a large amount of manual work is required to match the textbook content with the content covered by the ITS. While resource intensive, this matching is important as it allows the ITS and textbook to work together in a coherent and e ective manner. Furthermore, matching the content in such a way lets the textbook take advantage of the adaptivity and sophistication of the ITS; in e ect, this infuses the textbook with some measure of intelligence and adaptivity of its own. Given the importance of this matching process, in this work we investigate ways in which this work ow can be automated. To that end, we leverage natural language processing and machine learning techniques to build classi cation models for matching the textbook and ITS content. We then evaluate the potential performance of these models both as a fully automated system, as well as being used in combination with a human expert.</p>
      </abstract>
      <kwd-group>
        <kwd>Intelligent tutoring system Adaptive content Textbook Natural language processing Semi-supervised learning</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>With the growth of intelligent tutoring systems (ITSs) and other digital
learning materials, many classrooms operate under a blended learning model. In these
classrooms, adaptive online resources can be used in combination with more
traditional materials, such as static textbooks. While nding ways in which these
various resources can work together is important, in many instances this work is
labor and resource intensive. One such example occurs in the ALEKS adaptive
learning system (www.aleks.com), in which instructors are given the option of
integrating a traditional textbook into the online ALEKS course. The goal of
this integration is to give a matching of the ALEKS topics with the content
contained in the textbook; this matching allows the instructor to tailor the course to
Copyright c 2020 for this paper by its authors. Use permitted under Creative
Commons License Attribution 4.0 International (CC BY 4.0).
coincide with the content in the book, hopefully leading to a more seamless and
consistent learning experience for the student. Additionally, another important
bene t of this approach is that it allows a traditional, static textbook to take
advantage of the adaptive and intelligent components of the ITS. For example,
the ALEKS system keeps track of the topics that a student knows or doesn't
know, as well as the topics that the student is most ready to learn. Using the
matchings of these topics to the textbook content, an instructor can then see
what parts of the textbook the student has already mastered, along with the
parts in which there are gaps in knowledge. In essence, by matching the
textbook content with the ITS topics, the course sequencing and adaptivity of the
system can be used to guide the student through a more personalized path in
the textbook, thereby endowing a basic textbook with several of the advantages
of a more sophisticated intelligent textbook.</p>
      <p>While there are clear bene ts to this procedure, the matching of the ITS
topics with the textbook content is a laborious task performed by experts who,
in addition to needing to be well-acquainted with the subject matter of the
textbook, must also be familiar with the speci c characteristics and peculiarities
of the topics. Furthermore, since the matching must be repeated for each unique
textbook and course pairing, this process requires a large amount of manual
work. Thus, the goal of this study is to investigate techniques by which this
matching of textbook content and ITS topics can be automated.</p>
      <p>
        The problem discussed in this manuscript has many similarities to works such
as [
        <xref ref-type="bibr" rid="ref10 ref13 ref9">9,10,13</xref>
        ], which are concerned with the automated extraction of knowledge
components; in the case of [
        <xref ref-type="bibr" rid="ref10 ref9">9,10</xref>
        ], this extraction is performed on problems from
ASSISTments, while in [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] the knowledge components are derived from the
content of an online textbook. Other previous studies handling similar problems
include work such as [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], where an approach is outlined for identifying the core
concepts in a textbook section (as opposed to prerequisite concepts which are
not the main focus of the section); [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], which attempts to nd similar pairs of
exercises from within a large set of problems; and [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], which links similar
content across textbooks. Our speci c approach to solving the problem at hand has
perhaps the most overlap with the techniques used in [
        <xref ref-type="bibr" rid="ref10 ref9">9,10</xref>
        ]. That is, we leverage
and apply several techniques from machine learning and natural language
processing to build classi cation models. Additionally, similar to the approach in
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], we also take advantage of the available unlabeled data by experimenting with
semi-supervised versions of these models. Once we have developed our models,
we then apply them to our speci c problem of matching the ITS topics with the
appropriate textbook content. While the ultimate goal is for the trained
classi er to operate in a completely automated fashion without any human expert
oversight, we also evaluate the possible bene ts of using a hybrid approach; in
the latter case, a human expert is still a necessary component of the process,
but the hope is that the classi er can assist the expert and reduce the amount
of manual work required.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Background and Experimental Setup</title>
      <p>A course in ALEKS consists of a set of topics that covers a domain of
knowledge. A topic is a problem type that covers a discrete unit of knowledge within
this course. Each topic is composed of many example problems that are
carefully chosen to be equal in di culty and cover the same content. An example
problem from an ALEKS topic titled \Solving a rational equation that simpli es
to linear: Denominator x" is shown in Figure 1. This is one of the rst topics
covering rational equations that a student encounters in an ALEKS course. For
comparison, in Figure 2 an example problem from a more advanced rational
equation topic is shown. While this latter topic is more di cult, both topics are
typically found in the same textbook chapter and section.</p>
      <p>The most commonly used courses cover content from math and chemistry,
with additional courses available for other subject areas such as statistics and
accounting. A course might contain over a thousand topics in total, from which
an instructor is free to choose a speci c subset of topics; while in rare cases the
instructor chooses to use all the available topics, a more typical subset consists
of a few hundred. The instructor also has the option of integrating the content
of a textbook with this set of topics. When this is done the topics are matched
to chapters (and possibly sections) within the textbook. In order to obtain this
matching, subject matter experts familiarize themselves with the textbook, and
they then use this knowledge to determine the area of the textbook that best
matches each topic. This procedure is completed for every available topic in the
course, which allows a textbook to be matched with any possible subset of topics
that an instructor chooses to use.</p>
      <p>The goal of our study is to evaluate an attempt to automate this matching
of ITS topics with textbook content. In doing so, we treat this as a classi cation
problem, in which the chapters or sections of the textbook are the class labels.
As this automated procedure is designed to emulate the process used by a human
expert, our training data are extracted from the textbooks in the following way.
Each section in the textbook generates one data point in our training data set;
the features (as discussed in more detail shortly) are extracted from the textbook
content in that section, while the ground truth label is either the chapter number
or the section number. The unlabeled data set then consists of the ALEKS topics.
After the model is trained, it is used to assign labels to these topics, and we can
then compare these labels to those assigned manually by human experts.</p>
      <p>For our experiments we use textbooks from three di erent college level math
courses that are common in the United States: beginning algebra, intermediate
algebra, and college algebra. For each of these textbooks, we rely on the matching
of textbook content and topics that are currently used in the ALEKS system.
Each of these matchings has been created by a single human expert using the
following procedure. The expert rst familiarizes themselves with each individual
topic by reading the question and explanation, and also by looking at several of
the example problems contained in the topic. The expert then browses through
the book and matches the topic to a speci c chapter (and, in some cases, a
speci c section). From each course we choose one or two textbooks to use as
our validation set; the books in this validation set are then used to perform all
of our feature engineering and model selection. Once we are satis ed with the
features, models, and hyperparameters, we then apply these to one additional
book from each math course as our test evaluation. Note that this evaluation on
the test textbooks is di erent from the conventional machine learning work ow,
in that we are not applying previously trained models to the books in our test set.
Rather, we build new models on each of the books used for our test evaluation,
using the best choice of features and hyperparameters from the validation books,
and we then apply these models to classify the topics in the course. The reason
we use this procedure is that it exactly follows the work ow that would be used
in an implementation of the model within the ALEKS system. That is, in such
Fig. 2. Screen capture of the question and explanation for an ALEKS topic titled
\Solving a rational equation that simpli es to quadratic: Proportional form, advanced."
In comparison to the topic shown in Figure 1, this is an advanced problem that requires
the student to solve a complicated rational equation. However, both topics are typically
found in the same textbook chapter and section.
an implementation there is nothing that prevents the training of a new model for
each textbook; as such, using this testing procedure gives a realistic evaluation of
the performance of the model. Additionally, a key characteristic of this procedure
is that the features of the topics are available to the model during training.</p>
      <p>
        Each of the textbooks is in a digital format, with each textbook section
contained in a separate HTML le. Similarly, each ALEKS topic also has an
associated HTML le, which contains the text of the question, the title of the
topic, and an explanation which contains a worked out solution to the problem.
As we rely heavily on techniques from natural language processing (NLP) to
build our machine learning models, we process the HTML les using the following
procedure. We rst remove the HTML tags and extract the content from each of
the les using the Beautiful Soup [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] Python library. Next, we preprocess the
raw text by using the the Gensim [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] Python library to perform several standard
NLP operations, such as removing stop words, punctuation, and numerals, and
stemming each word to its root form.
      </p>
      <p>Once we have processed the text, we build our set of features using an n-gram
model. When building the n-gram model, we restrict the vocabulary to a
predetermined set of words that have speci c mathematical meanings. For example,
words such as add, degree, and triangle are kept, while other less informative
words (for our purposes) are removed. This procedure reduces the size of our
set of features, and on our validation set it also gave a slight boost in
performance. From this vocabulary we then extract our n-grams, experimenting with
various sequence sizes on our validation set of books. In all all cases we apply
term frequency-inverse document frequency (tf-idf) transformations before
feeding the features to the machine learning models. When building the vocabulary
and features for each textbook model, we also include the text from the
questions, titles, and explanations of the topics; as we mentioned previously, in an
actual application of this model, such information would be available during the
model building process.</p>
      <p>
        The textbooks we consider in our study typically have somewhere between 50
and 80 di erent sections, and each of these sections generates one (and only one)
entry in our labeled set of training data. Thus, because of the limited amount of
labeled data, we also experiment with semi-supervised learning models, in which
the topics are used to generate the unlabeled portion of the data set.
Semisupervised learning models lie between supervised and unsupervised learning,
and such models are unique in that they are able to take advantage of both
labeled and unlabeled data [
        <xref ref-type="bibr" rid="ref1 ref16">1,16</xref>
        ]. This can be very useful in situations, such
as ours, where assigning accurate labels to data takes a considerable amount
of manual (human) e ort. In such a case, adding the extra unlabeled data can
possibly give a large increase in the accuracy of the model [
        <xref ref-type="bibr" rid="ref7 ref8">7,8</xref>
        ].
      </p>
      <p>There are two main goals of semi-supervised learning. The goal of inductive
semi-supervised learning is to learn a predictor function that can be applied to
unseen data that is not contained in the training set; this is closely aligned with
the motivations of standard supervised learning algorithms. On the other hand,
transductive semi-supervised learning aims to learn a predictor function that
can be applied to the unlabeled data in the training set ; by this description,
transductive learning has overlap with clustering and other unsupervised
learning techniques that are not necessarily concerned with generalizing to future
data. When training our models, we treat the problem of classifying the topics
as one of transductive learning, where we use our small set of labeled data (i.e.,
the textbook content) to build a classi er that can help us assign labels to the
topics.</p>
      <p>
        Finally, we should also mention that, in the spirit of previous works such
as [
        <xref ref-type="bibr" rid="ref14 ref6">6,14</xref>
        ], we experimented with using similarity metrics to nd the textbook
content that best matches each ITS topic. For example, in one attempt we used
the doc2vec [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] model to generate vector space embeddings for each section of a
textbook, as well as for each topic. We then used the cosine similarity measure
to nd the best match between the topics and textbook content. However, the
results were less accurate in comparison to the classi cation models.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Experimental Results</title>
      <p>
        Most likely due to the small amount of training data available for each textbook,
the best performance on our validation set came from the simplest models we
tried, namely, nave Bayes and logistic regression classi ers. In comparison, more
advanced model architectures, such as neural networks and random forests, were
not as e ective. Thus, on our test textbooks we restrict our evaluations to the
nave Bayes and logistic regression models; additionally, we also include the
semi-supervised version of nave Bayes described in Section 5.3.1 of [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ],1 as it
also performed well on our validation set. For our n-gram models we use both
unigrams (n = 1) and bigrams (n = 2), the combination of which had the
strongest performance on our validation set. Adding larger values of n gave
slightly worse performance; as before, over tting on the small amount of training
data seems to be the likely reason for this drop in e ectiveness.
      </p>
      <p>To evaluate the performance of our classi ers, we use the probability
estimates from the models to extract the most likely chapter match for each topic,
and we then compare these chapter matchings to those from a human expert. To
nd the most likely chapter, as determined by the model, we use the following
procedure. Each textbook section is assigned a unique label in the training set.
We apply the models trained on these labels to the ITS topics and nd the
textbook section with the highest probability estimate; this section is then mapped
to its corresponding chapter to get the most likely chapter match for the topic.
Somewhat counterintuitively, training the classi ers with the section labels, as
opposed to using the chapter labels, gave stronger performance on our validation
textbooks; thus, for this reason we use this (indirect) approach to nd the best
chapter match for each topic.</p>
      <p>As a rst evaluation, we look at how well the model classi cations agree
with the expert classi cations. In Table 1 we report Cohen's kappa and
accu1 A Python module implementing this model is available at https://github.com/
jmatayoshi/semi-supervised-naive-bayes.
racy statistics for each of the books in our test set. As shown there, the nave
Bayes classi ers outperform the logistic regression models on all three books.
Additionally, while the di erence varies slightly across the textbooks, the
semisupervised nave Bayes model consistently outperforms the fully supervised
version; the stronger performance of the semi-supervised model is consistent with
what we observed on our validation textbooks.</p>
      <p>Overall, the results in Table 1 are encouraging, as the Cohen's kappa scores
for the semi-supervised nave Bayes models range from 0.722 to 0.791. To get a
relative measure of the performance of the classi ers, one of the authors, who
wasn't involved in the original matching of the topics, performed a new
matching on a random sample of 50 topics from the intermediate algebra textbook
(unfortunately, resources were not available to do a more comprehensive
comparison). For these 50 topics, 47 labels from this new matching are equal to the
original chapter labels, resulting in a Cohen's kappa score of 0.930. Thus, while
this comparison is only on a sample of 50 topics (out of the complete set of 802
in the intermediate algebra course), it is evidence that the classi er models are
currently not quite as accurate as human experts.</p>
      <p>Our next two evaluations analyze the hypothetical impact of the models if
they were to be used in combination with a human expert. That is, rather than
fully relying on the models to make all of the classi cations, in these evaluations
we assume the models are assisting a human expert. We envision a couple of
di erent scenarios in which this could be done. Our rst approach assumes that,
for a given textbook, the classi er would be used to determine the matchings
for which it is most con dent; the human expert could then focus their energy
on the classi cations that are more di cult for the model to determine. To
evaluate this procedure, we want a measure of how often the most con dent
classi cations from the model agree with the actual chapter classi cations. To
that end, we use the following procedure. For a given textbook, we rst nd the
probability of the most likely class for each topic, using the estimates from the
semi-supervised nave Bayes model; based on these probabilities, the topics are
then put in descending order. Next, for each positive integer n, where n ranges
from 1 to the number of topics in the course, we compute the Cohen's kappa
score for the rst n topics in the ordered sequence (i.e., the n topics for which
the classi er is most con dent). For example, employing this procedure with a
value of n = 100, we are computing Cohen's kappa for the 100 topics with the
highest probability estimates.</p>
      <p>1.0
0.8
a
p
ap0.6
k
s
'
n
e
oh0.4
C
0.2
0.0</p>
      <p>Beginning algebra
Intermediate algebra</p>
      <p>College algebra
0.0
0.2</p>
      <p>0.4 0.6
Proportion of topics
0.8
1.0
Fig. 3. Plots of the Cohen's kappa scores for the three test textbooks. For each
textbook, the topics are put in descending order based on the probability estimates from
the semi-supervised nave Bayes model, and Cohen's kappa is then computed at each
point in the sequence. For all of the test textbooks, the plot shows that using the
highest 50% of the probabilities returns a Cohen's kappa value of at least 0.9.</p>
      <p>The results from this procedure are shown in Figure 3, where we can see the
evolution of the Cohen's kappa scores. For example, using the 20% of the topics
in each course for which the classi er is most con dent (i.e., 0.2 on the x-axis),
the Cohen's kappa values are all at 0.95 or above; then, using the 50% of the
topics in each course with the highest probability estimates, the Cohen's kappa
values are all at 0.9 or above. Based on these results, one seemingly reasonable
scenario would be to let the classi er perform the matchings for, say, the 50%
of the topics for which it is most con dent, and the human expert then handles
the rest.</p>
      <p>Our next evaluation assumes that the human expert is involved throughout
the entire process, but that they are being guided by the predictions of the model.
In this scenario, we envision the expert using the recommendations made by the
model to make the matching process more e cient. Speci cally, for each topic
we assume that the expert is given a list of the chapters in descending order
based on the model probability estimates. The job of the expert is to then start
with the chapter with the highest probability and check if it indeed matches
the topic; if it doesn't, the expert moves on and checks the chapter with the
next highest probability, and so on. Thus, in this scenario the model would be
useful if, in most cases, the expert would only need to check a small number of
chapters.</p>
      <p>To that end, in Table 2 we show how often the human expert classi cation is
contained within the n most likely chapters, according to the class probabilities
from the semi-supervised nave Bayes models. That is, for n = 1 we simply
show how often the chapter from the human expert agrees with the chapter that
the classi er gives the highest probability estimate. Next, for n = 2, we nd the
section with the highest probability that is from a di erent chapter, and we then
report how often the human expert chapter label agrees with either of these two
chapters. The process then continues for the larger n values. From the results in
Table 2 we can see that, the vast majority of the time, the human expert label
is contained within the three or four most likely chapters.</p>
      <p>Beginning Intermediate College</p>
      <p>algebra algebra algebra
In this work we built and evaluated models for automatically associating topics
in an ITS system to the most appropriate content from a textbook. We did this
by leveraging NLP techniques and machine learning classi ers, and we analyzed
the results from both supervised and semi-supervised models. When attempting
to match the topics with the appropriate textbook chapters, the results from
applying the models showed fairly strong agreement with the human expert
decisions, as the Cohen's kappa values ranged from roughly 0.72 to 0.79.</p>
      <p>While the overall performance of the machine learning models is encouraging,
the argument could be made that the current versions are not quite accurate
enough for a fully automated implementation. However, the results from our
analyses seemed to indicate that they could be useful in a hybrid application.
For example, in all three of our test textbooks, using the 50% of topics from
each course with the most con dent predictions (i.e., the highest probability
estimates) returns Cohen's kappa scores of 0.9 or above. Thus, based on this
strong agreement with the human expert labels, a possible procedure would be
to use the matchings for which the classi er is most con dent, while the human
expert then concentrates on matching the remaining topics.</p>
      <p>Although this current work has focused on the matching of the topics with
the textbook chapters, being able to identify a speci c section in the book, while
more challenging, would also be useful. One complication with matching topics
to speci c sections is that the problem becomes slightly more ambiguous; that
is, while it is more-or-less straightforward for the human expert to identify a
chapter that is a good t for the topic, this is not necessarily the case with the
sections. There are many examples where a topic might seem to t equally well
in two or three di erent sections; at the other extreme, it may also be possible
that no section appears to match the topic.</p>
      <p>Another challenging application would be to match the ALEKS topics with
the various mathematics education standards that are used for K-12 education
in the United States. As with the matching of textbook content, matching the
ITS topics to these education standards would enable instructors to leverage the
information in the ITS. For example, based on the topics the student knows,
the instructor could see what education standards they have mastered and what
they still need to learn. However, similar to the matching of topics to textbook
sections, this problem also introduces extra complexities, as these education
standards can be very speci c and narrow in scope.</p>
      <p>
        Fortunately, there are additional modi cations to the models that could
lead to large performance gains and make the aforementioned problems more
tractable. First, the current models are entirely text-based; among other things,
mathematical expressions are completely ignored by the models. Thus, a natural
next step would be to incorporate equations and expressions into the features
of the model, which in many cases could allow for very speci c information into
the type of material that is being covered. Recent approaches for extracting
information from mathematical formulae, such as those used in [
        <xref ref-type="bibr" rid="ref15 ref3">3,15</xref>
        ], are highly
relevant and worth investigating in the context of our current problem.
      </p>
      <p>Second, the ALEKS system contains detailed data on the relationships
between the topics in a course. For example, the system has its own clustering of
topics into related groups, and it seems reasonable that topics within the same
group are more likely to appear in the same area of a textbook (or in related
education standards). Additionally, the system has information on the prerequisite
relationships between topics. Since it is unlikely that a particular topic would
appear after another topic that it is a prerequisite for, this information could be
useful in re ning the matchings and improving their accuracy.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Chapelle</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          , Scholkopf,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Zien</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . (eds.):
          <article-title>Semi-Supervised Learning</article-title>
          . MIT Press (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Labutov</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brusilovsky</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Semi-supervised techniques for mining learning outcomes and prerequisites</article-title>
          .
          <source>In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</source>
          . pp.
          <volume>907</volume>
          {
          <issue>915</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Lan</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vats</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Waters</surname>
            ,
            <given-names>A.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baraniuk</surname>
          </string-name>
          , R.G.:
          <article-title>Mathematical language processing: Automatic grading and feedback for open response mathematical questions</article-title>
          .
          <source>In: Proceedings of the Second</source>
          (
          <year>2015</year>
          ) ACM Conference on Learning @ Scale. pp.
          <volume>167</volume>
          {
          <issue>176</issue>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Le</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Distributed representations of sentences and documents</article-title>
          .
          <source>In: International Conference on Machine Learning</source>
          . pp.
          <volume>1188</volume>
          {
          <issue>1196</issue>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Su</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Finding similar exercises in online education systems</article-title>
          .
          <source>In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery &amp; Data Mining</source>
          . pp.
          <year>1821</year>
          {
          <year>1830</year>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Meng</surname>
            , R., Han,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brusilovsky</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Knowledge-based content linking for online textbooks</article-title>
          .
          <source>In: 2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)</source>
          . pp.
          <volume>18</volume>
          {
          <fpage>25</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7. Mitchell, T.:
          <article-title>The role of unlabeled data in supervised learning</article-title>
          .
          <source>In: Proceedings of the Sixth International Colloquium on Cognitive Science</source>
          . pp.
          <volume>103</volume>
          {
          <issue>111</issue>
          (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Nigam</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McCallum</surname>
            ,
            <given-names>A.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thrun</surname>
          </string-name>
          , S., Mitchell, T.:
          <article-title>Text classi cation from labeled and unlabeled documents using EM</article-title>
          .
          <source>Machine Learning</source>
          <volume>39</volume>
          (
          <issue>2-3</issue>
          ),
          <volume>103</volume>
          {
          <fpage>134</fpage>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Pardos</surname>
            ,
            <given-names>Z.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dadu</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Imputing KCs with representations of problem content and context</article-title>
          .
          <source>In: Proceedings of the 25th Conference on User Modeling, Adaptation and Personalization</source>
          . pp.
          <volume>148</volume>
          {
          <issue>155</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Patikorn</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deisadze</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grande</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            <given-names>ernan</given-names>
          </string-name>
          , N.:
          <article-title>Generalizability of methods for imputing mathematical skills needed to solve problems from texts</article-title>
          .
          <source>In: International Conference on Arti cial Intelligence in Education</source>
          . pp.
          <volume>396</volume>
          {
          <fpage>405</fpage>
          . Springer (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Rehurek</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sojka</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Software Framework for Topic Modelling with Large Corpora</article-title>
          .
          <source>In: Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks</source>
          . pp.
          <volume>45</volume>
          {
          <fpage>50</fpage>
          . ELRA, Valletta, Malta (May
          <year>2010</year>
          ), http://is.muni.cz/ publication/884893/en
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Richardson</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Beautiful soup documentation</article-title>
          .
          <source>April</source>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Thaker</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brusilovsky</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Student modeling with automatic knowledge component extraction for adaptive textbooks</article-title>
          .
          <source>In: Proceedings of the First Workshop on Intelligent Textbooks, International Conference on Arti cial Intelligence in Education</source>
          . pp.
          <volume>95</volume>
          {
          <issue>102</issue>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Thaker</surname>
            ,
            <given-names>K.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brusilovsky</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Concept enhanced content representation for linking educational resources</article-title>
          .
          <source>In: 2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)</source>
          . pp.
          <volume>413</volume>
          {
          <fpage>420</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Youssef</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>B.R.</given-names>
          </string-name>
          :
          <article-title>Deep learning for math knowledge processing</article-title>
          .
          <source>In: International Conference on Intelligent Computer Mathematics</source>
          . pp.
          <volume>271</volume>
          {
          <fpage>286</fpage>
          . Springer (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goldberg</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Introduction to Semi-Supervised Learning</article-title>
          . Morgan &amp;
          <string-name>
            <surname>Claypool</surname>
          </string-name>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>