<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>PRET: Prerequisite-Enriched Terminology. A Case Study on Educational Texts</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Chiara Alzetta</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Forsina Koceva</string-name>
          <email>frosina.kocevag@edu.unige.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Samuele Passalacqua</string-name>
          <email>samuele.passalacqua@dibris.unige.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ilaria Torre</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giovanni Adorni</string-name>
          <email>adornig@unige.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>DIBRIS, University of Genoa</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>English. In this paper we present PRET, a gold dataset annotated for prerequisite relations between educational concepts extracted from a computer science textbook, and we describe the language and domain independent approach for the creation of the resource. Additionally, we have created an annotation tool to support, validate and analyze the annotation.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Educational Concept Maps (ECM) are acyclic
graphs which formally represent a domain’s
knowledge and make explicit the pedagogical
dependency relations between concepts
        <xref ref-type="bibr" rid="ref1 ref3">(Adorni and
Koceva, 2016)</xref>
        . A concept, in an ECM, is an
atomic piece of knowledge of the subject domain.
From a pedagogical point of view, the most
important dependency relation between concepts is
the prerequisite relation, that explicits which
concepts a student has to learn before moving to the
next. Several approaches have been proposed to
extract prerequisite relations from various
educational sources
        <xref ref-type="bibr" rid="ref14 ref17 ref18 ref2 ref22 ref24 ref25">(Vuong et al., 2011; Yang et al.,
2015; Gordon et al., 2016; Wang et al., 2016;
Liang et al., 2017; Liang et al., 2018; Adorni et
al., 2018)</xref>
        . Textbooks in particular are a valuable
resource for this task since they are designed to
support the learning process respecting the
prerequisite relation.
      </p>
      <p>
        In the literature, the evaluation of the extracted
prerequisite relations is usually performed through
comparison with a gold standard produced by
human subjects that annotate relations between
concepts (see, among the others,
        <xref ref-type="bibr" rid="ref10 ref16 ref21 ref23">(Talukdar and
Cohen, 2012; Liang et al., 2015; Fabbri et al., 2018)</xref>
        ).
However, most of the evaluations lack a systematic
approach or simply lack the details that allow them
to be repeated. In this paper, we present our
experience in building PRET (Prerequisite-Enriched
Terminology), a gold dataset annotated with the
prerequisite relation between pairs of concepts.
The issues emerged with PRET led us to define
a methodology and a tool for manual prerequisite
annotation. The goal of the tool is to support the
creation of gold datasets for validating automatic
extraction of prerequisites. Both the PRET dataset
and the tool are available online1.
      </p>
      <p>PRET was constructed in two main steps: first
we exploited computational linguistics methods
to extract relevant terms from a textbook2, then
we asked humans to manually identify and
annotate the prerequisite relations between educational
concepts. Since the terminology creation step was
extensively described in Adorni et al. (2018), this
paper mainly focuses on the annotation phase.</p>
      <p>
        The annotation task consists in making explicit
the prerequisite relations between two distinct
concepts if the relation is somehow inferable from
the text in question. We represent a concept as a
domain–specific term denoting domain entities
expressed by either single nominal terms (e.g.
internet, network, software) or complex nominal
structures with modifiers (e.g. malicious software,
trojan horse, HyperText Document). Figure 1 shows
1http://teldh.dibris.unige.it/pret
2For the annotation we used chapter 4 of the computer
science textbook “Computer Science: An Overview”
        <xref ref-type="bibr" rid="ref12 ref16 ref25 ref6">(Brookshear and Brylow, 2015)</xref>
        .
a sample of the ECM resulting from PRET.
According to PRET dataset, an example of
prerequisite relation is network is a prerequisite of internet,
since a student has to know network before
learning internet.
      </p>
      <p>The paper is organized as follows. The
related work pertaining to the proposed method is
discussed in Section 2. Section 3 describes the
methodology used for the creation of the PRET
dataset and Section 4 presents the characteristics
of the obtained gold dataset and the agreement
computed for each pair of annotators together with
other statistics about the data. Section 5 describes
the main features of the annotation tool we
designed. Section 6 concludes the paper.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Automatic prerequisite identification is a task that
gained growing interest in recent years, especially
among scholars interested in automatic synthesis
of study plans
        <xref ref-type="bibr" rid="ref12 ref25 ref3 ref4">(Gasparetti et al., 2015; Yang et al.,
2015; Agrawal et al., 2016; Alsaad et al., 2018)</xref>
        .
When applying automatic prerequisite extraction
methods, a baseline for evaluation is needed.
Despite being time consuming, creating manually
annotated datasets is more effective and produces
gold resources, which are still rare.
      </p>
      <p>To the best of our knowledge, Talukdar and
Cohen (2012) is the only case where crowd–sourcing
is employed for annotation: they infer
prerequisite relationship between concepts by exploiting
hyper-links in Wikipedia pages and use
crowdsourcing to validate those relations in order to have
a gold training dataset for a classifier.</p>
      <p>
        More frequently the annotation of prerequisite
relations is performed by domain experts
        <xref ref-type="bibr" rid="ref10 ref16 ref18 ref23">(Liang et
al., 2015; Liang et al., 2018; Fabbri et al., 2018)</xref>
        or
by students with a certain competence on the
doma
        <xref ref-type="bibr" rid="ref13">in (Wang et al., 2015</xref>
        ; Pan et al., 2017). When
annotation is performed by non–experts,
agreement usually results very low, so an expert can
be consulted
        <xref ref-type="bibr" rid="ref14 ref7">(Chaplot et al., 2016; Gordon et al.,
2016)</xref>
        . Regardless of the annotation methodology,
we observe that in the mentioned related works
prerequisite relation properties (i.e. irreflexivity,
anti–symmetry, etc.) are rarely taken into account
in the annotation instructions for annotators. For
example, the fact that a concept cannot be
annotated as prerequisite of itself is usually left
unspecified.
      </p>
      <p>To support the annotation of prerequisites
between pairs of concepts, Gordon et al. (2016)
developed an interface showing, for each concept of
the domain, the list of relevant terms and
documents. Although this can be of some support for
the annotation providing certain useful
information, it cannot be considered an annotation tool
itself. According to our knowledge, a tool
specifically designed for prerequisite structure
annotation which also features agreement metrics is still
missing.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Annotation Methodology</title>
      <p>In Section 4 we will describe the PRET dataset,
while here we present the annotation methodology
that we used to build PRET and that we refined on
the basis of such experience.</p>
      <p>
        Concept identification. Our methodology for
prerequisite annotation requires that concepts are
extracted from educational materials, that we
broadly define Document (D), and provided to
annotators. Although we are conscious that a
concept, as mental structure, might entail multiple
terms, we simplify the problem of concept
identification assuming that each relevant term of D
represents a concept
        <xref ref-type="bibr" rid="ref19">(Novak and Can˜as, 2006)</xref>
        .
Thus, our list of concepts is a terminology (T) of
domain–specific terms (either single or complex
nominal structures) ordered according to the first
appearance of the terms of T in D and where each
concept corresponds to a single term.
      </p>
      <p>
        For the task of prerequisite annotation, it does
not matter if concepts are extracted
automatically, manually or semi–automatically. To build
PRET, we extracted concepts automatically. To
identify our terminology T, we relied on
TextTo-Knowledge (T2K2)
        <xref ref-type="bibr" rid="ref9">(Dell’Orletta et al., 2014)</xref>
        ,
a software platform developed at the Institute
of Computational Linguistics A. Zampolli of the
CNR in Pisa. T2K2 exploits Natural Language
Processing, statistical text analysis and machine
learning to extract and organize the domain
knowledge from a linguistically annotated text.
      </p>
      <p>We applied T2K2 to a text of 20,378 tokens
distributed over 751 sentences. 185 terms were
recognized as concepts of the domain (around 20% of
the total number of nouns in the corpus). As
expected, the extracted terminology contained both
single nominal structures, such as computer,
network and software, and complex nominal
structures with modifiers, like hypertext transfer
protocol, world wide web and hypertext markup
language. The set of concepts did not go through any
post–processing phase.</p>
      <p>
        Annotators selection. The role of annotators is
fundamental in order to obtain a gold dataset that
represents the pedagogical relations expressed in
the educational material. Consequently, the choice
of annotators is crucial. As mentioned above, in
the literature annotators are often domain experts
        <xref ref-type="bibr" rid="ref10 ref16 ref18 ref23">(Liang et al., 2015; Liang et al., 2018; Fabbri
et al., 2018)</xref>
        or students with some knowledge
        <xref ref-type="bibr" rid="ref13">in
that domain (Wang et al., 2015</xref>
        ; Pan et al., 2017).
Based on our experience with different types of
annotators, we suggest that annotators should have
enough knowledge to understand the content of
the educational material. Otherwise, the
annotation can be distorted by wrong comprehension
of the relations between concepts. On the other
hand, experts should not rely on their background
knowledge to identify relations, since the goal of
the annotation is to capture the knowledge
embodied in the educational resource. To build PRET we
recruited 6 annotators among professors and PhD
students working in fields related to computer
science, but eventually 2 of them revealed not to have
enough knowledge for the task.
      </p>
      <p>Annotation task. A prerequisite relation
between two concepts A and B is defined as a
dependency relation which represents what a learner
must know/study (concept A), before approaching
concept B. Thus, by definition, the prerequisite
relation has the following properties: i) asymmetry:
if concept A is a prerequisite of concept B, the
opposite cannot be true (e.g. network is prerequisite
of internet, so internet cannot be prerequisite of
network); ii) irreflexivity: a concept cannot be
prerequisite of itself; iii) transitiveness: if concept A
is a prerequisite of concept B, and concept B of
concept C, then concept A is also a prerequisite of
concept C (e.g. browser is prerequisite of HTTP,
HTTP is prerequisite of WWW, hence browser is
prerequisite of WWW according to the transitive
property).</p>
      <p>To keep the annotation as uniform as possible,
we provided the annotators with suggestions on
how to perform the task together with the book
chapter and the terminology extracted from it.
Considering the material supplied, we asked
annotators to trust the text considering only pairs of
distinct concepts of T and annotating the existence
of a prerequisite relation between the two concepts
only if derivable from D. In our method,
annotators should read the text and, for each new concept
(i.e. never mentioned in the previous lines),
identify all its prerequisites, but, if no prerequisite can
be identified, they should not enter any annotation.
We also wanted pedagogical relation properties to
be preserved, so we asked to respect the
irreflexive property not annotating self–prerequisites and
to avoid adding transitive relations. Considering
the topology of an ECM, we also asked
annotators not to enter cycles in the annotation because
they represent conceptually wrong relations. To
better understand this point, consider the ECM in
Figure 1: having a prerequisite relation between
computer and network and between network and
internet, entering a relation where internet is
prerequisite of computer would create a cycle (loop).</p>
      <p>The output of the annotation of each
annotator is an enriched terminology: a set of concepts
paired and enhanced with the prerequisite relation.
The enriched terminology can be used to create
an ECM where each concept is a node and the
edges are prerequisite relations identified by
humans (see Figure 1).</p>
      <sec id="sec-3-1">
        <title>Annotation validation. Human annotators are</title>
        <p>not immune from making mistakes and violating
the supplied recommendations. The tool we
propose addresses this issue by introducing controls
to prevent the annotators from making errors (e.g.
cycles, reflexive relations, symmetric relations).
In the next section we will describe the approach
we used to identify some mistakes by using graph
analysis algorithms.</p>
        <p>
          Annotators agreement evaluation. Our
experience and the literature
          <xref ref-type="bibr" rid="ref10">(Fabbri et al., 2018)</xref>
          show
that human judgments about prerequisite
identification can vary considerably, even when strict
guidelines are provided. This can depend on
several factors, including the subjectivity of
annotators and the type and complexity of D. Evaluating
the annotators’ agreement can be useful to assess
if the gold dataset is to be trusted or further
annotators are required. Section 4 will describe the
measures we used to evaluate annotators’
agreement in PRET.
        </p>
        <p>The final combination of the enriched
terminologies produced by each annotator is a
necessary step to build a gold dataset but, due to space
constraints, below we will only present our
approach, while a survey on combination metrics is
out of the scope of this paper.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>The PRET Dataset</title>
      <p>The PRET gold dataset consists of 34,225
concept pairs obtained by all possible combinations of
the elements in the concepts set (excluding self–
prerequisites). Pairs vary with respect to the
relation weight, computed for each pair by dividing
the number of annotators that annotated the pair by
the total number of annotators. Only 1.54% (526)
of the pairs has a relation weight higher than 0 (i.e.
it was annotated as prerequisite by at least one
annotator). Details about the distribution of
prerequisite relations and respective weights are reported
in Table 1.</p>
      <p>55.70% (293) of the prerequisite pairs was
identified by only one annotator, meaning that it is hard
for humans to agree on what a prerequisite is. We
further investigate this aspect in section 4.1.</p>
      <p>The analysis of the dataset carried out before
applying validation checks highlighted some
critical issues: some transitive relations were
explicitly annotated and some cycles were erroneously
added in the dataset, violating the instructions.
While cycles are due to distraction, transitive
relations are hard to recognize per se, especially when
broad terms are involved (e.g. computer, software,
machine).</p>
      <p>In order to study how these issues impact the
dataset, each annotation was validated against
cycles and transitive relations obtaining 5 dataset
variations, in addition to the original annotation.
The validation was conducted on the ECM derived
from the enriched terminology of each annotator
using a graph analysis algorithm. We operated on
cycles and transitive relations. In some variations,
the latter were added if the pair of concepts in the
ECM is connected by a path shorter than a certain
threshold, defined by considering the ECM
diameter, while cycles were either preserved or removed
depending on the variation we wanted to obtain.</p>
      <p>Eventually, we obtained the following
annotation variations: no cycles (removing
cycles), cycles and transitive (preserving cycles
and adding transitive relations), cycles and non–
transitive (preserving cycles and keeping only
direct links), no cycles and transitive (removing
cycles and adding transitivity) and no cycles and
non–transitive (removing both cycles and
transitivity).
4.1</p>
      <sec id="sec-4-1">
        <title>Annotators Agreement in PRET</title>
        <p>
          Following Artstein and Poesio (2008), we
computed the agreement between multiple annotators
using Fleiss’ k
          <xref ref-type="bibr" rid="ref11">(Fleiss, 1971)</xref>
          and between pairs
of annotators using Cohen’s k
          <xref ref-type="bibr" rid="ref8">(Cohen, 1960)</xref>
          .
Using the scale defined by Landis and Koch (1977),
Fleiss’ k values show fair agreement, suggesting
that prerequisite annotation is difficult. Similar
tasks obtained comparable or lower values,
confirming our hypothesis: Gordon et al. (2016)
measured the agreement as Pearson Correlation
obtaining 36%, while Fabbri et al. (2018) and
Chaplot et al. (2016) obtained respectively 30% and
19% of Fleiss’ k.
        </p>
        <p>
          Compared to the other variations, removing
cycles and adding transitive relations showed the
highest improvement on the agreement, also for
pairs of annotators (Table 2). Our results
suggest that different competence level entails
different annotations and values of agreement,
confirming previous results
          <xref ref-type="bibr" rid="ref14">(Gordon et al., 2016)</xref>
          :
lower agreement can be observed when annotator
4 (quasi–expert) is involved, possibly due to the
lower competence level if compared to the other
annotators. Annotator 4 is also the one who
considered the highest number of transitive relations,
producing a more connected ECM: it is likely that
when the competence in the domain is lower, a
person tends to consider a higher number of
prerequisites for each concept. On the other hand,
annotators with more experience show even
moderate (pairs A1-A3 and A2-A3) or substantial
agree+1.44
+8.35
-6.96
+1.70
+7.12
+1.40
+0.36
ment (pair A2-A3 for the variation). Adding
transitive relations and removing cycles generally
improves the agreement values also when we
consider pairs: we notice an increase of 8.35 points
for A1-A2. The only exception is observed for the
pair A1-A3, which experienced a decrease of
almost 7 points. The cause is though to be the
number of transitive relations considered by annotator
3, which is around one third of the transitive
relations annotated by annotator 1: the validation
creates more distance between the two annotations
reducing the agreement.
        </p>
        <p>As a support for the annotation, the experts used
a n n matrix of the terminology T where they
entered a binary value in the intersection between
two concepts to indicate the presence of a
prerequisite relation. We believe that our results are
partially influenced by the instrument we used to
perform the annotation: a large matrix structure
is likely to cause distraction errors and does not
perform validation checks during the annotation.
Based on this experience and the encountered
issues, we developed an annotation tool able to
support and validate the annotation. It will be
described in the next section.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Annotation and Analysis Tool</title>
      <p>We provide a language and domain independent
prototype tool which aims on the one hand to
support and validate the annotation process and on
the other hand to perform annotation analysis. All
its main features have been designed taking into
account real problems encountered while
building PRET. Thus, this tool is highly valuable for
annotators because specifically addresses
annotators’ needs and, at the same time, avoids possible
annotation biases. In particular, the tool has three
main functionalities: annotation support,
annotation representation and analysis of the results.</p>
      <p>To support the annotation, the user is provided
with the terminology T as a list L of concepts
ordered by their first occurrence in the text. This is
done in order to give the annotator an overview of
the context in which the concept occurs. We
observed that the textual context plays a crucial role
in deciding which concepts are prerequisites of the
one under observation, so for each term we show
the list of other terms with visual indication of the
progress in the text. Additionally, as said before,
the tool validates the map resulting from the
annotation against the existence of symmetric relations,
transitivity and cycles.</p>
      <p>Once the annotation is completed, the user can
choose to generate different types of visualization
of her/his annotation. The goal of this
functionality is to provide information visualization and data
summarization for analyzing and exploring the
result of the annotation. We provide the following
different views: Matrix (ordered by concept
frequency, clusters, temporal, occurrence or
alphabetic order), Arc Diagram, Graph and Clusters.
Furthermore, the Data Synthesis task provides the
number of concepts, number of relations, number
and list of disconnected nodes and transitive
relations.</p>
      <p>Lastly, the tool computes the agreement
between relations inserted by all annotators who took
part in the task (see Section 4.1) and provides
visualization of the final dataset, which results as
a combination of all users’ annotation. This
feature also outputs a Data Synthesis that provides the
number of relations of every annotator, number of
transitive relations and the direction of conflicting
relations between annotators.</p>
      <p>The demo version of the tool is available online
at the URL provided in the Introduction.
6</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion and Future Work</title>
      <p>In this paper, we described PRET, a gold dataset
manually annotated for prerequisite relations
between pairs of concepts; moreover we presented
the methodology we adopted and a tool to support
prerequisite annotation. The case study, even
limited as for the number of annotators and the
educational material, was a reasonably good training
ground to set the basis to define a methodology
for prerequisite annotation and to identify the
major issues related to this task. Moreover, the
analysis of the annotation provided insights for
automatic identification of concepts and prerequisites,
that will be investigated in future work.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Giovanni</given-names>
            <surname>Adorni</surname>
          </string-name>
          and
          <string-name>
            <given-names>Frosina</given-names>
            <surname>Koceva</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Educational concept maps for personalized learning path generation</article-title>
          .
          <source>In Conference of the Italian Association for Artificial Intelligence</source>
          , pages
          <fpage>135</fpage>
          -
          <lpage>148</lpage>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Giovanni</given-names>
            <surname>Adorni</surname>
          </string-name>
          , Felice Dell'Orletta, Frosina Koceva, Ilaria Torre, and
          <string-name>
            <given-names>Giulia</given-names>
            <surname>Venturi</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Extracting dependency relations from digital learning content</article-title>
          .
          <source>In Italian Research Conference on Digital Libraries</source>
          , pages
          <fpage>114</fpage>
          -
          <lpage>119</lpage>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Rakesh</given-names>
            <surname>Agrawal</surname>
          </string-name>
          , Behzad Golshan, and
          <string-name>
            <given-names>Evangelos</given-names>
            <surname>Papalexakis</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Toward data-driven design of educational courses: a feasibility study</article-title>
          .
          <source>Journal of Educational Data Mining</source>
          ,
          <volume>8</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>21</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Fareedah</given-names>
            <surname>Alsaad</surname>
          </string-name>
          , Assma Boughoula, Chase Geigle, Hari Sundaram, and
          <string-name>
            <given-names>Chengxiang</given-names>
            <surname>Zhai</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Mining MOOC lecture transcripts to construct concept dependency graphs</article-title>
          .
          <source>In Proceedings of the 11th International Conference on Educational Data Mining, EDM</source>
          <year>2018</year>
          ,
          <article-title>Buffalo</article-title>
          ,
          <string-name>
            <surname>NY</surname>
          </string-name>
          , USA, July
          <volume>15</volume>
          -
          <issue>18</issue>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Ron</given-names>
            <surname>Artstein</surname>
          </string-name>
          and
          <string-name>
            <given-names>Massimo</given-names>
            <surname>Poesio</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>Inter-coder agreement for computational linguistics</article-title>
          .
          <source>Computational Linguistics</source>
          ,
          <volume>34</volume>
          (
          <issue>4</issue>
          ):
          <fpage>555</fpage>
          -
          <lpage>596</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Glenn</given-names>
            <surname>Brookshear</surname>
          </string-name>
          and
          <string-name>
            <given-names>Dennis</given-names>
            <surname>Brylow</surname>
          </string-name>
          ,
          <year>2015</year>
          . Computer Science:
          <article-title>An Overview, Global Edition, chapter 4 Networking and the Internet</article-title>
          .
          <source>Pearson Education Limited.</source>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Devendra</given-names>
            <surname>Singh</surname>
          </string-name>
          <string-name>
            <surname>Chaplot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Yiming</given-names>
            <surname>Yang</surname>
          </string-name>
          , Jaime G. Carbonell, and
          <string-name>
            <surname>Kenneth</surname>
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Koedinger</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Datadriven automated induction of prerequisite structure graphs</article-title>
          .
          <source>In Proceedings of the 9th International Conference on Educational Data Mining, EDM</source>
          <year>2016</year>
          , Raleigh, North Carolina, USA, June 29 - July 2,
          <year>2016</year>
          , pages
          <fpage>318</fpage>
          -
          <lpage>323</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Jacob</given-names>
            <surname>Cohen</surname>
          </string-name>
          .
          <year>1960</year>
          .
          <article-title>A coefficient of agreement for nominal scales</article-title>
          .
          <source>Educational and psychological measurement</source>
          ,
          <volume>20</volume>
          (
          <issue>1</issue>
          ):
          <fpage>37</fpage>
          -
          <lpage>46</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>Felice</given-names>
            <surname>Dell'Orletta</surname>
          </string-name>
          , Giulia Venturi, Andrea Cimino, and
          <string-name>
            <given-names>Simonetta</given-names>
            <surname>Montemagni</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>T2k2: a system for automatically extracting and organizing knowledge from texts</article-title>
          .
          <source>In Proceedings of the Ninth International Conference on Language Resources</source>
          and
          <article-title>Evaluation (LREC-</article-title>
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Alexander R Fabbri</surname>
            ,
            <given-names>Irene</given-names>
          </string-name>
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>Prawat</given-names>
          </string-name>
          <string-name>
            <surname>Trairatvorakul</surname>
          </string-name>
          , Yijiao He, Wei Tai Ting, Robert Tung, Caitlin Westerfield, and
          <string-name>
            <surname>Dragomir R Radev</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Tutorialbank: A manually-collected corpus for prerequisite chains, survey extraction and resource recommendation</article-title>
          .
          <source>In ACL.</source>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Joseph L Fleiss</surname>
          </string-name>
          .
          <year>1971</year>
          .
          <article-title>Measuring nominal scale agreement among many raters</article-title>
          .
          <source>Psychological bulletin</source>
          ,
          <volume>76</volume>
          (
          <issue>5</issue>
          ):
          <fpage>378</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Fabio</given-names>
            <surname>Gasparetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Carla</given-names>
            <surname>Limongelli</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Filippo</given-names>
            <surname>Sciarrone</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Exploiting wikipedia for discovering prerequisite relationships among learning objects.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <source>In 2015 International Conference on Information Technology Based Higher Education and Training</source>
          ,
          <string-name>
            <surname>ITHET</surname>
          </string-name>
          <year>2015</year>
          , Lisbon, Portugal, June 11-13,
          <year>2015</year>
          , pages
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Jonathan</given-names>
            <surname>Gordon</surname>
          </string-name>
          , Linhong Zhu, Aram Galstyan, Prem Natarajan, and
          <string-name>
            <given-names>Gully</given-names>
            <surname>Burns</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Modeling concept dependencies in a scientific corpus</article-title>
          .
          <source>In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume</source>
          <volume>1</volume>
          :
          <string-name>
            <surname>Long</surname>
            <given-names>Papers)</given-names>
          </string-name>
          , volume
          <volume>1</volume>
          , pages
          <fpage>866</fpage>
          -
          <lpage>875</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>J. Richard</surname>
            Landis and
            <given-names>Gary G.</given-names>
          </string-name>
          <string-name>
            <surname>Koch</surname>
          </string-name>
          .
          <year>1977</year>
          .
          <article-title>The measurement of observer agreement for categorical data</article-title>
          .
          <source>Biometrics</source>
          ,
          <volume>33</volume>
          (
          <issue>1</issue>
          ):
          <fpage>159</fpage>
          -
          <lpage>174</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <given-names>Chen</given-names>
            <surname>Liang</surname>
          </string-name>
          , Zhaohui Wu, Wenyi Huang, and
          <string-name>
            <given-names>C Lee</given-names>
            <surname>Giles</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Measuring prerequisite relations among concepts</article-title>
          .
          <source>In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing</source>
          , pages
          <fpage>1668</fpage>
          -
          <lpage>1674</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>Chen</given-names>
            <surname>Liang</surname>
          </string-name>
          , Jianbo Ye, Zhaohui Wu, Bart Pursel, and
          <string-name>
            <given-names>C Lee</given-names>
            <surname>Giles</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Recovering concept prerequisite relations from university course dependencies</article-title>
          .
          <source>In AAAI</source>
          , pages
          <fpage>4786</fpage>
          -
          <lpage>4791</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>Chen</given-names>
            <surname>Liang</surname>
          </string-name>
          , Jianbo Ye, Shuting Wang,
          <string-name>
            <surname>Bart Pursel</surname>
            , and
            <given-names>C Lee</given-names>
          </string-name>
          <string-name>
            <surname>Giles</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Investigating active learning for concept prerequisite learning</article-title>
          .
          <source>Proc. EAAI</source>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <given-names>Joseph D.</given-names>
            <surname>Novak</surname>
          </string-name>
          and Alberto J. Can˜as.
          <year>2006</year>
          .
          <article-title>The theory underlying concept maps and how to construct and use them</article-title>
          .
          <source>research report 2006-01 Rev</source>
          <year>2008</year>
          -
          <volume>01</volume>
          ,
          <article-title>Florida Institute for Human and Machine Cognition</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <given-names>Liangming</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Chengjiang</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Juanzi</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and Jie</given-names>
            <surname>Tang</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Prerequisite relation learning for concepts in moocs</article-title>
          .
          <source>In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume</source>
          <volume>1</volume>
          :
          <string-name>
            <surname>Long</surname>
            <given-names>Papers)</given-names>
          </string-name>
          , volume
          <volume>1</volume>
          , pages
          <fpage>1447</fpage>
          -
          <lpage>1456</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <given-names>Partha</given-names>
            <surname>Pratim Talukdar</surname>
          </string-name>
          and William W Cohen.
          <year>2012</year>
          .
          <article-title>Crowdsourced comprehension: predicting prerequisite structure in wikipedia</article-title>
          .
          <source>In Proceedings of the Seventh Workshop on Building Educational Applications Using NLP</source>
          , pages
          <fpage>307</fpage>
          -
          <lpage>315</lpage>
          . Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <given-names>Annalies</given-names>
            <surname>Vuong</surname>
          </string-name>
          , Tristan Nixon, and
          <string-name>
            <given-names>Brendon</given-names>
            <surname>Towle</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>A method for finding prerequisites within a curriculum</article-title>
          .
          <source>In Proceedings of the 4th International Conference on Educational Data Mining, Eindhoven, The Netherlands, July 6-8</source>
          ,
          <year>2011</year>
          , pages
          <fpage>211</fpage>
          -
          <lpage>216</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <given-names>Shuting</given-names>
            <surname>Wang</surname>
          </string-name>
          , Chen Liang, Zhaohui Wu,
          <string-name>
            <given-names>Kyle</given-names>
            <surname>Williams</surname>
          </string-name>
          , Bart Pursel, Benjamin Brautigam, Sherwyn Saul,
          <string-name>
            <given-names>Hannah</given-names>
            <surname>Williams</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Kyle</given-names>
            <surname>Bowen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C Lee</given-names>
            <surname>Giles</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Concept hierarchy extraction from textbooks</article-title>
          .
          <source>In Proceedings of the 2015 ACM Symposium on Document Engineering</source>
          , pages
          <fpage>147</fpage>
          -
          <lpage>156</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <given-names>Shuting</given-names>
            <surname>Wang</surname>
          </string-name>
          , Alexander Ororbia, Zhaohui Wu,
          <string-name>
            <given-names>Kyle</given-names>
            <surname>Williams</surname>
          </string-name>
          , Chen Liang, Bart Pursel, and
          <string-name>
            <given-names>C Lee</given-names>
            <surname>Giles</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Using prerequisites to extract concept maps fromtextbooks</article-title>
          .
          <source>In Proceedings of the 25th acm international on conference on information and knowledge management</source>
          , pages
          <fpage>317</fpage>
          -
          <lpage>326</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <given-names>Yiming</given-names>
            <surname>Yang</surname>
          </string-name>
          , Hanxiao Liu, Jaime Carbonell, and Wanli Ma.
          <year>2015</year>
          .
          <article-title>Concept graph learning from educational data</article-title>
          .
          <source>In Proceedings of the Eighth ACM International Conference on Web Search and Data Mining</source>
          , pages
          <fpage>159</fpage>
          -
          <lpage>168</lpage>
          . ACM.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>