<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Short Paper: Assessing Procedural Knowledge in Open- ended Questions through Semantic Web Ontologies</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Eric Snow</string-name>
          <email>eric.snow@umoncton.ca</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chadia Moghrabi</string-name>
          <email>chadia.moghrabi@umoncton.ca</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Philippe Fournier-Viger</string-name>
          <email>philippe.fournier-viger@umoncton.ca</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Département d'informatique, Université de Moncton</institution>
          ,
          <addr-line>Moncton</addr-line>
          ,
          <country country="CA">Canada</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper presents a novel approach for automatically grading students' answers to open-ended questions. It is inspired by the OeLE method, which uses ontologies and Semantic Web technologies to represent course material. The main difference in our approach is that we add a new category of concepts, named functional concepts, which allow specifying an ordering relation between concepts. This modification allows assessing procedural knowledge in students' answers by grading the ordering of these concepts. We present an example for grading answers in a course about computer algorithms, and report the corresponding results.</p>
      </abstract>
      <kwd-group>
        <kwd>E-Learning</kwd>
        <kwd>Computer-Assisted Assessment (CAA)</kwd>
        <kwd>Ontology</kwd>
        <kwd>Semantic Web</kwd>
        <kwd>Procedural Knowledge</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Assessing the students’ learning in an e-learning environment often relies on multiple
choice or fill-in-the-blank questions, which only trigger the lowest level (Knowledge)
of Bloom’s taxonomy [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] of knowledge acquisition. As we shall see in Section 2,
several attempts have been made to incorporate open-ended questions in online
assessment, which would possibly trigger the higher levels of Bloom’s taxonomy
(Synthesis and Evaluation) in the students’ learning.
      </p>
      <p>
        However, grading open-ended questions by hand can be time-consuming. To build
an e-learning environment that can automatically grade free-text answers, a variety of
techniques have been used, such as Information Extraction (IE) [
        <xref ref-type="bibr" rid="ref4 ref5">4-5</xref>
        ], Natural
Language Processing (NLP) [
        <xref ref-type="bibr" rid="ref10 ref11 ref6 ref7 ref8 ref9">6-11</xref>
        ], or statistical techniques [
        <xref ref-type="bibr" rid="ref13 ref14 ref15">13-15</xref>
        ].
      </p>
      <p>
        Our approach resembles that of the OeLE system [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. This system also uses NLP to
assess the level of understanding of the students. Course material is represented in an
ontology and encoded in the Web Ontology Language (OWL). The use of Semantic
Web technologies allows the sharing and reusing of course ontologies, thus
potentially reducing the time spent designing the ontologies. This allows for a deeper
understanding of the text than more superficial statistical techniques. Automatic assessment
is much faster, and hopefully done more objectively, than manual scoring. The OeLE
system has been used in two online courses, Design and Evaluation of Didactic
Media, and Multimedia Systems and Graphical Interaction.
      </p>
      <p>OeLE successfully assesses the semantic content of the students’ answers if the
answers contain static expressions of facts about didactic media or multimedia systems.
However, when applying it to the assessment of a computer algorithms course, we
observed that the ordering of the elements in students’ answers is not taken into
account. It is crucial that this ordering be considered because to describe how an
algorithm works, certain concepts should be stated in a specific order. In this paper, we
address this challenge by proposing a new approach in which we introduce the idea of
functional concepts. The course ontology then incorporates ordering information
about a subset of these functional concepts. The assessment process is modified to
take into account the ordering of these concepts in the students’ answers and adjust
their grade accordingly. The novelty of our work is in applying a hybrid approach
combining the OeLE system with functional concepts to assess students’ answers in
domains using highly procedural knowledge.</p>
      <p>Section 2 of this paper is a review of other automatic free-text assessment systems.
We only focus here on short-answer assessment systems where reference texts are
tailored to the course material, although some other systems have also been developed
for essay scoring, where more general texts about a topic are used for training.
Section 3 presents the general methodology, followed by our preliminary results in
Section 4. We conclude the paper in Section 5 with some future work which we are
investigating.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        This section presents previous and ongoing research in automatic short-answer
assessment. A good review of many of these systems can be found in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Although
these systems do not take advantage of Semantic Web ontologies, they contain
nonetheless functionalities and techniques useful to our system.
      </p>
      <p>
        Some systems compare students’ answers to the ideal answer supplied by the
teacher. For instance, Automated Text Marker [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] uses a pattern-matching technique.
It has been tested in courses on Prolog programming, psychology and biology.
Automark [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] uses IE techniques to grade students’ answers by comparing them to a
mark scheme template provided by the teacher. It achieved 94.7% agreement with
human grading for six of the seven science-related questions asked on a test exam.
      </p>
      <p>
        Some systems require teachers to provide training sets of marked student answers.
For example, Auto-marking [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] uses NLP and pattern-matching techniques to
compare students’ answers to a training set of marked student answers. This system
obtained 88% of exact agreement with human grading in a biology course. Bayesian
Essay Test Scoring System (BETSY) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] uses naive Bayes classifiers to search for
specific features in students’ answers. In a biology course, it achieved up to 80%
accuracy. CarmelTC [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] uses syntactic analysis and naive Bayes classifiers to analyze
essay answers. On an experiment with 126 physics essays, it obtained 90% precision
and 80% recall. The Paperless-School Marking Engine (PS-ME) [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] is commercially
available and requires a training set of marked answers. The system uses NLP to
grade the students’ answers in addition to implementing Bloom’s taxonomy
heuristics. However, the exact implementation is not disclosed. C-rater [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] uses a set of
marked training essays to determine the students’ answers grade using NLP. In a
large-scale experiment of 170,000 answers to reading comprehension and algebra
questions, it achieved 85% accuracy. In [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], a combination of NLP and Support
Vector Machines is used to classify answers into two classes (above/below 6 points out of
10). It obtains an average of 65% precision rate (the only reported metric).
      </p>
      <p>
        The MultiNet Working Bench system [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] uses a graphical tool to represent the
students’ knowledge visually. It compares the semantic network extracted from the
student answer to that submitted by the teacher. Verified parts of the network are
displayed in green, while wrong or unverified parts (not supported by logic inference)
are displayed in red.
      </p>
      <p>
        Other systems rely on Latent Semantic Analysis (LSA). For example, Research
Methods Tutor [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] uses LSA to compare the students’ answers to a set of expected
answers. If the student answers incorrectly, the system guides the student into
obtaining the right answer. The Willow system [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] requires several unmarked reference
answers for each question. It also uses LSA to evaluate students’ answers written in
English or Spanish. In a computer science course, it achieved on average 0.54
correlation with the teacher’s grading. A system currently in use at the University of
Witwatersrand [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] uses LSA and clustering techniques. It achieves between 0.80 and 0.95
correlation with the teacher’s grading.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Methodology</title>
      <p>
        In this section, we briefly present the work on OeLE [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and how we have adapted it
and expanded on it in our system. Our focus has been on grading students’ answers to
questions in a computer algorithms course taught in French.
3.1
      </p>
    </sec>
    <sec id="sec-4">
      <title>Natural Language Processing</title>
      <p>
        For each of the online exam’s questions in OeLE [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], the ideal answer provided by the
teacher and the students’ answers are processed similarly. The GATE software
performs most of the NLP tasks, and the Protégé software is used to build the course
ontology and encode it in OWL. While OeLE is written in Java and uses the Jena
framework to process the encoded ontology, our system is done in PHP and we
developed our own ontology-processing code. It is important to note that OeLE and our
system use OWL for knowledge representation, but do not utilize its inference
services. In this paper, we use the same terminology as [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. We refer to OWL classes as
concepts, to object properties as relations, and to data properties as attributes. Also,
entity is used as a generic term for concept, relation, or attribute, while property is
used for relation or attribute.
      </p>
      <p>The NLP consists of three phases: Preparation, Search, and Set in a context. The
Preparation phase consists of spell-checking, sentence detection, tokenization and
POS tagging. In the Search phase, the linguistic expressions are detected and matched
against the course ontology. Finally, the Set in a context phase associates the
attributes and values to their respective concept, and also identifies which concepts
participate in a relation.</p>
      <p>In OeLE, the texts are annotated semiautomatically, meaning that the teacher only
needs to manually annotate the fragments unknown to the system or incorrectly
tagged. In our system, the natural language processing is done manually for the
moment, as GATE does not sufficiently support French (out-of-the-box) for our
purposes. Performing automatic French annotation is planned as a future work.</p>
      <p>
        As an example, we use an actual question from a computer algorithms course given
at our university: “Describe Depth-First Search (DFS)”. Table 1 shows the annotation
set (at the end of the NLP phase) of the partial student’s answer: “Depth-First Search
(DFS) is an exhaustive algorithm that explores a graph...” The ideal answer supplied
by the teacher is similarly annotated; however, for every annotated entity, a numerical
value ought to be supplied specifying the relative importance of that entity within the
question.
The grading stage consists of calculating the semantic distance between the
annotation sets (obtained in Section 3.1) of each student’s answer and that of the teacher’s
ideal answer, with respect to the course ontology. Because of space limitations, we
cannot give detailed calculations for the example. The reader is advised to see the full
explanation in the original publication [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], or an easy-to-follow example in [16].
      </p>
      <p>
        The formulas used in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] for calculating the semantic distances are given below. In
every function, teacher-provided constants allow for certain elements to be weighted
more or less heavily according to their importance. The best combination of these
constants is problem-dependent and should be discovered empirically. The “linguistic
distance” between the textual representation of the entities in the student and teacher’s
answer is also taken into account. All functions return values in the [
        <xref ref-type="bibr" rid="ref1">0,1</xref>
        ] interval.
Concept similarity. To calculate the concept similarity (CS) between concepts
, the following function is used:
(
)
(
)
(
)
(
)
      </p>
      <p>The constants
elements. Also,
,
indicate the relative importance of the corresponding
and .
and
(1)</p>
      <p>The concept proximity (CP) is calculated using the taxonomy formed in the
ontology by the class hierarchy defined in OWL. Note that the &lt;is-a&gt; relation is explicitly
added to the course ontology (with the class as domain and the subclass as image)
where rdfs:subClassOf is used:
&lt;owl:Class rdf:about="DepthFirstSearch"&gt;</p>
      <p>&lt;rdfs:subClassOf rdf:resource="Algorithm"/&gt;
&lt;/owl:Class&gt;</p>
      <p>If the concepts and have no taxonomic parent (other than the root), this value
is zero, otherwise it is defined as such:
where | ( )| is the number of concepts separating and through the
shortest common path through the taxonomic tree, and | | is the total number
of concepts in the ontology. A shorter path thus indicates a stronger similarity
between the two concepts.</p>
      <p>The properties similarity (PS) calculates the similarity between the set of properties
associated with and . The properties of a concept c are the union of the set of
attributes that have c as domain, and the set of relations that have c as domain or
image.</p>
      <p>Lastly, ( ) uses the Levenshtein distance between the string representation
of concepts and , written below, and is defined as follows:
Attribute similarity. The attribute similarity between two attributes
concepts is calculated by a similar function:
and
of two
(
(
Relation similarity. The relation similarity between two relations
lated in a similar manner:</p>
      <p>Global evaluation. In order to accomplish the evaluation of a question, each of the
concepts of the student’s answer is associated with the closest concept of the ideal
answer, given that each concept can only be used once. The similarity between each
pair of concepts is then calculated and is multiplied by the relative numerical value of
the concept in the ideal answer. The similarity is then added to the final grade. The
same process is repeated for relations and attributes.
3.3</p>
    </sec>
    <sec id="sec-5">
      <title>Procedural Knowledge Grading</title>
      <p>
        Our system uses the same grading algorithm as OeLE [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The students’ answers are
compared to the teacher’s ideal answer. The grades are calculated based on the most
similar entity in the expected answer. In OeLE, the order of the entities is not factored
in the grade and any permutation of the linguistic expressions of the student’s answer
yields the same grade.
      </p>
      <p>However, this is not appropriate for assessing procedural knowledge in our system.
If the above method is applied to evaluate text describing procedural knowledge such
as algorithms-related answers, the grade calculation ought to take into account the
relative order of a subset of concepts expressing procedural knowledge.
Functional concepts. In order to address this issue, we propose to add functional
concepts to the course ontology. A functional concept represents a global procedure, a
sequence of sub-procedures or individual steps to accomplish a given task.</p>
      <p>Let us consider the following example algorithm, DepthFirstSearch, given in
pseudocode:
procedure DepthFirstSearch</p>
      <p>VisitRoot
VisitFirstChildNode</p>
      <p>VisitOtherSiblings
end
procedure VisitRoot [...]
procedure VisitFirstChildNode [...]
procedure VisitOtherSiblings [...]</p>
      <p>For every procedure or sub-procedure, we create a corresponding functional
concept: DepthFirstSearch, VisitRoot, VisitFirstChildNode, and VisitOtherSiblings. The
last three sub-procedures could in turn be further decomposed.</p>
      <p>The functional concepts allow for a high-level description of the algorithm and
mask implementation details, which would be difficult to express in the ontology
using relations or attributes. Further decomposition of VisitRoot into individual steps
could be stated in any of the following ways:
DepthFirstSearch &lt;visits&gt; Root [using relation &lt;visits&gt;]
VisitRoot &lt;visits&gt; Root [same relation with a more specific concept]
Root.visited=true [the value of the attribute &lt;visited&gt; becomes true]
Representing functional concepts in OWL. Relationships between functions are
defined as meta-functions in [17]. These meta-functions are implemented in our
system as relations between two functional concepts. In this example, two instances of
the &lt;is-preceded-by&gt; relation are needed. One instance is needed between
VisitFirstChildNode and VisitRoot, because the root has to be visited first, and another
between VisitOtherSiblings and VisitFirstChildNode, because the first child node
should be visited first. Similarly, three instances of the &lt;is-achieved-by&gt; relation are
used between VisitRoot and each of the remaining functional concepts.</p>
      <p>The same idea is found in [18], where the relation preceded_by is defined similarly
to &lt;is-preceded-by&gt; and can be used to order any pair of classes P and P1. In other
words, P preceded_by P1 is defined as “Every P is such that there is some earlier P1”.
This relation is defined as transitive, and is neither symmetric, reflexive nor
antisymmetric.</p>
      <p>In [19], an irreflexive and transitive relation precedes is used when “the sequence
of the related events is of utmost importance for the correct interpretation”. This paper
also defines the inverse relation follows.</p>
      <p>Similarly, the working draft: “Time Ontology in OWL” [20] of the World Wide
Web Consortium (W3C) states that: “There is a before relation on temporal entities,
which gives directionality to time. If a temporal entity T1 is before another temporal
entity T2, then the end of T1 is before the beginning of T2.” This relation is part of the
time namespace.</p>
      <p>In our implementation, the functional concepts and the &lt;is-preceded-by&gt; relation
are defined as such in OWL:
&lt;owl:Class rdf:about="FunctionalConcept"/&gt;
&lt;owl:Class rdf:about="DepthFirstSearch"&gt;</p>
      <p>&lt;rdfs:subClassOf rdf:resource="FunctionalConcept"/&gt;
&lt;/owl:Class&gt;
&lt;owl:Class rdf:about="VisitRoot"&gt;</p>
      <p>&lt;rdfs:subClassOf rdf:resource="DepthFirstSearch"/&gt;
&lt;/owl:Class&gt;
&lt;owl:Class rdf:about="VisitFirstChildNode"&gt;</p>
      <p>&lt;rdfs:subClassOf rdf:resource="DepthFirstSearch"/&gt;
&lt;/owl:Class&gt;
&lt;owl:Class rdf:about="VisitOtherSiblings"&gt;</p>
      <p>&lt;rdfs:subClassOf rdf:resource="DepthFirstSearch"/&gt;
&lt;/owl:Class&gt;
&lt;owl:ObjectProperty rdf:about="IsPrecededBy"/&gt;</p>
      <p>Note that the &lt;is-achieved-by&gt; relation is implied by the class hierarchy rooted at
the concept FunctionalConcept, just as the &lt;is-a&gt; relation is implied by the class
hierarchy in OeLE.</p>
      <p>For every algorithm, a separate (meta) ontology lists the required orderings specific
to that algorithm. Although there exists many algorithms for graph exploration, we
only need to define the functional concepts once in the course ontology, and their
ordering can then be declared in a separate ontology. For instance, the
BreadthFirstSearch algorithm can be defined with the same functional concepts as above,
only ordered differently.</p>
      <p>For DepthFirstSearch, the meta-ontology is as follows:
VisitFirstChildNode &lt;is-preceded-by&gt; VisitRoot
VisitOtherSiblings &lt;is-preceded-by&gt; VisitFirstChildNode</p>
      <p>Note that the following relation is also inferred by the transitive property:
VisitOtherSiblings &lt;is-preceded-by&gt; VisitRoot
Grading with functional concepts. In our approach, the question evaluation process
remains mostly unchanged. No special treatment is given to the functional concept
class hierarchy rooted at the concept FunctionalConcept, even though its implied
relation is &lt;is-achieved-by&gt;, rather than the &lt;is-a&gt; relation implied for the other
concepts. This takes into account function nesting and composition, while allowing
calculating the proximity of the functional concepts.</p>
      <p>However, the global evaluation of a student answer R takes into account the
algorithm-specific orderings of the meta-ontology. The new evaluation function is given
below:
(
)</p>
      <p>
        The final grade (FG) for the student answer R is proportional to the global
evaluation of the answer, , obtained from Section 3.2. Here, is a constant in the
interval [
        <xref ref-type="bibr" rid="ref1">0,1</xref>
        ] allowing the teacher to adjust the relative importance of the correct
ordering of concepts in the global evaluation. The ordering factor of the answer, ,
is defined as follows:
(7)
(8)
|
|
where represents the number of functional concepts having the right
ordering in the student answer R, and | | the number of functional concepts
orderings in the meta-ontology.
      </p>
      <p>It should be noted that if functional concepts in the student’s answer are ordered
with the opposite relation (that is, &lt;is-followed-by&gt;), the evaluation algorithm inverts
the relation between the functional concepts.</p>
      <p>Also, the individual student grades are affected by the number of defined
orderings. If there are only a few orderings, as demonstrated below, students are strongly
penalized for every mistake. This is also the case with the concept proximity defined
in Formula 2, where the number of concepts in the ontology affects students' grades.
However, we can assume that the course ontology is fixed during evaluation, and that
the students' grades are therefore affected similarly (in a linear fashion).
4</p>
    </sec>
    <sec id="sec-6">
      <title>Working Example and Results</title>
      <p>Using Depth-First Search as an example, we can quantify the effect of the new
evaluation function on a student’s answer. To simplify, we omit the conceptual grading of
the answer and concentrate on the functional grading. Since the same entities are
present in both the student and teacher’s answers, the conceptual grade is 100%. The
ideal functional answer could be as follows: “Depth-First Search first visits the root
[of a graph], then [recursively] visits its first child node before visiting its other
siblings.” Table 2 shows the produced functional concepts.
Any permutation of this ideal answer taken as input by the original approach would
yield a grade of 100%. Now, consider the following student’s answer: “Depth-First
Search visits the root [of a graph], then [recursively] visits its first child node after
visiting its other siblings.” Here, “after” inverts the ordering of the two last concepts
(highlighted in bold below), yielding the following answer:
VisitFirstChildNode &lt;is-preceded-by&gt; VisitOtherSiblings
However, these two student orderings are correct:
VisitFirstChildNode &lt;is-preceded-by&gt; VisitRoot [inferred]
VisitOtherSiblings &lt;is-preceded-by&gt; VisitRoot</p>
      <p>As stated above, the conceptual grading of this answer, as performed by OeLE, is
100%. By using the new evaluation function (Formula 7), the final grade (FG)
becomes:
(
)
(9)
where the global evaluation (GE) is 100%, the ordering factor (DF) is 66.67%, and
the constant is given a value of 1.0. Considering that the ideal answer to this
algorithm contains only three orderings for pairs of functional concepts (one is inferred)
and that a third is out of order, this low grade seems acceptable, or at least a
reasonable improvement over the former grade of 100% that would have been attributed had
we only used the conceptual grading system.
5</p>
    </sec>
    <sec id="sec-7">
      <title>Conclusion and Future Work</title>
      <p>The work presented in this paper adapts the OeLE system to include procedural
knowledge. The example was taken from an algorithms course given at Université de
Moncton. This approach could be used in other domains where procedural knowledge
is central to processing the text. For example, [18] and [19] apply similar methods to
biomedical ontologies.</p>
      <p>The approach put forth in this paper introduces functional concepts to represent
procedural knowledge in ontologies. The class hierarchy of functional concepts is
considered as a series of instances of the relation &lt;is-achieved-by&gt; instead of &lt;is-a&gt;.
For every computer algorithm (or procedure, for other domains), a series of instances
of the relation &lt;is-preceded-by&gt; specify an ordering for pairs of functional concepts.</p>
      <p>In this paper, the texts were annotated manually. We are considering annotating the
French texts semiautomatically as future work. The detection of the orderings
(detecting keywords such as “first”, “before”, “after” in the example of Section 4) could also
be performed automatically.</p>
      <p>In the case where the student answer uses the opposite ordering relation
(&lt;isfollowed-by&gt;), the relation between the functional concepts is inverted prior to
evaluation. Some more complex answers could require more inversions, for example if the
student wrote “X and Y should be done after Z”.</p>
      <p>Future work could also consider flow control structures, such as loops or branches,
although the textual representation of those structures without proper indentation or
braces could be ambiguous. For example, the VisitOtherSiblings functional concept
can be decomposed into the following loop: (for every other sibling, VisitNode).</p>
      <p>Another idea that could be explored would be to add the notion of recursive
procedures, such as Depth-First Search. VisitFirstChildNode and (every VisitNode of)
VisitOtherSiblings should include recursive calls. As an ideal answer, the teacher could
give either: DFS.isRecursive=true, or VisitFirstChildNode.isRecursive=true and
VisitOtherSiblings.isRecursive=true. Depending on the ideal answer given and their own
answer, students could be unjustly penalized.
16. Fernández-Breis, J.T., Valencia-García, R., Cañavate- Cañavate, D., Vivancos-Vicente,
P.J., Castellanos-Nieves, D. OeLE: Applying ontologies to support the evaluation of open
questions-based tests. In: Proceedings of the KCAP’05 WORKSHOP. SW-EL’05:
Aplications of Semantic Web Technologies for E-Learning (in conjunction with 3rd
International Conference on Knowledge Capture (KCAP’05)), Banff, Canada (2005)
17. Aroyo, L., Dicheva, D.: Courseware authoring tasks ontology. In: Proceedings of the</p>
      <p>International Conference on Computers in Education, pp. 1319-1320. (2002)
18. Smith, B., Ceusters W., Klagges, B., Köhler, J., et al.: Relations in biomedical ontologies.</p>
      <p>Genome Biology 6(R46) (2005)
19. Schulz, S., Markó, K., Suntisrivaraporn, B. Formal representation of complex SNOMED</p>
      <p>CT expressions. BMC Medical Informatics and Decision Making 8(1) (2008)
20. World Wide Web Consortium (W3C),
http://www.w3.org/TR/2006/WD-owl-time20060927/, last accessed 2012-11-21.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bloom</surname>
            ,
            <given-names>B.S.:</given-names>
          </string-name>
          <article-title>Taxonomy of Educational Objectives, Handbook 1: The Cognitive Domain</article-title>
          .
          <article-title>David McKay Co Inc</article-title>
          ., New York (
          <year>1956</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Castellanos-Nieves</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fernández-Breis</surname>
            ,
            <given-names>J.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia-García</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martínez-Béjar</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Iniesta-Moreno</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Semantic web technologies for supporting learning assessment</article-title>
          .
          <source>Information Sciences</source>
          <volume>181</volume>
          (
          <issue>9</issue>
          ),
          <fpage>1517</fpage>
          -
          <lpage>1537</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Pérez-Marín</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pascual-Nieto</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodríguez</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Computer-assisted assessment of freetext answers</article-title>
          .
          <source>The Knowledge Engineering Review</source>
          <volume>24</volume>
          (
          <issue>4</issue>
          ),
          <fpage>353</fpage>
          -
          <lpage>374</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Callear</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jerrams-Smith</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soh</surname>
          </string-name>
          , V.:
          <article-title>CAA of short non-MCQ answers</article-title>
          .
          <source>In: Proceedings of the 5th International CAA Conference</source>
          , Loughborough, UK (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Jordan</surname>
          </string-name>
          , S., Mitchell, T.:
          <article-title>e-Assessment for learning? The potential of short-answer free-text questions with tailored feedback</article-title>
          .
          <source>British Journal of Educational Technology</source>
          <volume>40</volume>
          (
          <issue>2</issue>
          ),
          <fpage>371</fpage>
          -
          <lpage>385</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Sukkarieh</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pulman</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raikes</surname>
          </string-name>
          , N.:
          <article-title>Auto-marking: using computational linguistics to score short, free text responses</article-title>
          .
          <source>In: Proceedings of the 29th IAEA Conference</source>
          , Philadelphia, USA (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Rudner</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Liang</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Automated essay scoring using Bayes' theorem</article-title>
          .
          <source>In: Proceedings of the Annual Meeting of the National Council on Measurement in Education</source>
          , New Orleans, LA (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Rosé</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roque</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bhembe</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , VanLehn, K.:
          <article-title>A hybrid text classification approach for analysis of student essays</article-title>
          .
          <source>In: Proceedings of the HLT-NAACL Workshop on Educational Applications of NLP</source>
          , Edmonton, Canada (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Mason</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grove-Stephenson</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Automated free text marking with paperless school</article-title>
          .
          <source>In: Proceedings of the 6th International CAA Conference</source>
          , Loughborough, UK (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Burstein</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leacock</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Swartz</surname>
          </string-name>
          , R.:
          <article-title>Automated evaluation of essays and short answers</article-title>
          .
          <source>In: Proceedings of the 5th International CAA Conference</source>
          , Loughborough, UK (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Hou</surname>
          </string-name>
          , W.-J.,
          <string-name>
            <surname>Tsao</surname>
            ,
            <given-names>J.-H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
          </string-name>
          , S.-Y.,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Automatic Assessment of Students' Free-Text Answers with Support Vector Machines</article-title>
          .
          <source>LNCS 6096</source>
          ,
          <fpage>235</fpage>
          -
          <lpage>243</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Lutticke</surname>
          </string-name>
          , R.:
          <article-title>Graphic and NLP Based Assessment of Knowledge about Semantic Networks</article-title>
          .
          <source>In: Proceedings of the Artificial Intelligence in Education conference</source>
          , IOS Press (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Wiemer-Hastings</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Allbritton</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Arnott</surname>
          </string-name>
          , E.:
          <article-title>RMT: A dialog-based research methods tutor with or without a head</article-title>
          .
          <source>In: Proceedings of the 7th International Conference on Intelligent Tutoring Systems</source>
          , Springer-Verlag, Berlin (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Pérez-Marín</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alfonseca</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodríguez</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pascual-Nieto</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Willow: Automatic and adaptive assessment of students free-text answers</article-title>
          .
          <source>In: Proceedings of the 22nd International Conference of the Spanish Society for the Natural Language Processing (SEPLN)</source>
          , Zaragoza, Spain (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Klein</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kyrilov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tokman</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Automated Assessment of Short Free-Text Responses in Computer Science using Latent Semantic Analysis</article-title>
          .
          <source>In: ITiCSE '11 Proceedings of the 16th annual joint conference on Innovation and technology in computer science education</source>
          , New York, USA, pp.
          <fpage>158</fpage>
          -
          <lpage>162</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>