<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>OWL Reasoner Evaluation (ORE) Workshop 2013 Results: Short Report</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Rafael S. Gon¸calves</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Samantha Bail</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ernesto Jimenez-Ruiz</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nicolas Matentzoglu</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bijan Parsia</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Birte Glimm</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yevgeny Kazakov</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, University of Oxford</institution>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Institut fu ̈r Ku ̈nstliche Intelligenz, Ulm University</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>School of Computer Science, The University of Manchester</institution>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The OWL reasoner evaluation (ORE) workshop brings together reasoner developers and ontology engineers in order to discuss and evaluate the performance and robustness of modern reasoners on OWL ontologies. In addition to paper submissions, the workshop featured a live and offline reasoner competition where standard reasoning tasks were tested: classification, consistency, and concept satisfiability. The reasoner competition is performed on several large corpora of reallife OWL ontologies obtained from the web, as well as user-submitted ontologies which were found to be challenging for reasoners. Overall there were 14 reasoner submissions for the competition, some of which dedicated to certain subsets or profiles of OWL 2, and implementing different algorithms and optimisations. In this report, we give an overview of the competition methodology and present a summary of its results, divided into the respective categories based on OWL 2 profiles and test corpora.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>The OWL Reasoner Evaluation Workshop (ORE) aims at being an international
venue for the annual systematic evaluation of reasoners for (subsets of) the Web
Ontology Language OWL [9,3] and bringing together both users and developers
of such reasoners. The first ORE workshop was organized in 2012 as a satellite
event4 of the IJCAR conference [10], and started as an initiative in the context of
the SEALS (Semantic Evaluation At Large Scale) project [29]. In 2013 the ORE
workshop was organized together with the Description Logic (DL) workshop.</p>
      <p>This report summarizes the results of the ORE 2013 reasoner competition.
All test data, results, and further information about the competition are available
online: http://ore2013.cs.manchester.ac.uk.</p>
      <p>The remainder of the report is organized as follows. In Section 2, we present
the methodology of the competition. Section 3 provides a brief description of
each participating OWL reasoner. The results of the offline and live competitions
are shown in Sections 4 and 5, respectively. In Section 6 we present the results
for the user-submitted ontologies. Finally, Section 7 provides a summary of the
competition results.
4 http://www.cs.ox.ac.uk/isg/conferences/ORE2012/</p>
    </sec>
    <sec id="sec-2">
      <title>Methodology</title>
      <p>We start by describing the reasoning tasks considered in the competition,
followed by the presentation of the benchmark framework created for ORE 2013.
Subsequently, we report on the hardware and ontology corpora used.
2.1</p>
      <sec id="sec-2-1">
        <title>Reasoning tasks</title>
        <p>The competition was based on three standard reasoning tasks, namely
classification, consistency checking, and concept satisfiability checking. The call for
submissions also included query answering, but there were no reasoners
submitted for this task.
2.1.1 Ontology classification The classification task was chosen as the most
complex of the three tasks. Given an ontology, the reasoners were asked to
return an ontology file (parseable by the OWL API) in OWL functional syntax,
containing a set of SubClassOf axioms of the form α := A ⊑ B, for named
concepts A, B ∈ sig(O), where O |= α according to the following specifications:
1. Non-tautology:
– A ∈ sig(O) ∪ ⊤
– B ∈ sig(O) ∪ ⊥
– A 6= B
2. Directness: there exists no named concept C ∈ sig(O) s.t. O |= A ⊑ C and</p>
        <p>C ⊑ B, where C is not equivalent to A, B, or ⊥.
3. Conciseness: if O |= A ≡ ⊥, the only axiom with A on the left-hand side is</p>
        <p>A ⊑ ⊥ .
4. Consistency: if the given ontology is inconsistent, the only output is the
axiom ⊤ ⊑ ⊥ .
5. Non-strictness: if O |= A ≡ B, output A ⊑ B and B ⊑ A.</p>
        <p>These specifications were selected in order to obtain a set of SubClassOf
axioms that would represent all subsumptions between named classes, while
omitting irrelevant information.
2.1.2 Ontology consistency For this task, the reasoner was asked to test
the consistency of the ontology (i.e. whether O |= ⊤ ⊑ ⊥ ), and return ‘true’ or
‘false’, respectively.
2.1.3 Concept satisfiability This task was performed by randomly selecting
ten concepts from each ontology in the respective corpus, giving precedence to
unsatisfiable concepts where possible. The reasoner was then asked to test the
satisfiability of the concept, i.e. whether O |= A ≡ ⊥ for a named concept A,
and return ‘true’ or ‘false’, respectively.
2.2</p>
        <p>Benchmark framework
2.2.1 Implementation The aim of the benchmarking framework is to work
with as many different reasoner configurations as possible, without the need to
interfere with reasoner internals. We therefore asked the system developers to
write a simple executable wrapper for their reasoner which would accept input
arguments (ontology file name, reasoning task, output directory, concept name)
and output results according to our specification (a valid OWL file with the class
hierarchy, ‘true’/‘false’ for the consistency and satisfiability tasks, as well as the
time taken for the task, or a separate file with an error trace).</p>
        <p>The time measured is the wall-clock time (in milliseconds) elapsed from the
moment preceding reasoner creation (e.g. before the call to ReasonerFactory.
createReasoner(ontology) in the OWL API [8] where the ontology has already
been parsed into an OWL object) to the completion of the given task, i.e. it
includes the loading and possibly pre-processing time required by the reasoner,
but excludes time taken for file I/O. While measuring CPU time would be more
accurate, it comes with added complexity for concurrent implementations – for
instance, in Java, one would have to aggregate the run times of each thread. The
reasoners are also asked to enforce a five minute timeout, that is, if the measured
time exceeds 5 minutes then the reasoner should stop the ongoing operation, and
terminate itself. Failure to do so will trigger a kill command sent to the running
process after another minute in order to give enough time for the process to
terminate; i.e. the hard timeout is six minutes.</p>
        <p>While one might argue that leaving the reporting of operation times to the
reasoners may be error-prone, we believe that letting reasoner developers
themselves handle the input and output of their system, as well as the time
measurement, is the most straightforward way to include as many systems as possible;
regardless of their implementation programming language, whether they use the
OWL API, employ concurrent implementations, and so on. The large number of
reasoners that was submitted to the competition shows that writing this simple
wrapper script lowered the barrier for participation, and despite some difficulties
with non-standard output, most reasoners adhered to the specifications closely
enough for us to analyse their outputs.</p>
        <p>Additionally, it is clear that reasoners which do not implement the five minute
timeout, but rather rely on the kill signal after the six minute timeout sent by
the benchmark framework, could potentially gain a slight advantage through
this additional minute. However, not only is the number of successfully
completed tasks between the five and six minute marks negligible, but also we have
automatically induced a timeout for those reasoners that exceeded a runtime of
five minutes for some input.
2.2.2 Correctness check The reasoner output was checked for correctness
by a majority vote, i.e. the result returned by the most reasoners was considered
to be correct.5 Since the ontologies in the test corpus were ‘real-life’ ontologies,
5 Unless most reasoners return an empty OWL file, in which case the majority vote is
taken based on those reasoners which output a non-empty file.
this was the most straightforward way to automatically determine correctness
without manual inspection or artificially generating test cases.</p>
        <p>
          In the case of the consistency and satisfiability challenges the output was
a simple unambiguous ‘true’ or ‘false’, so any incorrect results were unlikely
to be caused by erroneous output from a sound reasoner; however, for ontology
classification, the reasoners output an ontology file containing OWL SubClassOf
axioms, which may lead to errors if the systems did not exactly follow the above
specifications on which types of axioms to include or exclude. For the purpose
of verifying correctness of the output we rely on an ontology diff to determine
whether two given results are logically equivalent [6]. The diff is tuned to ignore
(
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) tautological axioms of the type A ⊑ ⊤ for any named concept A, (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) axioms of
the form ⊥ ⊑ B or A ⊑ B, where A, B are named concepts and A is unsatisfiable,
and (
          <xref ref-type="bibr" rid="ref3">3</xref>
          ) if two result files are not equivalent due to OWL EquivalentClassOf
axioms, these axioms are ignored.6
2.2.3 Success and failure In the end, the outcome of a reasoning task on
an ontology was either ‘success’ or ‘fail’. A reasoner would pass the test (‘solve
the problem’) successfully if it met the following three criteria:
– Process the ontology without throwing an error (e.g. parsing error, out of
memory, unsupported OWL feature, etc.).
– Return a result within the allocated timeout.
        </p>
        <p>– Return the correct result (based on the majority vote).</p>
        <p>Likewise, a reasoner would fail a task if it did one of the following:
– Throw an error and abort the reasoning task.
– Return no result within the allocated time.
– Return an incorrect result (based on the majority vote).</p>
        <p>Note that these criteria mean that a reasoner could successfully solve a task
while being unsound or incomplete, or without completing the reasoning task
within the allocated time. For example, for the classification task, if the reasoner
has already found all required entailed atomic subsumptions without performing
all possible entailment checks within the five minute time frame, it can simply
output this ‘intermediate’ result before terminating the process. Since the
correctness check is performed on whatever the reasoner returns within the timeout,
the resulting output would be considered to be correct, despite the fact that the
reasoner has not fully completed the task.</p>
        <p>Likewise, a reasoner which does not support certain OWL 2 features, such as
datatypes, might find (if there are any to find ) the required atomic subsumptions
via some other ‘route’ if there are several reasons why the entailment holds. In
other words, if there exist multiple justifications (minimal entailing subsets) for
a subsumption of which at least one only contains supported features, then the
reasoner will still be able to find the subsumption without having to process the
6 While the presence of equivalences in some result should not a problem when
compared to a result with these equivalences in subsumption form, reasoners tend not to
produce the latter because they are non-strict subsumptions, so we allowed
equivalences and tuned our diff to ignore them where applicable.
unsupported feature. This is an issue we are planning to address with the next
iteration of the benchmark framework by modifying ontologies (i.e. ‘breaking’
their justifications) in order to specifically test certain OWL 2 features.
2.3</p>
      </sec>
      <sec id="sec-2-2">
        <title>Hardware</title>
        <p>The experiments were run on a cluster of identical computers (one reasoner per
computer) that were made available to us by Konstantin Korovin of the iProver
project7 at The University of Manchester, supported by the Royal Society grant
RG080491. Each computer had the following configuration:
– Intel Xeon QuadCore CPU @2.33GHz
– 12GB RAM (8GB assigned to the process)
– Running the Fedora 12 operating system
– Java version 1.6
2.4</p>
        <p>Test corpora
2.4.1 Main test corpus For each of the OWL 2 profiles [16] used in the
competition (OWL 2 EL, RL, and DL ontologies which were not in any of the
sub-profiles) we gathered a test set of up to 200 ontologies. The pool of ontologies
we sampled from was composed of three corpora: (i) the NCBO BioPortal8
corpus [17], (ii) the Oxford Ontology Library9, and (iii) the corpus of ontologies
from the Manchester Ontology Repository10 [13]. The corpora were filtered to
only include OWL 2 ontologies which had at least 100 axioms and 10 named
concepts. Note that the sample ontology pool, composed of 2499 ontologies,
does contain some degree of duplication due to the intersection of BioPortal, the
Manchester OWL Repository, and the Oxford Ontology Library.</p>
        <p>The ontologies were then binned by profile, i.e. one bin for each of the
following: OWL 2 EL ontologies, OWL 2 RL, and OWL 2 DL. Regarding the latter,
we chose to include here those ontologies that do not fall into any of the
subprofiles (i.e. OWL 2 EL, RL, or QL) in order to ensure that features outside
the sub-profiles were tested. For each of these profile bins, a stratified random
sample was drawn to obtain a set of 200 ontologies:
– 50 small ontologies (between 100 and 499 logical axioms)
– 100 medium sized ontologies (between 500 and 4,999 logical axioms)
– 50 large ontologies (5,000 and more logical axioms)</p>
        <p>Note that these thresholds and weightings were chosen based on the
distribution of ontology sizes we have found in several ontology corpora which follow
(roughly) a normal distribution, with a large number of medium-sized ontologies
and fewer small and large ontologies. While it would have been possible to select
7 http://www.cs.man.ac.uk/~korovink/iprover/
8 http://bioportal.bioontology.org/
9 http://www.cs.ox.ac.uk/isg/ontologies/
10 http://owl.cs.manchester.ac.uk/owlcorpus
exclusively medium-sized and large ontologies, we also expected some small
ontologies to be fairly complex for the reasoners, which is why they were included
in the test corpus.</p>
        <p>In addition to the ontologies from BioPortal, the Oxford Library, the
Manchester Repository, and user-submitted ontologies, the May 2013 version of the
National Cancer Institute (NCI) Thesaurus (NCIt) [5], and the January 2011
version of the Systematized Nomenclature of Medicine (SNOMED) Clinical Terms
(SNOMED CT) [21] were also added to the corpus, respectively to the DL and
EL profile bins.</p>
        <p>The experiments were run on the OWL functional syntax serialisations of
the selected ontologies, except for one reasoner (Konclude) which currently only
supports OWL/XML syntax. A number of ontologies serialised into functional
syntax (55 across all the sets) turned out to be broken (they were correctly loaded
and serialised, but the serialisation could not be parsed back by the OWL API),
possibly due to problems with the respective serialiser in the OWL API (version
3.4.4). These were replaced by random selections for their respective bin. The
same occurred for 12 ontologies serialised into OWL/XML.</p>
        <p>The entire sampling process was performed twice in order to create two
complete test sets: Set A for the offline competition, and Set B for the live
competition. Note that some ontologies occurred in both Set A and B: 40 ontologies
occurred in both Set A and B for the DL category, Set A and B were fully
identical for the EL category, and 29 ontologies were shared between Set A and B
in the RL category.
2.4.2 User-submitted ontologies In the call for submissions to the ORE
2013 workshop, we also included a call for ‘hard’ ontologies and potential
reasoner benchmark suites. Several groups of ontology and reasoner developers
submitted their ontologies, which were either newly developed OWL ontologies or
modifications of existing ones. These included:
– C. M. Keet, A. Lawrynowicz, C. d’Amato, M. Hilario: the Data Mining
OPtimization Ontology (DMOP) [12], a complex ontology with around 3,000
logical axioms in the SROIQ(D) description logic which makes uses of all
OWL 2 DL features.
– M. Samwald: Genomic CDS [20], an ALCQ ontology containing around 4,000
logical axioms, which involves a high number of qualified number restrictions
of the type ‘exactly 2’.
– V. Chaudhri, M. Wessel: Bio KB 101 [2], a set of OWL approximations of
the first-order logic representation of a biology textbook, which consists of
432 different approximations containing various OWL 2 features. Only 72
of these files were in the OWL 2 DL profile and thus used for the reasoner
evaluation.
– W. Song, B. Spencer, W. Du: three ontology variants:
• FMA-FNL, a variant of the FMA (Foundational Model of Anatomy)
ontology [19], a large and highly cyclic ALCOI(D) ontology with over
120,000 locial axioms.
• GALEN-FNL, a highly cyclic ALCHOI(D) variant of the well-known
Galen ontology [18], which contains around 37,000 logical axioms and
951 object properties.
• GALEN-Heart: a highly cyclic ALCHOI(D) ontology containing a
module extracted from the Galen ontology with over 10,000 logical axioms.
– S. Croset: Functional Therapeutic Chemical Classification System (FTC)11,
a large ontology with nearly 300,000 logical axioms in the OWL 2 EL profile.</p>
        <p>As mentioned above, some of the user-submitted ontologies (all except Bio
KB and DMOP) were added to the set used in the competition. Additionally, we
also performed a separate benchmark on all of the user-submitted ontologies.
3
3.1</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Participating reasoners</title>
      <sec id="sec-3-1">
        <title>OWL 2 DL reasoners</title>
        <p>Chainsaw [28] is a ‘metareasoner’ which first computes modules for an
ontology, then delegates the processing of those modules to an existing OWL 2
DL reasoner, e.g. FaCT++ in the current implementation.</p>
        <p>FaCT++ [27] is a tableaux reasoner written in C++ which supports the full</p>
        <p>OWL 2 DL profile.</p>
        <p>HermiT [4] is a Java-based OWL 2 DL reasoner implementing a hypertableau
calculus.</p>
        <p>JFact is a Java implementation of the FaCT++ reasoner with extended datatype
support.12
Konclude is a C++ reasoner supporting the full OWL 2 DL profile except
datatypes. It uses an optimised tableau algorithm which also supports
parallelised processing of non-deterministic branches and the parallelisation of
higher-level reasoning tasks, e.g. satisfiability and subsumption tests.13
MORe [1] is Java-based modular reasoner which integrates a fully-fledged (and
slower) reasoner with a profile specific (and more efficient) reasoner. In the
competition, MORe has integrated both HermiT and Pellet [24] as OWL 2
DL reasoners and ELK as the OWL 2 EL profile specific reasoner.</p>
        <p>Treasoner [7] is a Java reasoner which implements a standard tableau
algorithm for SHIQ.</p>
        <p>TrOWL [26] is an approximative OWL 2 DL reasoner. In particular, TrOWL
utilises a semantic approximation to transform OWL 2 DL ontologies into
OWL 2 QL for conjunctive query answering and a syntactic approximation
from OWL 2 DL to OWL 2 EL for TBox and ABox reasoning.</p>
        <p>WSClassifier [25] is a Java reasoner for the ALCHOI(D) fragment of OWL
2 DL, using a hybrid of the consequence based reasoner ConDOR [23] and
hypertableau reasoner HermiT.
11 https://www.ebi.ac.uk/chembl/ftc/
12 http://sourceforge.net/projects/jfact/
13 http://www.derivo.de/en/produkte/konclude/
ELepHant [22] is a highly optimised consequence-based EL+ reasoner
written in C, which is aimed at platforms with limited memory and computing
capabilities (e.g. embedded systems).</p>
        <p>ELK [11] is a consequence-based Java reasoner which utilises multiple cores/processors
by parallelising multiple threads.
jcel [14] uses a completion-based algorithm, which is a generalization of CEL’s
algorithm. It is a Java reasoner which supports ontologies in EL+.</p>
        <p>SnoRocket [15] is a Java reasoner developed for the efficient classification of the
SNOMED CT ontology. It implements a multi-threaded saturation algorithm
similar to that of ELK, thus support concurrent classification.
BaseVISor is a Java-based forward-chaining inference engine which supports</p>
        <p>OWL 2 RL and XML Schema Datatypes.14</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Results – Offline competition</title>
      <sec id="sec-4-1">
        <title>OWL 2 DL results</title>
        <p>Nine reasoners entered the OWL 2 DL category, although not all of them
competed in the three reasoning tasks. MORe participated with both HermiT and
Pellet as the internal DL reasoner. The results for the classification, consistency,
and satisfiability tasks are shown in Figure 1.</p>
        <p>In the classification task, HermiT performed best in terms of robustness with
147 out of 204 ontologies that were correctly processed within the timeout (at
12.3s per ontology), whereas MORe-Pellet achieved the smallest mean time (2.8s
per ontology) for the 141 ontologies it processed correctly.</p>
        <p>In the consistency task, Konclude processed the highest number of ontologies
correctly (186 out of 204), while also performing fastest on average with 1.7s per
ontology; Konclude was also twice as fast as the second faster reasoner (HermiT).</p>
        <p>Finally, for the DL satisfiability task, Konclude also processed the highest
number of concepts correctly (1,929 out of 2,040) within the given timeout, while
coming second after Chainsaw (1.3s) in terms of speed, with a mean time of 1.8s
per ontology.
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>OWL 2 EL results</title>
        <p>In addition to the EL-specific reasoners, all OWL 2 DL reasoners also
participated in the EL category; the results for all participating reasoners on the three
reasoning tasks in the EL profile are shown in Figure 2. In both the
classification and consistency categories, ELK performed extremely well both in terms
14 http://vistology.com/basevisor/basevisor.html</p>
        <sec id="sec-4-2-1">
          <title>Classification OWL 2 DL Ontologies Robustness Avg. time (s)</title>
          <p>)
4
0
2
f
o 200
t
u
o
(
r
e
b
m
u
N</p>
          <p>0
)
4
0
2
r
e
b
m
u
N</p>
          <p>0
)
0
4
0
2
f
o
t
u
o
(
m
e
l
b
o
r
p
m
u
N
2000
1500
1000
500</p>
          <p>0
)
s
(
2.7 2.9 2.3 2.6</p>
          <p>3.2
r
e
b
m
u
N</p>
          <p>0
)
0
0
2
f 200
o
t
u
o
(
r
e
b
m
u
N
)
0
0
0
2
f
o
t
u
o
(
m
e
l
b
o
r
p
m
u
N
2000
1500
1000
500
0
1.0
)
s
(
A
)
s
(
A
)
s
(
of robustness (185 and 200 out of 200 correctly processed ontologies) as well
as average speed (0.9s for classification, 0.5s for consistency checking). Perhaps
surprisingly, MORe with both HermiT and Pellet performed worse than ELK on
robustness, as we expected its combination of ELK with a DL reasoner to handle
more ontologies than the stand-alone version of ELK which does not support the
full OWL 2 EL profile. However, it is possible that the DL reasoners in MORe
got in fact ‘held up’ by those parts of the ontologies that the stand-alone ELK
simply ignored, which may have caused a this slightly worse result.</p>
          <p>Two of the EL-specific reasoners SnoRocket and ELepHant both performed
comparatively fast on those ontologies they did successfully process, but failed
to process a large number of ontologies (44 and 62, respectively). The remaining
EL reasoner, jcel, was slower than most other reasoners, while also failing to
process 38 of the 200 ontologies in the given time.</p>
          <p>Finally, for the satisfiability checking task in the EL category, Chainsaw
processed the highest number of concepts (all 2,000) correctly while also being
second fastest with an average of 1s per concept. MORe with both Pellet and
HermiT also completed all 2,000 concepts within the given timeouts, while ELK
performed fastest on those 1,990 concepts it did process.
Only one profile-specific reasoner (BaseVISor) competed in the OWL 2 RL
category. Figure 3 shows the results for the three challenges in the RL profile
category. Out of the eleven competing reasoners, BaseVISor failed on a significantly
large number of ontologies in the classification challenge and processed only 34
of the 197 ontologies correctly. 17 of these failures were due to parsing errors,
ten were caused by timeouts that did not return any results, and the remaining
failures were due to incorrect results (according to our correctness check). The
winning reasoner here was TReasoner, which—despite being the second-slowest
reasoner in the group—correctly classified 181 of the 197 ontologies, while most
other reasoners correctly processed between 151 and 157 ontologies.</p>
          <p>In the consistency checking task, Konclude correctly processed all 197
ontologies, while also performing significantly faster than the other reasoners. Finally,
the RL satisfiability category was won by both MORe versions, which correctly
processed all 1,970 concepts at an average speed of 0.7s per concept.
5</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Results – Live competition</title>
      <p>The live competition was performed using only the classification task in the
OWL 2 DL and EL categories, since this is the task supported by most reasoners.
The setup was slightly modified from that of the live competition: rather than
running the reasoners until they had processed all ontologies in the corpus, we
set a strict timeout of one hour for the EL classification task and two hours
for the DL classification task, and measured how many ontologies the reasoners
would successfully classify in the given time (applying the same five/six minute
)
7
9
1
fo 200
t
u
o
(
s
m 150
e
l
b
o
r
p
d 100
e
v
l
o
s
f
ro 50
e
b
m
u
N</p>
      <p>0
)
7
9
1
f 200
o
t
u
o
(
s
m 150
e
l
b
o
r
p
d 100
e
v
l
o
s
f
ro 50
e
b
m
u
N</p>
      <p>0
chainsaw
baseviso</p>
      <p>r</p>
      <sec id="sec-5-1">
        <title>Robustness</title>
        <p>Avg. time (s)
190</p>
        <p>189
Fig. 3: Results (robustness/average time) for the OWL 2 RL category.
timeout per ontology as in the offline competition). As mentioned above, the
live competition was performed on Set B, which was entirely different for the
DL category, but nearly identical (due to the small number of available EL
ontologies) to Set A in the EL category. That is, we expected the results for the
DL category to differ from the offline competition, while the results for the EL
competition would be largely identical.</p>
        <p>The live competition was held on the second day of the Description Logic
2013 workshop, allowing workshop participants to place bets on the reasoner
performance, while the current status for each reasoner (number of attempted
and number of successfully classified ontologies) was shown and continuously
updated on a screen.
Due to the use of the different test corpus (Set B) in the live competition, we
expected a slightly different outcome from the offline competition. And indeed,
the winning reasoner (in terms of number of correctly processed ontologies) was
WSClassifier, which had shown an average performance in the offline
competition. WSClassifier processed 153 out of the 221 ontologies in the test corpus,
with an average time of 5.1s per ontology, while the reasoner in second place was
Konclude, with 141 ontologies and an average time of 1.4s per ontology. Figure 4
shows an overview of the number of processed ontologies and classification times
in the DL category.</p>
      </sec>
      <sec id="sec-5-2">
        <title>Classification OWL 2 EL Ontologies (live)</title>
        <p>0.5 0.1
elk more-mheorrme-ithpeerllmetitwsclasksoinficelrutrdoewl snorocjckeelt fact elephajfnatct treasocnhearinsaw
The number of processed ontologies and mean classifications for all reasoners
participating in the EL live competition can be found in Figure 5. Perhaps
unsurprisingly, in the EL classification challenge the results were very similar
to the offline challenge, with ELK classifying 196 out of the 200 ontologies at
an average speed of 0.5s per ontology. Again, ELepHant was clearly the fastest
reasoner with less than 0.1 seconds per ontology, but it also failed on 56 of the
200 ontologies.
6</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Results – User-submitted ontologies</title>
      <p>As with the live competition, the results for the user-submitted ontologies
presented here are limited to the classification task, as we consider this to be the
most relevant (TBox) reasoning task which is supported by all reasoners in the
competition. Note that due to the high number of timeouts and errors on some
of these ontologies, the correctness of the successful reasoners could not be
determined.
6.1</p>
      <sec id="sec-6-1">
        <title>OWL 2 DL classification results</title>
        <p>In total, 66 user-submitted ontologies fell into the OWL 2 DL profile, which
included the three modified versions of FMA and GALEN discussed above, two
versions of the Genomic CDS knowledge base (CDS and CDS-demo), 58 different
variants of the Bio KB 101 ontology (of which four were considered to be the
most challenging by the ontology developers), and three of the DMOP ontologies.</p>
        <p>Except for Chainsaw and TReasoner, all reasoners could successfully classify
54 of the Bio KB ontologies within the five minute timeout, while none of the
reasoners processed any of the four ‘hard’ Bio KB ontologies within the timeout.</p>
        <p>The only reasoners that could process both Genomic CDS ontologies were
TrOWL and WSClassifier (at an average time of approximately 3 and 8 seconds),
while FaCT++ also managed to classify the complete Genomic CDS in 100
seconds. Interestingly, HermiT was the only reasoner to report that the Genomic
CDS ontology was inconsistent.</p>
        <p>TrOWL and WSClassifier were also the only reasoners to classify the FMA
and GALEN modifications within the timeout (perhaps unsurprisingly, since
WSClassifier was tuned to work with these ontologies), while both Chainsaw
and FaCT++ successfully processed the two GALEN versions, and MORe-Pellet
processed the GALEN-FNL version in 14 seconds. For the remaining ontologies,
all reasoners except FaCT++ and TrOWL reported datatype errors.</p>
        <p>At an average of 0.17 seconds per processed ontology, Konclude was clearly
fastest, while most other reasoners also managed average times of less than five
seconds for the ontologies they processed correctly.
There were 19 user-submitted OWL 2 EL ontologies, 18 of which were variants
of the Bio KB 101 ontology, and the FTC knowledge base. Neither Chainsaw
nor ELepHant could process any of the Bio KB ontologies within the five minute
timeout, while ELK reported a parsing error. The remaining reasoners, except
Snorocket, processed all 18 Bio KB ontologies correctly within the timeout, with
Konclude being fastest at 0.1 seconds per ontology.</p>
        <p>ELK, Konclude, and WSClassifier all successfully processed the FTC KB,
with ELK clearly being fastest at five seconds (it did, however, ignore three
ObjectPropertyRange axioms which are outside the OWL 2 EL fragment supported
by ELK), and the other two reasoners taking between 20 and 30 seconds. The
remaining reasoners either timed out or reported an error for this ontology.
All 18 ontologies in the OWL 2 RL profile were variants of Bio KB 101.
BaseVISor failed to parse the input on all files, while Chainsaw timed out on 15 of the
ontologies. The remaining reasoners all classified the ontologies correctly within
the five minute timeout, with Konclude processing the ontologies at an average
of 0.15 seconds.
7</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Summary</title>
      <p>In this report we presented an overview of the methodology and results of the
ORE reasoner competition for the different categories, OWL 2 profiles, and test
corpora. There were a total of 14 OWL reasoners submitted for participation
in the competition, which made it all the more successful. Out of these, 5 were
profile specific reasoners (4 OWL 2 EL and 1 OWL 2 RL) while 9 were OWL 2
DL reasoners or supported a large fragment of SROIQ(D) not included within
the OWL 2 EL, RL or QL profiles. The reasoners were evaluated with a random
sample of ontologies from known repositories, on three standard reasoning tasks:
classification, consistency checking, and concept satisfiability. In the competition
we gave preference to how robust the systems were, that is, the number of tests
correctly passed within the given timeout, rather than reasoning times alone.
The top 3 reasoners for each category are listed below:</p>
      <sec id="sec-7-1">
        <title>OWL 2 DL Ontologies</title>
        <p>
          – Classification: (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) HermiT (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) MORe-HermiT/MORe-Pellet (
          <xref ref-type="bibr" rid="ref3">3</xref>
          ) Konclude
– Consistency: (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) Konclude (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) FaCT++ (
          <xref ref-type="bibr" rid="ref3">3</xref>
          ) Chainsaw
– Satisfiability: (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) Konclude (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) MORe-Pellet/MORe-HermiT (
          <xref ref-type="bibr" rid="ref3">3</xref>
          ) TReasoner
– Classification (live): (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) WSClassifier (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) Konclude (
          <xref ref-type="bibr" rid="ref3">3</xref>
          ) TReasoner
        </p>
      </sec>
      <sec id="sec-7-2">
        <title>OWL 2 EL Ontologies</title>
        <p>
          – Classification: (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) ELK (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) WSClassifier (
          <xref ref-type="bibr" rid="ref3">3</xref>
          ) MORe-HermiT
– Consistency: (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) ELK (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) HermiT (
          <xref ref-type="bibr" rid="ref3">3</xref>
          ) Konclude
– Satisfiability: (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) Chainsaw (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) MORe-Pellet (
          <xref ref-type="bibr" rid="ref3">3</xref>
          ) TrOWL
– Classification (live): (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) ELK (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) MORe-HermiT/MORe-Pellet (
          <xref ref-type="bibr" rid="ref3">3</xref>
          ) HermiT
        </p>
      </sec>
      <sec id="sec-7-3">
        <title>OWL 2 DL Ontologies</title>
        <p>
          – Classification: (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) TReasoner (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) Konclude (
          <xref ref-type="bibr" rid="ref3">3</xref>
          ) TrOWL
– Consistency: (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) Konclude (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) HermiT (
          <xref ref-type="bibr" rid="ref3">3</xref>
          ) Chainsaw
– Satisfiability: (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) MORe-HermiT/MORe-Pellet (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) Konclude (
          <xref ref-type="bibr" rid="ref3">3</xref>
          ) HermiT
        </p>
        <p>Additionally, the MORe and ELepHant reasoners were also given a special
recognition prize. MORe was selected as the best newcomer reasoner since it
consistently performed well in terms of time and robustness. The ELepHant
reasoner, although it struggled with a high number of errors, was incredibly fast
for the ontologies that it was able to classify correctly, and so was awarded a
special mention. We look forward to seeing the evolution of these novel reasoners.</p>
        <p>Regarding the user-submitted ontologies, it is interesting to see that most
reasoners could either process all or none of the Bio KB ontologies. When they
did process them, the classification times were fairly uniform. The results for the
GALEN and FMA modifications, which were specifically developed for testing
with WSClassifier, confirmed the robustness of the reasoner on these ontologies;
however, the other two reasoners which could process the GALEN modifications
(Chainsaw and FaCT++) were significantly faster within the timeout. Our
experiments on the Genomic CDS ontologies confirmed the reports of the
ontology developer [20] who found that out of the now ‘mainstream’ reasoners, only
TrOWL could process the ontology in reasonable time, while HermiT (falsely)
reported an inconsistency error. While we have seen that WSClassifier could also
process the ontology, the correctness of the classification result is unclear, since
WSClassifier does not support qualified number restrictions which are heavily
used in Genomic CDS.</p>
        <p>Finally, we have only carried out our benchmark with a fixed timeout of five
minutes in the main offline and live competitions, which may have been too short
for some of these ontologies, e.g. the four ‘challenging’ Bio KB ontologies could
not be processed by any of the reasoners Thus, we are planning to re-run these
tests with longer timeouts in the near future.</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgements</title>
      <p>We thank Konstantin Korovin (supported by the Royal Society grant RG080491)
at the University of Manchester who kindly provided us with the PC cluster for
the competition. We also thank the developers of the submitted reasoners and
ontologies for their invaluable effort. We also gratefully acknowledge the support of
the ORE workshop sponsor: B2i Healthcare (https://www.b2international.
com/). Ernesto Jimenez-Ruiz was supported by the Seventh Framework Program
(FP7) of the European Commission under Grant Agreement 318338, ‘Optique’,
and the EPSRC projects Score!, ExODA and MaSI3.
12. Keet, C.M., Lawrynowicz, A., d’Amato, C., Hilario, M.: Modeling issues and
choices in the Data Mining OPtimization Ontology. In: OWL: Experiences and
Directions (2013)
13. Matentzoglu, N., Bail, S., Parsia, B.: A corpus of owl dl ontologies. In: 26th
International Workshop on Description Logics (DL). pp. 829–841 (2013)
14. Mendez, J.: jcel: A Modular Rule-based Reasoner. In: 1st OWL Reasoner
Evaluation Workshop (ORE) (2012)
15. Metke Jimenez, A., John Lawley, M.: Snorocket 2.0: Concrete Domains and
Concurrent Classification. In: 2nd OWL Reasoner Evaluation Workshop (ORE) (2013)
16. Motik, B., Cuenca Grau, B., Horrocks, I., Wu, Z., Fokoue, A., Lutz, C.: OWL 2
web ontology language profiles. W3C Recommendation (2009)
17. Noy, N.F., Shah, N.H., Whetzel, P.L., Dai, B., Dorf, M., Griffith, N., Jonquet, C.,
Rubin, D.L., Storey, M.A.D., Chute, C.G., Musen, M.A.: Bioportal: ontologies and
integrated data resources at the click of a mouse. Nucleic Acids Research
37(WebServer-Issue), 170–173 (2009)
18. Rector, A.L., Rogers, J.E., Zanstra, P.E., Van Der Haring, E., Openg:
OpenGALEN: open source medical terminology and tools. AMIA Annu Symp Proc
(2003)
19. Rosse, C., Mejino Jr., J.: A reference ontology for biomedical informatics: the</p>
      <p>
        Foundational Model of Anatomy. J. Biomed. Informatics 36(
        <xref ref-type="bibr" rid="ref6">6</xref>
        ), 478–500 (2003)
20. Samwald, M.: Genomic CDS: an example of a complex ontology for
pharmacogenetics and clinical decision support. In: 2nd OWL Reasoner Evaluation Workshop
(ORE) (2013)
21. Schulz, S., Cornet, R., Spackman, K.A.: Consolidating SNOMED CT’s ontological
commitment. Applied Ontology 6(
        <xref ref-type="bibr" rid="ref1">1</xref>
        ), 1–11 (2011)
22. Sertkaya, B.: The ELepHant Reasoner System Description. In: 2nd OWL Reasoner
      </p>
      <p>Evaluation Workshop (ORE) (2013)
23. Simancik, F., Kazakov, Y., Horrocks, I.: Consequence-Based Reasoning beyond
Horn Ontologies. In: 22nd International Joint Conference on Artificial Intelligence
(IJCAI). pp. 1093–1098 (2011)
24. Sirin, E., Parsia, B., Cuenca Grau, B., Kalyanpur, A., Katz, Y.: Pellet: A practical</p>
      <p>
        OWL DL reasoner. J. of Web Semantics 5(
        <xref ref-type="bibr" rid="ref2">2</xref>
        ), 51–53 (2007)
25. Song, W., Spencer, B., Du, W.: A Transformation Approach for Classifying
ALCHI(D) Ontologies with a Consequence-based ALCH Reasoner. In: 2nd OWL
Reasoner Evaluation Workshop (ORE) (2013)
26. Thomas, E., Pan, J.Z., Ren, Y.: TrOWL: Tractable OWL 2 Reasoning
Infrastructure. In: 7th Extended Semantic Web Conference (ESWC). pp. 431–435 (2010)
27. Tsarkov, D., Horrocks, I.: FaCT++ Description Logic Reasoner: System
Description. In: Third International Joint Conference on Automated Reasoning (IJCAR).
pp. 292–297 (2006)
28. Tsarkov, D., Palmisano, I.: Chainsaw: a Metareasoner for Large Ontologies. In: 1st
      </p>
      <p>OWL Reasoner Evaluation Workshop (ORE) (2012)
29. Wrigley, S.N., Garcia-Castro, R., Nixon, L.J.B.: Semantic Evaluation At Large
Scale (SEALS). In: The 21st World Wide Web Conf., WWW (Companion Volume).
pp. 299–302 (2012), http://www.seals-project.eu</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Armas</given-names>
            <surname>Romero</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Cuenca Grau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Horrocks</surname>
          </string-name>
          ,
          <string-name>
            <surname>I.</surname>
          </string-name>
          ,
          <article-title>Jim´enez-</article-title>
          <string-name>
            <surname>Ruiz</surname>
          </string-name>
          , E.:
          <article-title>MORe: a Modular OWL Reasoner for Ontology Classification</article-title>
          .
          <source>In: OWL Reasoning Evaluation Workshop (ORE)</source>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Chaudhri</surname>
            ,
            <given-names>V.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wessel</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heymans</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>KB Bio 101: A Challenge for OWL Reasoners</article-title>
          .
          <source>In: 2nd OWL Reasoner Evaluation Workshop (ORE)</source>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Cuenca</given-names>
            <surname>Grau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Horrocks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Motik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Parsia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Patel-Schneider</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.F.</given-names>
            ,
            <surname>Sattler</surname>
          </string-name>
          ,
          <string-name>
            <surname>U.</surname>
          </string-name>
          :
          <article-title>OWL 2: The next step for OWL</article-title>
          .
          <source>J. Web Sem</source>
          .
          <volume>6</volume>
          (
          <issue>4</issue>
          ),
          <fpage>309</fpage>
          -
          <lpage>322</lpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Glimm</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motik</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shearer</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stoilos</surname>
          </string-name>
          , G.:
          <article-title>A novel approach to ontology classification</article-title>
          .
          <source>J. of Web Semantics</source>
          <volume>10</volume>
          (
          <issue>1</issue>
          ) (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Golbeck</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fragoso</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hartel</surname>
            ,
            <given-names>F.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hendler</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oberthaler</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>The National Cancer Institute's Th´esaurus and Ontology</article-title>
          .
          <source>J. Web Sem</source>
          .
          <volume>1</volume>
          (
          <issue>1</issue>
          ),
          <fpage>75</fpage>
          -
          <lpage>80</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6. Gonc¸alves, R.S.,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Categorising logical differences between OWL ontologies</article-title>
          .
          <source>In: ACM Conference on Information and Knowledge Management (CIKM)</source>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Grigoryev</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ivashko</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>TReasoner: System Description</article-title>
          .
          <source>In: 2nd OWL Reasoner Evaluation Workshop (ORE)</source>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Horridge</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bechhofer</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>The OWL API: A java api for owl ontologies</article-title>
          .
          <source>Semantic Web</source>
          <volume>2</volume>
          (
          <issue>1</issue>
          ),
          <fpage>11</fpage>
          -
          <lpage>21</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Patel-Schneider</surname>
            ,
            <given-names>P.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>van Harmelen</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <string-name>
            <surname>From</surname>
            <given-names>SHIQ</given-names>
          </string-name>
          and
          <article-title>RDF to OWL: the making of a web ontology language</article-title>
          .
          <source>J. Web Sem</source>
          .
          <volume>1</volume>
          (
          <issue>1</issue>
          ),
          <fpage>7</fpage>
          -
          <lpage>26</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yatskevich</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <article-title>Jim´enez-</article-title>
          <string-name>
            <surname>Ruiz</surname>
          </string-name>
          , E. (eds.):
          <source>Proceedings of the 1st International Workshop on OWL Reasoner Evaluation (ORE)</source>
          ,
          <source>CEUR Workshop Proceedings</source>
          , vol.
          <volume>858</volume>
          .
          <string-name>
            <surname>CEUR-WS.org</surname>
          </string-name>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Kazakov</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          , Kr¨otzsch,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Simancik</surname>
          </string-name>
          ,
          <string-name>
            <surname>F.</surname>
          </string-name>
          :
          <article-title>Concurrent classification of EL ontologies</article-title>
          .
          <source>In: International Semantic Web Conference (ISWC)</source>
          . pp.
          <fpage>305</fpage>
          -
          <lpage>320</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>