<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The SZS Ontologies for Automated Reasoning Software</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Geoff Sutcliffe</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Rudnicki P</institution>
          ,
          <addr-line>Sutcliffe G., Konev B., Schmidt R., Schulz S.</addr-line>
          <institution>(eds.); Proceedings of the Combined KEAPPA - IWIL Workshops</institution>
          ,
          <addr-line>pp. 38-49</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Miami</institution>
        </aff>
      </contrib-group>
      <fpage>38</fpage>
      <lpage>49</lpage>
      <abstract>
        <p>This paper describes the SZS ontologies that provide status values for precisely describing what is known or has been established about logical data. The ontology values are useful for describing existing logical data, and for automated reasoning software to describe their input and output. Standards for presenting the ontology values are also provided. The real use of automated reasoning software - automated theorem proving (ATP) systems and other tools - is not as standalone software that a user invokes directly, but rather as embedded components of more complex reasoning systems. For one example, NASA's certifiable program synthesis system [6] embeds the SSCPA ATP system harness [20], the ATP systems E [13], SPASS [24], Vampire [12], and the GDV derivation verifier [17]. For another example, SRI's BioDeducta system [14] embeds the ATP system SNARK [15], and the BioBike integrated knowledge base and biocomputing platform [10]. In this embedded context automated reasoning software is typically treated as a black box with known processing capabilities. In order to use the software, the host system must know how to invoke the software, how to pass data into the software, and how to accept data produced by the software. The data passed in to and out from automated reasoning software typically consists of logical data, e.g., formulae, derivations, interpretations, etc., and status values that describe what is known or has been established about the logical data, e.g., the nature of the formulae, their theoremhood or satisfiability, a reason why the software could not process the data, etc. For software that works with first-order logic, the de facto standard for expressing logical data is the TPTP language [19] (and it is expected that this will soon extend to higher-order logic [4]). The SZS ontologies that are linked to the TPTP are used by some automated reasoning software to express the status values. This paper describes the SZS ontologies and their use by automated reasoning software. The status information output by current automated reasoning software varies widely in quantity, quality, and meaning. At the low end of the scale, for example, an ATP system might report only an assurance that the input problem's conjecture is a theorem of the axioms (the wonderful “yes” output). In some cases the claimed status is misleading, e.g., when a clause normal form refutation based ATP system claims that a first-order input problem consisting of axioms and a conjecture is “unsatisfiable”, it typically means that the conjecture is a theorem of the axioms. At the high end of the scale, for example, a tool such as Infinox might report that a set of formulae does not have a finite model, or, for another example, a set of formulae might be tagged as representing a Herbrand interpretation. In order to seamlessly embed automated reasoning software in more complex reasoning systems, it is necessary to correctly and precisely specify status values for the input and output data. The SZS ontologies provide fine grained ontologies of status values that are suitable for this task. The SZS success ontology provides status values to describe what is known or has been successfully established about the relationship between the axioms and conjecture in logical data. It is described in Section 2. The SZS no-success ontology provides status values to describe why a success ontology value has not been established. It is described in Section 3. The SZS dataform ontology provides status values</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>to describe the nature of logical data. It is described in Section 4. All status values are expressed as
“OneWord”, and also have a three letter mnemonic. In addition to the ontologies themselves, standards
for presenting status values have been specified. These are described in Section 5.
2</p>
      <p>
        The SZS Success Ontology
The SZS success ontology was inspired by work done to establish communication protocols for systems
on the MathWeb Software Bus [
        <xref ref-type="bibr" rid="ref2 ref25">2, 25</xref>
        ]. The ontology assumes that the logical data is a 2-tuple of the
form hAx;Ci, where Ax is a set (conjunction) of axioms and C is a conjecture formula. This is a common
standard usage of ATP systems. If the input is not of the form hAx;Ci, it is treated as a conjecture
formula (even if it is a “set of axioms” from the user view point, e.g., a set of formulae all with the
TPTP role axiom), and the 2-tuple is hT RU E;Ci. The success ontology values are based on the possible
relationships between the sets of models of Ax and C. The ontology values can also be interpreted in
terms of the formula F Ax ) C. For example, the status Theorem means that the set of models of Ax
is a (not necessarily strict) subset of the set of models of C, i.e., every model of Ax is a model of C. In
this case F is valid.
      </p>
      <p>Figure 1 shows the success ontology (many of the “OneWord” status values are abbreviated in the
figure - see the list below for the official full “OneWord”s). The lines in the ontology can be followed
up the hierarchy as isa links, e.g., an ETH isa EQV isa (SAT and a THM). Figure 2 shows the
relationships between the model sets for some of the success ontology values. The outer grey ring contains all
interpretations, the long dashed black ring contains the models of Ax, and the short dashed black ring
contains the models of C.</p>
      <p>SatPres</p>
      <p>SAP
from
CSA
to
CUP</p>
      <p>UnsPres</p>
      <p>UNP
EquiSat</p>
      <p>ESA
Satisfiable</p>
      <p>SAT</p>
      <p>Theorem</p>
      <p>THM
FinitelySat</p>
      <p>FSA
Equivalent TautC WeakerC</p>
      <p>EQV TAC WEC</p>
      <p>Success</p>
      <p>SUC
NoConsequence</p>
      <p>NOC
ContraAxioms</p>
      <p>CAX
from</p>
      <p>SAT
EquiCntrSat</p>
      <p>ECS
to</p>
      <p>UNP
CntrSatPres CntrUnsPres</p>
      <p>CSP CUP
CounterThm</p>
      <p>CTH</p>
      <p>CounterSat</p>
      <p>CSA
FinitelyUns</p>
      <p>FUN
WCtrConc UnsConc CntrEquiv</p>
      <p>WCC UNC CEQ
Equiv
Thm
ETH</p>
      <p>Taut- Weaker
ology TautConc
TAU WTC</p>
      <p>Weaker SatConc</p>
      <p>Thm ContraAx
WTH SCA</p>
      <p>SatCtrConc Weaker Weaker
UnsatContraAx CtrThm UnsConc isfiable</p>
      <p>SCC WCT WUC UNS</p>
      <p>Equiv
CntrThm</p>
      <p>ECT
TautConc WConc
ContraAx ContraAx</p>
      <p>TCA WCA</p>
      <p>UnsConc
ContraAx</p>
      <p>UCA</p>
      <p>The meanings of the success ontology values are as follows. Associated with each status value are
some possible dataforms that might be provided to justify the ontology value for given logical data - see
Section 4.
Interpretations
Models of Ax</p>
      <p>Models of C
Satisfiable NoConsequence CounterSat</p>
      <p>SAT NOC CSA
EquivThm</p>
      <p>ETH</p>
      <p>Tautology</p>
      <p>TAU</p>
      <p>WeakerTautConc</p>
      <p>WTC</p>
      <p>WeakerThm</p>
      <p>WTH
TautConc
ContraAx</p>
      <p>TCA</p>
      <p>WConc
ContraAx</p>
      <p>WCA</p>
      <p>UnsConc
ContraAx</p>
      <p>UCA</p>
      <p>WeakerUnsConc</p>
      <p>WUC</p>
      <p>Unsatisfiable</p>
      <p>UNS
WeakerTheorem (WTH): Some interpretations are models of Ax, all models of Ax are models of
C, some models of C are not models of Ax, and some interpretations are not models of C. See
Theorem and Satisfiable.</p>
      <p>ContradictoryAxioms (CAX): No interpretations are models of Ax. F is valid, and anything is a
theorem of Ax. Possible dataforms are Refutations of Ax.</p>
      <p>SatisfiableConclusionContradictoryAxioms (SCA): No interpretations are models of Ax, and some
interpretations are models of C. See ContradictoryAxioms.</p>
      <p>TautologousConclusionContradictoryAxioms (TCA): No interpretations are models of Ax, and all
interpretations are models of C. See TautologousConclusion and
SatisfiableConclusionContradictoryAxioms.</p>
      <p>WeakerConclusionContradictoryAxioms (WCA): No interpretations are models of Ax, and some,
but not all, interpretations are models of C. See SatisfiableConclusionContradictoryAxioms and
SatisfiableCounterConclusionContradictoryAxioms.</p>
      <p>CounterUnsatisfiabilityPreserving (CUP): If there does not exist a model of Ax then there does not
exist a model of :C, i.e., if Ax is unsatisfiable then :C is unsatisfiable.</p>
      <p>CounterSatisfiabilityPreserving (CSP): If there exists a model of Ax then there exists a model of
:C, i.e., if Ax is satisfiable then :C is satisfiable.</p>
      <p>EquiCounterSatisfiable (ECS): There exists a model of Ax iff there exists a model of :C, i.e., Ax
is (un)satisfiable iff :C is (un)satisfiable.</p>
      <p>CounterSatisfiable (CSA): Some interpretations are models of Ax, and some models of Ax are
models of :C. F is not valid, :F is satisfiable, and C is not a theorem of Ax. Possible dataforms
are Models of Ax ^ :C.</p>
      <p>CounterTheorem (CTH): All models of Ax are models of :C. F is not valid, and :C is a theorem
of Ax. Possible dataforms are Proofs of :C from Ax.</p>
      <p>CounterEquivalent (CEQ): Some interpretations are models of Ax, all models of Ax are models
of :C, and all models of :C are models of Ax. F is not valid, and :C is a theorem of Ax. All
interpretations are models of Ax xor of C. Possible dataforms are Proofs of :C from Ax and of Ax
from :C.</p>
      <p>UnsatisfiableConclusion (UNC): Some interpretations are models of Ax, and all interpretations are
models of :C (i.e., no interpretations are models of C). F is not valid, and :C is a tautology.
Possible dataforms are Proofs of :C.</p>
      <p>WeakerCounterConclusion (WCC): Some interpretations are models of Ax, and all models of Ax
are models of :C, and some models of :C are not models of Ax. See CounterTheorem and
CounterSatisfiable.</p>
      <p>EquivalentCounterTheorem (ECT): Some, but not all, interpretations are models of Ax, all models
of Ax are models of :C, and all models of :C are models of Ax. See CounterEquivalent.
FinitelyUnsatisfiable (FUN): All finite interpretations are finite models of Ax, and all finite
interpretations are finite models of :C (i.e., no finite interpretations are finite models of C).
Unsatisfiable (UNS): All interpretations are models of Ax, and all interpretations are models of :C.
(i.e., no interpretations are models of C). F is unsatisfiable, :F is valid, and :C is a tautology.
Possible dataforms are Proofs of Ax and of C, and Refutations of F .</p>
      <p>WeakerUnsatisfiableConclusion (WUC): Some, but not all, interpretations are models of Ax, and
all interpretations are models of :C. See Unsatisfiable and WeakerCounterConclusion.
WeakerCounterTheorem (WCT): Some interpretations are models of Ax, all models of Ax are
models of :C, some models of :C are not models of Ax, and some interpretations are not models
of :C. See CounterSatisfiable.
SatisfiableCounterConclusionContradictoryAxioms (SCC): No interpretations are models of Ax,
and some interpretations are models of :C. See ContradictoryAxioms.</p>
      <p>UnsatisfiableConclusionContradictoryAxioms (UCA): No interpretations are models of Ax, and all
interpretations are models of :C (i.e., no interpretations are models of C). See
UnsatisfiableConclusion and SatisfiableCounterConclusionContradictoryAxioms.</p>
      <p>NoConsequence (NOC): Some interpretations are models of Ax, some models of Ax are models of
C, and some models of Ax are models of :C. F is not valid, F is satisfiable, :F is not valid, :F
is satisfiable, and C is not a theorem of Ax. Possible dataforms are pairs of models, one Model of
Ax ^ C and one Model of Ax ^ :C.</p>
      <p>The success ontology is very fine grained, and has more status values than are commonly used by
automated reasoning software, by ATP systems in particular. A suitable subset for practical uses of ATP
systems is as follows:</p>
      <p>FOF problems with a conjecture - report Theorem or CounterSatisfiable.</p>
      <p>FOF problems without a conjecture - report Satisfiable or Unsatisfiable.</p>
      <p>CNF problems - report Satisfiable or Unsatisfiable.
2.1</p>
      <p>Validation of the Success Ontology
Two steps have been taken towards formal validation of the success ontology. The first step was the
enumeration of the possible relationships between the models of Ax and C (some of which are illustrated
in Figure 2). This provided a basis for the ontology values, and a basis for the isa links. The second
step1 was to axiomatize the ontology and prove relevant properties. (The axiomatization implemented
covers the “positive” part of the ontology regarding Ax and C, and just two commonly used values from
the “negative” part regarding Ax and:C. It is expected that the results obtained will extend without
difficulty to the full ontology.) The axiomatization encodes the relationship between the models of
Ax and C for each ontology value, and, from that, relationships between the ontology values can be
proven. Additionally, a finite model of the axioms was found, demonstrating the consistency of the
axiomatization and hence the ontology.</p>
      <p>The axiomatization is in first-order logic. As example, the axioms that describe the ESA, THM, and
ETH values are given in Figure 3. Four relationships between pairs of ontology values were defined and
axiomatized:
a isa b , meaning that if hAx;Ci has the status a then it also has the status b . For example, WTH
isa THM.
a nota b , meaning that if hAx;Ci has the status a then it does not necessarily have the status b .
For example, THM nota SAT (because SAT does not hold for the case of contradictory Ax).
a nevera b , meaning that if hAx;Ci has the status a then it cannot have the status b . For example,
SAT nevera CAX.</p>
      <p>a xora b , meaning that every hAx;Ci has the status a xor b . For example, THM xora CSA.</p>
      <p>Additionally, axioms that deal with properties of formulae and models were provided. The
relationships and properties axioms are given in Figure 3.</p>
      <p>
        The axiomatization was shown to be consistent by generating a finite model using Paradox [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Some
general properties of the relationships were proved using an ATP system (see below for a discussion of
the ATP system used), e.g., that isa is a transitive relation, and that if a isa b and a nota g then b nota
g. Next the relationships between all pairs of ontology values were investigated, using the ATP system
1Thanks to the reviewer of this paper whose comments instigated this step.
fof(esa,axiom,(
! [Ax,C] :
( ( ? [I1] : model(I1,Ax)
&lt;=&gt; ? [I2] : model(I2,C) )
&lt;=&gt; status(Ax,C,esa) ) )).
to prove the relationships from the axioms. The isa relationship was tested first, as if a pair of ontology
values has the isa relationship they cannot have any of the other three relationships. For those pairs that
were not proved to have the isa relationship, the nevera relationship was tested next. For those pairs that
were proved to have the nevera relationship the xora relationship was tested, and for the other pairs the
nota relationship was tested. Proving a nota relationship requires establishing the existence of formulae
and models that deny the relationship. In the axiomatization five examples are provided, the tautology,
satisfiable, contradiction, non thm spt and sat non taut pair axioms above.
      </p>
      <p>The results of the testing are shown in Table 1, where the vertical axis value has the shown
relationship to the horizontal axis value. The isa relationship is denoted by ), nota by :, nevera by , and xora
by . Sixty-eight pairs were proved to have the isa relationship, 89 to have the nota relationship, 179 to
have the nevera (and not the xora) relationship, 4 to have the xora relationship, and for the remaining two
pairs no relationship could be proved. The latter are the cases that WEC nota WTC and WEC nota TAC,
which require exhibition of an hAx;Ci pair that has the WEC property but in which C is not a tautology.
This could be done explicitly, along the lines of the sat non taut pair axiom above, but that seemed
like cheating. Some of the nota relationships may also be nevera, but could not be proved so.</p>
      <p>
        As mentioned above, automated theorem proving was used to prove the relationships between the
ontology values. At first, proofs were attempted using monolithic ATP systems such as EP, SPASS, and
Vampire. The success rate was low, because the axiomatization forms a large theory - see [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Therefore
the SRASS system [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] was used, and it was highly successful in identifying the necessary axioms for
proving each conjecture, and subsequently obtaining either a proof using EP or an assurance of a proof
using iProver [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. In addition to SRASS, the MANSEX [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] and IDV [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] tools were used during the
initial development of the axiomatization, to find the most obvious relationships and to analyze proofs.
:
:
:
:
:
      </p>
      <p>UNP
SAP
ESA
SAT
THM
EQV
TAC
WEC
ETH
TAU
WTC
WTH
CAX
SCA
TCA
WCA
CSA
UNS
NOC
:
:
:
:
:
:
:
) )
) ) )
: ) : :
) ) ) )
) ) ) )
) ) ) )
) ) ) )
) ) ) )
) ) ) )
) ) ) )
: ) :
)
)
)
: : :
)
)
) ) ) )
:
:
:
:
)
)
)
)
)
)
)
)
)
)
)
:
:
:
:
:
:
:
)
) )</p>
      <p>)
:
:
:
:
:
:
:
:
:
:
:
:
:
)
)
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
:
)
)
)
:
:
:
)
)
:
:
:
:
:
:
:
:
:
:
:
:
)
)
UNP SAP ESA SAT THM EQV TAC WEC ETH TAU WTC WTH CAX SCA TCA WCA CSA UNS NOC
All automated reasoning and proof processing was done on a computer with a Intel Xeon 2.80GHz CPU
and 3GB memory, running the Linux 2.6 operating system, and with a 60s CPU time limit per proof
attempt (on the entire SRASS process).</p>
      <p>The formal analysis has had beneficial effects. Three new ontology values were added, three errors in
the definitions of the ontology values were exposed and corrected, four incorrect isa links in the ontology
were found and removed, and several unnoticed isa relationships were revealed and added. The isa links
in Figure 1 correspond to those in Table 1.
3</p>
      <p>
        The SZS No-Success Ontology
While it is always hoped that automated reasoning software will successfully process the logical data, and
hence establish a success ontology value, in reality this often does not happen, for a variety of reasons. In
order to understand and make productive use of a lack of success, e.g., [
        <xref ref-type="bibr" rid="ref11 ref8">11, 8</xref>
        ], it is necessary to precisely
specify the reason for and nature of the lack of success. The SZS no-success ontology provides suitable
status values for describing the reasons. Note that no-success is not the same as failure: failure means
that the software has completed its attempt to process the logical data and could not establish a success
ontology value. In contrast, no-success might be because the software is still running, or that it has not
yet even started processing the logical data. Figure 4 shows the no-success ontology.
      </p>
      <p>The meanings of the no-success ontology values are as follows:</p>
      <p>NoSuccess (NOS): The logical data has not been processed successfully (yet).</p>
      <p>Open (OPN): A success value has never been established.</p>
      <p>Unknown (UNK): Success value unknown, and no assumption has been made.
Open</p>
      <p>OPN
TypeError</p>
      <p>TYE</p>
      <p>Stopped</p>
      <p>STP
Forced</p>
      <p>FOR
Assumed (ASS(U ,S)): The success ontology value S has been assumed because the actual value
is unknown for the no-success ontology reason U . U is taken from the subontology starting at
Unknown in the no-success ontology.</p>
      <p>Stopped (STP): Software attempted to process the data, and stopped without a success status.
Error (ERR): Software stopped due to an error.</p>
      <p>OSError (OSE): Software stopped due to an operating system error.</p>
      <p>InputError (INE): Software stopped due to an input error.</p>
      <p>SyntaxError (SYE): Software stopped due to an input syntax error.</p>
      <p>SemanticError (SEE): Software stopped due to an input semantic error.</p>
      <p>TypeError (TYE): Software stopped due to an input type error (for typed logical data).
Forced (FOR): Software was forced to stop by an external force.</p>
      <p>User (USR): Software was forced to stop by the user.</p>
      <p>ResourceOut (RSO): Software stopped because some resource ran out.</p>
      <p>Timeout (TMO): Software stopped because the CPU time limit ran out.</p>
      <p>MemoryOut (MMO): Software stopped because the memory limit ran out.</p>
      <p>GaveUp (GUP): Software gave up of its own accord.</p>
      <p>Incomplete (INC): Software gave up because it’s incomplete.</p>
      <p>Inappropriate (IAP): Software gave up because it cannot process this type of data.</p>
      <p>InProgress (INP): Software is still running.</p>
      <p>NotTried (NTT): Software has not tried to process the data.</p>
      <p>NotTriedYet (NTY): Software has not tried to process the data yet, but might in the future.</p>
      <p>The no-success ontology is very fine grained, and has more status values than are commonly used by
automated reasoning software. A suitable subset for practical uses is as follows:</p>
      <p>The software stopped due to CPU limit - report Timeout.</p>
      <p>The software gave up due to incompleteness - report GaveUp.</p>
      <p>The software stopped due to an error - report Error.</p>
      <p>Any other cases - report Unknown.</p>
      <p>The SZS Dataform Ontology
The success status values describe what is known or has been established about the relationship between
the axioms and conjecture in logical data, but do not describe the form of logical data. The dataform
ontology provides suitable values for describing the form of logical data. The dataform ontology values
are commonly used to describe data provided to justify a success ontology value, e.g., if an ATP system
reports the success ontology value Theorem it might output a proof to justify that. Figure 5 shows the
dataform ontology.</p>
      <p>Proof</p>
      <p>Prf
Derivation</p>
      <p>Der</p>
      <p>Refutation</p>
      <p>Ref</p>
      <p>Solution</p>
      <p>Sol
Interpretation</p>
      <p>Int
Model
Mod</p>
      <p>LogicalData</p>
      <p>LDa</p>
      <p>NotSoln</p>
      <p>NSo</p>
      <p>None</p>
      <p>Non
ListOfFormulae</p>
      <p>Lof</p>
      <p>IncompletePrf Assurance</p>
      <p>IPr Ass
ListOfTHF</p>
      <p>Lth</p>
      <p>ListOfFOF</p>
      <p>Lfo</p>
      <p>ListOfCNF IncompleteRef</p>
      <p>Lcn IRf
CNFRefutation DomainMap HerbrandModel Sat'nModel</p>
      <p>CRf Dom HMo SMo
IncompleteCNFRef</p>
      <p>ICf
FiniteModel InfiniteModel</p>
      <p>FMo IMo
IncompleteCNFRefutation (ICf): A CNF refutation with parts missing.</p>
      <p>Assurance (Ass): Only an assurance of the success ontology value.</p>
      <p>None (Non): Nothing.</p>
      <p>The dataform ontology is very fine grained, and has more status values than are commonly used by
automated reasoning software, by ATP systems in particular. A suitable subset for practical uses of ATP
systems is as follows:</p>
      <p>A generic proof - report Proof.</p>
      <p>A refutation - report Refutation.</p>
      <p>A CNF refutation - report CNFRefutation.</p>
      <p>A generic model - report Model.</p>
      <p>A finite model - report FiniteModel.</p>
      <p>A Herbrand model - report HerbrandModel.</p>
      <p>A saturation model - report SaturationModel.
or
or
5</p>
      <p>The SZS Presentation Standards
The SZS ontologies provide status values that precisely describe what is known or has been established
about logical data. In order to make the use of the values easy in more complex reasoning systems, it
is necessary to specify precisely how the values should be presented. This makes it easy for harness
software to prepare input data and examine output data that contains ontology values, e.g., in practice, to
grep the output from automated reasoning software for lines that provide status values.</p>
      <p>Success and no-success ontology values should be presented in lines of the form
% SZS status ontology value for logical data identifier
(The leading ’%’ makes the line into a TPTP language comment.) For example
% SZS status Unsatisfiable for SYN075+1
% SZS status GaveUp for SYN075+1</p>
      <p>A success or no-success ontology value should be presented as early as possible, at least before any
data output to justify the value. The justifying data should be delimited by lines of the form
% SZS output start dataform ontology value for logical data identifier
and</p>
      <p>% SZS output end dataform ontology value for logical data identifier
For example
% SZS output start CNFRefutation for SYN075-1</p>
      <p>output data
% SZS output end CNFRefutation for SYN075-1
All “SZS” lines can optionally have software specific information appended, separated by a :, i.e.,
% SZS status ontology value for logical data identifier : software specific information
% SZS output start dataform ontology value for logical data identifier : software specific info
% SZS output end dataform ontology value for logical data identifier : software specific info
For example
% SZS status GaveUp for SYN075+1 : Could not complete CNF conversion
% SZS output end CNFRefutation for SYN075-1 : Completed in CNF conversion</p>
    </sec>
    <sec id="sec-2">
      <title>Conclusion</title>
      <p>
        This paper has presented the SZS ontologies of status values that are suitable for expressing precisely
what is known or has been established about logical data. The ontologies can be used for existing logical
data, e.g., they are used for the status of problems in the TPTP problem library [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] and solutions in the
TSTP solution library [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], and can be used by automated reasoning software to describe their input and
output. Already several ATP systems, e.g., Darwin [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], E, Metis [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], Paradox [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], use the SZS ontologies
and the presentation standards, and this contributes to simplifying their embedding into more complex
reasoning systems.
      </p>
      <p>
        In addition to its use for reporting the overall status of a hAx;Ci 2-tuple, the SZS success ontology is
used to report the status of individual inference steps in TPTP format derivations [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. This is done in
the “useful information” field of an inference record of an inferred formula. For example, in
cnf(58,plain,
( ~ hates(agatha,esk2_1(butler)) ),
inference(spm,[status(thm)],[51,48])).
the status is Theorem (recorded as a lowercase acronym value thm), which indicates that the formulae is
a theorem of it’s two parent formulae 51 and 48. The Theorem status is most common in derivations, but
the SAP and ESA status values are also used quite often, e.g., for the formulae inferred by Skolemization
and splitting steps. These status values can be used for semantic verification of the derivations, as is done
by the GDV derivation verifier [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
      </p>
      <p>While the SZS ontologies are in use and have matured to some extent, it is not claimed that they are
comprehensive and perfect. Developers and users of automated reasoning software are invited to provide
feedback that might lead to improvements and increased usage. Already some users are working on
success ontology values for results from computer algebra and other computational reasoning systems.
In related work, SZS standards for returning answers from question-and-answer systems have been
proposed.2 It is hoped that over time, with increased usage, the ontologies will become battle hardened, and
will be a core standard for automated reasoning.
2See http://www.tptp.org/TPTP/Proposals/AnswerExtraction.html</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <source>[1] Proceedings of the CADE-21 Workshop on Empirically Successful Automated Reasoning in Large Theories, number 257 in CEUR Workshop Proceedings</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Armando</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kohlhase</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Ranise</surname>
          </string-name>
          .
          <article-title>Communication Protocols for Mathematical Services based on KQML and OMRS</article-title>
          . In M. Kerber and M. Kohlhase, editors,
          <source>Proceedings of the Calculemus Symposium</source>
          <year>2000</year>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>P.</given-names>
            <surname>Baumgartner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Fuchs</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Tinelli</surname>
          </string-name>
          .
          <article-title>Darwin - A Theorem Prover for the Model Evolution Calculus</article-title>
          . In G. Sutcliffe,
          <string-name>
            <given-names>S.</given-names>
            <surname>Schulz</surname>
          </string-name>
          , and T. Tammet, editors,
          <source>Proceedings of the Workshop on Empirically Successful First Order Reasoning, 2nd International Joint Conference on Automated Reasoning</source>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>C.</given-names>
            <surname>Benzmu</surname>
          </string-name>
          ¨ller,
          <string-name>
            <given-names>F.</given-names>
            <surname>Rabe</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Sutcliffe. THF0 - The Core TPTP</surname>
          </string-name>
          <article-title>Language for Classical Higher-Order Logic</article-title>
          . In P. Baumgartner,
          <string-name>
            <given-names>A.</given-names>
            <surname>Armando</surname>
          </string-name>
          , and D. Gilles, editors,
          <source>Proceedings of the 4th International Joint Conference on Automated Reasoning, Lecture Notes in Artificial Intelligence, page Accepted</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>K.</given-names>
            <surname>Claessen</surname>
          </string-name>
          and
          <string-name>
            <given-names>N.</given-names>
            <surname>Sorensson</surname>
          </string-name>
          .
          <article-title>New Techniques that Improve MACE-style Finite Model Finding</article-title>
          . In P. Baumgartner and C. Fermueller, editors,
          <source>Proceedings of the CADE-19 Workshop:</source>
          Model Computation - Principles, Algorithms, Applications,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>E.</given-names>
            <surname>Denney</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Fischer</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Schumann</surname>
          </string-name>
          .
          <article-title>Using Automated Theorem Provers to Certify Auto-generated Aerospace Software</article-title>
          . In M. Rusinowitch and D. Basin, editors,
          <source>Proceedings of the 2nd International Joint Conference on Automated Reasoning, number 3097 in Lecture Notes in Artificial Intelligence</source>
          , pages
          <fpage>198</fpage>
          -
          <lpage>212</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Hurd</surname>
          </string-name>
          .
          <article-title>First-Order Proof Tactics in Higher-Order Logic Theorem Provers</article-title>
          . In M. Archer,
          <string-name>
            <given-names>B. Di</given-names>
            <surname>Vito</surname>
          </string-name>
          , and C. Munoz, editors,
          <source>Proceedings of the 1st International Workshop on Design and Application</source>
          of Strategies/- Tactics in Higher Order Logics,
          <string-name>
            <surname>number</surname>
            <given-names>NASA</given-names>
          </string-name>
          /CP-2003
          <source>-212448 in NASA Technical Reports</source>
          , pages
          <fpage>56</fpage>
          -
          <lpage>68</lpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Ireland</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Bundy</surname>
          </string-name>
          .
          <article-title>Productive use of Failure in Inductive Proof</article-title>
          .
          <source>Journal of Automated Reasoning</source>
          ,
          <volume>16</volume>
          (
          <issue>1-2</issue>
          ):
          <fpage>79</fpage>
          -
          <lpage>111</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>K.</given-names>
            <surname>Korovin.</surname>
          </string-name>
          iProver
          <article-title>- An Instantiation-Based Theorem Prover for First-order Logic (System Description)</article-title>
          . In P. Baumgartner,
          <string-name>
            <given-names>A.</given-names>
            <surname>Armando</surname>
          </string-name>
          , and D. Gilles, editors,
          <source>Proceedings of the 4th International Joint Conference on Automated Reasoning, number 5195 in Lecture Notes in Artificial Intelligence</source>
          , pages
          <fpage>292</fpage>
          -
          <lpage>298</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>P.</given-names>
            <surname>Massar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Travers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Elhai</surname>
          </string-name>
          , and
          <string-name>
            <surname>J. Shrager.</surname>
          </string-name>
          <article-title>BioLingua: A Programmable Knowledge Environment for Biologists</article-title>
          .
          <source>Bioinformatics</source>
          ,
          <volume>21</volume>
          (
          <issue>2</issue>
          ):
          <fpage>199</fpage>
          -
          <lpage>207</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>R.</given-names>
            <surname>Monroy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bundy</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Ireland</surname>
          </string-name>
          .
          <article-title>Proof Plans for the Correction of False Conjectures</article-title>
          . In F. Pfenning, editor,
          <source>Proceedings of the 5th International Conference on Logic for Programming</source>
          ,
          <source>Artificial Intelligence, and Reasoning, number 822 in Lecture Notes in Artificial Intelligence</source>
          , pages
          <fpage>178</fpage>
          -
          <lpage>189</lpage>
          . Springer-Verlag,
          <year>1994</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A.</given-names>
            <surname>Riazanov</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Voronkov</surname>
          </string-name>
          .
          <article-title>The Design and Implementation of Vampire</article-title>
          .
          <source>AI Communications</source>
          ,
          <volume>15</volume>
          (
          <issue>2- 3</issue>
          ):
          <fpage>91</fpage>
          -
          <lpage>110</lpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>S.</given-names>
            <surname>Schulz. E: A Brainiac Theorem</surname>
          </string-name>
          <article-title>Prover</article-title>
          .
          <source>AI Communications</source>
          ,
          <volume>15</volume>
          (
          <issue>2-3</issue>
          ):
          <fpage>111</fpage>
          -
          <lpage>126</lpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>J.</given-names>
            <surname>Shrager</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Waldinger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stickel</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Massar</surname>
          </string-name>
          . Deductive Biocomputing.
          <source>PLoS ONE</source>
          ,
          <volume>2</volume>
          (
          <issue>4</issue>
          ),
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>M.E. Stickel.</surname>
          </string-name>
          <article-title>SNARK - SRI's New Automated Reasoning Kit</article-title>
          . http://www.ai.sri.com/ stickel/snark.html.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>G.</given-names>
            <surname>Sutcliffe. The TSTP Solution</surname>
          </string-name>
          <article-title>Library</article-title>
          . http://www.TPTP.org/TSTP.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>G.</given-names>
            <surname>Sutcliffe. Semantic</surname>
          </string-name>
          Derivation Verification.
          <source>International Journal on Artificial Intelligence Tools</source>
          ,
          <volume>15</volume>
          (
          <issue>6</issue>
          ):
          <fpage>1053</fpage>
          -
          <lpage>1070</lpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>G.</given-names>
            <surname>Sutcliffe</surname>
          </string-name>
          and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Puzis. SRASS -</surname>
          </string-name>
          <article-title>a Semantic Relevance Axiom Selection System</article-title>
          . In F. Pfenning, editor,
          <source>Proceedings of the 21st International Conference on Automated Deduction, number 4603 in Lecture Notes in Artificial Intelligence</source>
          , pages
          <fpage>295</fpage>
          -
          <lpage>310</lpage>
          . Springer-Verlag,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>G.</given-names>
            <surname>Sutcliffe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Schulz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Claessen</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A. Van</given-names>
            <surname>Gelder</surname>
          </string-name>
          .
          <article-title>Using the TPTP Language for Writing Derivations and Finite Interpretations</article-title>
          . In U. Furbach and N. Shankar, editors,
          <source>Proceedings of the 3rd International Joint Conference on Automated Reasoning, number 4130 in Lecture Notes in Artificial Intelligence</source>
          , pages
          <fpage>67</fpage>
          -
          <lpage>81</lpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>G.</given-names>
            <surname>Sutcliffe</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Seyfang</surname>
          </string-name>
          .
          <article-title>Smart Selective Competition Parallelism ATP</article-title>
          . In A. Kumar and I. Russell, editors,
          <source>Proceedings of the 12th International FLAIRS Conference</source>
          , pages
          <fpage>341</fpage>
          -
          <lpage>345</lpage>
          . AAAI Press,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>G.</given-names>
            <surname>Sutcliffe</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.B.</given-names>
            <surname>Suttner</surname>
          </string-name>
          .
          <source>The TPTP Problem Library: CNF Release v1.2.1. Journal of Automated Reasoning</source>
          ,
          <volume>21</volume>
          (
          <issue>2</issue>
          ):
          <fpage>177</fpage>
          -
          <lpage>203</lpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>G.</given-names>
            <surname>Sutcliffe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Yerikalapudi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Trac</surname>
          </string-name>
          .
          <source>Multiple Answer Extraction for Question Answering with Automated Theorem Proving Systems. Rejected from the 15th International Conference on Logic for Programming Artificial Intelligence and Reasoning</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>S.</given-names>
            <surname>Trac</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Puzis</surname>
          </string-name>
          , and
          <string-name>
            <surname>G. Sutcliffe.</surname>
          </string-name>
          <article-title>An Interactive Derivation Viewer</article-title>
          . In S. Autexier and C. Benzmu¨ller, editors,
          <source>Proceedings of the 7th Workshop on User Interfaces for Theorem Provers, 3rd International Joint Conference on Automated Reasoning</source>
          , volume
          <volume>174</volume>
          of Electronic Notes in Theoretical Computer Science, pages
          <fpage>109</fpage>
          -
          <lpage>123</lpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>C.</given-names>
            <surname>Weidenbach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Hillenbrand</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rusev</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Topic</surname>
          </string-name>
          .
          <source>SPASS Version 3</source>
          .0. In F. Pfenning, editor,
          <source>Proceedings of the 21st International Conference on Automated Deduction, number 4603 in Lecture Notes in Artificial Intelligence</source>
          , pages
          <fpage>514</fpage>
          -
          <lpage>520</lpage>
          . Springer-Verlag,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>J.</given-names>
            <surname>Zimmer</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Kohlhase</surname>
          </string-name>
          .
          <article-title>System Description: The MathWeb Software Bus for Distributed Mathematical Reasoning</article-title>
          . In A. Voronkov, editor,
          <source>Proceedings of the 18th International Conference on Automated Deduction, number 2392 in Lecture Notes in Artificial Intelligence</source>
          , pages
          <fpage>139</fpage>
          -
          <lpage>143</lpage>
          . Springer-Verlag,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>