1 Introduction

The SZS Ontologies for Automated Reasoning Software

Geoff Sutcliffe

0 1 0 Rudnicki P , Sutcliffe G., Konev B., Schmidt R., Schulz S. (eds.); Proceedings of the Combined KEAPPA - IWIL Workshops , pp. 38-49 1 University of Miami

38 49

This paper describes the SZS ontologies that provide status values for precisely describing what is known or has been established about logical data. The ontology values are useful for describing existing logical data, and for automated reasoning software to describe their input and output. Standards for presenting the ontology values are also provided. The real use of automated reasoning software - automated theorem proving (ATP) systems and other tools - is not as standalone software that a user invokes directly, but rather as embedded components of more complex reasoning systems. For one example, NASA's certifiable program synthesis system [6] embeds the SSCPA ATP system harness [20], the ATP systems E [13], SPASS [24], Vampire [12], and the GDV derivation verifier [17]. For another example, SRI's BioDeducta system [14] embeds the ATP system SNARK [15], and the BioBike integrated knowledge base and biocomputing platform [10]. In this embedded context automated reasoning software is typically treated as a black box with known processing capabilities. In order to use the software, the host system must know how to invoke the software, how to pass data into the software, and how to accept data produced by the software. The data passed in to and out from automated reasoning software typically consists of logical data, e.g., formulae, derivations, interpretations, etc., and status values that describe what is known or has been established about the logical data, e.g., the nature of the formulae, their theoremhood or satisfiability, a reason why the software could not process the data, etc. For software that works with first-order logic, the de facto standard for expressing logical data is the TPTP language [19] (and it is expected that this will soon extend to higher-order logic [4]). The SZS ontologies that are linked to the TPTP are used by some automated reasoning software to express the status values. This paper describes the SZS ontologies and their use by automated reasoning software. The status information output by current automated reasoning software varies widely in quantity, quality, and meaning. At the low end of the scale, for example, an ATP system might report only an assurance that the input problem's conjecture is a theorem of the axioms (the wonderful “yes” output). In some cases the claimed status is misleading, e.g., when a clause normal form refutation based ATP system claims that a first-order input problem consisting of axioms and a conjecture is “unsatisfiable”, it typically means that the conjecture is a theorem of the axioms. At the high end of the scale, for example, a tool such as Infinox might report that a set of formulae does not have a finite model, or, for another example, a set of formulae might be tagged as representing a Herbrand interpretation. In order to seamlessly embed automated reasoning software in more complex reasoning systems, it is necessary to correctly and precisely specify status values for the input and output data. The SZS ontologies provide fine grained ontologies of status values that are suitable for this task. The SZS success ontology provides status values to describe what is known or has been successfully established about the relationship between the axioms and conjecture in logical data. It is described in Section 2. The SZS no-success ontology provides status values to describe why a success ontology value has not been established. It is described in Section 3. The SZS dataform ontology provides status values

1 Introduction

to describe the nature of logical data. It is described in Section 4. All status values are expressed as “OneWord”, and also have a three letter mnemonic. In addition to the ontologies themselves, standards for presenting status values have been specified. These are described in Section 5. 2

The SZS Success Ontology The SZS success ontology was inspired by work done to establish communication protocols for systems on the MathWeb Software Bus [ 2, 25 ]. The ontology assumes that the logical data is a 2-tuple of the form hAx;Ci, where Ax is a set (conjunction) of axioms and C is a conjecture formula. This is a common standard usage of ATP systems. If the input is not of the form hAx;Ci, it is treated as a conjecture formula (even if it is a “set of axioms” from the user view point, e.g., a set of formulae all with the TPTP role axiom), and the 2-tuple is hT RU E;Ci. The success ontology values are based on the possible relationships between the sets of models of Ax and C. The ontology values can also be interpreted in terms of the formula F Ax ) C. For example, the status Theorem means that the set of models of Ax is a (not necessarily strict) subset of the set of models of C, i.e., every model of Ax is a model of C. In this case F is valid.

Figure 1 shows the success ontology (many of the “OneWord” status values are abbreviated in the figure - see the list below for the official full “OneWord”s). The lines in the ontology can be followed up the hierarchy as isa links, e.g., an ETH isa EQV isa (SAT and a THM). Figure 2 shows the relationships between the model sets for some of the success ontology values. The outer grey ring contains all interpretations, the long dashed black ring contains the models of Ax, and the short dashed black ring contains the models of C.

SatPres

SAP from CSA to CUP

UnsPres

UNP EquiSat

ESA Satisfiable

SAT

Theorem

THM FinitelySat

FSA Equivalent TautC WeakerC

EQV TAC WEC

Success

SUC NoConsequence

NOC ContraAxioms

CAX from

SAT EquiCntrSat

ECS to

UNP CntrSatPres CntrUnsPres

CSP CUP CounterThm

CTH

CounterSat

CSA FinitelyUns

FUN WCtrConc UnsConc CntrEquiv

WCC UNC CEQ Equiv Thm ETH

Taut- Weaker ology TautConc TAU WTC

Weaker SatConc

Thm ContraAx WTH SCA

SatCtrConc Weaker Weaker UnsatContraAx CtrThm UnsConc isfiable

SCC WCT WUC UNS

Equiv CntrThm

ECT TautConc WConc ContraAx ContraAx

TCA WCA

UnsConc ContraAx

UCA

The meanings of the success ontology values are as follows. Associated with each status value are some possible dataforms that might be provided to justify the ontology value for given logical data - see Section 4. Interpretations Models of Ax

Models of C Satisfiable NoConsequence CounterSat

SAT NOC CSA EquivThm

ETH

Tautology

TAU

WeakerTautConc

WTC

WeakerThm

WTH TautConc ContraAx

TCA

WConc ContraAx

WCA

UnsConc ContraAx

UCA

WeakerUnsConc

WUC

Unsatisfiable

UNS WeakerTheorem (WTH): Some interpretations are models of Ax, all models of Ax are models of C, some models of C are not models of Ax, and some interpretations are not models of C. See Theorem and Satisfiable.

ContradictoryAxioms (CAX): No interpretations are models of Ax. F is valid, and anything is a theorem of Ax. Possible dataforms are Refutations of Ax.

SatisfiableConclusionContradictoryAxioms (SCA): No interpretations are models of Ax, and some interpretations are models of C. See ContradictoryAxioms.

TautologousConclusionContradictoryAxioms (TCA): No interpretations are models of Ax, and all interpretations are models of C. See TautologousConclusion and SatisfiableConclusionContradictoryAxioms.

WeakerConclusionContradictoryAxioms (WCA): No interpretations are models of Ax, and some, but not all, interpretations are models of C. See SatisfiableConclusionContradictoryAxioms and SatisfiableCounterConclusionContradictoryAxioms.

CounterUnsatisfiabilityPreserving (CUP): If there does not exist a model of Ax then there does not exist a model of :C, i.e., if Ax is unsatisfiable then :C is unsatisfiable.

CounterSatisfiabilityPreserving (CSP): If there exists a model of Ax then there exists a model of :C, i.e., if Ax is satisfiable then :C is satisfiable.

EquiCounterSatisfiable (ECS): There exists a model of Ax iff there exists a model of :C, i.e., Ax is (un)satisfiable iff :C is (un)satisfiable.

CounterSatisfiable (CSA): Some interpretations are models of Ax, and some models of Ax are models of :C. F is not valid, :F is satisfiable, and C is not a theorem of Ax. Possible dataforms are Models of Ax ^ :C.

CounterTheorem (CTH): All models of Ax are models of :C. F is not valid, and :C is a theorem of Ax. Possible dataforms are Proofs of :C from Ax.

CounterEquivalent (CEQ): Some interpretations are models of Ax, all models of Ax are models of :C, and all models of :C are models of Ax. F is not valid, and :C is a theorem of Ax. All interpretations are models of Ax xor of C. Possible dataforms are Proofs of :C from Ax and of Ax from :C.

UnsatisfiableConclusion (UNC): Some interpretations are models of Ax, and all interpretations are models of :C (i.e., no interpretations are models of C). F is not valid, and :C is a tautology. Possible dataforms are Proofs of :C.

WeakerCounterConclusion (WCC): Some interpretations are models of Ax, and all models of Ax are models of :C, and some models of :C are not models of Ax. See CounterTheorem and CounterSatisfiable.

EquivalentCounterTheorem (ECT): Some, but not all, interpretations are models of Ax, all models of Ax are models of :C, and all models of :C are models of Ax. See CounterEquivalent. FinitelyUnsatisfiable (FUN): All finite interpretations are finite models of Ax, and all finite interpretations are finite models of :C (i.e., no finite interpretations are finite models of C). Unsatisfiable (UNS): All interpretations are models of Ax, and all interpretations are models of :C. (i.e., no interpretations are models of C). F is unsatisfiable, :F is valid, and :C is a tautology. Possible dataforms are Proofs of Ax and of C, and Refutations of F .

WeakerUnsatisfiableConclusion (WUC): Some, but not all, interpretations are models of Ax, and all interpretations are models of :C. See Unsatisfiable and WeakerCounterConclusion. WeakerCounterTheorem (WCT): Some interpretations are models of Ax, all models of Ax are models of :C, some models of :C are not models of Ax, and some interpretations are not models of :C. See CounterSatisfiable. SatisfiableCounterConclusionContradictoryAxioms (SCC): No interpretations are models of Ax, and some interpretations are models of :C. See ContradictoryAxioms.

UnsatisfiableConclusionContradictoryAxioms (UCA): No interpretations are models of Ax, and all interpretations are models of :C (i.e., no interpretations are models of C). See UnsatisfiableConclusion and SatisfiableCounterConclusionContradictoryAxioms.

NoConsequence (NOC): Some interpretations are models of Ax, some models of Ax are models of C, and some models of Ax are models of :C. F is not valid, F is satisfiable, :F is not valid, :F is satisfiable, and C is not a theorem of Ax. Possible dataforms are pairs of models, one Model of Ax ^ C and one Model of Ax ^ :C.

The success ontology is very fine grained, and has more status values than are commonly used by automated reasoning software, by ATP systems in particular. A suitable subset for practical uses of ATP systems is as follows:

FOF problems with a conjecture - report Theorem or CounterSatisfiable.

FOF problems without a conjecture - report Satisfiable or Unsatisfiable.

CNF problems - report Satisfiable or Unsatisfiable. 2.1

Validation of the Success Ontology Two steps have been taken towards formal validation of the success ontology. The first step was the enumeration of the possible relationships between the models of Ax and C (some of which are illustrated in Figure 2). This provided a basis for the ontology values, and a basis for the isa links. The second step1 was to axiomatize the ontology and prove relevant properties. (The axiomatization implemented covers the “positive” part of the ontology regarding Ax and C, and just two commonly used values from the “negative” part regarding Ax and:C. It is expected that the results obtained will extend without difficulty to the full ontology.) The axiomatization encodes the relationship between the models of Ax and C for each ontology value, and, from that, relationships between the ontology values can be proven. Additionally, a finite model of the axioms was found, demonstrating the consistency of the axiomatization and hence the ontology.

The axiomatization is in first-order logic. As example, the axioms that describe the ESA, THM, and ETH values are given in Figure 3. Four relationships between pairs of ontology values were defined and axiomatized: a isa b , meaning that if hAx;Ci has the status a then it also has the status b . For example, WTH isa THM. a nota b , meaning that if hAx;Ci has the status a then it does not necessarily have the status b . For example, THM nota SAT (because SAT does not hold for the case of contradictory Ax). a nevera b , meaning that if hAx;Ci has the status a then it cannot have the status b . For example, SAT nevera CAX.

a xora b , meaning that every hAx;Ci has the status a xor b . For example, THM xora CSA.

Additionally, axioms that deal with properties of formulae and models were provided. The relationships and properties axioms are given in Figure 3.

The axiomatization was shown to be consistent by generating a finite model using Paradox [ 5 ]. Some general properties of the relationships were proved using an ATP system (see below for a discussion of the ATP system used), e.g., that isa is a transitive relation, and that if a isa b and a nota g then b nota g. Next the relationships between all pairs of ontology values were investigated, using the ATP system 1Thanks to the reviewer of this paper whose comments instigated this step. fof(esa,axiom,( ! [Ax,C] : ( ( ? [I1] : model(I1,Ax) <=> ? [I2] : model(I2,C) ) <=> status(Ax,C,esa) ) )). to prove the relationships from the axioms. The isa relationship was tested first, as if a pair of ontology values has the isa relationship they cannot have any of the other three relationships. For those pairs that were not proved to have the isa relationship, the nevera relationship was tested next. For those pairs that were proved to have the nevera relationship the xora relationship was tested, and for the other pairs the nota relationship was tested. Proving a nota relationship requires establishing the existence of formulae and models that deny the relationship. In the axiomatization five examples are provided, the tautology, satisfiable, contradiction, non thm spt and sat non taut pair axioms above.

The results of the testing are shown in Table 1, where the vertical axis value has the shown relationship to the horizontal axis value. The isa relationship is denoted by ), nota by :, nevera by , and xora by . Sixty-eight pairs were proved to have the isa relationship, 89 to have the nota relationship, 179 to have the nevera (and not the xora) relationship, 4 to have the xora relationship, and for the remaining two pairs no relationship could be proved. The latter are the cases that WEC nota WTC and WEC nota TAC, which require exhibition of an hAx;Ci pair that has the WEC property but in which C is not a tautology. This could be done explicitly, along the lines of the sat non taut pair axiom above, but that seemed like cheating. Some of the nota relationships may also be nevera, but could not be proved so.

As mentioned above, automated theorem proving was used to prove the relationships between the ontology values. At first, proofs were attempted using monolithic ATP systems such as EP, SPASS, and Vampire. The success rate was low, because the axiomatization forms a large theory - see [ 1 ]. Therefore the SRASS system [ 18 ] was used, and it was highly successful in identifying the necessary axioms for proving each conjecture, and subsequently obtaining either a proof using EP or an assurance of a proof using iProver [ 9 ]. In addition to SRASS, the MANSEX [ 22 ] and IDV [ 23 ] tools were used during the initial development of the axiomatization, to find the most obvious relationships and to analyze proofs. : : : : :

UNP SAP ESA SAT THM EQV TAC WEC ETH TAU WTC WTH CAX SCA TCA WCA CSA UNS NOC : : : : : : : ) ) ) ) ) : ) : : ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) : ) : ) ) ) : : : ) ) ) ) ) ) : : : : ) ) ) ) ) ) ) ) ) ) ) : : : : : : : ) ) )

) : : : : : : : : : : : : : ) ) : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : ) ) ) : : : ) ) : : : : : : : : : : : : ) ) UNP SAP ESA SAT THM EQV TAC WEC ETH TAU WTC WTH CAX SCA TCA WCA CSA UNS NOC All automated reasoning and proof processing was done on a computer with a Intel Xeon 2.80GHz CPU and 3GB memory, running the Linux 2.6 operating system, and with a 60s CPU time limit per proof attempt (on the entire SRASS process).

The formal analysis has had beneficial effects. Three new ontology values were added, three errors in the definitions of the ontology values were exposed and corrected, four incorrect isa links in the ontology were found and removed, and several unnoticed isa relationships were revealed and added. The isa links in Figure 1 correspond to those in Table 1. 3

The SZS No-Success Ontology While it is always hoped that automated reasoning software will successfully process the logical data, and hence establish a success ontology value, in reality this often does not happen, for a variety of reasons. In order to understand and make productive use of a lack of success, e.g., [ 11, 8 ], it is necessary to precisely specify the reason for and nature of the lack of success. The SZS no-success ontology provides suitable status values for describing the reasons. Note that no-success is not the same as failure: failure means that the software has completed its attempt to process the logical data and could not establish a success ontology value. In contrast, no-success might be because the software is still running, or that it has not yet even started processing the logical data. Figure 4 shows the no-success ontology.

The meanings of the no-success ontology values are as follows:

NoSuccess (NOS): The logical data has not been processed successfully (yet).

Open (OPN): A success value has never been established.

Unknown (UNK): Success value unknown, and no assumption has been made. Open

OPN TypeError

TYE

Stopped

STP Forced

FOR Assumed (ASS(U ,S)): The success ontology value S has been assumed because the actual value is unknown for the no-success ontology reason U . U is taken from the subontology starting at Unknown in the no-success ontology.

Stopped (STP): Software attempted to process the data, and stopped without a success status. Error (ERR): Software stopped due to an error.

OSError (OSE): Software stopped due to an operating system error.

InputError (INE): Software stopped due to an input error.

SyntaxError (SYE): Software stopped due to an input syntax error.

SemanticError (SEE): Software stopped due to an input semantic error.

TypeError (TYE): Software stopped due to an input type error (for typed logical data). Forced (FOR): Software was forced to stop by an external force.

User (USR): Software was forced to stop by the user.

ResourceOut (RSO): Software stopped because some resource ran out.

Timeout (TMO): Software stopped because the CPU time limit ran out.

MemoryOut (MMO): Software stopped because the memory limit ran out.

GaveUp (GUP): Software gave up of its own accord.

Incomplete (INC): Software gave up because it’s incomplete.

Inappropriate (IAP): Software gave up because it cannot process this type of data.

InProgress (INP): Software is still running.

NotTried (NTT): Software has not tried to process the data.

NotTriedYet (NTY): Software has not tried to process the data yet, but might in the future.

The no-success ontology is very fine grained, and has more status values than are commonly used by automated reasoning software. A suitable subset for practical uses is as follows:

The software stopped due to CPU limit - report Timeout.

The software gave up due to incompleteness - report GaveUp.

The software stopped due to an error - report Error.

Any other cases - report Unknown.

The SZS Dataform Ontology The success status values describe what is known or has been established about the relationship between the axioms and conjecture in logical data, but do not describe the form of logical data. The dataform ontology provides suitable values for describing the form of logical data. The dataform ontology values are commonly used to describe data provided to justify a success ontology value, e.g., if an ATP system reports the success ontology value Theorem it might output a proof to justify that. Figure 5 shows the dataform ontology.

Proof

Prf Derivation

Der

Refutation

Ref

Solution

Sol Interpretation

Int Model Mod

LogicalData

LDa

NotSoln

NSo

None

Non ListOfFormulae

Lof

IncompletePrf Assurance

IPr Ass ListOfTHF

Lth

ListOfFOF

Lfo

ListOfCNF IncompleteRef

Lcn IRf CNFRefutation DomainMap HerbrandModel Sat'nModel

CRf Dom HMo SMo IncompleteCNFRef

ICf FiniteModel InfiniteModel

FMo IMo IncompleteCNFRefutation (ICf): A CNF refutation with parts missing.

Assurance (Ass): Only an assurance of the success ontology value.

None (Non): Nothing.

The dataform ontology is very fine grained, and has more status values than are commonly used by automated reasoning software, by ATP systems in particular. A suitable subset for practical uses of ATP systems is as follows:

A generic proof - report Proof.

A refutation - report Refutation.

A CNF refutation - report CNFRefutation.

A generic model - report Model.

A finite model - report FiniteModel.

A Herbrand model - report HerbrandModel.

A saturation model - report SaturationModel. or or 5

The SZS Presentation Standards The SZS ontologies provide status values that precisely describe what is known or has been established about logical data. In order to make the use of the values easy in more complex reasoning systems, it is necessary to specify precisely how the values should be presented. This makes it easy for harness software to prepare input data and examine output data that contains ontology values, e.g., in practice, to grep the output from automated reasoning software for lines that provide status values.

Success and no-success ontology values should be presented in lines of the form % SZS status ontology value for logical data identifier (The leading ’%’ makes the line into a TPTP language comment.) For example % SZS status Unsatisfiable for SYN075+1 % SZS status GaveUp for SYN075+1

A success or no-success ontology value should be presented as early as possible, at least before any data output to justify the value. The justifying data should be delimited by lines of the form % SZS output start dataform ontology value for logical data identifier and

% SZS output end dataform ontology value for logical data identifier For example % SZS output start CNFRefutation for SYN075-1

output data % SZS output end CNFRefutation for SYN075-1 All “SZS” lines can optionally have software specific information appended, separated by a :, i.e., % SZS status ontology value for logical data identifier : software specific information % SZS output start dataform ontology value for logical data identifier : software specific info % SZS output end dataform ontology value for logical data identifier : software specific info For example % SZS status GaveUp for SYN075+1 : Could not complete CNF conversion % SZS output end CNFRefutation for SYN075-1 : Completed in CNF conversion

Conclusion

This paper has presented the SZS ontologies of status values that are suitable for expressing precisely what is known or has been established about logical data. The ontologies can be used for existing logical data, e.g., they are used for the status of problems in the TPTP problem library [ 21 ] and solutions in the TSTP solution library [ 16 ], and can be used by automated reasoning software to describe their input and output. Already several ATP systems, e.g., Darwin [ 3 ], E, Metis [ 7 ], Paradox [ 5 ], use the SZS ontologies and the presentation standards, and this contributes to simplifying their embedding into more complex reasoning systems.

In addition to its use for reporting the overall status of a hAx;Ci 2-tuple, the SZS success ontology is used to report the status of individual inference steps in TPTP format derivations [ 19 ]. This is done in the “useful information” field of an inference record of an inferred formula. For example, in cnf(58,plain, ( ~ hates(agatha,esk2_1(butler)) ), inference(spm,[status(thm)],[51,48])). the status is Theorem (recorded as a lowercase acronym value thm), which indicates that the formulae is a theorem of it’s two parent formulae 51 and 48. The Theorem status is most common in derivations, but the SAP and ESA status values are also used quite often, e.g., for the formulae inferred by Skolemization and splitting steps. These status values can be used for semantic verification of the derivations, as is done by the GDV derivation verifier [ 17 ].

While the SZS ontologies are in use and have matured to some extent, it is not claimed that they are comprehensive and perfect. Developers and users of automated reasoning software are invited to provide feedback that might lead to improvements and increased usage. Already some users are working on success ontology values for results from computer algebra and other computational reasoning systems. In related work, SZS standards for returning answers from question-and-answer systems have been proposed.2 It is hoped that over time, with increased usage, the ontologies will become battle hardened, and will be a core standard for automated reasoning. 2See http://www.tptp.org/TPTP/Proposals/AnswerExtraction.html

[1] Proceedings of the CADE-21 Workshop on Empirically Successful Automated Reasoning in Large Theories, number 257 in CEUR Workshop Proceedings , 2007 .

[2]

Armando ,

Kohlhase , and

Ranise . Communication Protocols for Mathematical Services based on KQML and OMRS . In M. Kerber and M. Kohlhase, editors, Proceedings of the Calculemus Symposium 2000 , 2000 .

[3]

Baumgartner ,

Fuchs , and

Tinelli . Darwin - A Theorem Prover for the Model Evolution Calculus . In G. Sutcliffe,

Schulz , and T. Tammet, editors, Proceedings of the Workshop on Empirically Successful First Order Reasoning, 2nd International Joint Conference on Automated Reasoning , 2004 .

[4]

Benzmu ¨ller,

Rabe , and

Sutcliffe. THF0 - The Core TPTP Language for Classical Higher-Order Logic . In P. Baumgartner,

Armando , and D. Gilles, editors, Proceedings of the 4th International Joint Conference on Automated Reasoning, Lecture Notes in Artificial Intelligence, page Accepted , 2008 .

[5]

Claessen and

Sorensson . New Techniques that Improve MACE-style Finite Model Finding . In P. Baumgartner and C. Fermueller, editors, Proceedings of the CADE-19 Workshop: Model Computation - Principles, Algorithms, Applications, 2003 .

[6]

Denney ,

Fischer , and

Schumann . Using Automated Theorem Provers to Certify Auto-generated Aerospace Software . In M. Rusinowitch and D. Basin, editors, Proceedings of the 2nd International Joint Conference on Automated Reasoning, number 3097 in Lecture Notes in Artificial Intelligence , pages 198 - 212 , 2004 .

[7]

Hurd . First-Order Proof Tactics in Higher-Order Logic Theorem Provers . In M. Archer,

B. Di

Vito , and C. Munoz, editors, Proceedings of the 1st International Workshop on Design and Application of Strategies/- Tactics in Higher Order Logics, number

NASA

/CP-2003 -212448 in NASA Technical Reports , pages 56 - 68 , 2003 .

[8]

Ireland and

Bundy . Productive use of Failure in Inductive Proof . Journal of Automated Reasoning , 16 ( 1-2 ): 79 - 111 , 1996 .

[9]

Korovin. iProver - An Instantiation-Based Theorem Prover for First-order Logic (System Description) . In P. Baumgartner,

Armando , and D. Gilles, editors, Proceedings of the 4th International Joint Conference on Automated Reasoning, number 5195 in Lecture Notes in Artificial Intelligence , pages 292 - 298 , 2008 .

[10]

Massar ,

Travers ,

Elhai , and J. Shrager. BioLingua: A Programmable Knowledge Environment for Biologists . Bioinformatics , 21 ( 2 ): 199 - 207 , 2005 .

[11]

Monroy ,

Bundy , and

Ireland . Proof Plans for the Correction of False Conjectures . In F. Pfenning, editor, Proceedings of the 5th International Conference on Logic for Programming , Artificial Intelligence, and Reasoning, number 822 in Lecture Notes in Artificial Intelligence , pages 178 - 189 . Springer-Verlag, 1994 .

[12]

Riazanov and

Voronkov . The Design and Implementation of Vampire . AI Communications , 15 ( 2- 3 ): 91 - 110 , 2002 .

[13]

Schulz. E: A Brainiac Theorem Prover . AI Communications , 15 ( 2-3 ): 111 - 126 , 2002 .

[14]

Shrager ,

Waldinger ,

Stickel , and

Massar . Deductive Biocomputing. PLoS ONE , 2 ( 4 ), 2007 .

[15] M.E. Stickel. SNARK - SRI's New Automated Reasoning Kit . http://www.ai.sri.com/ stickel/snark.html.

[16]

Sutcliffe. The TSTP Solution Library . http://www.TPTP.org/TSTP.

[17]

Sutcliffe. Semantic Derivation Verification. International Journal on Artificial Intelligence Tools , 15 ( 6 ): 1053 - 1070 , 2006 .

[18]

Sutcliffe and

Puzis. SRASS - a Semantic Relevance Axiom Selection System . In F. Pfenning, editor, Proceedings of the 21st International Conference on Automated Deduction, number 4603 in Lecture Notes in Artificial Intelligence , pages 295 - 310 . Springer-Verlag, 2007 .

[19]

Sutcliffe ,

Schulz ,

Claessen , and

A. Van

Gelder . Using the TPTP Language for Writing Derivations and Finite Interpretations . In U. Furbach and N. Shankar, editors, Proceedings of the 3rd International Joint Conference on Automated Reasoning, number 4130 in Lecture Notes in Artificial Intelligence , pages 67 - 81 , 2006 .

[20]

Sutcliffe and

Seyfang . Smart Selective Competition Parallelism ATP . In A. Kumar and I. Russell, editors, Proceedings of the 12th International FLAIRS Conference , pages 341 - 345 . AAAI Press, 1999 .

[21]

Sutcliffe and

C.B.

Suttner . The TPTP Problem Library: CNF Release v1.2.1. Journal of Automated Reasoning , 21 ( 2 ): 177 - 203 , 1998 .

[22]

Sutcliffe ,

Yerikalapudi , and

Trac . Multiple Answer Extraction for Question Answering with Automated Theorem Proving Systems. Rejected from the 15th International Conference on Logic for Programming Artificial Intelligence and Reasoning , 2008 .

[23]

Trac ,

Puzis , and G. Sutcliffe. An Interactive Derivation Viewer . In S. Autexier and C. Benzmu¨ller, editors, Proceedings of the 7th Workshop on User Interfaces for Theorem Provers, 3rd International Joint Conference on Automated Reasoning , volume 174 of Electronic Notes in Theoretical Computer Science, pages 109 - 123 , 2006 .

[24]

Weidenbach ,

Schmidt ,

Hillenbrand ,

Rusev , and

Topic . SPASS Version 3 .0. In F. Pfenning, editor, Proceedings of the 21st International Conference on Automated Deduction, number 4603 in Lecture Notes in Artificial Intelligence , pages 514 - 520 . Springer-Verlag, 2007 .

[25]

Zimmer and

Kohlhase . System Description: The MathWeb Software Bus for Distributed Mathematical Reasoning . In A. Voronkov, editor, Proceedings of the 18th International Conference on Automated Deduction, number 2392 in Lecture Notes in Artificial Intelligence , pages 139 - 143 . Springer-Verlag, 2002 .