1. Introduction

SMT

First-Order Instantiation using Discriminating Terms

Chad E. Brown

Mikoláš Janota

0 0 Czech Technical University in Prague, Czech Institute of Informatics , Robotics and Cybernetics, Jugoslávských partyzánů 1580/3, 160 00 Prague 6, Dejvice , Czech Republic

2021

19 18 19

This paper proposes a technique to limit the number of possible terms to be considered in quantifier instantiation. One of the major hurdles that SMT solvers face when dealing with quantifiers is that there are simply too many terms to instantiate with. So even if the right set of terms is available to the solver, meaning they appear in the formula, the solver might not have enough resources to come upon the right combination. This motivates the technique presented in this paper, which instantiates only by a certain type of terms, called discriminating terms. The paper introduces a class of formulas, where the proposed technique has a considerable impact.

eol>SMT quantifiers instantiation

1. Introduction

Quantifiers represent one of the major challenges for contemporary SMT solvers and since typically they lead to undecidability or extreme computational complexity, they are likely to remain a challenge for times to come.

Most commonly, the general techniques for dealing with quantifiers gradually instantiate the quantified part of the formula with ground terms until the resulting ground formula becomes unsatisfiable. The terms to be used in instantiations may be chosen either by syntactic properties (E-matching [ 1 ]) or semantic properties (e.g. model-based quantifier instantiation [ 2 ]). Interestingly, it has been shown that these techniques do not always pay of and simple enumeration of terms gives better results in some cases [ 3 ].

This is where this paper comes in. We propose a technique to limit the set of terms to enumerate. Roughly speaking, in the context of first order logic with equality, a term is labeled as discriminating if it participates in a disequality.

We have modified the enumeration instantiation algorithm in CVC4 [ 4 ] so that only discriminating terms are considered. We further construct a family of formulas where this approach demonstrably helps.

2. A Class of Problems

Kaminski and Smolka [ 5 ] consider an identity that holds over the booleans: ( ( ())) = (). Here is a boolean and is a unary function on booleans. It is easy to informally see why this identity holds by considering all four possible interpretations of . Obviously the identity also holds if the domain of interest has only one element. Hence one can make an easy first-order problem by including an axiom

∀. = ∨ = ∨ = stating there are at most two elements and making the conclusion ( ( ())) = (). A formal proof would proceed by equational reasoning after instantiating the quantifiers using the subterms of the conjecture: , (), ( ()) and ( ( ())). This problem can be made slightly more dificult by including unary functions 0, . . . , − 1 and writing the conclusion as ( ( (0(· · · − 1() · · · )))) = (0(· · · − 1() · · · )). The modified problem has + 4 subterms instead of only 4. The subterms of the form (· · · − 1() · · · ) with > 0 are red herrings. The four subterms (0(· · · − 1() · · · )) with ∈ {0, 1, 2, 3} are suficient to use as instantiations to complete the proof.

The problems can also be made more dificult in a diferent way. Following a proof from [ 6 ] we will argue that for each natural number > 0 there are natural numbers 1 > 0 and 2 ≥ 0 such that 1+2 () = 2 () if the domain of interest has at most elements. If = 2, we can take 1 = 2 and 2 = 1 as above. More generally, we choose 2 as − 1 and 1 is the least common multiple (lcm) of the sequence 2, . . . , . The reason behind these choices are explored in the following subsection.

To construct the family of formulae in question, define atmost to be the first-order formula ∀0 · · · .

⋁︁ 0≤ <≤ = and let kam be the first-order formula

atmost ∧ lcm({2,...,})+− 1(0(· · · − 1() · · · )) ̸= − 1(0(· · · − 1() · · · )). 2.1. Unsatisfiability of kam Following the proof (and terminology) of Theorem 7 in [ 6 ] we show that for an interpretation of size at most the following identity holds:

lcm({2,...,})+− 1() = − 1()

The intuition for the equality is as follows. The sequence , (), . . . , (), . . . must eventually repeat. Both sides of the equation will be in the part that repeats and the length of the repeating part will be a number at most . Since this length divides lcm({2, . . . , }) we will be able to conclude

lcm({2,...,})( − 1()) = − 1() Let us now expand this idea more carefully.

Consider an arbitrary interpretation of the function and the constant assuming that the universe has at most elements. For the purpose of this subsection, we abuse notation by writing (· · · ) for the value of under such interpretation and write for the value of under the interpretation. By the pigeonhole principle there must exist 1 and 2 such that 0 ≤ 2 < 1 ≤ such that 1 () = 2 (). Let 1 and 2 be the least numbers with this property. Following [ 6 ] we call 1 the size and 2 the prefix . We also call the positive number 1 − 2 the lasso and say () is in the lasso if ≥ 2. The sequence can be written as follows:

lcm({2,...,})( − 1()) = − 1() as desired.

3. Quasidiscriminating Terms

The tableau calculus from [ 7 ] restricts first-order quantifier instantiation to so-called discriminating terms, i.e., terms that occur on one side of a disequation on the branch. A consequence of the completeness proof for the tableau calculus is a refinement of Herbrand’s theorem indicating that (in the presence of the tableau calculus rules or analogous rules) instantiating with members of the universe of discriminating terms is suficient to lead to an inconsistency, if the branch is inconsistent.

The change of setting from the tableau calculus of [ 7 ] to SMT means discriminating terms are not always suficient. As a simple example we consider kam20. Assume we have clause normalized so that there is one quantified formula

∀. = ∨ = ∨ = and one disequation ( ( ())) ̸= (). There are only two discriminating terms: 3() and (). Instantiating , and with these two terms will always lead to at least two of , and being the same term so that the resulting disjunction will always have a literal that is trivial by reflexivity. In the calculus of [ 7 ] there is a decomposition rule that would add ( ()) ̸= to the branch since 3() ̸= () is on the branch. This new disequation means there are now four discriminating terms. As discussed above, these four terms are suficient to use as instantiations to derive a contradiction.

One option would be to extend CVC4 to behave in ways that simulate the additional rules of [ 7 ]. In the example above, this would mean when the current propositional model sets the literal 3() = () to false, CVC4 could mimic the decomposition rule by adding a propositional discriminating enumeration z3 default 0 100 200

300 instances 400 500 clause corresponding to 3() = () ∨ 2() ̸= . Further tableau rules that would need to have a similar counterpart are the mating and confrontation rules.

We have chosen a simpler, heuristic approach without attempting to maintain completeness. However, a heuristic that restricts to discriminating terms without simulating the tableau rules would be far too restrictive. An intermediate heuristic is to restrict to terms that would be discriminating if the decomposition rule were included. We call these quasidiscriminating terms.

As a technical definition, we say a pair (, ) is a discriminating pair if the literal = is assigned false by the propositional model. We recursively define quasidiscriminating pairs (, ) as follows: Every discriminating pair is a quasidiscriminating pair. If ( (1, . . . , ), (1, . . . , )) is a quasidiscriminating pair, then (, ) is a quasidiscriminating pair for each ∈ {1, . . . , } where and are not the same term. A term is quasidiscriminating if there is some such that (, ) or (, ) is a quasidiscriminating pair.

We have modified the enumeration instantiation algorithm in CVC4 [ 4 ] so that only quasidiscriminating terms are considered.

4. Results

The problems used for evaluation are the formulas kam as defined above. The parameters were chosen as follows. The parameter ranges between 4..10 and the value of ranges between 0..99 for ∈ 4..8 and it ranges between 0..9 for ∈ 9..10. Recall that the parameter represents the domain size and the number of the “dummy” terms .

For the comparison we considered the default version of CVC4, CVC4 run only in the enumeration mode, and Z3. Figure 1 shows a cactus plot for the experiment results under 5-minute timeout (300s). Table 1 breaks down the number of solved instances by the domain size, i.e. by the parameter . domain 4 5 6 7 8 9 10 Total (520)

Our strategy using discriminating terms solved all the considered benchmarks very quickly except for the largest ones. CVC4’s enumerative is also performing quite well but the time starts to increase much more quickly and eventually it times out on the largest problems. The default version of CVC4 performs rather poorly; our explanation for this is that the conflict-based instantiation [ 8 ] is taking up too much time because of the deep terms. The results for Z3 are surprising because it can successfully solve the largest instances but tends to fail on the smaller ones in a somewhat nonuniform fashion. We have also tried running Vampire [ 9 ] but that has timed out on all the considered problems.

5. Summary and Future Work

This paper proposes a way to restrict possible candidates term for quantifier instantiation by looking at syntactic properties of the given formula. In particular, we consider only terms that participate in a disequality. We construct a family of formulas where this technique has demonstrably the best results.

The presented techniques opens a number of avenues for future work. Since the presented technique disregards theories, a natural generalization would be to include theory-specific predicates, other than just disequality. For instance, in the context of arithmetic strict comparisons (<) imply disequality and therefore could be used in a similar fashion. While the technique is clearly performing well on the constructed family of formulas, as of now we don’t have any dividends that is helpful on general formulas. We conjecture that this might happen in problems with many function nesting but also, careful integration with other techniques will be needed. A natural next step to take would be to run the modified CVC4 over the problem sets in the SMT-LIB to obtain data for how often the quasi-discriminating terms technique is helpful and how often it is not.

The family of formulas proposed here is interesting on its own, if only because they are easy to understand and yet present a challenge to existing SMT solvers and first-order automated theorem provers. In general, it is unclear whether it is better to instantiate with deeper terms or with more shallow terms. In the provided family, the outermost terms are actually the right ones and the inner ones are “red herrings.” But one can envision scenarios where the opposite is true. So the question is, how to distinguish scenarios like these. The results were supported by the Ministry of Education, Youth and Sports within the dedicated program ERC CZ under the project POSTMAN no. LL1902. This scientific article is part of the RICAIP project that has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 857306.

[1]

Detlefs , G. Nelson,

J. B.

Saxe , Simplify: a theorem prover for program checking , J. ACM 52 ( 2005 ) 365 - 473 . doi: 10 .1145/1066100.1066102.

[2]

Ge , L. M. de Moura, Complete instantiation for quantified formulas in satisfiability modulo theories , in: Computer Aided Verification, 21st International Conference, CAV, 2009 , pp. 306 - 320 . doi: 10 .1007/978-3- 642 -02658-4\_ 25 .

[3]

Reynolds ,

Barbosa ,

Fontaine , Revisiting enumerative instantiation, in: Tools and Algorithms for the Construction and Analysis of Systems , volume 10806 , 2018 , pp. 112 - 131 . doi: 10 .1007/978-3- 319 -89963-3\_7.

[4]

C. W.

Barrett ,

C. L.

Conway ,

Deters ,

Hadarean ,

Jovanovic ,

King ,

Reynolds , C. Tinelli, CVC4, in: G. Gopalakrishnan, S. Qadeer (Eds.), Computer Aided Verification - 23rd International Conference, CAV, volume 6806 , Springer, 2011 , pp. 171 - 177 . doi: 10 . 1007/978-3- 642 -22110-1\_ 14 .

[5]

Kaminski ,

Smolka , A finite axiomatization of propositional type theory in pure lambda calculus, in: Reasoning in Simple Type Theory: Festschrift in Honor of Peter B . Andrews on His 70th Birthday , College Publications, 2008 , pp. 243 - 258 .

[6]

M. P.

Bonacina ,

C. A.

Lynch , L. de Moura, On deciding satisfiability by theorem proving with speculative inferences , Journal of Automated Reasoning 47 ( 2011 ) 161 - 189 . doi: 10 . 1007/s10817-010-9213-y.

[7]

C. E.

Brown , G. Smolka, Analytic tableaux for simple type theory and its first-order fragment , Logical Methods in Computer Science 6 ( 2010 ). doi: 10 .2168/LMCS-6( 2 :3) 2010 .

[8]

Reynolds ,

Tinelli , L. M. de Moura, Finding conflicting instances of quantified formulas in SMT, in: Formal Methods in Computer-Aided Design , FMCAD 2014 , Lausanne, Switzerland, October 21-24 , 2014 , IEEE, 2014 , pp. 195 - 202 . doi: 10 .1109/FMCAD. 2014 . 6987613 .

[9]

Kovács ,

Voronkov , First-Order Theorem Proving and Vampire, in: International Conference on Computer Aided Verification, volume 8044 , 2013 , pp. 1 - 35 . doi: 10 .1007/ 978-3- 642 -39799- 8 _ 1 .