Observing the Truth: Diagrams, Sets and Free
                       Rides

                                    Gem Stapleton

                                University of Brighton,
                                     Brighton, UK
                             g.e.stapleton@brighton.ac.uk

    There are many different notations that can define relationships between sets,
some of which are diagrammatic and others symbolic. Even when a notation
is selected, there are choices to be made between semantically equivalent, yet
syntactically different, statements. Syntactic choices include variations in both
abstract syntax and concrete syntax where graphical and topological properties
can differ. Whilst it is clearly important to understand the relative benefits of
choices in all senses, the focus here is on the choice of notation and, within that,
abstract syntax choices1 .


                            Fig. 1: Illustrating free rides.


    This importance of notation choice has been explored previously, through
the study of free rides [4]. To illustrate, suppose we are given the set-theory
statements P ∩ Q = ∅, R ∩ Q = ∅ and S ⊆ Q. We can translate these three
statements into the Euler diagram in figure 1. As a consequence, the diagram
automatically expresses P ∩S = ∅ and R∩S = ∅: we can see these two statements
are true ‘for free’. By contrast, one cannot read off P ∩ S = ∅ and R ∩ S = ∅
from P ∩ Q = ∅, R ∩ Q = ∅ and S ⊆ Q; instead, the first two statements must be
inferred. Therefore, the diagram has a perceived advantage over the set-theory
statements, with P ∩ S = ∅ and R ∩ S = ∅ being examples of free rides.
    Free rides are thought to be a major reason why diagrammatic notations
can outperform symbolic notations in reasoning tasks. In particular, free rides
indicate why we may wish to choose Euler diagrams over symbolic set-theory
as a notation for representing relationships between sets. A key requirement
that underpins the theory of free rides is the existence of a translation from one
set of statements, such as symbolic statements about sets, into another set of
1
    Readers interested in concrete syntax choices for Euler diagrams or linear diagrams
    are referred to [1–3].
statements, such as a set of (perhaps a single) Euler diagrams. Such a translation
essentially chooses a representation, at the abstract syntax level, of the originally
defined statements in the other notation. Here, this idea is generalized to the
notion of an observational advantage that does not require the existence of a
translation between notations. Instead, all that is required is for the two sets of
statements to be semantically equivalent.
    To illustrate, suppose we have the two set-theory statements P = Q and
P ∩ R = ∅. The two Euler diagrams d1 and d2 in figure 2 are, between them,
semantically equivalent to P = Q and P ∩ R = ∅:

                      D = {d1 , d2 } ≡ S = {P = Q, Q ∩ R = ∅}.

We can observe P ∩ R = ∅ from S because it is written explicitly in S. However,
we cannot observe P ∩R = ∅ from either d1 and d2 . In this case, from D we must
infer d, which directly expresses P ∩ R = ∅. Therefore, S has an observational
advantage over D. Moreover, D also has an observational advantage over S, since
we can observe Q ∩ R = ∅ from D but not S.


                           Fig. 2: A more complex example.


    Through defining what it means to be able to observe one statement from a
set of statements, we can precisely describe the observational advantages of one
set of statements over another. In particular, in the case of Euler diagrams and
set theory we can prove that Euler diagrams are observationally complete. That is
given a finite set of set-theory statements, S, there exists a single Euler diagram,
d, that is semantically equivalent to S such that any set-theory statement that
can be inferred from S can be observed from d [5]2 . Thus, it is possible to
characterise, formally, the observational advantages of Euler diagrams over set-
theory. Importantly, the theory of observational advantages allows us to explain
why diagrams are advantageous representations of sets in a more general way
than free rides.
Acknowledgement This extended abstract is based on research conducted col-
laboratively with Mateja Jamnik and Atsushi Shimojima [5, 6].

2
    We note that d must explicitly represent all sets occurring in S and that the inferred
    set-theory statements must only represent sets occurring in S; we refer the reader
    to [5, 6] for more details.
References
1. Blake, A., Stapleton, G., Rodgers, P., Howse, J.: The impact of topological and
   graphical choices on the perception of Euler diagrams. Information Sciences 330,
   455–482 (2015)
2. Rodgers, P., , Zhang, L., Purchase, H.: Wellformedness properties in Euler diagrams:
   Which should be used? IEEE Transactions on Visualization and Computer Graphics
   18(7), 1089–1100 (2012)
3. Rodgers, P., Stapleton, G., Chapman, P.: Visualizing sets with linear diagrams.
   ACM Transactions on Computer-Human Interaction 22, 27:1–27:39 (2015)
4. Shimojima, A.: Semantic Properties of Diagrams and Their Cognitive Potentials.
   CSLI Publications, Stanford, CA (2015)
5. Stapleton, G., Jamnik, M., Shimojima, A.: What makes an effective representation of
   information: A formal account of inferential advantages. Journal of Logic, Language
   and Information (under review)
6. Stapleton, G., Jamnik, M., Shimojima, A.: Effective representation of information:
   Generalizing free rides. In: Diagrams 2016. Springer (2016)