=Paper= {{Paper |id=Vol-2744/paper25 |storemode=property |title=Non-Empirical Metrics for Ontology Visualizations Evaluation and Comparing |pdfUrl=https://ceur-ws.org/Vol-2744/paper25.pdf |volume=Vol-2744 |authors=Ildar Baimuratov,Than Nguyen }} ==Non-Empirical Metrics for Ontology Visualizations Evaluation and Comparing== https://ceur-ws.org/Vol-2744/paper25.pdf
     Non-Empirical Metrics for Ontology Visualizations
              Evaluation and Comparing?

    Ildar Baimuratov[0000−0002−6573−131X] and Than Nguyen[0000−0002−6679−7839]

         ITMO University, Kronverksky Pr. 49, bldg. A, St. Petersburg, 197101, Russia
               {baimuratov.i,nguyenngocthan92}@gmail.com



        Abstract. There are numerous ontology visualization systems, however, the choice
        of a visualization system is non-trivial, as there is no method for evaluation and
        comparing them, except for empirical experiments, that are subjective and costly.
        In this research, we aim to develop non-empirical metrics for ontology visual-
        izations evaluation and comparing. First, we propose several half-formal met-
        rics that require expert evaluation. These metrics are completeness, semanticity,
        and conservativeness. We apply the proposed metrics to evaluate and compare
        VOWL and Logic Graphs visualization systems. And second, we develop a com-
        pletely computable measure for the complexity of ontology visualizations, based
        on graph theory and information theory. In particular, ontology visualizations are
        considered as hypergraphs and the information measure is derived from the Hart-
        ley function. The usage of the proposed information measure is exemplified by
        the evaluation of visualizations of the sample of axioms from the DoCO ontology
        in Logic Graphs and Graphol. These results can be practically applied for choos-
        ing ontology visualization systems in general and regarding a particular ontology.

        Keywords: Ontology Visualization, Expert Evaluation, Hypergraphs, Informa-
        tion Measure.


1     Introduction

Visualization of an ontology improves comprehension of knowledge it contains. There
are numerous ontology visualization systems, the reviews are presented in [1–3]. How-
ever, the choice of a visualization system is non-trivial, as there is no method for evalu-
ation and comparing of ontology visualization systems present at the literature, except
for empirical experiments, that are subjective and costly.
    Little researches consider the evaluation of visualization systems in general. In [4]
the authors propose some recommendations considering graph diagrams, like mini-
mization of crossings between edges. The authors of [5] present empirical research on
applying these criteria to automatic graph layout algorithms. In [6] several new shape-
based metrics are proposed for large graphs. All these metrics are based on empirical
experiments, i.e. on human assessments.
    Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons Li-
    cense Attribution 4.0 International (CC BY 4.0).
?
    Publication is supported by RFBR grant 20-01-00358
2 I. Baimuratov and Than N.

    Therefore, we aim to develop formal metrics for ontology visualizations estimation.
’Formal’ means that they must be objective and computable. Some metrics we propose
require external knowledge of the language being visualized, its semantics, and knowl-
edge of other visualization systems, therefore, they are half-formal and require expert
evaluation. Another criterion is based on graph theory and information theory and is
fully computable.
    The outline of the paper is as follows: in Section 2 we propose several metrics
for expert evaluation and in Section 3 we derive the information measure for ontology
visualizations complexity.


2     Expert evaluation
First, we propose to consider several features of visualization systems that, though re-
lated to the formal properties, like completeness, still require expert evaluation, as they
involve external knowledge.

2.1   Completeness
The most important property of a visualization system is its completeness with respect
to the language being visualized, because if a visualization system can not represent
some axioms of an ontology, the system can not be applied to the ontology. In addition,
a common reference language serves as a common denominator for comparing different
visualization systems.
    Ontologies are denoted on the OWL language [7]. The OWL 1 standard provided
three increasingly expressive sub-languages: OWL Lite, OWL DL, and OWL Full. In
this paper, we consider OWL DL language, as it provides the maximum expressiveness,
retaining decidability.
    The formal foundation of OWL is description logics (DLs) [8]. DLs are a family
of logic languages, that can be used to represent the terminological knowledge of an
application domain. We consider axioms, formulated with the SHOIN description logic
syntax, as it corresponds to OWL DL language. We evaluate completeness of a visual-
ization system by counting the number of SHOIN syntax entities that the system can
represent.

2.2   Semanticity
We suppose, the advantage of a visualization with respect to a reference language is
that it improves comprehension of a formula with representing its semantic. Therefore,
we propose to evaluate the ability of a visualization system to represent semantics of
expressions.
     We consider a diagram of a visualization system for a logical relation as semantical,
if it represents the semantic of the relation. For example, compare the visualization of
conjunction from Graphol [9], Fig. 1, with the corresponding Venn diagram [10], Fig. 2.
Venn diagram represents that these two sets have common elements, while in Graphol
conjunction is just labeled with a hexagon.
                           Non-Empirical Metrics for Ontology Visualizations Evaluation... 3




       Fig. 1. Conjunction in Graphol               Fig. 2. Conjunction in Venn diagrams


2.3   Conservativeness
Finally, we suppose that it is important to use existing graphic primitives from mathe-
matical theories, as in the other case, i.e. introducing new graphic primitives, instead of
helping a user to understand an ontology it forces him or her to learn just one more lan-
guage. Considering again the example above, in Graphol a user has to learn that hexagon
denotes conjunction, while if the Venn diagram was used, the user familiar with Venn
diagrams would have understood the diagram without additional instructions. There-
fore, we consider a visualization system as conservative, if it uses the existing graphic
primitives.

2.4   Example of evaluation
We provide an evaluation of the VOWL [11] visualization system as an example. We
examined its completeness with respect to OWL DL language, its semanticity, the abil-
ity to represent the semantics of relations, and whether its graphic primitives are new or
adopted from common visualization systems. See Table 1.
     Here we see that VOWL can represent only 12 of 15 entities of the SHOIN descrip-
tion logic, therefore, its completeness rate is 0.8. The diagrams for concepts, conjunc-
tion, disjunction, and equivalence are semantical and conservative, as they are based on
Venn diagrams. The diagram for roles can be considered as graph-theory based, there-
fore, it is also semantical and conservative. The diagram for negation is conservative, as
it uses the sign of negation from logic, but it is not semantical since it doesn’t represent
the semantic of negation.
     In addition, we provide an example of comparing of visualization systems, per-
formed by the authors. We compare VOWL with Logic graphs (LGs) [12], the seman-
tically oriented ontology visualization method, developed by us. The analogous evalu-
ation of the LGs is in Table 2, the scores of VOWL and LGs are presented in Table 3.
As wee see, LGs are complete and more semantical and conservative, than VOWL.


3     Information measuring
In the previous section, we proposed several metrics for expert-based evaluation. Prop-
erties like completeness and conservativeness are important for visualization systems
evaluation, but it is hard to imagine that they would be fully computable. Thus, we pro-
pose one more approach to ontology visualization systems evaluation intended to be
completely formal. This approach is based on information measuring.
4 I. Baimuratov and Than N.

                        Table 1. Expert-based evaluation of VOWL

               N      Entity        Complete      Semantical Conservative
               1 Concept                              1            1

               2 Role                                 1            1

               3 Negation                             0            1

               4 Conjunction                          1            1

               5 Disjunction                          1            1
                 Existential
               6                                      0            0
                 restriction
                 Universal
               7                                      0            0
                 restriction
               8 Transitive role                      0            0

               9 Inverse role                         0            0

               10 Role hierarchy                      0            0
                   Number
               11                                     0            0
                  restriction
               12 Nominal                             0            0
                  Functional
               13                                     0            0
                      role
                   Concept
               14                                     0            0
                  inclusion
                    Concept
               15                                     1            1
                  equivalence
                                       0.8           0.33          0.4



    The intended information measure should estimate not the content of the ontology,
as it is the same for each visualization, but the complexity of its form. It implies the
following requirement for the intended information measure:


 – as it should estimate the complexity of an ontology visualization, it should depend
   on the complexity of its structure, in other words, on the number of nodes, edges,
   and types of edges;
 – it should be normalized, as visualizations of one and the same ontology in different
   visualization systems can have a different number of nodes and edges;
 – as it should measure the visualization complexity, a visualization with a greater
   number of nodes, edges, or edges types should have a higher value of the measure.
                              Non-Empirical Metrics for Ontology Visualizations Evaluation... 5

                              Table 2. Expert-based evaluation of LGs

              N       Entity             Complete        Semantical Conservative

              1 Concept                                      1           1

              2 Role                                         1           1

              3 Negation                                     1           1

              4 Conjunction                                  1           1

              5 Disjunction                                  1           1

                  Existential
              6                                              1           1
                  restriction
                  Universal
              7                                              1           1
                  restriction


              8 Transitive role                              1           1


              9 Inverse role                                 1           1

             10 Role hierarchy                               1           1

                   Number
             11                                              0           1
                  restriction

             12 Nominal                                      1           1

                  Functional
             13                                              0           1
                     role
                   Concept
             14                                              0           0
                  inclusion
                    Concept
             15                                              1           1
                  equivalence
                                             1              0.8         0.93



3.1   Hypergraphs as the formal framework

Before defining the information measure, we have to define the formal framework. We
propose to consider ontology visualization as a hypergraph. Simple graphs are not suit-
able for our goals as many ontology visualization systems use edges connecting more
than two nodes. A hypergraph can be represented as an incidence matrix, therefore, an
ontology visualization can be represented as an incidence matrix as well.
6 I. Baimuratov and Than N.

                    Table 3. Expert-based comparing of VOWL and LG

                                 Complete Semantical Conservative
                        VOWL       0.8      0.33         0.4
                         LGs        1        0.8        0.93


    Let there is a hypergraph H = (X, E), where X – is a set of nodes and E is a set
of edges. It is represented with |X| × |E| incidence matrix A = (aij ), where
                                      (
                                        1, if xi ∈ ej
                                aij =                                            (1)
                                        0, otherwise

for undirected graph and
                                   
                                                   0
                                   −1, if (xi , x ) ∈ ej
                                   
                                             0
                              aij = 1, if (x , xi ) ∈ ej                              (2)
                                   
                                     0, otherwise
                                   

and
                                 aij = 2 if ej = (xi , xi ).                          (3)
    Consider the following axiom from the Document Components Ontology (DoCO)
[13] as the example

                     chapter v ∃contains.paragraph t section                          (4)

and its visualization in the Graphol system, see Fig. 3. Here edges denoting disjunction




                       Fig. 3. Visualization of the axiom 1 in Graphol


connect three nodes: ’graphol.paragraph’, ’graphol.section’ and ’or’, therefore, it is the
hypergraph. The incidence matrix for this hypergraph is Table 4. Each node of the
diagram corresponds to a row of the matrix and each relation – to a column. As nodes
’graphol.paragraph’, ’graphol.section’ and ’or’ are connected with the edge ’or’, the
corresponding cells have value 1. In this research, we ignore the direction of edges for
simplicity.
                          Non-Empirical Metrics for Ontology Visualizations Evaluation... 7

                  Table 4. The incidence matrix for the axiom 1 in Graphol

                                         subClassOf graphol.contains or
                           exists            1             1         0
                      graphol.chapter        1             0         0
                      graphol.contains       0             1         0
                     graphol.paragraph       0             0         1
                       graphol.section       0             0         1
                             or              0             1         1



3.2   The information measure

There are researches on graph information measuring. The survey of graph entropy
measures is in [14]. The authors of [15] perform information-theoretic analysis of edge
bundling visualizations in terms of adjacency matrices and mutual information. But
none of the measures presented there satisfy our requirement. Therefore, we develop a
new graph information measure.
   First, we define the set of all different values of an incidence matrix A:
Definition 1. For a given incidence matrix A the set of all values is {a}.
For undirected graph {a} = {0, 1}, for directed {a} = {−1, 0, 1}, for directed graph
with loops {a} = {−1, 0, 1, 2} and so on.
   Now we define the set of all possible edges E:
Definition 2. For a hypergraph H with a given set of nodes X the set of all possible
edges E = {a}X .
    We are ready to define the information measure for hypergraph complexity estima-
tion by deriving it from the Hartley function [16]

                                          logb |A|,                                    (5)

where A is an arbitrary set and b – an arbitrary number. We substitute the number of
edges |E| as |A| and the number of all possible edges |E| as b.

Definition 3. For a hypergraph H with a given set of nodes X, a given set of edges
E and a set of corresponding incidence matrix values {a}, the information I(H) is
following:
                                             1
                        I(H) = log|E| |E| =     log|{a}| |E|                   (6)
                                            |X|

    Consider several simple graphs for illustration, see Fig. 4, its information estimation
is at the Table 5. As wee see, H2 has a more complex structure comparing to H1 and,
therefore, its information value is higher. H3 is directed and each directed edge con-
tains less information, therefore, with the same number of edges its information value
is lower compared to H1 . Summing up, the information measure satisfies the desired
properties.
8 I. Baimuratov and Than N.


           H1             x2       H2              x2        H3            x2
                   e1                      e1                        e1

           x1                      x1                e3       x1

                   e2                      e2                        e2
                          x3                       x3                      x3


                                Fig. 4. The graph examples

                   Table 5. Information estimation of the graph examples

                                  H |X| |E| |{a}| I(H)
                                  1 3 2       2 0.33
                                  2 3 3       2 0.53
                                  3 3 3       3 0.21



3.3   Comparing visualizations with the information measure

We provide an example of comparing ontology visualizations with the developed in-
formation measure. Unlike expert-based evaluation, where we compared visualization
systems itself, for information measure we have to compare visualizations of a par-
ticular ontology. We use the DoCO ontology as it is a real ontology, used in different
applications, and it contains nontrivial axioms. We visualized some axioms of this on-
tology in Graphol and Logic Graphs (LGs) [12], the semantically oriented ontology
visualization method, developed by us. The list of axioms and their visualizations are
in Table 6.
    The example of an incidence matrix for Graphol was provided in Table 4. Now
consider the example of the incidence matrix for Logic Graphs. The incidence matrix
for the axiom 1 in Logic graphs is Table 7.
    We compare LGs with Graphol by measuring information of the corresponding vi-
sualizations for the sample of axioms, presented in Table 6. The result is in Table 8. As
we see, the average information of LGs on this sample is higher, than of Graphol.


4     Conclusion

In this research, we proposed several non-empirical metrics for ontology visualization
evaluation and comparing. These metrics are divided into two groups. The first group
includes three metrics: completeness, semanticity, and conservativeness. These metrics
require expert evaluation and, therefore, they are half-formal. As an example, we com-
pared two ontology visualization systems: VOWL and Logic Graphs.
    The second group consists of the completely computable information measure, de-
rived from the Hartley formula, that allows normalized measuring complexity of ontol-
ogy visualizations, represented as hypergraphs with incidence matrices. As an example,
we compared Logic Graphs with Graphol by measuring average information of visual-
izations of the sample of axioms from the DoCO ontology.
                         Non-Empirical Metrics for Ontology Visualizations Evaluation... 9

                Table 6. Visualizations of DoCO in LGs and Graphol

1                  chapter v ∃contains.paragraph t section




2   abstract v (chapter t section) u (∃ispartof.bodymatter t f rontmatter)




3               af terword v section u ∃ispartof.backmatter)



4     appendix v (section u headedcontainer) u (∃ispartof.backmatter)




5                backmatter v discourseelement u container



6                           chapterlabel v ¬sectionlabel



7                       chaptersubtitle v ∃ispartof.chapter




8                            f igure v mata t milestone



9        glossary v section u (∃ispartof.backmatter t f rontmatter)




                 Table 7. The incidence matrix for axiom 1 in LGs

                        subClassOf contains negation 1 negation 2 negation 3
            chapter         1         0         0          0          0
          conjunction       0         1         0          0          1
            domain          1         1         0          0          0
           paragraph        0         0         1          0          0
            section         0         0         0          1          0
10I. Baimuratov and Than N.

                  Table 8. Comparing information of ontology visualizations

                          N        LG              Graphol
                           |X| |E| |{a}| I(H) |X| |E| |{a}| I(H)
                          1 5 6      2 0.52 6 3         2 0.26
                          2 9 11 2 0.38 10 5            2 0.23
                          3 5 3      2 0.32 6 3         2 0.26
                          4 7 4      2 0.29 8 4         2 0.25
                          5 4 2      2 0.25 4 2         2 0.25
                          6 2 2      2    0.5 3 2       2 0.33
                          7 3 2      2 0.33 4 2         2 0.25
                          8 4 5      2 0.58 4 2         2 0.25
                          9 7 7      2    0.4 8 4       2 0.25
                                          0.4               0.26


     These results can be practically applied for choosing ontology visualization systems
in general and regarding a particular ontology. Considering the presented examples,
it is recommended to use LGs rather than VOWL in general, as it has higher scores
of completeness, semanticity, and conservativeness, and for visualizing the mentioned
fragment of the DoCO, as LGs has higher informativeness.


References
 1. Antoniazzi F., Viola F.: RDF graph visualization tools: A survey //2018 23rd Conference of
    Open Innovations Association (FRUCT).IEEE,25-36 (2018)
 2. Dudáš, M.,Lohmann, S., Svátek, V., Pavlov, D.: Ontology visualization methods and tools:
    A survey of the state of the art. The Knowledge Engineering Review, vol 33, (2018)
 3. Steffen, L., Stefan, N. , Florian, H. Thomas, E: Visualizing ontologies with VOWL. Semantic
    Web. 7. 399-419. (2016)
 4. R. Tamassia, C. Batini, and M. Talamo. An algorithm for automatic layout of entity-
    relationship diagrams. In C. G. Davis, S. Jajodia, P. A. Ng, and R. T. Yeh, editors, 3rd Int.
    Conf. on Entity-Relationship Approach (ER’83), pages 421-439. North-Holland, 1983.
 5. Purchase, Helen & Carrington, David & Allder, Jo-Anne. (2002). Empirical Evaluation of
    Aesthetics-based Graph Layout. Empirical Software Engineering. 7.
 6. Eades, Peter & Hong, Seok-Hee & Klein, Karsten & Nguyen, An. (2015). Shape-Based
    Quality Metrics for Large Graph Visualization. 502-514.
 7. Deborah, L., Frank, V.,, et al. Owl web ontology language overview. W3C recommendation,
    10(10):2004,(2004)
 8. Baader, F.,Calvanese, D. , McGuinness, D. ,Nardi, D. and Patel-Schneider,P.: The Descrip-
    tion Logic Handbook: Theory, Implementation, and Applications. Cambridge University
    Press, second edition, (2007)
 9. Graphol official website, Http://www.obdasystems.com/graphol. Last accessed 4 jul 2020
10. Ruskey, F.,Weston, M.:A Survey of Venn Diagrams. The Electronic Journal of Combina-
    torics, (2005)
11. VOWL official website, Http://vowl.visualdataweb.org/. Last accessed 4 jul 2020
12. Mouromtsev, D.,Baimuratov, I.: Logic Graphs: A complete visualization method for logical
    languages based on Ch. S. Peirce’s existential graphs. CEUR Workshop Proceedings, Vol.
    2344, 2019, pp. 1-10.
                            Non-Empirical Metrics for Ontology Visualizations Evaluation... 11

13. Ontology DoCO, https://sparontologies.github.io/doco/current/doco.html. Last accessed 4
    jul 2020
14. Dehmer, Matthias & Mowshowitz, Abbe. (2011). A History of Graph Entropy Measures.
    Information Sciences. 1. 57-78.
15. Wu, Jieting & Zhu, Feiyu & Liu, Xin & Yu, Hongfeng. (2018). An Information-Theoretic
    Framework for Evaluating Edge Bundling Visualization. Entropy. 20. 625.
16. R. V. L. Hartley, ”Transmission of information,” in The Bell System Technical Journal, vol.
    7, no. 3, pp. 535-563, July 1928.