<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Structural Equality Generating Dependencies and Definite Descriptions</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>David Toman</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Grant Weddell</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Cheriton School of Computer Science, University of Waterloo, 200 University Ave W.</institution>
          ,
          <addr-line>Waterloo, ON N2L 3G1</addr-line>
          ,
          <country country="CA">Canada</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <abstract>
        <p>We introduce a very general variety of path description dependencies (PDDs) for an expressive dialect of the FunDL family of description logics called structural PDDs. In general, PDDs enable capturing equality generating dependencies for an ontology in a progressively more fine-grained manner, starting with equality implied by simple alignment of facts about entities through to new structural PDDs in which equality only follows according to a structured alignment of non-empty sets of facts about an entity. We show that logical consequence for this new FunDL dialect is decidable if a given ontology appeals to an exclusive use of structural PDDs, but that logical consequence becomes undecidable when more course grained varieties of PDDs are also allowed in the ontology. An extension to a referring expression type language for defining concepts in this description logic to serve as referring expressions that depend on structural identification is also presented and is tied to a diagnosis of a singularity condition for such concepts to logical consequence of PDDs for an ontology.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;referring expressions</kwd>
        <kwd>path description dependencies</kwd>
        <kwd>structural equality</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Structured data sources abound, and ontology based data access is all about querying such sources via
an ontological understanding of their content. Here, efective ways to communicate answers to queries
will depend critically on communicating references to underlying entities via referring expressions, also
called definite descriptions [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Earlier work has introduced the notion of referring expression types,
each of which will define a set of possible referring expressions [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ]. That such descriptions achieve
unambiguous reference will turn depend on ontological knowledge of so-called equality generating
dependencies, for example, knowing that a person will have a unique social insurance number, or that
a room, when non-empty, will have a unique combination of people occupying the room.
      </p>
      <p>
        In this paper, we introduce a more expressive variety of such dependencies when an ontological
understanding is expressed in terms of a description logic, in particular, in terms of an expressive dialect
of the FunDL family of description logics [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. For a better alignment with common data sources such as
relational databases, all such logics are feature-based instead of role-based, that is, consider facts to be
captured with partial functions instead of more general binary relationships. Such logics have recently
included a concept constructor called a path description dependency (PDD) in which component path
descriptions can be annotated to define progressively richer conditions for equality generation [
        <xref ref-type="bibr" rid="ref6 ref7 ref8">6, 7, 8</xref>
        ].
There are two possible annotations that have been considered. Both relate to the respective non-empty
sets of entities reachable by a path description: a “set intersection” annotation that is satisfied when
there is at least one such entity in common, and a “set equality” annotation that is satisfied when
the respective non-empty sets of reachable entities are the same. In this paper, we introduce a new
more expressive “structural equality” annotation for path descriptions in which equality only follows
according to a structured alignment of non-empty sets of facts about an entity.
      </p>
      <p>For example, consider where a document will have a style consisting of sets of sets of keywords,
where each top level set is a group of keywords occurring in one of the document’s paragraphs. It will
now be possible to identity document styles with exactly the same keywords that are also grouped in
exactly the same way. This is illustrated below in which three graphs define how keywords kw1, kw2
and kw3 relate to three possible document styles ds1, ds2 and ds3 via a path description of the form
kw-grp− .kw-dom− .kw-ran:</p>
      <p>kw-grp ↗d s↑1 ↖
kw-dom</p>
      <p>↑
kw-ran ↓
kw1
↑
↓
kw2
↑
↓
kw3
↑
↓
kw1
↗d s↑2
↑ ↖
↓
kw2</p>
      <p>↓
kw3
↑
↓
kw1
d↗s3↖ kw-grp
↑
kw-dom
↓ kw-ran
kw2
Here, the path description consists of the three features kw-grp, kw-dom and kw-ran, and characterizes
how the keywords can be “reached” from a document style by following a path of feature values: first
the inverse of kw-grp to one of the document style’s keyword groups, and then the inverse of kw-dom
followed by kw-ran to the keywords in a group.</p>
      <p>A non-empty set intersection annotation for this path would imply that all three graphs must describe
the same document style while this would only hold for the left two graphs with the more fine-grained
non-empty set equality annotation. But now, with our new structural equality annotation, the left two
must also describe distinct document styles since the same keywords are now grouped by paragraphs
in diferent ways.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Summary of Definitions and Results</title>
      <p>
        We define a family of description logics set-ℒℱ ℐ that are members of the FunDL dialects of
description logic [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], We use standard symbols for and ways of interpreting primitive features and concepts as
functions and sets of objects. The main novelty of the set-ℒℱ ℐ family is allowing path descriptions
Pd to participate in PDDs.
      </p>
      <p>Definition 1 (Path Descriptions). A path description is defined by the grammar</p>
      <p>Pd ::= id | . Pd |  − . Pd | C?. Pd,
for  ∈ F, where  − is called the inverse of  ,  a concept, and with the stipulation that substrings of the
form . − and  − . do not appear in any path description Pd. □</p>
      <p>In this paper we study the notion of structural path description agreement in PDDs.
Definition 2 (Structural Pd Agreement). Let Pd be a path description, ℐ an interpretation and  and 
be two △ elements. We say that  and  structurally agree on Pd, Pd≃(, ), when:
 =  if Pd = id ,
∀1, 1.(1 =  ℐ ()) ∧ (1 =  ℐ ()) → Pd1≃(1, 1) if Pd = . Pd1,
∀1.( ℐ (1) = ) → ∃1.( ℐ (1) = ) ∧ Pd1≃(1, 1)</p>
      <p>∧ ∀1.( ℐ (1) = ) → ∃1.( ℐ (1) = ) ∧ Pd1≃(1, 1) if Pd =  − . Pd1,
 ∈ Cℐ ∧  ∈ Cℐ ∧ Pd1≃(, ) if Pd = C?. Pd1 .</p>
      <p>We introduce other notions of path equality (discussed in the introduction) in place of ≃ in the
definition of PDD below in Definition 4.</p>
      <p>Definition 3 (Concepts, Subsumptions, and TBoxes). A {≃}-ℒℱ ℐ (a member of the set-ℒℱ ℐ
family) concept description  is constructed from primitive concepts using Boolean concept constructors
⊓, ⊔, and ¬, value restrictions on features ∀., unqualified existential restrictions on features ∃ and
inverse features ∃ − , and the path description dependency (PDD) of the form  : Pd1≃, . . . , Pd≃ → Pd≃ .
The semantics of all the derived concept descriptions  is defined in the standard way; for the PDD concept
constructor the semantics is given by
( : Pd1≃, ..., Pd≃ → Pd≃)ℐ =</p>
      <p>{ | ∀  ∈ ℐ : Pdℐ ({}) ̸= ∅ ∧ Pdℐ ({}) ̸= ∅ ∧ (⋀︀=1 Pd≃(, )) → Pd≃(, )},
where, for a set  ⊆ △ , Pdℐ () is the set of △ elements reachable from  in ℐ via Pd. A subsumption
is an expression of the form 1 ⊑ 2, where the  are concepts, and where PDDs occur only in 2 but
not within the scope of negation.1 A terminology (TBox)  consists of a finite set of subsumptions, and a
posed question  is a single subsumption. The notions of satisfaction and entailment are standard. □</p>
      <p>
        The entailment in {≃}-ℒℱ ℐ can be shown decidable via mapping to (unsatisfiability of) an
Ackermann-prefix [
        <xref ref-type="bibr" rid="ref11 ref12">11, 12</xref>
        ] formula:
Theorem 1. The entailment problem in {≃}-ℒℱ ℐ is complete for EXPTIME.
      </p>
      <p>
        Alternative path-based Pd agreements, introduced in [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ], have been defined as follows:
Definition 4 (Alternative Path-Based PD Agreement(s)). Let ℐ be an interpretation and 1 and 2 be two
△ elements. We write Pd∩(1, 2) to express Pdℐ ({1}) ∩ Pdℐ ({2}) ̸= ∅ (the set intersection agreement)
and Pd≈ (1, 2) to express Pdℐ ({1}) = Pdℐ ({2}) ̸= ∅ (the non-empty set agreement). □
      </p>
      <p>The following Theorem shows that mixing path agreement variants in PDDs/TBoxes leads to
undecidability:
Theorem 2. The entailment problems in {≃, ≈} -ℒℱ ℐ and {≃, ∩}-ℒℱ ℐ are undecidable.</p>
      <p>
        The members of the set-ℒℱ ℐ family are designed to serve as the underlying ontological languages
that allow referring expressions [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] to be plural—a reference to an object now can be achieved by
specifying a set of appropriately related objects (that have explicit identifiers).
      </p>
      <p>Definition 5 (Referring Expression Types). A referring expression type () is defined by the following
grammar, where A is a primitive concept.</p>
      <p>::= {?} | A →  | ∃. | ∃ − . | 1 ⊓ 2 | 1 ; 2
The language of referring concepts inhabiting , ℒ(), is defined as follows:</p>
      <p>ℒ({?}) = {{} |  is a constant symbol}
ℒ(A → ) = {A ⊓  |  ∈ ℒ()}</p>
      <p>ℒ(∃.) = {∃. |  ∈ ℒ()}
ℒ(∃ − .) = {(∃ − .[⃗/⃗1]) ⊓ . . . ⊓ (∃ − .[⃗/]) |  ∈ ℒ()}
⃗
ℒ(1 ⊓ 2) = {1 ⊓ 2 | 1 ∈ ℒ(1) ∧ 2 ∈ ℒ(2)}</p>
      <p>ℒ(1; 2) = ℒ(1) ∪ ℒ(2)
where [⃗/⃗] is the concept  in which all nominals ⃗ in  have been replaced by ⃗; this replacement is
over all possible distinct choices of ⃗1, . . . , ⃗ for ⃗ and all  ∈  . Given a TBox  and referring expression
type , the singularity problem for  with respect to  is to determine if |ℐ | ≤ 1 for every  ∈ ℒ()
and every model ℐ of  . □
Example 1. Each of the three graphs in our introductory example are parse trees for concepts occurring
in ℒ() when  is “∃kw-grp− .∃kw-dom− .∃kw-ran.{?}”. For example, the middle graph would be the
concept
(∃kw-grp− .∃kw-dom− .∃kw-ran.{1}) ⊓
(∃kw-grp− .(∃kw-dom− .∃kw-ran.{2} ⊓ ∃kw-dom− .∃kw-ran.{3})).</p>
      <p>To formulate our result we need to normalize the referring expression types.</p>
      <p>Definition 6 (Normalized Referring Expression Types). We use Norm() to refer to an exhaustive
application of the following rewrite rules:</p>
      <p>
        A → (1; 2) ↦→ A → 1; A → 2
 ⊓ (1; 2) ↦→  ⊓ 1;  ⊓ 2
(1; 2) ⊓  ↦→ 1 ⊓  ; 2 ⊓ 
1Violating this latter condition leads immediately to undecidability [
        <xref ref-type="bibr" rid="ref10 ref9">9, 10</xref>
        ].
      </p>
      <p>∃.(1; 2) ↦→ ∃.1; ∃.2
∃ − .(1; 2) ↦→ ∃ − .1; ∃ − .2
□</p>
      <p>
        The definition of Norm is an adaptation of referring expression type normalization in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] with the
following consequences: (1) ℒ() = ℒ(Norm()), and (2) all preference operators (“;”) are at the top
level of Norm(). We call the maximal “;”-free parts of Norm() preference-free components. The
following auxiliary function will be used to formulate subsumptions in set-ℒℱ ℐ to statically test
for singularity of each preference free component.
      </p>
      <p>Pds({?}) = {(id )≃}
Pds(A → ) = {(A?. Pd)≃ | (Pd)≃ ∈ Pds()}</p>
      <p>Pds(∃.) = {(. Pd′)≃ | (Pd′)≃ ∈ Pds()}</p>
      <p>Pds(∃ − .) = {( − . Pd′)≃ | (Pd′)≃ ∈ Pds()}</p>
      <p>Pds(1 ⊓ 2) = Pds(1) ∪ Pds(2)
The function extracts a set of path descriptions adorned with “≃” leading to nominals from the
preferencefree referring expression type. The singularity test is now as follows:
Theorem 3. Let  be a TBox in set-ℒℱ ℐ and  a referring expression type. Then all referring
concepts in ℒ() are singular with respect to  if and only if  |= ⊤ ⊑ ⊤ : Pds(′) → id holds for
every preference-free component ′ of Norm(). □</p>
    </sec>
    <sec id="sec-3">
      <title>Declaration on Generative AI</title>
      <p>The author(s) have not employed any Generative AI tools.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>D.</given-names>
            <surname>Toman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. E.</given-names>
            <surname>Weddell</surname>
          </string-name>
          , Structural Equality Generating Dependencies in Definite Descriptions,
          <source>Technical Report CS-2025-05</source>
          , Cheriton School of Computer Science, University of Waterloo,
          <year>2025</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>B.</given-names>
            <surname>Russell</surname>
          </string-name>
          , On denoting,
          <source>Mind</source>
          <volume>14</volume>
          (
          <year>1905</year>
          )
          <fpage>479</fpage>
          -
          <lpage>493</lpage>
          . URL: http://www.jstor.org/stable/2248381.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Borgida</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Toman</surname>
          </string-name>
          , G. Weddell,
          <article-title>On referring expressions in query answering over first order knowledge bases</article-title>
          ,
          <source>in: Proc. Principles of Knowledge Representation and Reasoning</source>
          ,
          <source>KR</source>
          <year>2016</year>
          ,
          <year>2016</year>
          , pp.
          <fpage>319</fpage>
          -
          <lpage>328</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Borgida</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Franconi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Toman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. E.</given-names>
            <surname>Weddell</surname>
          </string-name>
          ,
          <article-title>Understanding document data sources using ontologies with referring expressions</article-title>
          ,
          <source>in: AI 2022: Advances in Artificial Intelligence</source>
          , volume
          <volume>13728</volume>
          <source>of LNCS</source>
          , Springer,
          <year>2022</year>
          , pp.
          <fpage>367</fpage>
          -
          <lpage>380</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S.</given-names>
            <surname>McIntyre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Toman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. E.</given-names>
            <surname>Weddell</surname>
          </string-name>
          , FunDL
          <article-title>- A family of feature-based description logics, with applications in querying structured data sources</article-title>
          , in: Description Logic,
          <string-name>
            <given-names>Theory</given-names>
            <surname>Combination</surname>
          </string-name>
          , and
          <string-name>
            <surname>All</surname>
          </string-name>
          That - Essays Dedicated to Franz
          <source>Baader on the Occasion of His 60th Birthday</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>404</fpage>
          -
          <lpage>430</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>E.</given-names>
            <surname>Feng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Borgida</surname>
          </string-name>
          , E. Franconi,
          <string-name>
            <given-names>P. F.</given-names>
            <surname>Patel-Schneider</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Toman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. E.</given-names>
            <surname>Weddell</surname>
          </string-name>
          ,
          <article-title>Path Description Dependencies in Feature-Based DLs</article-title>
          , in: Description Logics, volume
          <volume>3515</volume>
          <source>of CEUR Workshop Proceedings</source>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>E.</given-names>
            <surname>Feng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Toman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. E.</given-names>
            <surname>Weddell</surname>
          </string-name>
          ,
          <article-title>On mixed semantics of path description dependencies in FunDL</article-title>
          , in: Description Logics, volume in
          <source>press of CEUR Workshop Proceedings</source>
          ,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>E.</given-names>
            <surname>Feng</surname>
          </string-name>
          , E. Franconi,
          <string-name>
            <given-names>P. F.</given-names>
            <surname>Patel-Schneider</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Toman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. E.</given-names>
            <surname>Weddell</surname>
          </string-name>
          ,
          <article-title>Equality generating dependencies in description logics via path agreements</article-title>
          ,
          <source>in: AI (2)</source>
          , volume
          <volume>15443</volume>
          of Lecture Notes in Computer Science,
          <year>2024</year>
          , pp.
          <fpage>214</fpage>
          -
          <lpage>227</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>D.</given-names>
            <surname>Toman</surname>
          </string-name>
          , G. Weddell,
          <article-title>On Keys and Functional Dependencies as First-Class Citizens in Description Logics</article-title>
          ,
          <source>in: Proc. of Int. Joint Conf. on Automated Reasoning (IJCAR)</source>
          ,
          <year>2006</year>
          , pp.
          <fpage>647</fpage>
          -
          <lpage>661</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>D.</given-names>
            <surname>Toman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. E.</given-names>
            <surname>Weddell</surname>
          </string-name>
          ,
          <article-title>On Keys and Functional Dependencies as First-Class Citizens in Description Logics</article-title>
          ,
          <source>J. Aut. Reasoning</source>
          <volume>40</volume>
          (
          <year>2008</year>
          )
          <fpage>117</fpage>
          -
          <lpage>132</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>W.</given-names>
            <surname>Ackermann</surname>
          </string-name>
          , Uber die Erfullbarkeit gewisser Zahlausdrucke,
          <source>Mathematische Annalen</source>
          <volume>100</volume>
          (
          <year>1928</year>
          )
          <fpage>638</fpage>
          -
          <lpage>649</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Fürer</surname>
          </string-name>
          ,
          <article-title>Alternation and the Ackermann Case of the Decision Problem</article-title>
          ,
          <string-name>
            <surname>L</surname>
          </string-name>
          'Enseignement Math.
          <volume>27</volume>
          (
          <year>1981</year>
          )
          <fpage>137</fpage>
          -
          <lpage>162</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>