<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Knowledge representation for neuro-symbolic digital building twin querying</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Stéphane Reynaud</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anthony Dumas</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ana Roxin</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>B27-AI, R&amp;D</institution>
          ,
          <addr-line>2 rue René Char, 21000 Dijon</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Laboratoire d'Informatique de Bourgogne (LIB EA 7534), Université de Bourgogne</institution>
          ,
          <addr-line>21000 Dijon</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <fpage>4</fpage>
      <lpage>30</lpage>
      <abstract>
        <p>The complexity of modern construction projects necessitates collaboration among diverse stakeholders and the handling of substantial data volumes. Significant investments in digitization have occurred globally to address this complexity, emphasising the need for interoperability among standards and diverse knowledge sources. The emergence of Digital Building Twins (DBTs) further underscores the importance of integrating heterogeneous data to create comprehensive digital representations of buildings. DBTs enable real-time data integration and support various phases of architectural development, ofering practitioners access to historical, present, and predictive data. In this context, our research focuses on enhancing the accessibility and interpretability of DBT data through natural language querying. Leveraging domain-specific ontology and advanced AI techniques, our approach facilitates eficient communication between users and DBTs, enabling rapid extraction of specific building details. This paper presents our methodology, including knowledge representation, semantic analysis, and information extraction, along with evaluation results demonstrating its efectiveness in improving DBT querying performance.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Digital Building Twin (DBT)</kwd>
        <kwd>Building Information Modelling (BIM)</kwd>
        <kwd>Artificial Intelligence (AI)</kwd>
        <kwd>Neurosymbolic AI</kwd>
        <kwd>Ontology</kwd>
        <kwd>Knowledge representation</kwd>
        <kwd>Knowledge base question answering (KBQA)</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Construction projects are complex and require collaboration between various stakeholders to
develop increasingly intelligent structures. Smart constructions entail handling substantial data
volumes, which must be organised for services like digital archives, data analysis, and predictive
interpretation. Significant investments in digitisation occur globally to address the complexity
of information required for decision-making in today’s construction industry, necessitating
interoperability among the underlying standards used. Historically, GIS (Geographic Information
Systems) addresses the modelling of the environment surrounding a construction, while BIM
(Building Information Modelling) tackles the construction itself. To have a complete model of
constructions and their environment, one must integrate GIS and BIM standards, thus blurring
traditional distinctions between the two domains, highlighting the need to remove barriers to
information sharing across disciplines, ultimately optimising investments and enhancing data
quality [
        <xref ref-type="bibr" rid="ref1 ref50">1</xref>
        ].
      </p>
      <p>
        Additionally, Digital Building Twins (DBT) represent a crucial aspect of modern construction
processes. A DBT entails real-time communication among sensors to create a comprehensive
digital representation of a building at a specific moment, reflecting the evolving understanding
of construction in the digital era. Typically relying on Building Information Modelling (BIM), a
DBT represents the virtual counterpart of a physical structure, replicating its real attributes and
conditions [
        <xref ref-type="bibr" rid="ref1 ref50">1</xref>
        ]. DBTs enable seamless real-time data integration and support multiple phases
of architectural development, from design to operational and maintenance stages. DBTs allow
AEC (Architecture, Engineering, Construction) practitioners to gain access to historical, present,
and predictive data, empowering them to explore diverse optimisation possibilities. In the
era of Industry 4.0, DBTs are instrumental in smart buildings, streamlining data aggregation,
optimising operations, and facilitating predictive maintenance processes [
        <xref ref-type="bibr" rid="ref2 ref51">2</xref>
        ].
      </p>
      <p>To best understand the challenges associated with accessing information from a DBT, let us
consider the example of a large building with a surface of 10,000 square meters and 50 stories,
which is a structure that typically has numerous emergency exits. Querying information about
these exits, including their dimensions and compliance status, involves using some software tool
to navigate through diferent levels and inspect potential existing door labels such as "fire exit" or
"emergency exit." This task is time-consuming, taking at least a dozen minutes for experienced
users and up to 30 minutes for less seasoned practitioners, particularly if thoroughness is
required. Additionally, obtaining dimensions of door frames often requires guesswork, as this
information may not be readily available.</p>
      <p>
        Our approach aims to eficiently address such queries by providing specific building details
that may not be directly represented in the existing standard schema of the DBT. The approach
emphasises the efectiveness and user-centric design of the enhanced information retrieval
system presented in [
        <xref ref-type="bibr" rid="ref3 ref52">3</xref>
        ] by enabling users to obtain this information in seconds rather than
minutes.
      </p>
      <p>Following our definition of a DBT, this article presents our approach for an intuitive "dialogue"
between humans and DBTs by rendering the data within a DBT accessible and interpretable
via machine processing. Our approach allows expert and novice users to ask queries in natural
language by leveraging domain-specific and semantically rich ontology crafted by domain
experts, i.e. the OB27AI ontology. As such, it leverages advanced capabilities derived from the
domain of Artificial Intelligence (AI), namely symbolic AI (through knowledge representation),
deep learning and Natural Language Processing (NLP).</p>
      <p>
        In this paper, we focus on improving the performance of queries addressed in natural language
over knowledge representations of DBTs. Our contributions are summarised as follows:
• We describe a complete approach for natural language querying of DBTs
• We define a new knowledge representation of DBTs in the form of the OB27AI ontology
and detail its design and contents
• We enhance our approach presented in [
        <xref ref-type="bibr" rid="ref3 ref52">3</xref>
        ] in terms of semantic analysis and information
extraction
• We specify our implementation for querying the DBT along with its related phases,
i.e. query analysis, query parsing, grounding, extraction, generation, execution and
transformation
• We evaluate the performance improvement of our approach compared to [
        <xref ref-type="bibr" rid="ref3 ref52">3</xref>
        ], both on a
general level (in terms of recall, precision and F1 score) and on the level of each phase.
      </p>
      <p>The article is structured as follows: section 2 provides some definitions forming the necessary
scientific background for our research, and section 3 resumes existing research done in the
related domains. Our approach is introduced in section 4, while its implementation is provided
in section 5. Last but not least, section 6 gives the results of the evaluation of our prototype,
and section 7 concludes this paper.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Scientific background</title>
      <p>The following sections provide the necessary definitions and references to understand our
approach’s components.</p>
      <sec id="sec-2-1">
        <title>2.1. Modelling &amp; querying knowledge</title>
        <p>
          Semantic Web (SW) aims to make heterogeneous information accessible to a machine, allowing
to turn data into machine-processable knowledge. SW uses an ensemble of W3C standards
for annotating data present on the Web and enabling knowledge graphs (KG). Based on the
existing Web standards, e.g. URI (Universal Resource Identifier) for resource identification and
HTTP as a universal access protocol, SW standards build upon RDF (Resource Description
Framework) for representing simple facts about resources (identified by HTTP URIs) and upon
the OWL (Web Ontology Language) family for describing more complex knowledge in the form
of ontologies. In an ontology, knowledge is expressed as "Subject - Predicate - Object" triples
and can be queried or modified using SPARQL (SPARQL Protocol and RDF Query Language).
Additionally, Linked Data (LD) has been defined by the W3C 1 as four principles for enhancing
the share and reuse of such annotated data to form a global network of knowledge accessible
and comprehensible by a network of machines. This link between diferent silos of knowledge
can be expressed in diferent ways, depending on the need. If it is necessary to align knowledge
for inference purposes (e.g. owl:equivalentClass), the reasoning mechanics provided by OWL
are to be preferred but can be costly; if the link to be established is strictly semantic, using the
vocabulary of an existing ontology such as SKOS is suficient [
          <xref ref-type="bibr" rid="ref4 ref53">4</xref>
          ].
        </p>
        <p>The OWL family comprises several profiles for ontology description languages based on
Description logic (DL) with diferent levels of expressivity. They allow for defining complex
knowledge representations, combining concepts, relationships among concepts and constraints
over these elements, generally expressed as logical rules or necessary and suficient conditions
over class definitions. Given their symbolic and logical nature, it is possible to reason upon such
knowledge representations by applying specific algorithms (called reasoners) that allow implicit
axioms from the explicit ones contained in the ontology to be inferred. KGs are obtained when
using LD principles to connect elements from diferent ontologies.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. OpenBIM</title>
        <p>
          Building Information Modelling (BIM) has become vital in the last decade in AEC (Architecture,
Engineering and Construction). Following ISO 23386:2020 [
          <xref ref-type="bibr" rid="ref5 ref54">5</xref>
          ], BIM is defined as the "use of a
shared digital representation of an asset to facilitate design, construction and operation processes
to form a reliable basis for decisions". OpenBIM is a collaborative process supported by the
BuildingSMART International organisation, emphasising the interoperability of construction
project data through open standards and common vocabularies.
        </p>
        <p>
          Industry Foundation Classes (IFC) [
          <xref ref-type="bibr" rid="ref55 ref6">6</xref>
          ] is the openBIM standard exchange format for
exchanging building and infrastructure digital representations in the AEC industry. Initially designed
for sharing and exchanging product data, the diferent efective implementations of the IFC data
model have proven their limitations for querying and analysis tasks[
          <xref ref-type="bibr" rid="ref56 ref7">7</xref>
          ]. Indeed, the primary
goal of IFC is to efectively represent basic concepts [
          <xref ref-type="bibr" rid="ref57 ref8">8</xref>
          ] and descriptive building information,
often characterised by complex structures. This complexity has led to challenges in querying
and managing IFC instance data. While promoted as a vector for interoperability, IFC lacks the
lfexibility to integrate and process data from multiple sources [
          <xref ref-type="bibr" rid="ref58 ref9">9</xref>
          ].
        </p>
        <p>
          OpenBIM also defines the exchange requirements for every type of business process and
every BIM design stage through Information Delivery Manuals (IDMs) [
          <xref ref-type="bibr" rid="ref10 ref59">10</xref>
          ]. IDMs describe
the framework for implementing Model View Definitions (MVDs) 2, representing a subset
of the information strictly necessary for a specific business process [
          <xref ref-type="bibr" rid="ref11 ref60">11</xref>
          ]. For the IFC 2x3
TC1 specification, the most common MVD is the "Coordination View 2.0", which enables
BIM requirements to be matched to each discipline, such as architecture, MEP and structure,
represented respectively by the CV2.0-Arch, CV2.0-MEP and CV2.0-Struct views. Once the
MVD has been extracted from the IFC file, the IDM file is used within the visualisation software
to check the conformity of the MVD concerning the requirements criteria. Finally, the BIM
Collaboration Format (BCF), encapsulated in IFC, is used for reporting purposes to alert the
various stakeholders and ensure that the project runs smoothly.
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Interoperability</title>
        <p>
          As seen in section 2.2, the AEC industry has wide heterogeneity in existing knowledge and
data. The diversity and variation in information types, structures, formats, and semantics pose
interoperability challenges, as defined by ISO/IEC 2382:2015 [
          <xref ref-type="bibr" rid="ref12 ref61">12</xref>
          ].
        </p>
        <p>
          Current standards outline three primary levels of interoperability: data, syntactic, and
semantic. These levels are interconnected and progressively enhance each other, with lower tiers
furnishing the necessary components for the functionalities of higher levels [
          <xref ref-type="bibr" rid="ref13 ref62">13</xref>
          ]:
• Data interoperability, as defined by ISO/IEC 20944-1:2013 [
          <xref ref-type="bibr" rid="ref14 ref63">14</xref>
          ], relates to the
comprehensive spectrum of activities involving data creation, interpretation, computation, utilisation,
transmission, and interchange.
• Syntactic interoperability, as defined by ISO 22378:2022 [
          <xref ref-type="bibr" rid="ref15 ref64">15</xref>
          ], refers to the capability of
two or more systems or services to exchange structured information seamlessly. It ensures
that the formats of the exchanged information are comprehensible to the participating
systems, as outlined by ISO/IEC 19941:2017 [
          <xref ref-type="bibr" rid="ref16 ref65">16</xref>
          ].
2https://technical.buildingsmart.org/standards/ifc/mvd/mvd-database/
• Semantic interoperability, as defined by ISO/TS 18308:2011 [
          <xref ref-type="bibr" rid="ref17 ref66">17</xref>
          ], refers to the ability of
data exchanged between systems to be understood in terms of fully specified domain
concepts. It enables systems or services to automatically interpret and utilise exchanged
information accurately, as outlined by ISO 22378:2022 [
          <xref ref-type="bibr" rid="ref15 ref64">15</xref>
          ]. Furthermore, it ensures that
the participating systems comprehend the meaning of the data model within the context
of a specific subject area, as outlined by ISO/IEC 19941:2017 [
          <xref ref-type="bibr" rid="ref16 ref65">16</xref>
          ].
        </p>
        <p>Data and syntactic interoperability are already extensively addressed by standard
protocols (e.g. TCP/IP, HTTP) and the adoption of syntax standards (e.g. XML, HTML). Semantic
interoperability, though, is yet to be fully addressed by existing standards and approaches.</p>
      </sec>
      <sec id="sec-2-4">
        <title>2.4. Natural Language Processing</title>
        <p>
          Although BIM is considered state-of-the-art in the AEC industry, its adoption rate remains
relatively low [
          <xref ref-type="bibr" rid="ref18 ref67">18</xref>
          ]. This slow adoption highlights the need to explore methods to enhance users’
interaction with BIM models, such as using Natural Language Processing (NLP).
        </p>
        <p>
          NLP is a branch of AI that enables computers to understand, interpret, and generate human
language meaningfully. NLP utilises techniques from various fields, including linguistics,
computer science, and machine learning, to bridge the gap between human communication
and computational systems. Contrary to the formal knowledge models provided by ontologies,
language models are ambiguous and vague, while the relation between symbols to objects is not
clearly defined as in DL. Language models are defined as a "probability distribution describing
the likelihood of any string" [
          <xref ref-type="bibr" rid="ref19 ref68">19</xref>
          ]. Given these issues, approaches based on neural networks
have encountered a large success in this field. Providing an extensive state-of-the-art of these
goes beyond the scope of this article, but to best understand our contribution, two approaches
are worth presenting: the Transformer architecture and Pre-trained Language Models (PLMs).
        </p>
        <p>
          The Transformer architecture, outlined in [
          <xref ref-type="bibr" rid="ref20 ref69">20</xref>
          ], introduces the concept of "self-attention",
allowing each word to consider all others, disregarding their positions. It enables the language
model to prioritise words efectively (using positional embedding), which is crucial for generating
accurate outputs if we consider the above definition of a language model. Comprising encoder
and decoder layers with self-attention and feed-forward neural networks, the Transformer
replaces traditional recurrent and convolutional layers, improving parallelisation and capturing
long-range dependencies more eficiently. It also incorporates positional encodings to maintain
sequence order information [
          <xref ref-type="bibr" rid="ref19 ref68">19</xref>
          ].
        </p>
        <p>
          Built on the Transformer architecture, several approaches called Pretrained Language Models
(PLMs) have been published, i.e. BERT (Bidirectional Encoder Representations from
Transformers) or GPT (Generative Pretrained Transformer). As an extension of the Transformer
architecture, PLMs are a form of transfer learning that uses large and general text datasets to
train an initial NLP model with self-supervised learning objectives. The initial NLP model is
refined using smaller domain-specific text datasets [
          <xref ref-type="bibr" rid="ref19 ref68">19</xref>
          ].
        </p>
        <p>There are three broad categories of PLM transformers:
• Encoder-only (e.g. Bidirectional Encoder Representations from Transformers aka BERT)
are meant to generate contextual representations of text. We aim to test SpaCy LLM,
based on OpenLLaMA3 which uses Large Language Model Meta AI (LLaMA), developed
by Meta.
• Decoder-only (e.g. Generative Pre-trained Transformer aka GPT) are purposed to generate
new text based on a context. We plan to test Falcon4, developed by TII (Technology
Innovation Institute) UAE and GPT, developed by OpenAI, which is only available
onpremises in version 25 (now in version 4).
• Encoder-decoder (e.g. Text-to-Text Transfer Transformer aka T5) are geared toward
transforming text. We seek to test Flan-T56, developed by Google, a T5 fine-tuned using
instruction-based prompts.</p>
        <p>
          NLP can be split into two main domains [
          <xref ref-type="bibr" rid="ref21 ref70">21</xref>
          ]: Natural Language Understanding (NLU) and
Natural Language Generation (NLG). NLU deciphers human language, parsing inputs into a
structured format for machines. It includes tasks like entity recognition, syntactic, and semantic
analysis. NLG[
          <xref ref-type="bibr" rid="ref22 ref71">22</xref>
          ], on the other hand, creates coherent text from structured data, performing
tasks such as report generation and personalized messaging. Together, NLU and NLG cover
language processing from comprehension to production.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Related work</title>
      <p>This section summarises existing standards and works in the literature about our approach’s
diferent components.</p>
      <sec id="sec-3-1">
        <title>3.1. Knowledge modelling for the digital building twin</title>
        <p>
          To address the limitations of the IFC standard as mentioned in section 2.2, bSI organisation
promotes the ifcOWL ontology, which serves as a foundational domain ontology for the AEC
industry [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ].
        </p>
        <p>
          ifcOWL provides an OWL2-DL semantic representation of the IFC schema, which can be
used not only to access information in an IFC model, but also to link it with external data not
explicitly described in the IFC schema. Due to the verbose structure underlying the IFC data
model (more than 900 classes), the resulting OWL structure has a high OWL2-DL expressivity
(SROIQ(D)) [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ], which inevitably leads to the collapse of reasoning engine performance due to
the number of axioms and assertions to be processed [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ].
        </p>
        <p>
          When considering the use of ifcOWL for handling and interpreting DBT data, several types
of research have highlighted the need for less expressive structures [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ] as well as for adapted
querying approaches [
          <xref ref-type="bibr" rid="ref56 ref7">7</xref>
          ]. Additionally, several methodologies have been published
addressing the simplification of DBT data conversion from IFC into an ontology, such as IFC2LD 7,
3https://github.com/openlm-research/open_llama
4https://falconllm.tii.ae/
5https://openai.com/research/gpt-2-1-5b-release
6https://github.com/google-research/t5x/blob/main/docs/models.md#flan-t5-checkpoints
7https://github.com/Web-of-Building-Data/ifc2ld.git
IFCtoRDF8, IFCtoLBD9, KGG10 [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ]. In addition, numerous works have led to ontologies with
narrower scopes and/or simplified structures [
          <xref ref-type="bibr" rid="ref25 ref27 ref28">25, 27, 28</xref>
          ].
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Approaches for aligning ontologies</title>
        <p>
          “Ontology alignment is the process of identifying correspondences between semantically related
entities of diferent ontologies” [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ] to address semantic heterogeneity and enable semantic
interoperability. This process leads to identifying various types of relationships among entities
(e.g. equivalence, subsumption, disjointness).
        </p>
        <p>
          As we have already seen, there is a wide disparity in the data around the field of AEC
construction engineering. Generally speaking, ontologies answer how to structure all this data,
enabling it to be expressed as knowledge. Moreover, depending on the formalism used (RDF
/ OWL / etc.), they enable us to define a balance between the expressiveness and decidability
of the knowledge they describe [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ], to express the axioms and rules of behaviour of each of
the entities within the domains in which they are represented, and to apply a logical reasoning
mechanism to them.
        </p>
        <p>
          Aligning 2 or more ontologies to map the entities (ABox / TBox) of each domain under study
addresses the problem of semantic heterogeneity in the data [
          <xref ref-type="bibr" rid="ref29 ref31">29, 31</xref>
          ].
        </p>
        <p>
          There are 4 main approaches to ontological alignment [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ]:
1. Terminological analysis methods focus on lexical, syntactic or linguistic similarities [
          <xref ref-type="bibr" rid="ref33">33</xref>
          ].
        </p>
        <p>
          They consist of comparing labels qualifying knowledge based on the notions of synonymy,
polysemy, translation or according to methods for calculating frequencies of occurrence
(TF-IDF, bags of words, etc.).
2. Approaches based on internal ontological structures (metadata, cardinalities, constraints,
attributes, etc.) and external structures (taxonomies) [
          <xref ref-type="bibr" rid="ref34">34</xref>
          ].
3. Instance-based approaches - extensions use statistical and data analysis methods to
determine a level of sharing of standard entity-to-entity information and resources between
the ontologies to be aligned.
4. Semantic similarity approaches are based on the meaning and context of entities. With
the rise of Large Language Models, many researchers are looking into ontology alignment
based on generative AI [
          <xref ref-type="bibr" rid="ref35">35</xref>
          ], as they are particularly well suited to natural language-based
information retrieval.
        </p>
        <p>Ontology matching approaches are evaluated by the OAEI (Ontology Alignment Evaluation
Initiative)11, which brings together all the research on the subject. Ontology alignment remains
a real challenge for the scientific community.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Knowledge Base Question Answering</title>
        <p>
          Knowledge Base Question Answering (KBQA) systems aim to find answers to queries expressed
in natural language from a Knowledge Base (KB). Present methodologies for KBQA can be
8https://github.com/pipauwel/IFCtoRDF
9https://github.com/jyrkioraskari/IFCtoLBD
10https://github.com/kyriakos-katsigarakis/openmetrics/tree/master/openmetrics-kgg
11http://oaei.ontologymatching.org/
broadly categorised into two main groups:
1. Semantic Parsing Based Methods: These approaches focus on a semantic parser that
translates questions into intermediate logic forms. These logical forms are then executed
against the KB to retrieve relevant answers [
          <xref ref-type="bibr" rid="ref36">36</xref>
          ].
2. Information Retrieval Based Methods: These methods involve retrieving potential answers
from the topic-entity-centric sub-graph within the KB. Subsequently, the answers retrieved
are ranked based on their relevance to the questions asked [
          <xref ref-type="bibr" rid="ref37">37</xref>
          ].
        </p>
        <p>
          Recently, there has been a notable surge in research interest towards combining these two
approaches for KBQA [
          <xref ref-type="bibr" rid="ref38">38</xref>
          ]. Indeed, the neuro-symbolic reasoning (NSR) approach gains
attention from the eficacy and enhanced interpretability ofered by semantic parsing and its
symbolic backbone and the powerful capabilities of neural networks to capture information
from unstructured data surrounding topics of interest as in information retrieval [
          <xref ref-type="bibr" rid="ref39">39</xref>
          ].
        </p>
        <p>
          When considering KBQA systems for querying DBT data, the semantic parsing of natural
language queries involves identifying IFC concepts, their relationships, and value restrictions.
As the existing ifcOWL ontology does not provide natural language descriptions or labels
for its elements (i.e. IFC concepts), the IFC Natural Language Expression (INLE) ontology
[
          <xref ref-type="bibr" rid="ref40">40</xref>
          ] was developed to add these necessary constructs, therefore supplementing ifcOWL with
natural language representations of IFC concepts, providing notably synonyms, hyponyms,
abbreviations, and morphological variations. Moreover, this INLE ontology has been developed
as a seed ontology for model-specific semantic interpretation of natural language queries,
focusing on all building elements (e.g., IfcWall, IfcDoor, IfcBeam) and 2 types of spatial elements
(IfcBuildingStorey and IfcSpace) from IFC 2x3 TC1, comprising 121 classes, 58 object properties,
446 individuals, and 3071 axioms in all.
        </p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Approaches for interoperability</title>
        <p>
          As seen in section 2.1, SW promotes using standardised vocabularies, ontologies and linked
data principles to enable machine-readable and interoperable information. Still, semantic
heterogeneity is persistent as knowledge representation is done through independent ontologies
with no existing links. Ontologies being formal knowledge models, one can consider standard
ISO 11354-1:2011 [
          <xref ref-type="bibr" rid="ref41">41</xref>
          ] interoperability approaches, i.e. integration, unification, or federation,
for addressing the related semantic interoperability issues.
        </p>
        <p>Integration harmonises elements/systems into a cohesive common whole, ensuring they
conform to a common format for seamless interaction. It prioritises interoperability from
the outset, ideal for new system designs requiring adherence to a common form. Specifically
regarding ontologies, integration involves defining a new ontology and defining outgoing links
to the elements of existing ontologies.</p>
        <p>
          Unification establishes a shared meta-model for semantic equivalence, aiding translation
between diferent models. The meta-model ranges from a foundational reference vocabulary
to an intricate ontology. It maps all alternative models to this common framework, enabling
translation but potentially leading to some loss of information. Studies show it enhances
interoperability among AEC stakeholders [
          <xref ref-type="bibr" rid="ref42 ref43">42, 43</xref>
          ].
        </p>
        <p>
          Federation is decentralised, allowing entities to maintain autonomy without conforming
to a common format. Interoperability is achieved through mappings and agreements despite
diferences in vocabularies or methodologies. Unlike integration and unification, it doesn’t
require a common format or meta-model, allowing each entity to retain autonomy. This
approach suits contexts with diverse or intricate vocabularies and methodologies. As such, it has
higher design and implementation costs and provides the greatest flexibility. FOWLA (Federated
Architecture for OWL Ontologies) is one of the few examples of an ontology federation approach
[
          <xref ref-type="bibr" rid="ref44">44</xref>
          ].
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Our approach</title>
      <sec id="sec-4-1">
        <title>4.1. Overview of the contribution</title>
        <p>
          We aim to enhance the general performance of the approach presented in [
          <xref ref-type="bibr" rid="ref3 ref52">3</xref>
          ] in terms of
precision, recall and F1 score. The overall system architecture is presented in 1 below. We
highlight in green the new elements added in this version compared to the previous one.
        </p>
        <p>The main modifications consist of a new domain ontology OB27AI (detailed in section 4.2)
along with a dependency tree providing a syntactic understanding of the input sentence through
constituency parsing coupled with an NL-enhanced model for grounding query elements
(detailed in section 4.3).</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Knowledge modelling</title>
        <p>To do so, we decided to develop a new domain ontology that would complement it, with input
from domain experts, to have a finer-grain description of the desired subdomains while keeping
the ontology simple enough for existing reasoners to work efectively.</p>
        <p>
          There are many methods to develop a new domain ontology (e.g. TOVE, Seven-Step, SENSUS):
among these methods, the Seven-Step [
          <xref ref-type="bibr" rid="ref45">45</xref>
          ] is chosen as it’s widely used and fits our approach.
It adapts Gruber’s five main principles of ontology construction [
          <xref ref-type="bibr" rid="ref46">46</xref>
          ].
        </p>
        <p>This methodology is iterative, and often, steps are revisited as the development process evolves.
Each step builds on the previous ones, contributing to a comprehensive and robust ontology
that accurately represents the knowledge in a particular domain. Additionally, continuous
validation with domain experts and potential ontology users is essential throughout these steps
to ensure its accuracy and usability.</p>
        <sec id="sec-4-2-1">
          <title>4.2.1. Our domain ontology</title>
          <p>The following paragraphs detail the methodology for conceiving our domain ontology (OB27AI).
According to the Seven-Step methodology, we first determine our ontology’s domain and scope.
We chose to focus on the following subdomains to encompass the DBT and its wider context
and surroundings:
• Structural works: the major works of a building (e.g. foundations, load-bearing walls and
beams).
• Heating, Ventilating and Air Conditioning (HVAC): all the fluids (e.g. water, air, gas)
within a building.
• Highways and miscellaneous external works: outside of a building (e.g. access and
pathways, roads and parking areas, drainage networks, tanks), and the outdoor structural
works (e.g. preparation of the site for foundations, fittings, excavation)
• Electrical works: requirements and layout of electrical equipment, high-voltage (e.g.
electrical power, lighting), low-voltage (e.g. telephone network, computer network), and
ifre safety (e.g. detectors, fire alarms)
• Materials: e.g. concrete, steel, aluminium, wood, cement.
• Woodwork: similar to structural works but specific to wood-based structural elements.
• All trade works: covers all of the above, as well as all the "second works" of a building
(e.g. partitions, non-load-bearing walls, doors and windows, paint, skirting boards).
For each domain above, domain experts enumerate the important terms and structure them into a
taxonomy built top-down from a single high-level concept. They follow simple recommendations
about naming conventions and guidelines regarding the expected structure. A group review
of these taxonomies is then done to help the domain experts finalise them, to ensure the level
of detail is in line with our objectives, and to check there are no duplicates of concepts across
domains. The taxonomies are then processed automatically to fill the ontology. A manual
review is done afterwards to check the consistency of the ontology, adapting it when needed
while not deviating from the taxonomies. As outlined in the Seven-Step methodology, the whole
process is iterative.</p>
          <p>The TBox represents the entities and their relationships on a conceptual level, as shown in
Fig. 2. The classes and the hierarchies we choose are similar to those found in the IFC ontology
(version 2x3 TC1), though the following diferences and additions drive our work:
• More classes are defined to materialise components of larger systems (e.g. door frame)
for finer granularity.
• The properties are simpler and straighter to keep the ontology more human-readable.
• The natural language description of objects is prominent in the "b27:Name" class and its
sub-classes, as it’s a key point to access the model and interact with the DBT.</p>
          <p>
            The "b27:Name" is the root for all natural language representations of the concepts we chose
to focus on in our domain ontology, providing synonyms, hyponyms, hypernyms, abbreviations
and morphological variations, along with a similarity index related to the main (i.e. origin) term
representing a concept. As we aim to go deeper with a finer grain in our chosen subdomains
than the INLE ontology, we rely on this ontology (and its alignments) for a natural language
expression of the broader IFC concepts that may not be covered by our ontology. This approach
has been chosen to ease the interoperability with ifcOWL [
            <xref ref-type="bibr" rid="ref23">23</xref>
            ] and INLE [
            <xref ref-type="bibr" rid="ref40">40</xref>
            ]. We provide an
example snippet of the description of a "b27:Name_DoorPanel", a subclass of "b27:Name" in
Listing 1.
          </p>
          <p>Listing 1: Example of a "b27:Name" subclass</p>
        </sec>
        <sec id="sec-4-2-2">
          <title>4.2.2. Ontology alignment</title>
          <p>Following the Seven-Step methodology, we identified the diferent ontologies we could reuse.
ifcOWL is the most comprehensive ontology for IFC, which, in our approach, is complemented by
INLE for handling natural language. As we create a new ontology, we maintain interoperability
with ifcOWL and INLE through unification for a faster and tighter alignment. To do so, we
embed the shared meta-model in our OB27AI ontology. A future venue would be a federation
approach for interoperability, allowing our ontology to be fully independent, transposing the
existing meta-model into mappings and agreements, and moving it out of our ontology.</p>
          <p>We choose to proceed with alignment based on the internal structures of ifcOWL and INLE,
which allows greater control over the inferences in the context of global reasoning. Our work is
easier because INLE has been built as an extension of ifcOWL. Therefore, they are both already
aligned. The considered ontologies have very few individuals in their original state, i.e., they
are not linked to a model. Therefore, we focus on the schema of these ontologies, i.e., their
classes and properties.</p>
          <p>
            Our methodology has been inspired by Euzenat et al. [
            <xref ref-type="bibr" rid="ref29">29</xref>
            ] and consists of the following steps.
1. Adjust the structure of our ontologies to facilitate easier association.
2. Assess the similarities to establish the matching along these axes:
• Linguistic: considers the textual similarities between entity labels
• Structural: considers the structure of the ontologies, i.e. the classes, their hierarchies,
and their relationships.
          </p>
          <p>• Semantic: considers the meaning of terms by relying primarily on domain experts.
3. Generate the meta-model to be embedded within our OB27AI ontology for most mapping
based on constraints, equivalence, and subsumption. A more complex mapping part
requires establishing inference rules outside our OB27AI ontology.
4. Validate the alignment using reasoning tools for inconsistencies and logical errors. A
manual review is done with the domain experts to finalise the validation.
5. Refine the alignment each time a new taxonomy is incorporated into our OB27AI ontology
by repeating this process on the extended ontology.
6. Integrate the alignments to produce our OB27AI ontology aligned with ifcOWL, and a
modified INLE ontology aligned with ours (Fig. 3).</p>
          <p>As briefly seen on Fig. 3, the INLE ontology has some generic individuals (e.g. "inle:ifcdoor1").
Our OB27AI ontology has such instances for properties, predefined values of properties,
materials, etc., that are shown with more details in Fig. 4</p>
          <p>However, most individuals are related to the names (and their variants, i.e. synonym,
hyponym, lemma, etc.) in natural languages used by the domain experts, including common terms.
Those are used to attach labels to the objects (e.g. door panel, door leaf), their properties (e.g.
door aperture), the values of properties (e.g. automatic or manual door operation), etc.</p>
          <p>As illustrated in Fig. 5, based on the labelling done in a building model (e.g. "Ridge beam"),
the precise type of object (e.g. RidgeBeam) from our ontology is retrieved for an individual
that is otherwise typed as a broader IFC class (e.g. IfcBeam). This process allows us to add
constraints of all kinds (e.g. material, dimension, position, property, etc.) on these individuals,
enabling us to reach our goal for a fine-grained description within the subdomains we chose.</p>
          <p>We provide an example snippet another example of alignment around "b27:Door" in Listing 2.</p>
          <p>Listing 2: Example of an OB27AI alignment
The complete list of alignments can be found as a link set on the GitHub - donner le lien.</p>
          <p>The complete definition of these alignments represents ongoing work and goes beyond the
scope of this article. Besides corresponding to a correct application of Linked Data principles,
such alignments provide the advantage of inheriting from properties and constraints defined in
the other ontologies, thus allowing us to further reason on DBT knowledge. For example,
aligning our classes with the BOT ontology allows us to take advantage of the diferent topological
properties defined in BOT (Building Topology Ontology) ref - https://w3c-lbd-cg.github.io/bot/,
thus allowing us to decompose the DBT into diferent spaces. Additionally, such alignments
allow other ontologies to reuse the natural language annotation of building elements, as permitted
by the B27AI ontology.</p>
        </sec>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Natural Language (NL) querying</title>
        <p>
          In our previous approach [
          <xref ref-type="bibr" rid="ref3 ref52">3</xref>
          ], we built a KBQA system based on semantic analysis methodology,
where the named entity recognition (NER) phase is a mixture of neural networks to decompose
the user’s query and symbolic reasoning to ground entities in the KG.
        </p>
        <p>To improve our KBQA system, diferent approaches for the NER phase needed to be tested. The
parsing of the input sentence is done on the semantic level using a convolutional neural network
(CNN), providing a dependency tree where nodes represent the words, and the directed edges
represent dependencies between the words. Each dependency is labelled with a relationship
type that describes the nature of the dependency (e.g., subject, object, modifier).</p>
        <p>We aim to test adding a syntactic understanding of the input sentence through constituency
parsing, where the output reflects the syntactic structure of a sentence in terms of a hierarchical
tree of phrases, sub-phrases, and words.</p>
        <p>
          Additionally, we aim to test diferent types of neural networks widely used for NLU. Indeed,
as mentioned in section 2.4, Transformer neural networks have gained considerable attention
in NLP-related tasks, as they show superior performance in handling of context and sequence
than CNNs and recurrent neural networks (RNNs) [
          <xref ref-type="bibr" rid="ref20 ref69">20</xref>
          ], due to their attention mechanisms and
ability to model bidirectional contexts eficiently [
          <xref ref-type="bibr" rid="ref47">47</xref>
          ].
        </p>
        <p>Our KBQA system outputs the items in the DBT corresponding to a query. This raw dry
output contrasts sharply with the user’s natural language query. Therefore, we decide to use
NLG to represent the output in a more user-friendly way, testing template-based sentence
generation and pre-trained language models (PLM) for text generation.</p>
        <p>In both the NLU and NLG tasks, our goal is to find the optimal balance between the relevance
of the output and the speed to obtain it.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Implementation</title>
      <p>To build and maintain our ontology and its alignments, we use Protégé12, an open-source tool
from Stanford University that facilitates creating, maintaining, and editing ontologies. It ofers
a graphical interface for modelling ontologies using OWL and supports testing inferences and
running SPARQL queries.</p>
      <sec id="sec-5-1">
        <title>5.1. Ontology’s population</title>
        <p>Another output built from the work done by the domain experts is embedding properties
and constraints into the DBT. Specifically, tools such as Autodesk Revit 13, ofer the capability to
add custom fields, predefined values, and sometimes even some basic rules (e.g. EXAMPLE) to
the model. This constitutes the preparatory step for practitioners to build and/or obtain a DBT
enriched with our domain ontology.</p>
        <p>To populate our KGs, the first step is to create an IFC-based repository in our graph DB
(DataBase), loaded with the ifcOWL (IFC 2x3 TC1) ontology from the oficial file (using the
turtle serialization format). The DBT is exported to an IFC raw file (STEP format), which is
then transformed into an RDF file (using IFCtoRDF) to allow the model to be imported into
the IFC-based repository of the graph DB. A small duplex apartment building contains 454,359
statements (364,216 explicit and 90,143 inferred) and has an expansion ratio of 1.25.</p>
        <p>The second step is to create an NL-based repository in the graph DB, loaded with our OB27AI
ontology aligned with IFC and the INLE ontology aligned with our ontology: both are imported
from their respective RDF files. A small duplex apartment building contains 34,727 statements
(18,856 explicit and 15,871 inferred) and has an expansion ratio of 1.84.</p>
        <p>The third step is to build and fill an NL-based representation of the DBT from the IFC-based
repository. Once our OB27AI ontology (aligned with IFC) is merged with the IFC-based graph
representation of the model, the entities and concepts of interest are filtered based on the INLE
ontology (aligned with ours), to fill the NL-based repository with the NL-enhanced model.</p>
        <p>These three steps, shown in Fig. 6, are done using Python scripts and the GraphDB14 APIs
(Application Programming Interfaces).</p>
        <p>GraphDB, edited by OntoText, is our chosen graph database engine and knowledge
management tool, equipped with reasoning and inferencing capabilities and compliant with RDF and
SPARQL.</p>
        <p>Python is selected as the programming language due to its versatility and the extensive array
of libraries and frameworks available, both third-party and native, in the relevant areas.</p>
        <p>The owlready215 module, licensed under GNU LGPL v3, is a Python module for loading,
browsing, and modifying ontologies, as well as for searching, inferencing, and executing SPARQL
queries.</p>
        <p>The minimalist and lightweight Flask16 framework is employed for developing web services
in Python. It provides the essential elements required to create lightweight web applications.</p>
        <p>Conda17 is an open-source package manager that creates isolated Python environments, each
containing packages and dependencies specific to various tools and experiments.</p>
        <p>Project management and sharing are facilitated through GitHub, Microsoft’s online platform
that uses the Git version control system.</p>
        <p>Microsoft’s Visual Studio Code (VS Code) is the source code editor. It is lightweight, fast, and
versatile, with particularly strong integration of Python, Conda, and Git.</p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. NL querying</title>
        <p>Our approach for elaborating an NL answer from an NL query comprises 7 steps and is shown
in Fig. 7.</p>
        <p>The "Analyse" step uses the user’s question as input, triggering multiple NLU analyses to
obtain one dependency and one constituency tree combined into a single tree representing the
query.</p>
        <p>The "Parse" step traverses the dependency and constituency tree to recognise potential entities.
It organises them in branches of diferent lengths, following the overall structure of the input
tree to generate an entity tree. This entity tree comprises all the possible branches between two
entities, including their alternatives above a threshold based on semantic similarities with the
terms from the origin sentence.</p>
        <p>The "Ground" step traverses the entity tree to look for the entities in the NL KG while also
searching for potential links between such entities in this KG. These grounded entities and links
(aka grounded triples) are stored in a tree structurally identical to the one of the entity tree,
though lighter as some branches have been pruned, either for the lack of possible relationships
between entities or because their local relevance dropped below the threshold to keep them.
The value for local relevance is computed on semantic similarities of entities and distances
between classes and domains or ranges.</p>
        <p>The "Extract" step explores the grounded triples, expands them in local SPARQL queries to be
executed against the IFC KG, and ranks them against the triple score computed for the pruning
in the previous step. The goal is to prune the grounded tree further when a query does not
return a result, except for comparison (e.g., walls higher than 3m). Each branch’s best constraint
(aka triples grounded in the IFC KG) is kept for the output list.</p>
        <p>The "Generate" step stitches the constraints together in a shallow tree-like structure to
account for the potential hierarchy of sub-queries to form the final SPARQL query.</p>
        <p>The "Execute" step executes the final SPARQL query against the IFC KG and collects the raw
list of IFC results.</p>
        <p>The "Transform" step parses the raw list of IFC results to obtain the labels and types of
the objects from the DBT so that very simple sentences can be created automatically using
templates.</p>
        <p>These sentences can be presented to the user, though more importantly, they can be used,
in combination with the user’s query for context, to generate more complex sentences using
PLMs (introduced in section 2.4).</p>
        <p>As input to these models, either the keywords from the questions and the answers are
provided, or the question along the template-generated answer and an instruction. For example,
with the question "List the exterior doors with a width larger than 1m." (#21 in appendix A),
there are 2 items in the ground truth:
• The raw keywords are "2 door is external has property width larger than 1 meter".
• The template-based generated answer is "There are 2 items of type door. These items are
’Door_6597’ and ’Door_6702’.".
• For models that can incorporate prompts (i.e. Falcon, GPT, Flan-T5), the full input is then:
"Question: List the exterior doors with a width larger than 1m.</p>
        <p>Answer: There are 2 items of type door. These items are ’Door_6597’ and ’Door_6702’.</p>
        <p>Generate a sentence to answer the question:"</p>
        <p>For NLU tasks to understand the user’s query, the primary open-source library we used is
spaCy18, renowned for its high execution speed and precision in processing text. Available for
Python, spaCy is designed to handle various NLP tasks, including tokenization, lemmatization,
parsing, and building dependency trees.</p>
        <p>
          For the specialised and precise task of constructing constituency trees that represent questions,
the BeNePar19 (Berkeley Neural Parser) is used for its exceptional eficiency [
          <xref ref-type="bibr" rid="ref48">48</xref>
          ]. Originally
developed at the University of Berkeley, this parser is designed to conduct syntactic analysis of
sentences using deep neural networks, enhancing the accuracy of such analyses. It captures
complex syntactic dependencies, enabling precise analysis in multiple languages. BeNePar is
integrated with spaCy.
        </p>
        <p>Hugging Face20 provides open-source libraries that ofer easy access to various pre-trained
models for diverse NLP tasks alongside a platform for sharing models and datasets.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Results &amp; evaluation</title>
      <p>The tests were conducted on the two versions of our KBQA system, using a set of 22 questions
like "Search for all the load bearing beams with slope equal to 0." (see appendix A for full list),
the ground truth having been manually extracted from the DBT.</p>
      <sec id="sec-6-1">
        <title>6.1. KBQA performance</title>
        <p>When the first version was evaluated against this new set of questions, longer and more complex,
the recall dropped from 95.0% to 50.5%, and the F1 score from 0.974 to 0.671, mostly because
this version has been using the small spaCy CNN model "en_core_web_sm", a model 35 times
smaller than its transformer counterpart.</p>
        <p>The two versions in our tests (see Table 1) have been configured to use the same NLP neural
network from the spaCy transformer library ("en_core_web_trf"). The tests were performed on
a computer with an Intel ® i5 CPU (2.5 GHz), 16 GB RAM, and the Windows 10 64-bit system.</p>
        <p>Our precision is lower by 2.7 points due to many false positives in a single question (#11 in
appendix A). The recall is improved by 18.6 points, while the F1 score, more balanced, shows an
improvement of 8.9 points.</p>
        <p>
          NSR-BIM v1 [
          <xref ref-type="bibr" rid="ref3 ref52">3</xref>
          ]
        </p>
        <p>NSR-BIM v2 (Ours)</p>
        <p>Variation (%)
Recall
Precision
F1 score
+10</p>
        <p>The time taken for knowledge data preparation, shown in Table 2, is drastically reduced by
44%, mostly thanks to the use of a graph database instead of file-based storage.</p>
        <p>NSR-BIM v2 (Ours)</p>
        <p>
          Variation (%)
NSR-BIM v1 [
          <xref ref-type="bibr" rid="ref3 ref52">3</xref>
          ]
        </p>
        <p>NSR-BIM v2 (Ours)</p>
        <p>Variation (%)
The query processing time, shown in Table 3, is further detailed below.</p>
        <p>The step #1 is slightly longer as a constituency tree is produced in version 2 while not in
version 1, although most of the time taken in the analysis of the question is linked to the
NLP model used ("en_core_web_trf"): using a smaller and simpler model ("en_core_web_sm")
results in a duration of 0.01s instead of 0.34s, though the trees obtained are syntactically and
semantically too weak and faulty when longer and more complex sentences are used.</p>
        <p>Step #2 in version 2 is 8 times longer (0.08s vs 0.01s) than in version 1 because two trees
have to be parsed and combined, and mainly because semantic similarities against terms in the
ontologies are computed at this stage.</p>
        <p>The step #3 is new to version 2, as no grounding against KG was done in version 1. When
this step was first added to the first version, it took 3.52s on average, more than twice the
current time, as the existing tree traversing algorithm was not designed to handle that many
tree branches and triples.</p>
        <p>The step #4 is more than 4 times longer, as there are more extracted constraints, each tested
against the IFC KG in version 2. Those constraints were taken as they were without further
pruning in version 1: when introducing constraints testing against the KG in version 1, the
average time climbed to 0.75s as too many constraints were tested.</p>
        <p>Step #5 is twice as fast in version 2, as step #4 already performed the expansion of constraints
into SPARQL sub-queries to enable executing them in the IFC KG.</p>
        <p>The step #6 is almost 20 times faster, as version 2 uses a graph database instead of files.</p>
        <p>Step #7 is new to version 2: the time measures have been done using template-based answer
generation, not transformer-based PLM.</p>
        <p>The overall time to process a query has doubled on average in version 2, largely due to
the intermediate grounding of the whole tree against both NL and IFC KGs. For example, the
question "List the exterior doors with a width larger than 1m." (#21 in appendix A), which used
to have 2 branches with 12 triples in the first version, has 6 branches with 57 triples in the
second version, each of them being tentatively grounded in the KGs in this second version.</p>
        <p>Nevertheless, in version 1, this question #21 found no answers as the 2 branches completely
separated "width" from "door": both branches were directly and separately attached to the
root. This issue, widespread in test questions, was fixed by creating a larger and deeper tree
representing the question: in the case of question #21, some branches were now linking both
concepts of "width" and "door". However, having larger and deeper trees required better pruning
to avoid the rise of wrong answers, hence the grounding and extraction steps (#3 and #4) that
would test the branches and their triples against NL and IFC KGs.</p>
        <p>Another example of a question that was failing in version 1 was "What are the dimensions of
the door frame of all the building’s emergency exits?". This question is not part of the test set
because version 1 could not possibly find its answer: indeed, this is the kind of question where
a concept (e.g. "door frame") doesn’t exist explicitly in the model. Therefore, such a concept is
inferred through our OB27AI ontology aligned with IFC, resulting in finding a relevant answer
with our second version.</p>
        <p>Regardless of implementation details, one major venue to improve the execution speed could
be to use Cython, which allows compiling Python code in C or C++ and the direct call to
functions written in C or C++. Another major venue could be loading the NLP neural network
model to a GPU. There is also room for improvement in the grounding queries made for each
possible triple.</p>
        <p>
          A contextual performance comparison between our first version and MOP-SP [
          <xref ref-type="bibr" rid="ref40">40</xref>
          ], which is
outside our scope here, shall be found in [
          <xref ref-type="bibr" rid="ref3 ref52">3</xref>
          ].
        </p>
      </sec>
      <sec id="sec-6-2">
        <title>6.2. NLG evaluation</title>
        <p>To generate a user-friendly answer, we tested 5 models of Transformer-based PLM (see Table 4),
available on-premises (as opposed to cloud-based), with both qualitative (rated "very poor",
"poor", "good" or "very good") and quantitative (typical time to generate the answer) criteria.
When possible, the prompt and the question in full were added as input to the template-based
generated sentence.</p>
        <p>For example, with the question "List the exterior doors with a width larger than 1m." the
template-based generated answer is "There are 2 items of type door. These items are ’Door_6597’
and ’Door_6702’." and the PLM generated answers range from "Two door width more than 1
meter, exterior." (very poor) to "There are two exterior doors with a width larger than 1 meter,
’Door_6597’ and ’Door_6702’." (very good).</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>7. Conclusion and future work</title>
      <p>Integrating multiple knowledge sources and standards becomes increasingly imperative as the
construction industry progresses into the digital era. This fusion blurs traditional distinctions
between domains and emphasises the necessity of interoperability to facilitate seamless
information sharing across disciplines. Moreover, the advent of Digital Building Twins (DBTs)
represents a pivotal advancement in modern construction processes, enabling real-time data
integration and supporting multiple architectural development phases. Our research contributes
to this evolving landscape by presenting an approach that enhances the accessibility and
interpretability of DBT data through natural language queries. By leveraging a domain-specific
ontology and advanced AI techniques, our methodology facilitates eficient communication
between humans and DBTs, allowing users to extract specific building details rapidly and
accurately. Our comprehensive approach, encompassing knowledge representation, semantic
analysis, and information extraction, represents a significant step forward in advancing the
capabilities of DBT querying systems.</p>
      <p>
        Future work Several strategies can be implemented when considering future work. Firstly,
grounding can be improved by setting a trust threshold to prevent systematic grounding.
Secondly, the number of NLP models loaded should be limited; for example, one could be
employed for constituency/dependency tree (NLU) and another for answer generation (NLG).
Additionally, maintaining context appears needed when accommodating multiple questions
about the same topic or object of interest. Further, extracting and sorting partial results can
provide explanations or clues leading to the final answer. To broaden the scope and robustness
of the system, an idea would be to expand test sets with questions sourced from practitioners
across diverse domains, along with generating various syntactic and semantic variations for
each. Lastly, as mentioned in section 4.2.2, aligning the OB27AI ontology with more existing
ontologies can contribute to its efectiveness and relevance across diferent domains while
enhancing the interpretability of the DBT, i.e. taking advantage of the topological knowledge
present in ontologies such as BOT (Building Ontology Topology) [
        <xref ref-type="bibr" rid="ref49">49</xref>
        ] or geoSPARQL26.
      </p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgments</title>
      <p>We want to express our gratitude to B27-AI for their financial and material contributions to this
study and to the French National Agency of Research and Technology ANRT for their CIFRE
subvention. Our thanks equally go to the Faculty of Science and Technology at the University
of Burgundy for their academic assistance and to the anonymous reviewers for their valuable
feedback.
To evaluate the versions of our KBQA system, we used the following list of questions:</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Roxin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Abdou</surname>
          </string-name>
          , W. Derigent,
          <article-title>Interoperable Digital Building Twins Through Communicating Materials</article-title>
          and
          <string-name>
            <surname>Semantic</surname>
            <given-names>BIM</given-names>
          </string-name>
          ,
          <source>SN COMPUT. SCI. 3</source>
          (
          <year>2021</year>
          )
          <article-title>23</article-title>
          . URL: https://doi.org/10.1007/s42979-021-00860-w. doi:
          <volume>10</volume>
          .1007/s42979-021-00860-w.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>O. A.</given-names>
            <surname>Al-Mufti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. A.</given-names>
            <surname>Al-Isawi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. H.</given-names>
            <surname>Amirah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Ghenai</surname>
          </string-name>
          ,
          <article-title>Digital Twinning and ANN-based Forecasting Model for Building Energy Consumption</article-title>
          , in: 2023
          <source>Advances in Science and Engineering Technology International Conferences (ASET)</source>
          ,
          <year>2023</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          . URL: https: //ieeexplore.ieee.org/document/10180899. doi:
          <volume>10</volume>
          .1109/ASET56582.
          <year>2023</year>
          .
          <volume>10180899</volume>
          , iSSN:
          <fpage>2831</fpage>
          -
          <lpage>6878</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Reynaud</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Dumas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Roxin</surname>
          </string-name>
          ,
          <article-title>Neuro-symbolic approach for querying BIM models</article-title>
          ,
          <source>in: 2023 17th International Conference on Signal-Image Technology &amp; Internet-Based Systems (SITIS)</source>
          ,
          <year>2023</year>
          , pp.
          <fpage>62</fpage>
          -
          <lpage>69</lpage>
          . URL: https://ieeexplore.ieee.org/document/10472789. doi:
          <volume>10</volume>
          .1109/SITIS61268.
          <year>2023</year>
          .
          <volume>00019</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Stellato</surname>
          </string-name>
          , Dictionary, Thesaurus or Ontology?
          <article-title>Disentangling Our Choices in the Semantic Web Jungle</article-title>
          ,
          <source>Journal of Integrative Agriculture</source>
          <volume>11</volume>
          (
          <year>2012</year>
          )
          <fpage>710</fpage>
          -
          <lpage>719</lpage>
          . URL: https://www.sciencedirect.com/science/article/pii/S2095311912600604. doi:
          <volume>10</volume>
          . 1016/S2095-
          <volume>3119</volume>
          (
          <issue>12</issue>
          )
          <fpage>60060</fpage>
          -
          <lpage>4</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <source>[5] ISO</source>
          <volume>23386</volume>
          :
          <year>2020</year>
          ,
          <article-title>Building information modelling and other digital processes used in construction - Methodology to describe, author and maintain properties in interconnected data dictionaries</article-title>
          ,
          <year>2020</year>
          . URL: https://www.iso.org/obp/ui/en/#iso:std:iso:23386:ed-1:v1:
          <fpage>en</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <source>[6] ISO 16739-1</source>
          :
          <fpage>2024</fpage>
          ,
          <string-name>
            <given-names>Industry</given-names>
            <surname>Foundation</surname>
          </string-name>
          <article-title>Classes (IFC) for data sharing in the construction and facility management industries</article-title>
          ,
          <year>2024</year>
          . URL: https://www.iso.org/standard/84123.html.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>T. M. d. Farias</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Roxin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Nicolle</surname>
          </string-name>
          ,
          <article-title>A rule-based methodology to extract building model views</article-title>
          ,
          <source>Automation in Construction</source>
          <volume>92</volume>
          (
          <year>2018</year>
          )
          <fpage>214</fpage>
          -
          <lpage>229</lpage>
          . URL: https://www.sciencedirect. com/science/article/pii/S0926580517306738. doi:
          <volume>10</volume>
          .1016/j.autcon.
          <year>2018</year>
          .
          <volume>03</volume>
          .035.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>P.</given-names>
            <surname>Pauwels</surname>
          </string-name>
          ,
          <string-name>
            <surname>D. Van Deursen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Verstraeten</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. De Roo</surname>
            , R. De Meyer, R. Van de Walle,
            <given-names>J. Van Campenhout</given-names>
          </string-name>
          ,
          <article-title>A semantic rule checking environment for building performance checking</article-title>
          ,
          <source>Automation in Construction</source>
          <volume>20</volume>
          (
          <year>2011</year>
          )
          <fpage>506</fpage>
          -
          <lpage>518</lpage>
          . URL: https://www.sciencedirect. com/science/article/pii/S0926580510001962. doi:
          <volume>10</volume>
          .1016/j.autcon.
          <year>2010</year>
          .
          <volume>11</volume>
          .017.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>C.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , J. Beetz, d. Vries,
          <article-title>BimSPARQL: Domain-specific functional SPARQL extensions for querying RDF building data</article-title>
          ,
          <source>Semantic Web</source>
          (
          <year>2018</year>
          )
          <fpage>1</fpage>
          -
          <lpage>27</lpage>
          . doi:
          <volume>10</volume>
          .3233/SW-180297.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <source>[10] ISO 29481-1</source>
          :
          <fpage>2016</fpage>
          ,
          <article-title>Building information models - Information delivery manual - Part 1: Methodology and format, 2016</article-title>
          . URL: https://www.iso.org/obp/ui/en/#iso:std:iso:29481:-1: ed-2:v1:
          <fpage>en</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <source>[11] ISO 29481-3:2022, Building information models - Information delivery manual - Part</source>
          <volume>3</volume>
          :
          <string-name>
            <surname>Data</surname>
            <given-names>schema</given-names>
          </string-name>
          ,
          <year>2022</year>
          . URL: https://www.iso.org/obp/ui/en/#iso:std:iso:29481:-3:ed-1:v1:
          <fpage>en</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12] ISO/IEC 2382:
          <year>2015</year>
          , Information technology - Vocabulary,
          <year>2015</year>
          . URL: https://www.iso. org/obp/ui/en/#iso:std:iso-iec:2382:ed-1:v2:
          <fpage>en</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>H.</given-names>
            <surname>Kubicek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cimander</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. J.</given-names>
            <surname>Scholl</surname>
          </string-name>
          , Layers of Interoperability, in: Organizational Interoperability in E-Government: Lessons from 77 European
          <string-name>
            <surname>Good-Practice</surname>
            <given-names>Cases</given-names>
          </string-name>
          , Springer Berlin Heidelberg, Berlin, Heidelberg,
          <year>2011</year>
          , pp.
          <fpage>85</fpage>
          -
          <lpage>96</lpage>
          . URL: https://doi.org/10.1007/ 978-3-
          <fpage>642</fpage>
          -22502-
          <issue>4</issue>
          _7. doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>642</fpage>
          -22502-
          <issue>4</issue>
          _
          <fpage>7</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14] ISO/IEC 20944-1:
          <fpage>2013</fpage>
          , Information technology - Metadata Registries Interoperability and
          <string-name>
            <surname>Bindings (MDR-IB</surname>
          </string-name>
          )
          <article-title>- Part 1: Framework, common vocabulary, and common provisions for conformance</article-title>
          ,
          <year>2013</year>
          . URL: https://www.iso.org/obp/ui/en/#iso:std:iso-iec:
          <volume>20944</volume>
          :-1:ed-1: v1:
          <fpage>en</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <source>[15] ISO</source>
          <volume>22378</volume>
          :
          <year>2022</year>
          ,
          <article-title>Security and resilience - Authenticity, integrity and trust for products and documents - Guidelines for interoperable object identification and related authentication systems to deter counterfeiting and illicit trade</article-title>
          ,
          <year>2022</year>
          . URL: https://www.iso.org/obp/ui/en/ #iso:std:iso:22378:ed-1:v1:
          <fpage>en</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16] ISO/IEC 19941:
          <year>2017</year>
          , Information technology -
          <source>Cloud computing - Interoperability and portability</source>
          ,
          <year>2017</year>
          . URL: https://www.iso.org/obp/ui/en/#iso:std:iso-iec:19941:ed-1:v1:
          <fpage>en</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <source>[17] ISO</source>
          <volume>18308</volume>
          :
          <year>2011</year>
          ,
          <article-title>Health informatics - Requirements for an electronic health record architecture</article-title>
          ,
          <year>2011</year>
          . URL: https://www.iso.org/obp/ui/en/#iso:std:iso:18308:ed-1:v1:
          <fpage>en</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>D.</given-names>
            <surname>Walasek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Barszcz</surname>
          </string-name>
          ,
          <article-title>Analysis of the Adoption Rate of Building Information Modeling</article-title>
          [BIM]
          <article-title>and its Return on Investment [ROI]</article-title>
          ,
          <source>Procedia Engineering</source>
          <volume>172</volume>
          (
          <year>2017</year>
          )
          <fpage>1227</fpage>
          -
          <lpage>1234</lpage>
          . URL: https://linkinghub.elsevier.com/retrieve/pii/S1877705817306501. doi:
          <volume>10</volume>
          .1016/j.proeng.
          <year>2017</year>
          .
          <volume>02</volume>
          .144.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>S. J.</given-names>
            <surname>Russell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Norvig</surname>
          </string-name>
          , Artificial Intelligence:
          <string-name>
            <given-names>A Modern</given-names>
            <surname>Approach</surname>
          </string-name>
          , Pearson,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>A.</given-names>
            <surname>Vaswani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Shazeer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Parmar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Uszkoreit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. N.</given-names>
            <surname>Gomez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Kaiser</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Polosukhin</surname>
          </string-name>
          , Attention is All you Need, in: I. Guyon,
          <string-name>
            <given-names>U. V.</given-names>
            <surname>Luxburg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bengio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wallach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fergus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vishwanathan</surname>
          </string-name>
          , R. Garnett (Eds.),
          <source>Advances in Neural Information Processing Systems</source>
          , volume
          <volume>30</volume>
          ,
          <string-name>
            <surname>Curran</surname>
            <given-names>Associates</given-names>
          </string-name>
          , Inc.,
          <year>2017</year>
          . URL: https://proceedings.neurips.cc/ paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>D.</given-names>
            <surname>Khurana</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Koli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Khatter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <article-title>Natural language processing: state of the art, current trends and challenges</article-title>
          ,
          <source>Multimed Tools Appl</source>
          <volume>82</volume>
          (
          <year>2023</year>
          )
          <fpage>3713</fpage>
          -
          <lpage>3744</lpage>
          . URL: https: //link.springer.
          <source>com/10.1007/s11042-022-13428-4</source>
          . doi:
          <volume>10</volume>
          .1007/s11042-022-13428-4.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>H. K.</given-names>
            <surname>Skrodelis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Romanovs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Zenina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Gorskis</surname>
          </string-name>
          ,
          <article-title>The Latest in Natural Language Generation: Trends, Tools and Applications in Industry</article-title>
          ,
          <source>in: 2023 IEEE 10th Jubilee Workshop on Advances in Information, Electronic and Electrical Engineering (AIEEE)</source>
          ,
          <year>2023</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>5</lpage>
          . URL: https://ieeexplore.ieee.org/document/10134841. doi:
          <volume>10</volume>
          .1109/AIEEE58915.
          <year>2023</year>
          .
          <volume>10134841</volume>
          , iSSN:
          <fpage>2689</fpage>
          -
          <lpage>7342</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>P.</given-names>
            <surname>Pauwels</surname>
          </string-name>
          , W. Terkaj,
          <article-title>EXPRESS to OWL for construction industry: Towards a recommendable and usable ifcOWL ontology</article-title>
          ,
          <source>Automation in Construction</source>
          <volume>63</volume>
          (
          <year>2016</year>
          )
          <fpage>100</fpage>
          -
          <lpage>133</lpage>
          . URL: https://www.sciencedirect.com/science/article/pii/S0926580515002435. doi:
          <volume>10</volume>
          .1016/j. autcon.
          <year>2015</year>
          .
          <volume>12</volume>
          .003.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>W.</given-names>
            <surname>Terkaj</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Pauwels</surname>
          </string-name>
          ,
          <article-title>A Method to Generate a Modular ifcOWL Ontology</article-title>
          ,
          <year>2017</year>
          . URL: https://www.semanticscholar.org/paper/ A-Method
          <article-title>-to-Generate-a-Modular-ifcOWL-</article-title>
          <string-name>
            <surname>Ontology-</surname>
          </string-name>
          Terkaj-Pauwels/
          <year>1c6411c9f9295dfc9503ab1a9e0055cead6804f3</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>T. M. Farias</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Roxin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <article-title>Nicolle, IfcWoD, Semantically Adapting IFC Model Relations into OWL Properties, ArXiv (</article-title>
          <year>2015</year>
          ). URL: https://www.semanticscholar. org/paper/IfcWoD%2C-
          <string-name>
            <surname>Semantically-Adapting-IFC-Model-Relations-</surname>
          </string-name>
          Farias-Roxin/
          <year>8161912ecf67fd6d3f94d0cb0754f77f335ce56</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bonduel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Oraskari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Pauwels</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Vergauwen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Klein</surname>
          </string-name>
          ,
          <article-title>The IFC to linked building data converter - current status</article-title>
          ,
          <year>2018</year>
          . URL: https://www.semanticscholar. org/paper/
          <article-title>The-IFC-to-linked-building-data-converter-current-Bonduel-Oraskari/ 945040b77455d09b3f37f975fb18ab12624982d6.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>P.</given-names>
            <surname>Pauwels</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . Roxin,
          <article-title>SimpleBIM: From full ifcOWL graphs to simplified building graphs</article-title>
          ,
          <source>in: eWork and eBusiness in Architecture, Engineering and Construction: ECPPM</source>
          <year>2016</year>
          , CRC Press,
          <year>2016</year>
          . Num Pages:
          <fpage>8</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bonduel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Wagner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Pauwels</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Vergauwen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Klein</surname>
          </string-name>
          ,
          <article-title>Including widespread geometry formats in semantic graphs using RDF literals</article-title>
          ,
          <source>in: Proceedings of the 2019 European Conference for Computing in Construction, European Council on Computing in Construction</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>341</fpage>
          -
          <lpage>350</lpage>
          . URL: http://hdl.handle.net/1854/LU-8633665. doi:
          <volume>10</volume>
          . 35490/ec3.
          <year>2019</year>
          .
          <volume>166</volume>
          , iSSN:
          <fpage>2684</fpage>
          -
          <lpage>1150</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>J.</given-names>
            <surname>Euzenat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Shvaiko</surname>
          </string-name>
          , Ontology Matching, Springer, Berlin, Heidelberg,
          <year>2013</year>
          . URL: https: //link.springer.com/10.1007/978-3-
          <fpage>642</fpage>
          -38721-0. doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>642</fpage>
          -38721-0.
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>C. T.</given-names>
            <surname>Hoyt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Konotopez</surname>
          </string-name>
          , C. Ebeling,
          <article-title>PyBEL: a computational framework for Biological Expression Language</article-title>
          ,
          <source>Bioinformatics</source>
          <volume>34</volume>
          (
          <year>2018</year>
          )
          <fpage>703</fpage>
          -
          <lpage>704</lpage>
          . URL: https://doi.org/10.1093/ bioinformatics/btx660. doi:
          <volume>10</volume>
          .1093/bioinformatics/btx660.
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>X.</given-names>
            <surname>Xue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhou</surname>
          </string-name>
          , G. Mao,
          <string-name>
            <given-names>H.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <article-title>Matching heterogeneous ontologies with adaptive evolutionary algorithm</article-title>
          ,
          <source>Connection Science</source>
          <volume>34</volume>
          (
          <year>2022</year>
          )
          <fpage>811</fpage>
          -
          <lpage>828</lpage>
          . URL: https://doi. org/10.1080/09540091.
          <year>2021</year>
          .
          <volume>1991278</volume>
          . doi:
          <volume>10</volume>
          .1080/09540091.
          <year>2021</year>
          .
          <volume>1991278</volume>
          , publisher: Taylor &amp; Francis _eprint: https://doi.org/10.1080/09540091.
          <year>2021</year>
          .
          <volume>1991278</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>F.</given-names>
            <surname>Ardjani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Bouchiha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Malki</surname>
          </string-name>
          ,
          <string-name>
            <surname>Ontology-Alignment</surname>
            <given-names>Techniques</given-names>
          </string-name>
          :
          <article-title>Survey and Analysis</article-title>
          , IJMECS
          <volume>7</volume>
          (
          <year>2015</year>
          )
          <fpage>67</fpage>
          -
          <lpage>78</lpage>
          . URL: http://www.mecs-press.org/ijmecs/ijmecs-v7
          <source>-n11/v7n11-8</source>
          . html. doi:
          <volume>10</volume>
          .5815/ijmecs.
          <year>2015</year>
          .
          <volume>11</volume>
          .08.
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>M.</given-names>
            <surname>Gulić</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Magdalenić</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Vrdoljak</surname>
          </string-name>
          ,
          <article-title>Ontology Matching Using TF/IDF Measure with Synonym Recognition</article-title>
          , in: T. Skersys,
          <string-name>
            <given-names>R.</given-names>
            <surname>Butleris</surname>
          </string-name>
          , R. Butkiene (Eds.),
          <source>Information and Software Technologies</source>
          , Springer, Berlin, Heidelberg,
          <year>2013</year>
          , pp.
          <fpage>22</fpage>
          -
          <lpage>33</lpage>
          . doi:
          <volume>10</volume>
          .1007/ 978-3-
          <fpage>642</fpage>
          -41947-
          <issue>8</issue>
          _
          <fpage>3</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>S.</given-names>
            <surname>Zghal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. Ben</given-names>
            <surname>Yahia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. Mephu</given-names>
            <surname>Nguifo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Slimani</surname>
          </string-name>
          , SODA :
          <article-title>Une approche structurelle pour l'alignement d'ontologies OWL-DL</article-title>
          , in: B.
          <string-name>
            <given-names>D. e. B. P.</given-names>
            <surname>Gargouri Faïez</surname>
          </string-name>
          (Ed.),
          <source>1ères Journées Francophones sur les Ontologies (JFO'07)</source>
          , Sousse, Tunisia,
          <year>2007</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>20</lpage>
          . URL: https://hal.science/hal-00445724.
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>Y.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Dong</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Horrocks</surname>
          </string-name>
          ,
          <article-title>Exploring Large Language Models for Ontology Alignment</article-title>
          ,
          <source>in: Posters and Demos ISWC</source>
          <year>2023</year>
          , Athens, Greece,
          <year>2023</year>
          . URL: http://arxiv. org/abs/2309.07172, arXiv:
          <fpage>2309</fpage>
          .07172 [cs].
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>J.</given-names>
            <surname>Berant</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Chou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Frostig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Liang</surname>
          </string-name>
          ,
          <article-title>Semantic Parsing on Freebase from QuestionAnswer Pairs</article-title>
          , in: D.
          <string-name>
            <surname>Yarowsky</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Baldwin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Korhonen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Livescu</surname>
          </string-name>
          , S. Bethard (Eds.),
          <source>Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing</source>
          , Association for Computational Linguistics, Seattle, Washington, USA,
          <year>2013</year>
          , pp.
          <fpage>1533</fpage>
          -
          <lpage>1544</lpage>
          . URL: https://aclanthology.org/D13-1160.
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>A.</given-names>
            <surname>Bordes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chopra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Weston</surname>
          </string-name>
          ,
          <article-title>Question Answering with Subgraph Embeddings</article-title>
          , in: A.
          <string-name>
            <surname>Moschitti</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Pang</surname>
          </string-name>
          , W. Daelemans (Eds.),
          <source>Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)</source>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          , Doha, Qatar,
          <year>2014</year>
          , pp.
          <fpage>615</fpage>
          -
          <lpage>620</lpage>
          . URL: https://aclanthology.org/D14-1067. doi:
          <volume>10</volume>
          .3115/v1/
          <fpage>D14</fpage>
          -1067.
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [38]
          <string-name>
            <given-names>J.</given-names>
            <surname>Gomes</surname>
          </string-name>
          , R. C. de Mello,
          <string-name>
            <given-names>V.</given-names>
            <surname>Ströele</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. F. de Souza</surname>
          </string-name>
          ,
          <article-title>A study of approaches to answering complex questions over knowledge bases</article-title>
          ,
          <source>Knowledge and Information Systems</source>
          <volume>64</volume>
          (
          <year>2022</year>
          )
          <fpage>2849</fpage>
          -
          <lpage>2881</lpage>
          . URL: https://doi.org/10.1007/s10115-022-01737-x. doi:
          <volume>10</volume>
          .1007/ s10115-022-01737-x.
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [39]
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Ke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Ding</surname>
          </string-name>
          ,
          <article-title>Neural, symbolic and neuralsymbolic reasoning on knowledge graphs</article-title>
          ,
          <source>AI</source>
          Open 2
          <article-title>(</article-title>
          <year>2021</year>
          )
          <fpage>14</fpage>
          -
          <lpage>35</lpage>
          . URL: https:// www.sciencedirect.com/science/article/pii/S2666651021000061. doi:https://doi.org/ 10.1016/j.aiopen.
          <year>2021</year>
          .
          <volume>03</volume>
          .001.
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [40]
          <string-name>
            <given-names>M.</given-names>
            <surname>Yin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Tang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Webster</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Ying</surname>
          </string-name>
          ,
          <article-title>An ontology-aided, natural languagebased approach for multi-constraint BIM model querying</article-title>
          ,
          <source>Journal of Building Engineering</source>
          <volume>76</volume>
          (
          <year>2023</year>
          )
          <article-title>107066</article-title>
          . URL: https://linkinghub.elsevier.com/retrieve/pii/S2352710223012457. doi:
          <volume>10</volume>
          .1016/j.jobe.
          <year>2023</year>
          .
          <volume>107066</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          <source>[41] ISO 11354-1</source>
          :
          <fpage>2011</fpage>
          ,
          <article-title>Advanced automation technologies and their applications - Requirements for establishing manufacturing enterprise process interoperability - Part 1: Framework for enterprise interoperability</article-title>
          ,
          <year>2011</year>
          . URL: https://www.iso.org/obp/ui/en/#iso:std: iso:11354:-1:ed-1:v1:
          <fpage>en</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          [42]
          <string-name>
            <given-names>P.</given-names>
            <surname>Hagedorn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Block</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Zentgraf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Sigalov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>König</surname>
          </string-name>
          ,
          <article-title>Toolchains for Interoperable BIM Workflows in a Web-Based Integration Platform</article-title>
          ,
          <source>Applied Sciences</source>
          <volume>12</volume>
          (
          <year>2022</year>
          )
          <article-title>5959</article-title>
          . URL: https://www.mdpi.com/2076-3417/12/12/5959. doi:
          <volume>10</volume>
          .3390/app12125959, number: 12 Publisher: Multidisciplinary Digital Publishing Institute.
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          [43]
          <string-name>
            <given-names>L.</given-names>
            <surname>Höltgen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Cleve</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hagedorn</surname>
          </string-name>
          ,
          <article-title>Implementation of an Open Web Interface for the Container-based Exchange of Linked Building Data</article-title>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          [44]
          <string-name>
            <surname>T. Mendes de Farias</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Roxin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <article-title>Nicolle, FOWLA, A Federated Architecture for Ontologies</article-title>
          .,
          <year>2015</year>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>319</fpage>
          -21542-6.
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          [45]
          <string-name>
            <given-names>N.</given-names>
            <surname>Noy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Mcguinness</surname>
          </string-name>
          ,
          <article-title>Ontology Development 101: A Guide to Creating Your First Ontology</article-title>
          ,
          <source>Knowledge Systems Laboratory</source>
          <volume>32</volume>
          (
          <year>2001</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          [46]
          <string-name>
            <given-names>T. R.</given-names>
            <surname>Gruber</surname>
          </string-name>
          ,
          <article-title>A translation approach to portable ontology specifications, Knowledge Acquisition 5 (</article-title>
          <year>1993</year>
          )
          <fpage>199</fpage>
          -
          <lpage>220</lpage>
          . URL: https://www.sciencedirect.com/science/article/pii/ S1042814383710083. doi:
          <volume>10</volume>
          .1006/knac.
          <year>1993</year>
          .
          <volume>1008</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          [47]
          <string-name>
            <given-names>J.</given-names>
            <surname>Devlin</surname>
          </string-name>
          , M.-
          <string-name>
            <given-names>W.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Toutanova</surname>
          </string-name>
          , BERT:
          <article-title>Pre-training of Deep Bidirectional Transformers for Language Understanding</article-title>
          , in: J.
          <string-name>
            <surname>Burstein</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Doran</surname>
          </string-name>
          , T. Solorio (Eds.),
          <source>Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</source>
          , Volume
          <volume>1</volume>
          (Long and Short Papers),
          <source>Association for Computational Linguistics</source>
          , Minneapolis, Minnesota,
          <year>2019</year>
          , pp.
          <fpage>4171</fpage>
          -
          <lpage>4186</lpage>
          . URL: https://aclanthology.org/N19-1423. doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>N19</fpage>
          -1423.
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          [48]
          <string-name>
            <given-names>N.</given-names>
            <surname>Kitaev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Klein</surname>
          </string-name>
          ,
          <article-title>Constituency Parsing with a Self-Attentive Encoder, in: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Association for Computational Linguistics</article-title>
          , Melbourne, Australia,
          <year>2018</year>
          , pp.
          <fpage>2676</fpage>
          -
          <lpage>2686</lpage>
          . URL: https://www.aclweb.org/anthology/P18-1249. doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>P18</fpage>
          -1249.
        </mixed-citation>
      </ref>
      <ref id="ref49">
        <mixed-citation>
          [49]
          <string-name>
            <given-names>M. H.</given-names>
            <surname>Rasmussen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lefrançois</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. F.</given-names>
            <surname>Schneider</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Pauwels</surname>
          </string-name>
          ,
          <string-name>
            <surname>BOT:</surname>
          </string-name>
          <article-title>The building topology ontology of the W3C linked building data group</article-title>
          ,
          <source>Semantic Web</source>
          <volume>12</volume>
          (
          <year>2021</year>
          )
          <fpage>143</fpage>
          -
          <lpage>161</lpage>
          . URL: https://content.iospress.com/articles/semantic-web/sw200385. doi:
          <volume>10</volume>
          .3233/SW-200385, publisher: IOS Press.
        </mixed-citation>
      </ref>
      <ref id="ref50">
        <mixed-citation>
          1.
          <article-title>Find external walls on floor 2 with heights smaller than the height of A201.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref51">
        <mixed-citation>
          <article-title>2. Search for external walls on the floor 2</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref52">
        <mixed-citation>
          <article-title>3. Select the windows with a width greater than 4m.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref53">
        <mixed-citation>
          4.
          <article-title>List all single swing right doors</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref54">
        <mixed-citation>
          5.
          <article-title>How many rooms are in this house?</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref55">
        <mixed-citation>
          6.
          <article-title>Identify all the beams with a span larger than 5m.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref56">
        <mixed-citation>
          7.
          <article-title>Find all the doors with a height greater than 2</article-title>
          .10 m.
        </mixed-citation>
      </ref>
      <ref id="ref57">
        <mixed-citation>
          <article-title>8. Locate the walls in brick.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref58">
        <mixed-citation>
          <article-title>9. Search for the slabs in wood.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref59">
        <mixed-citation>
          10.
          <article-title>Select the footing at ground 0</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref60">
        <mixed-citation>
          11.
          <article-title>Find all spaces connected to B201 and contained in floor 2</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref61">
        <mixed-citation>
          12.
          <article-title>Select all stairs on level 1</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref62">
        <mixed-citation>
          13.
          <article-title>Find the area of A103.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref63">
        <mixed-citation>
          14.
          <article-title>Are there load-bearing roofs?</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref64">
        <mixed-citation>
          15.
          <article-title>Locate walls whose width is greater than 0</article-title>
          .400m.
        </mixed-citation>
      </ref>
      <ref id="ref65">
        <mixed-citation>
          16.
          <article-title>List windows whose sill height is greater than 1</article-title>
          .5m.
        </mixed-citation>
      </ref>
      <ref id="ref66">
        <mixed-citation>
          17.
          <article-title>Find the room that has the largest area</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref67">
        <mixed-citation>
          18.
          <article-title>Search for all the load-bearing beams with slope equal to 0</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref68">
        <mixed-citation>
          19.
          <article-title>Select the windows on the roof</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref69">
        <mixed-citation>
          20.
          <article-title>Locate all the stairs</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref70">
        <mixed-citation>
          21.
          <article-title>List the exterior doors with a width larger than 1m.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref71">
        <mixed-citation>
          22.
          <article-title>Find the doors with automatic operation</article-title>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>