<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards usable ICDD containers for ontology-driven data linking and link validation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Philipp Hagedorn</string-name>
          <email>philipp.hagedorn-n6v@ruhr-uni-bochum.de</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Madhumitha Senthilvel</string-name>
          <email>senthilvel@dc.rwth-aachen.de</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hans Schevers</string-name>
          <email>hans.schevers@buildingbits.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lucas B. Verhelst</string-name>
          <email>lucas.verhelst@bim-connected.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>BIM-Connected</institution>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>BuildingBits</institution>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Chair of Computing in Engineering, Ruhr-Universität Bochum</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Chair of Design Computation, RWTH Aachen University</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <fpage>35</fpage>
      <lpage>46</lpage>
      <abstract>
        <p>Delivering data and documents of a building throughout its lifecycle is a common task in construction planning, engineering, and maintenance. The international standard Information Containers for linked Document Delivery (ICDD) has been created to link data and documents together for integrated delivery. This paper describes two independent software prototypes for creating these hybrid linksets in ICDD containers and validating them with the Shapes and Constraint Language (SHACL). This research focuses on ways to link data modeled in the Resource Description Framework (RDF) to non-RDF resources like documents and entities within documents. It argues that current ICDD mechanisms for linking to RDF resources are cumbersome and do not exploit the advantages of Linked Data. It also argues that the ICDD specification regarding the usage of Linked Data is very restrictive, thereby blocking the full benefits of Linked Data. The paper presents two strategies to alleviate these barriers to using ICDD; one approach is within the ICDD standard but adds additional agreements on top of the current ICDD standard. The other approach is technically outside the ICDD standard. Eventually, conclusions and recommendations are drawn from the presented implementations and strategies.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;ICDD</kwd>
        <kwd>ISO 21597</kwd>
        <kwd>Prototype</kwd>
        <kwd>Deep Linking</kwd>
        <kwd>Linked Building Data</kwd>
        <kwd>SHACL</kwd>
        <kwd>RDF Validation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Stakeholders of the built environment and the construction industry constantly share
heterogeneous information. In the complex landscape of the built environment, there is a need to
exchange information in a standard way without losing the context of the information and
only relying on vendor-specific formats [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Furthermore, with the increasing data volumes and
diferent types of data having to form a unified view, the need to transfer interconnected data is
growing [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        The ISO 21597 [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ] Information Container for linked Document Delivery – Exchange
Specification (ICDD) was released in 2020 to overcome the lack of a vendor-neutral structure for
exchanging heterogeneous distributed building data. The ISO standard specifies a container
0000-0002-6249-243X (P. Hagedorn); 0000-0003-0733-9157 (M. Senthilvel); 0000-0003-1651-8202 (L. B. Verhelst)
© 2023 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
      </p>
      <p>CPWrEooUrckResehdoinpgs IhStpN:/c1e6u1r3-w-0s.o7r3g CEUR Workshop Proceedings (CEUR-WS.org)
with a structure that holds the payload that has to be exchanged. In this container, a header
ifle is present to register and describe the contained files with metadata. Link files describe the
(deep) links between these files. Both metadata files are formatted using Linked Data (LD) and
Semantic Web formats, i.e., the Resource Description Framework (RDF) and the Web Ontology
Language (OWL). RDF and OWL are standardized by the World Wide Web Consortium (W3C).</p>
      <p>If the payload happens to be (partly) LD, the standard specifies that links should still be created
through the linkset files. However, because deep linking between datasets is natively supported
in LD, the proposed linking mechanism of ICDD is unnecessarily cumbersome. Furthermore,
the overhead of the linking structure in ICDD could lead to redundancy in the data if direct
links between the LD payload persist. Further, the standard does not specify how to transfer
the data model and requirements that might be encoded using Semantic Web standards, e.g.,
Shapes Constraint Language (SHACL). Such functionality would make the container even more
self-describing and enable the software to verify the exchanged data.</p>
      <p>In this paper, we discuss how a revision of the ICDD standard might change the container to
use the characteristics of LD better and consequently use the standard languages of LD (e.g.,
RDF, OWL, SHACL). However, because the conformance paragraph in the standard prohibits
any extensions of the structures specified in it, only one of the two proposed solutions will
conform to the standard. In the following, we will examine two independent software prototypes
for ICDD regarding their usability with LD payloads, their handling of the linkage between
RDF data and documents, and their ability to validate the contents of ICDD containers with
SHACL. After concluding problem statements from the analysis of the prototypes, strategies for
extensions of the ICDD standard are defined, and general recommendations are given in the
conclusions.</p>
    </sec>
    <sec id="sec-2">
      <title>2. The ICDD standard and Linked Data</title>
      <p>
        Information exchange in the construction industry is conducted according to ISO 19650 [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
Information containers are referred to for exchanging information as the smallest addressable
unit in a software environment. However, it is not defined what an information container is and
how it should be structured. A standardized information container is supplied with the ICDD
defined by ISO 21597-1:2020 [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The standard contains a structured container schema in a ZIP
archive built upon Semantic Web technologies. ICDD containers can exchange interconnected
documents such as GML files, IFC files, Excel files, PDF files, CAD drawings, etc. Information
about documents, including links between them, can be asserted using LD in the container. Use
cases for the ICDD container are presented in recent research [
        <xref ref-type="bibr" rid="ref5 ref6 ref7">5, 6, 7</xref>
        ]. LD makes use of a part
of the Semantic Web technologies. There are increasingly more examples in research where
building data is converted to LD or building data is referred to via LD [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. The construction
industry increasingly uses these Semantic Web technologies to model knowledge in RDF and
OWL to facilitate a web-based provision and exchange of building data [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>
        LD is provided through publicly available base and domain-specific ontologies that can be
leveraged to represent building data in a modular ontology combination. Base ontologies contain
commonly agreed terminology that is valid for multiple domains, while domain ontologies
extend the base ontologies with domain-specific knowledge. These ontologies are also referred
to as Terminology-Box (T-Box). The W3C Linked Building Data (LBD) Community Group1
is one instance that proposes well-established base ontologies such as the Building Topology
Ontology (BOT) or the Ontology for Property Management (OPM) [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. LBD is thus an approach
to employ linked data-based applications across the life cycle of buildings using shared ontologies.
Besides these shared ontologies, Object Type Libraries (OTL) are increasingly used to describe
the semantics of buildings, including standardized object types and property specifications of
objects.
      </p>
      <p>
        The ICDD container is a successor of the Dutch COINS container, as described by Hoeber et
al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. The container supplies an index file defined by the vocabulary from the Container.rdf 2
and includes a folder Ontology resources for ontologies, Payload documents for arbitrary
documents, and Payload triples for RDF-serialized data and linksets.
      </p>
      <p>ls:DirectedLink
ls:hasFromLinkElement: ls:LinkElement [1..]
ls:hasToLinkElement: ls:LinkElement [1..]
Legend
Properties
inheritance
object property</p>
      <p>Classes</p>
      <p>Container Ontology (ct)</p>
      <p>Linkset Ontology (ls)
ls:Link
ls:hasLinkElement: ls:LinkElement[2..]
subClassOf
subClassOf
ls:hasLinkElement</p>
      <p>ls:LinkElement
ls:hasDocument: ct:Document [1..1]
ls:hasIdentifier: ls:Identifier [0..1]
ls:BinaryLink
ls:hasLinkElement: ls:LinkElement [2..2]</p>
      <p>subClassOf
ls:StringBasedIdentifier
ls:identifierField: string [0..1]
ls:identifier: string [1..1]
ls:hasDocument
ls:hasIdentifier
ls:Identifier
subClassOf
...</p>
      <p>ct:Document
subClassOf
ls:UriBasedIdentifier
ls:uri: anyUri [1..1]
ls:QueryBasedIdentifier
ls:queryExpression: string [1..1]
ls:queryLanguage: string [1..1]</p>
      <p>
        Documents registered in the container index can be linked using the vocabulary from the
Linkset.rdf 3. Specific link types can be used with the ExtendedLinkset.rdf 4 vocabulary, which is
standardized in the second part of the ISO standard [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The general structure of defining links
between documents is reported in Figure 1.
      </p>
      <p>
        Linking between two or more documents is provided through the link structure shown
in Figure 1, defining a ls:LinkElement class that can be referenced in several link types.
The ls:LinkElement class contains exactly one document reference and optionally one
identifier of the identifier types ls:StringBasedIdentifier, ls:UriBasedIdentifier,
and ls:QueryBasedIdentifier. These identifiers allow the deep linking of entities inside
documents. Deep linking uses the above-mentioned identifiers, which form a generic mechanism
that arguably needs supplementary specifications for each file type. For example, deep linking
to a GML file could mean that ICDD’s identifier refers to a gml:id within a GML file, although
this is not documented or specified. It could also mean that the ICDD identifier refers to the
objectID property within a GML file. For linking to IFC files, ICDD’s identifier could use the
IFC-GUID property, but the line number (in the case of an IFC step physical file) is also valid.
The deep linkage of drawings and IFC models in ICDD containers has also been analyzed by
Borrmann et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        Validation of ICDD containers is provided by the second part of the ISO 21597-2:2020 [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
Therein, SHACL [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] is used to validate every instance of the ICDD ontologies inside a container.
To conduct the validation, the appendix delivers SHACL shapes to verify the container. SHACL
validation for LBD using generated SHACL shapes has been investigated by Senthilvel et al. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
The documentation of SHACL can be found in [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. The usage of SHACL for ICDD container
conformity validation is also addressed by Schevers [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. However, the approaches in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and
[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] only validate the conformity with the ISO 21597 standard and do not touch the contents
of a container. The conformity criteria of an ICDD container are specified in ISO 21597 and
forbid the extension of any class or property from the given ICDD ontologies. According to the
conformance criteria in 6.2 f) of ISO 21597-2 [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], RDF data must be placed in the Payload triples
folder and is thus not registered in the index file of the container. A consequence of this is that
no links to entities can be made in the RDF data, as a ct:Document instance is mandatory for
this (see Figure 1).
      </p>
      <p>
        An approach for validating the linksets inside ICDD containers content-wise to ensure the link
validity based on the linked entities is provided by Hagedorn and König [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. In their approach,
building data is transferred into RDF data and validated in the synopsis of all resources.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Prototype software systems</title>
      <p>In the following section, two independent software prototypes for viewing and editing ICDD
containers are presented and analyzed regarding their handling of RDF-based data and linking
between these RDF-based data and documents inside the container.</p>
      <sec id="sec-3-1">
        <title>3.1. RUB ICDD Platform (RUB-IP)</title>
        <p>
          The RUB-IP is a viewing, editing, and collaboration platform based on ICDD information
containers. Hagedorn et al. [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] provide the underlying software architecture for this web
platform. ICDD containers are maintained on the platform in projects. According to information
management using ISO 19650, projects can contain ICDD containers with specific states and
purposes. The platform implements rights management for projects and containers. Using
RUB-IP, ICDD containers can be queried using SPARQL and validated using SHACL. Therefore,
payload documents available as RDF data, container metadata, linksets, and all other RDF
data from the Payload triples folder are integrated into a triples store. Thus, validation can
be performed not only on the schema level but also on the content level and regarding the
integrity and plausibility of linksets. Moreover, RUB-IP renders IFC files into a 3D viewer, to
enable a visual selection of IFC elements and to query, e.g., linked documents or entities, for the
selection by using SPARQL queries. The platform relies on the open-source published IIB.ICDD
framework [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] for handling ICDD containers using C# and .NET.
        </p>
        <p>
          RUB-IP shows the contents of an ICDD container featuring all the links between documents
following the ICDD specifications. Extra functionality is present when ICDD links refer to
other RDF files, e.g., showing the graph structure of the link data structure, the linked RDF
individuals, and direct L1 level properties of the entities. Rasmussen et al. [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] define these
property levels for LBD from L1 to L3 according to the respective steps (number of triples) from
a feature of interest to the value of a property.
        </p>
        <sec id="sec-3-1-1">
          <title>3.1.1. Handling of RDF-based data</title>
          <p>
            Loading RDF files into ICDD containers on the RUB-IP can be done threefold. First, RDF
documents can be stored in the Payload documents folder as ct:InternalDocument instance.
Moreover, RDF files can also be referenced as ct:ExternalDocument instances via their
respective URI, under which an RDF serialized format is retrievable. Both options store the LD
in the Payload documents folder. The third option to add LD to ICDD containers in the RUB-IP is
to store it under Payload triples as defined in Part 2 of ISO 21597. As of the standards definition,
only the linksets from the Payload triples are registered in the container index file, but neither
the other RDF files in the Payload triples folder nor the ontologies in the Ontology resources
folder are considered. For registering RDF files from the Payload triples folder in the container,
the ICDD ontologies are extended, as described in [
            <xref ref-type="bibr" rid="ref15">15</xref>
            ]. A class exdoc:PayloadProxy is
defined as a subclass of ct:ExternalDocument with the derived property ls:uri from the
ct:ExternalDocument class. With this, an indication is made that an RDF document is
added in the Payload triples folder referring to the respective ls:uri as a base URI of the
contained RDF graph. All uploaded RDF files on the RUB-IP are automatically generated as
exdoc:PayloadProxy instances and registered in the index file. Through RDFS reasoning, all
instances of exdoc:PayloadProxy can be inferred to ct:ExternalDocument to conform
to ISO 21597 again.
3.1.2. Handling of the linking between RDF data and documents
Depending on how RDF data is stored in containers on the RUB-IP, there are possible ways
to link RDF data with other documents in the container using ls:StringBasedIdentifier
or ls:UriBasedIdentifier. The identifier type ls:QueryBasedIdentifier is not yet
implemented into the system. For RDF data stored in the Payload documents, links can easily
be created using a ls:LinkElement referring to the internal or external document instance
and either a URI-based identifier to address the RDF entity directly via its URI or a string-based
identifier to refer to an RDF entity via a combination of an identifier field and identifier value.
The latter option can be used to look up respective triples where the identifier field is used
as a predicate and the identifier value as an object. This delivers one possible subject if the
identifier value is unique in the RDF data (e.g., the IFC-GUID when using ifcOWL) or multiple
values if the identifier is not unique (e.g., a property value of entities with a particular label or
numeric value). If the RDF data is stored in Payload triples, both methods to identify instances
can be used as described before. The only diference is that the ls:LinkElement refers to the
generated PayloadProxy instance reflecting the RDF document in the container index file.
However, none of these two solutions for handling RDF data and linking to it is intended by the
ISO 21597 standard.
          </p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Wistor ICDD Viewer</title>
        <p>
          The Wistor ICDD Viewer [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] enables users to upload an ICDD container and browse through
the data. An IFC ‘Adaptor’ enables visualization of IFC files within the ICDD container. The
adaptor resolves deep links towards IFC elements and other documents. Related content can be
found using the 3D IFC viewer; consequently, extra documents can be attached to IFC elements.
The goal of this ICDD viewer, besides handling ICDD content, is also to handle extra LD based
on a Object Type Library (OTL) specified in SHACL. ‘Instances’ of OTL classes are present in
an additional RDF file within the ICDD Container, enabling the exchange of objects, properties,
and their relations connected to other documents using ICDD data structures. Figure 3 shows
the Wistor ICDD Viewer of a substation design document in multiple IFC files where objects in
red have extra information available using an RDF file in the Payload documents folder.
        </p>
        <p>The properties panel (bottom left widget) shows the extra properties of the selected object.
These properties are gathered from an RDF file. The connection between the selected IFC
element and the RDF file resources is specified using the ICDD classes.</p>
        <p>Multiple verification steps are available to ensure the exchanged data complies with the
specifications. First, the ICDD container is verified to meet the ISO 21597 conditions. As the
client primarily wants IFC files using their specifications related to (necessary) documents and
wants extra object information according to their SHACL OTL, additional verification steps are
required. IFC files are checked to ensure they comply, and the extra LD is validated against the
SHACL OTL.</p>
        <sec id="sec-3-2-1">
          <title>3.2.1. Handling of RDF-based data</title>
          <p>The Wistor ICDD Viewer had to make its own extra decisions to enable a more holistic
veriifcation of ICDD data encompassing IFC files, ICDD links, and additional LD based upon an
OTL (in SHACL). The first decision was to load extra LD in the Payload documents folder within
the ICDD specification. Primarily the idea was to put additional RDF files in the Payload triples
folder, but Wistor’s interpretation was that this was not according to the ISO standard. A second
decision was to load an LD OTL to make more information on object types available to the
viewer. As the OTL was based upon SHACL, the extra LD set could be validated, ensuring that
the exchanged data conforms. This was the third decision (validating the supplementary dataset
against a SHACL/OTL).</p>
        </sec>
        <sec id="sec-3-2-2">
          <title>3.2.2. Handling of the linking between RDF data and documents</title>
          <p>Another decision was to use ls:LinkElement and ls:StringBasedIdentifier to link to
an LD data resource. Instead, the foundation of LD proposes to use IRIs to link to resources.
However, this approach was at least complying with the ICDD standard. As multiple ways are
present to handle a link to an LD resource (e.g., ls:UriBasedIdentifier), extra guidelines
are necessary for ICDD container creators to ensure they use the same approach. Data structures
for linking to external documents were also present within the OTL, enabling cardinality
restrictions or simply enabling the specification of mandatory documents. However, these
data structures are present in the supplementary RDF files and not related to ICDD links,
consequently introducing redundancy and potential conflicts.</p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Problem statements</title>
        <p>
          Based on the two prototype implementations, remarks and problem statements on the usability
of ICDD for ontology-driven data linking and link validation for the construction industry
are drawn. Both prototypes evaluated use cases where RDF data was available inside the
container in addition to the container metadata and linksets, allowing users to load RDF
ifles into the container. Both prototypes demonstrated a cumbersome way to link to an RDF
resource using the ICDD linking structure with either the ls:StringBasedIdentifier or
the ls:UriBasedIdentifier according to the standard and consequently ignoring very basic
LD solutions. In the second prototype, this leads to a definition of links in the LD itself and
not in the linksets of the ICDD, so the advantage of separating links in the linksets from the
original data in the payload is not maintained anymore. This can lead to duplicate linking and
additional efort in maintaining the container and linksets. Using the ICDD linking structure
combined with LD in a linkset or payload LD is not allowed by the standard and would fail the
SHACL validation provided by ISO 21597-2 [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
        </p>
        <p>Moreover, both prototypes use SHACL for validating and rule execution to validate and
visualize the interconnections to additional LD in the ICDD containers. However, link validation
is not genuinely possible for deep linking, as especially the ls:StringBasedIdentifier as
a key-value-pair representing the identifier does not convey a unique meaning or identification.
Thus, it is only usable within the defining system unless there is an agreement on how the
identifiers are used. As the standard provides schema validation, the next step is validating the
linking and context, which the standard does not cover yet. The standard does not deliver a
solution where to store the SHACL shapes that should be evaluated for the specific container.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Extending ICDD for linking and validation</title>
      <p>After comparing the implementations and considering the lack of solutions for deep linking
RDF data in ICDD, the challenges are: (1) to overcome the quite complex standard for simple
data linking, (2) to uniquely identify deep linked elements with the built-in identifiers, (3) to
exploit the added value of LD and (4) to enable identification of SHACL files that are meant for
data validation. To overcome these challenges, it is necessary to acknowledge that RDF data
difers from other payloads in the container. The authors identified potential solutions in two
strategies to put the strengths of the ICDD data structure together with the full power of LD.</p>
      <p>The first strategy (stay-within-the-standard) contains solutions within the ICDD standard
resulting in best practices or even additional agreements on dealing with LD datasets within ICDD
containers. This could result in LD modeling agreements and an LD adapter implementation
agreement. The second strategy (extend-the-standard) contains solutions that are technically
not compliant with the ICDD standard because they extend or change the schema provided by
the standard. These two strategies will be discussed further in the following paragraphs.</p>
      <sec id="sec-4-1">
        <title>4.1. Compliant solutions (stay-within-the-standard)</title>
        <p>Compliant solutions can be met by storing RDF data and SHACL files in the Payload documents
folder without having them treated specially. Linking to RDF data must then be done via the
ls:URIBasedIdentifier provided by the standard to make the linking resolvable for other
container recipients. This linking mechanism is present in the current standard and is the closest
way to bridge the gap to a handy and usable application of LD in ICDD. Regarding the unique
identification of entities in other file types, e.g., IFC models, GIS data, or calculation spreadsheets,
a common understanding of how to use the ls:StringBasedIdentifier using adapters and
implementation guidelines is necessary. The two prototypical implementations have shown
that even for IFC documents, diferent key-value pairs for referring to the IFC GUID can be
used for linking, e.g., when utilizing diferent keys: "GUID", "globalID", "IFC GUID". This makes
it hard for software systems to implement ICDD in an interoperable way.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Ideal solutions (non-compliant, extend-the-standard)</title>
        <p>In the above section, solutions compliant with the current specifications in ISO 21597 were
elaborated. This section presents approaches that deviate from the specifications of ISO 21597
while retaining the concept of information containers.</p>
        <p>The first solution could be to allow extensions of the linkset ontology, including the
necessary modifications to accommodate this. For instance, the ranges and restrictions on the
predicates ls:hasLinkElement, ls:hasFromLinkElement, and ls:hasToLinkElement
could be adjusted to not only allow instances of ls:LinkElement. One possibility would be a
generic restriction to any rdfs:Resource so that either ls:LinkElement referencing the
documents or RDF entities could directly be addressed, which is depicted in Figure 4.
ls:hasFromLinkElement
:linker_xx</p>
        <p>PAYLOAD_TRIPLES_1
inst:BuildingElement_01</p>
        <p>The advantage of this approach is that the ls:LinkElement, ls:Identifier, and the
identifier declaration can entirely be omitted for one side of the link, and only the reference
to the entity is referred to via ls:hasFromLinkElement. With this change in the modeling
principle of the links, RDF data must not necessarily be registered in the index but must be
present in the Payload triples folder.</p>
        <p>
          T-Box files, like SHACL shapes and extensions of the linkset ontology, must be placed
in the Ontology resources folder. RDF datasets (A-Box) must be put into the Payload triples
folder. Company-specific OTLs or T-Box datasets like the upcoming Semantic Modeling and
Linking (SML) standard draft could arguably be used with the ICDD standard. This could bridge
the gap between hybrid modeling of semantics and documents at this point [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ].
        </p>
        <p>Eventually, a more generic solution is an approach that uses a combination of the above
proposals and also considers the usage of the link type classes (defined in part 2) as property
values. In this solution, the original container structure (of the three folders for Ontology
resources, Payload documents, and Payload triples along with the overarching index file) is
ct:ContainerDescription</p>
        <p>LinkEllesm:hansTo
rdf:type et</p>
        <p>
          ls:hLainskFErolemment
:link002
rdf:type
:linker_xx
rdf:type
els:IsControlledBy
ls:LinkElement
retained as the essential information container structure. To illustrate this, Figure 5 showcases
an example container index file snippet (green oval), with the linkset file (purple oval), in
which two documents :document1 and :document2 are linked using the two hierarchy
levels: first declaring documents as ls:LinkElement and this is then used to link (by using
ls:hasToLinkElement and ls:hasFromLinkElement) to an instance :linker_xx which
is finally assigned to a link class els:IsControlledBy. If this mandatory declaration of both
ls:Link and ls:LinkElement can be omitted, and the current restrictions on the usage of
link types are removed, it is possible to assign a link class as a property value directly (this is
illustrated by the blue dotted arrows in the figure) [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. However, in this solution, though the
linkset structure is partially retained, the usage of link classes beyond the assigned domain and
range will be recognized as a non-conformance.
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions and Recommendations</title>
      <p>Adopting LD for the built environment results in the need to exchange data using LD in
conjunction with other documents. The development and adaption of the ICDD standard for the
construction industry is a logical step toward data-driven storage and exchange of information.
Two independent prototype developments were presented within this research focusing on the
linking of data, the storage of LD datasets, and the validation of the container contents. However,
the authors have recorded that the ICDD linking structure for RDF resources is cumbersome and
does not exploit the full potential of LD. In the current standard state, additional agreements are
necessary to facilitate an ICDD software ecosystem that maintains and verifies ICDD containers
containing extra LD datasets. Possible solutions for improving the ICDD standards were given.
The recommendations are summarized, and an outlook on future research is shown in the
following. Based on the research carried out in this paper, several recommendations for action
are outlined:
• When using ICDD in its current standardization, supplemental agreements for structuring
link elements and identifiers for specific file types, e.g., IFC, GIS, or spreadsheets, could
help interpret the links inside ICDD in diferent systems uniquely.
• The basic idea of ICDD should be that the potential of LD is not limited by its use as a
simple ZIP archive without touching the benefits of LD.
• For extending the ICDD standard for an ideal interaction with LD, the extensions of
classes and properties of the ICDD ontologies should be allowed.
• The storage of internal and external RDF files should be registered in the index file.
• SHACL shapes residing in a dedicated folder or referenced from a web resource should
be registered inside the index file as requirements for container exchange.</p>
      <p>
        Regardless of the presented solutions for improving the linking of LD datasets in ICDD, the
validation of containers is needed and should be supported by the standard for the conformity
check and the validation of content. Regarding further research, there is also ongoing
standardization regarding the use of LD for modeling and linking building data as SML resulting
in a standard draft of BS EN 17632 [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. In addition, there are overlaps and touching points
between modeling documents and non-document data. Further overlap is between general LD
concepts such as the Linked Data Platform, which allows defining containers for maintaining
heterogeneous data [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. Both approaches can be analyzed in further work and could be adapted
for implementing a usable container-based information exchange in the construction industry.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Werbrouck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Pauwels</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Beetz</surname>
          </string-name>
          , E. Mannens,
          <article-title>Mapping Federated AEC projects to Industry Standards using dynamic Views</article-title>
          ,
          <source>in: Proceedings of the 10th Linked Data in Architecture and Construction Workshop</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>65</fpage>
          -
          <lpage>76</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <article-title>[2] ISO 21597-1, Information container for linked document delivery: Exchange specification - Part 1</article-title>
          : Container, 1 ed., International Organization for Standardization, Geneva, Switzerland,
          <year>2020</year>
          . URL: https://www.iso.org/standard/74389.html.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <article-title>[3] ISO 21597-2, Information container for linked document delivery: Exchange specification - Part 2: Link types</article-title>
          , 1 ed., International Organization for Standardization, Geneva, Switzerland,
          <year>2020</year>
          . URL: https://www.iso.org/standard/74390.html.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <article-title>[4] ISO 19650-1, Organization of information about construction works - Information management using building information modelling</article-title>
          .
          <source>Part 1: Concepts and principles</source>
          , 1 ed., International Organization for Standardization, Geneva, Switzerland,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Borrmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Abualdenien</surname>
          </string-name>
          , T. Krijnen,
          <article-title>Information containers providing deep linkage of drawings and BIM models, The 38th CIB W78 conference on Information and Communication Technologies for AECO (</article-title>
          <year>2021</year>
          ). URL: http://itc.scix.net/paper/w78-2021
          <string-name>
            <surname>-</surname>
          </string-name>
          paper-082.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Göbels</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Beetz</surname>
          </string-name>
          ,
          <article-title>Using ICDD Containers for documentation data archives: Semiautomated creation of linked documentation data archives using an Information Container for linked Document Delivery</article-title>
          ,
          <source>Proceedings of the 27th International Conference on Cultural Heritage and New Technologies</source>
          ,
          <year>November 2022</year>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Karlapudi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Prathap</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Menzel</surname>
          </string-name>
          ,
          <article-title>An explanatory use case for the implementation of Information Container for linked Document Delivery in Common Data Environments</article-title>
          ,
          <source>in: EG-ICE 2021 Workshop on Intelligent Computing in Engineering</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>76</fpage>
          -
          <lpage>86</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>H.</given-names>
            <surname>Schevers</surname>
          </string-name>
          , J. Mitchell,
          <string-name>
            <given-names>P.</given-names>
            <surname>Akhurst</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Marchant</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bull</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>McDonald</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Drogemuller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Linning</surname>
          </string-name>
          ,
          <article-title>Towards digital facility modelling for Sydney opera house using IFC and semantic web technology</article-title>
          ,
          <source>ITcon</source>
          <volume>12</volume>
          (
          <year>2007</year>
          )
          <fpage>347</fpage>
          -
          <lpage>362</lpage>
          . URL: https://www.itcon.org/
          <year>2007</year>
          /24.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>P.</given-names>
            <surname>Pauwels</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          McGlinn (Eds.),
          <source>Buildings and Semantics: Data Models and Web Technologies for the Built Environment</source>
          , 1 ed., CRC Press, London,
          <year>2022</year>
          . doi:
          <volume>10</volume>
          .1201/9781003204381.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>H.</given-names>
            <surname>Hoeber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Alsem</surname>
          </string-name>
          ,
          <article-title>Life-cycle information management using open-standard BIM, Engineering</article-title>
          ,
          <source>Construction and Architectural Management</source>
          <volume>23</volume>
          (
          <year>2016</year>
          )
          <fpage>696</fpage>
          -
          <lpage>708</lpage>
          . doi:
          <volume>10</volume>
          . 1108/ECAM-01-2016-0023.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>H.</given-names>
            <surname>Knublauch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kontokostas</surname>
          </string-name>
          ,
          <source>Shapes Constraint Language (SHACL): W3C Recommendation 20 July</source>
          <year>2017</year>
          ,
          <year>2017</year>
          . URL: https://www.w3.org/TR/shacl/.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Senthilvel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Beetz</surname>
          </string-name>
          ,
          <article-title>A Visual Programming Approach for Validating Linked Building Data</article-title>
          , in: L.
          <string-name>
            <surname>-C. Ungureanu</surname>
          </string-name>
          , T. Hartmann (Eds.),
          <source>EG-ICE 2020 proceedings: International Workshop on Intelligent Computing in Engineering</source>
          , Berlin,
          <year>2020</year>
          , pp.
          <fpage>403</fpage>
          -
          <lpage>411</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>H.</given-names>
            <surname>Schevers</surname>
          </string-name>
          ,
          <article-title>Towards a webbased service for NEN-EN-ISO 21597-1:2020 container validation</article-title>
          :
          <source>Bimloket Governance documentation Draft 19 september</source>
          <year>2022</year>
          ,
          <year>2022</year>
          . URL: https://bimloket.github.io/ICDD-NL/validation/.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>P.</given-names>
            <surname>Hagedorn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>König</surname>
          </string-name>
          ,
          <article-title>Rule-Based Semantic Validation for Standardized Linked Building Models</article-title>
          ,
          <source>in: Proceedings of the 18th International Conference on Computing in Civil and Building Engineering</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>772</fpage>
          -
          <lpage>787</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>030</fpage>
          -51295-8_
          <fpage>53</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>P.</given-names>
            <surname>Hagedorn</surname>
          </string-name>
          , L. Liu,
          <string-name>
            <given-names>M.</given-names>
            <surname>König</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Hajdin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Blumenfeld</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stöckner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Billmaier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Grossauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Gavin</surname>
          </string-name>
          ,
          <article-title>BIM-enabled Infrastructure Asset Management using Information Containers and Semantic Web</article-title>
          ,
          <source>ASCE Journal of Computing in Civil Engineering</source>
          <volume>37</volume>
          (
          <year>2023</year>
          ). doi:
          <volume>10</volume>
          .1061/(ASCE)CP.
          <fpage>1943</fpage>
          -
          <volume>5487</volume>
          .
          <fpage>0001051</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>P.</given-names>
            <surname>Hagedorn</surname>
          </string-name>
          ,
          <string-name>
            <surname>IIB.</surname>
          </string-name>
          <article-title>ICDD framework: a framework to open, create, validate, edit, and export containers defined by ISO 21597-1:2020: MIT license</article-title>
          ,
          <year>2022</year>
          . URL: https://doi.org/10.5281/ zenodo.7256174. doi:
          <volume>10</volume>
          .5281/zenodo.7256174.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>M. H.</given-names>
            <surname>Rasmussen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lefrancois</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Bonduel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Hviid</surname>
          </string-name>
          , J. Karlshøj,
          <string-name>
            <surname>OPM:</surname>
          </string-name>
          <article-title>An ontology for describing properties that evolve over time</article-title>
          ,
          <source>Proceedings of the 6th Linked Data in Architecture and Construction Workshop</source>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <article-title>BIM-Connected and BuildingBits</article-title>
          , Wistor platform,
          <year>2023</year>
          . URL: https://www.wistor.nl/en/, last accessed:
          <volume>29</volume>
          .
          <fpage>03</fpage>
          .
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <source>[19] BS EN 17632</source>
          ,
          <string-name>
            <surname>Building Information Modelling (BIM) - Semantic Modelling</surname>
          </string-name>
          and
          <article-title>Linking (SML): Part 1: Generic modelling patterns</article-title>
          [Draft], 1 ed.,
          <source>British Standards Institution</source>
          , London, United Kingdom,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>N.</given-names>
            <surname>Noy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Uschold</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Welty</surname>
          </string-name>
          ,
          <source>Representing Classes As Property Values on the Semantic Web</source>
          ,
          <year>2005</year>
          . URL: https://www.w3.org/TR/2005/NOTE-swbp
          <article-title>-classes-as-values-</article-title>
          <volume>20050405</volume>
          /.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>S.</given-names>
            <surname>Speicher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Arwe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Malhotra</surname>
          </string-name>
          ,
          <source>Linked Data Platform</source>
          <volume>1</volume>
          .0,
          <string-name>
            <given-names>W3C</given-names>
            <surname>Recommendations</surname>
          </string-name>
          (
          <year>2015</year>
          ). URL: http://www.w3.org/TR/2015/REC-ldp-
          <volume>20150226</volume>
          /.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>