<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Fourth International Workshop On Knowledge Graph Construction Co-located with the ESWC</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Preface for the 4th Edition of the International Knowledge Graph Construction Workshop</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>David Chaves-Fraga</string-name>
          <email>david.chaves@upm.es</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anastasia Dimou</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ana Iglesias-Molina</string-name>
          <email>ana.iglesiasm@upm.es</email>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Umutcan Serles</string-name>
          <email>umutcan.serles@sti2.at</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dylan Van Assche</string-name>
          <email>dylan.van.assche@ugent.be</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Flanders Make - DTAI-FET</string-name>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Crete, Greece</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>IDLab, Dept of Electronics and Information Systems, Ghent University - imec</institution>
          ,
          <country country="BE">Belgium</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>KU Leuven, Department of Computer Science</institution>
          ,
          <addr-line>Sint-Katelijne-Waver</addr-line>
          ,
          <country country="BE">Belgium</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Leuven.AI - KU Leuven institute for AI</institution>
          ,
          <addr-line>B-3000 Leuven</addr-line>
          ,
          <country country="BE">Belgium</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Semantic Technology Institute Innsbruck, Universität Innsbruck</institution>
          ,
          <country country="AT">Austria</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Universidad Politécnica de Madrid</institution>
          ,
          <addr-line>Campus de Montegancedo, Boadilla del Monte</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>28</volume>
      <abstract>
        <p>More and more knowledge graphs are constructed for private use, e.g., the Amazon Product Graph [1] or the Fashion Knowledge Graph by Zalando1,or public use, e.g., DBpedia2 or Wikidata3. While techniques to automatically construct KGs from existing Web objects exist (e.g., scraping Web tables), there is still room for improvement. So far, constructing knowledge graphs was considered an engineering task, however, more scientifically robust methods keep on emerging. These methods were widely questioned for their verbosity, low performance or dificulty of use, while the data sources' variety and complexity cause further syntax and Declarative methods (mapping languages) for describing rules to construct knowledge graphs and approaches to execute those rules keep on emerging. Nevertheless constructing knowledge graphs is still not a straightforward task because several existing challenges remain and yet the barriers to construct knowledge graphs are not lowered enough to be easily and broadly adopted by industry. These reasons and the vastly populated knowledge graph construction W3C Community Group4 show that there are still open questions that require further investigation to come up with groundbreaking solutions.</p>
      </abstract>
      <kwd-group>
        <kwd>Workshop</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR
Workshop
Proceedings
Besides R2RML-based extensions, alternatives were proposed, e.g., SPARQL-Generate [11] and
ShExML [12], as well as methods to perform data transformations while constructing knowledge
graphs, e.g., FnO [13] and FunUL [14].</p>
      <p>The fourth edition of the knowledge graph construction workshop5 has a special focus on
the benchmarking of knowledge graph construction methods. We aim to put the focus of this
edition to methods for eficient construction of KGs that evaluate or analyze the trade-ofs of
diferent approaches and when to choose which system. It also included:
• Keynote. The workshop includes the keynote from Benjamin Cogrel (Ontopic): “From</p>
      <p>Ontop to Ontopic: a virtual-first perspective on knowledge graph construction”
• The Knowledge Graph Construction Challenge. For the first time, the workshop includes
a challenge that aims at benchmarking systems to find which RDF graph construction
system optimizes for metrics e.g. execution time, CPU, memory usage, or a combination
of these metrics.</p>
      <p>The final goal of the event is to provide a venue for scientific discourse, systematic analysis
and rigorous evaluation of languages, techniques and tools, as well as practical and applied
experiences and lessons-learned for constructing knowledge graphs from academia and industry.</p>
      <p>Fourteen papers were submitted. The reviews were open and public, and hosted at Open
Review6. Each paper received at least three reviews from reviewers with diferent background
and status. Each paper received a review from a senior, a junior and an industry researcher.</p>
      <p>Six papers were accepted and four were conditionally accepted. Five of the accepted papers
were long papers and the other five were short papers. The following papers were accepted for
publication and presented at the workshop:
• Composable Semantic Data Transformation Pipelines with Chimera [15].
• Test-driven Knowledge Graph Construction [16].
• Towards Semantic Interpretation of Structured Data Sources in Privacy-Preserving
Environments [17].
• Declarative RDF Construction from In-Memory Data Structures with RML [18].
• Reference Conditions: Relating Mapping Rules Without Joining [19].
• Preserving the Alignment of LD with Source Data [20].
• Designing NORIA: a Knowledge Graph-based Platform for Anomaly Detection and
Incident Management in ICT Systems [21].
• Towards a Mapping Framework for the Tenders Electronic Daily Standard Forms [22].
• Meta2KG: An Embeddings-based Approach for Transforming Metadata to Knowledge</p>
      <p>Graphs [23].</p>
      <p>• Scaling RML and SPARQL-based Knowledge Graph Construction with Apache Spark [24].</p>
      <p>For the first time, a Knowledge Graph Construction Challenge was organized during the
workshop to evaluate the performance of diferent knowledge graph construction approaches
in terms of execution time and resources e.g. CPU, RAM, etc. The goal of this challenge was
5http://w3id.org/kg-construct/workshop/2023
6https://openreview.net/group?id=eswc-conferences.org/ESWC/2023/Workshop/KGCW
to identify existing gaps of diferent approaches and not only put the focus on execution time,
but also resources. The challenge consisted of 2 parts: (i) artificial data for analyzing specific
parameters of the construction process e.g. joins, data size, mappings, and (ii) real-life data of the
GTFS Madrid Benchmark to evaluate approaches in real use cases. We received 4 participants:
CARML, SDM-RDFizer, RDFProcessingToolkit / Sansa, and RMLStreamer of which 3 submitted
a final report included in the proceedings:
• Knowledge Graph Creation Challenge: Results for SDM-RDFizer [25]
• KGCW2023 Challenge Report: RDFProcessingToolkit / Sansa [26]
• RMLStreamer with Reference Conditions in the KGCW Challenge 2023 [27]
Organizing Committee
• David Chaves-Fraga, Universidad Politécnica de Madrid &amp; KU Leuven
• Anastasia Dimou, KU Leuven, Flanders Make, Leuven.AI
• Dylan Van Assche, Ghent University – imec – IDLab
• Ana Iglesias-Molina, Universidad Politécnica de Madrid
• Umutcan Serles, University of Innsbruck
Program Committee
[8] J. F. Sequeda, D. P. Miranker, Ultrawrap: SPARQL execution on relational data, Web</p>
      <p>Semantics: Science, Services and Agents on the WWW (2013).
[9] F. Priyatna, O. Corcho, J. Sequeda, Formalisation and experiences of r2rml-based sparql to
sql query translation using morph, in: Proceedings of the 23rd International Conference
on World Wide Web, 2014.
[10] D. Calvanese, B. Cogrel, S. Komla-Ebri, R. Kontchakov, D. Lanti, M. Rezk, M.
RodriguezMuro, G. Xiao, Ontop: Answering SPARQL Queries over Relational Databases, Semantic
Web Journal (2017).
[11] M. Lefrançois, A. Zimmermann, N. Bakerally, A SPARQL extension for generating RDF
from heterogeneous formats, in: The Semantic Web: 14th International Conference, 2017.
[12] H. García-González, I. Boneva, S. Staworko, J. E. Labra-Gayo, J. M. C. Lovelle, Shexml:
improving the usability of heterogeneous data mapping languages for first-time users,
PeerJ Computer Science 6 (2020) e318.
[13] B. De Meester, A. Dimou, R. Verborgh, E. Mannens, An ontology to semantically declare
and describe functions, in: European Semantic Web Conference, 2016, pp. 46–49.
[14] A. C. Junior, C. Debruyne, R. Brennan, D. O’Sullivan, Funul: a method to incorporate
functions into uplift mapping languages, in: Proceedings of the 18th International Conference
on Information Integration and Web-based Applications and Services, 2016, pp. 267–275.
[15] M. Grassi, M. Scrocca, A. Carenini, M. Comerio, I. Celino, Composable Semantic Data
Transformation Pipelines with Chimera, in: Proceedings of the 4th International Workshop
on Knowledge Graph Construction, 2023.
[16] J. Mynarz, K. Haniková, V. Svátek, Test-driven knowledge graph construction, in:
Proceedings of the 4th International Workshop on Knowledge Graph Construction, 2023.
[17] C. Karalka, G. Meditskos, N. Bassiliades, Towards Semantic Interpretation of Structured
Data Sources in Privacy-Preserving Environments, in: Proceedings of the 4th International
Workshop on Knowledge Graph Construction, 2023.
[18] I. Dasoulas, D. Chaves-Fraga, D. Garijo, A. Dimou, Declarative RDF construction from
in-memory data structures with RML, in: Proceedings of the 4th International Workshop
on Knowledge Graph Construction, 2023.
[19] E. de Vleeschauwer, S. M. Oo, B. De Meester, P. Colpaert, Reference conditions: relating
mapping rules without joining, in: Proceedings of the 4th International Workshop on
Knowledge Graph Construction, 2023.
[20] A. Randles, D. O’Sullivan, Preserving the Alignment of LD with Source Data, in:
Proceedings of the 4th International Workshop on Knowledge Graph Construction, 2023.
[21] L. Tailhardat, Y. Chabot, R. Troncy, Designing NORIA: a Knowledge Graph-based Platform
for Anomaly Detection and Incident Management in ICT Systems, in: Proceedings of the
4th International Workshop on Knowledge Graph Construction, 2023.
[22] E. Costetchi, A. Vassiliades, C. Nyulas, Towards a Mapping Framework for the Tenders
Electronic Daily Standard Forms, in: Proceedings of the 4th International Workshop on
Knowledge Graph Construction, 2023.
[23] N. Abdelmageed, B. König-Ries, Meta2KG: An Embeddings-based Approach for
Transforming Metadata to Knowledge Graphs, in: Proceedings of the 4th International Workshop
on Knowledge Graph Construction, 2023.
[24] C. Stadler, L. Bühmann, L.-P. Meyer, M. Martin, Scaling RML and SPARQL-based Knowledge
Graph Construction with Apache Spark, in: Proceedings of the 4th International Workshop
on Knowledge Graph Construction, 2023.
[25] E. Iglesias, V. Maria-Esther, Knowledge Graph Creation Challenge: Results for
SDMRDFizer, in: Proceedings of the 4th International Workshop on Knowledge Graph
Construction, 2023.
[26] S. Bin, C. Stadler, L. Bühmann, KGCW2023 Challenge Report: RDFProcessingToolkit /
Sansa, in: Proceedings of the 4th International Workshop on Knowledge Graph
Construction, 2023.
[27] E. de Vleeschauwer, G. Haesendonck, D. Van Assche, B. De Meester, RMLStreamer with
Reference Conditions in the KGCW Challenge 2023, in: Proceedings of the 4th International
Workshop on Knowledge Graph Construction, 2023.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>X. L.</given-names>
            <surname>Dong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Liang</surname>
          </string-name>
          , J. Ma,
          <string-name>
            <given-names>Y. E.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. Blanco</given-names>
            <surname>Saldana</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Deshpande</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Michetti</given-names>
            <surname>Manduca</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. P.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Xiao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.-S.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Karamanolakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Mao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Faloutsos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>McCallum</surname>
          </string-name>
          , J. Han,
          <article-title>Autoknow: Self-driving knowledge collection for products of thousands of types</article-title>
          ,
          <source>KDD '20</source>
          ,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2020</year>
          , p.
          <fpage>2724</fpage>
          -
          <lpage>2734</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Dimou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. V.</given-names>
            <surname>Sande</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Colpaert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Verborgh</surname>
          </string-name>
          , E. Mannens, R. V. de Walle,
          <article-title>Rml: A generic language for integrated rdf mappings of heterogeneous data</article-title>
          ,
          <source>in: Proceedings of the 7th Workshop on Linked Data on the Web (LDOW)</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D.</given-names>
            <surname>Chaves-Fraga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Priyatna</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Perez-Santana</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Corcho</surname>
          </string-name>
          ,
          <article-title>Virtual statistics knowledge graph generation from CSV files</article-title>
          , in: Emerging Topics in Semantic Technologies:
          <article-title>ISWC 2018 Satellite Events, Studies on the Semantic Web</article-title>
          , IOS Press,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>F.</given-names>
            <surname>Michel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Djimenou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Faron-Zucker</surname>
          </string-name>
          ,
          <source>J. Montagnat, xR2RML: Relational and NonRelational Databases toRDF Mapping Language</source>
          ,
          <source>Technical Report</source>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>E.</given-names>
            <surname>Iglesias</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jozashoori</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Chaves-Fraga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Collarana</surname>
          </string-name>
          , M.-E. Vidal,
          <article-title>SDM-RDFizer: An RML Interpreter for the Eficient Creation of RDF Knowledge Graphs</article-title>
          ,
          <source>in: Proceedings of the 29th ACM International Conference on Information &amp; Knowledge Management</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>3039</fpage>
          -
          <lpage>3046</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>U.</given-names>
            <surname>Şimşek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Kärle</surname>
          </string-name>
          , D. Fensel, RocketRML
          <article-title>- A NodeJS implementation of a Use-Case Specific RML Mapper</article-title>
          ,
          <source>in: Proceedings of the 1st Workshop on Knowledge Graph Building</source>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Jozashoori</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Chaves-Fraga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Iglesias</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.-E. Vidal</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          <string-name>
            <surname>Corcho</surname>
          </string-name>
          , Funmap:
          <article-title>Eficient execution of functional mappings for knowledge graph creation</article-title>
          , in: International Semantic Web Conference, Springer,
          <year>2020</year>
          , pp.
          <fpage>276</fpage>
          -
          <lpage>293</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>