<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>RMLMapper supported by RML-view-to-CSV in the KGC W Challenge 2025</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Els de Vleeschauwer</string-name>
          <email>els.devleeschauwer@ugent.be</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dylan Van Assche</string-name>
          <email>dylan.vanassche@ugent.be</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ben De Meester</string-name>
          <email>ben.demeester@ugent.be</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Workshop</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Ghent University - imec</institution>
          ,
          <addr-line>Technologiepark-Zwijnaarde 122, 9052 Ghent</addr-line>
          ,
          <country country="BE">Belgium</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>IDLab, Department of Electronics and Information Systems</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>KGCW'25: 6th International Workshop on Knowledge Graph Construction</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <abstract>
        <p>This paper presents the results of the Knowledge Graph Construction Workshop 2025 Challenge with RMLMapper, a reference in-memory JAVA implementation of the RML specification (RMLio) before RML got taken up by the W3C Knowledge Graph Construction (KGC) Community Group (RMLkgc). We updated RMLMapper to translate R2RML and RMLio into the latest version of RMLkgc. We participated in the challenge for the modules RML-Core, RML-IO and RML-LV. RMLMapper largely supports RML-Core and RML-IO (90% and 65%). For RML-LV, we added RML-view-to-CSV as a first step to our knowledge graph construction pipeline to materialize RML logical views (RML-LV) as CSV files. This way, RMLMapper largely supports RML-LV (81%). We currently cannot claim full coverage of the RMLkgc specification, however, this challenge enabled us to uncover the current gaps in our implementation, and allowed us to provide basic RMLkgc support in a mature JAVA implementation. Our continued participation in the community group will allow us to remain up to date with the latest developments in RMLkgc and further identify future implementation priorities for the RMLMapper.</p>
      </abstract>
      <kwd-group>
        <kwd>RMLMapper</kwd>
        <kwd>RML-view-to-CSV</kwd>
        <kwd>challenge</kwd>
        <kwd>knowledge graph construction</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>W</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>
        The third edition of the Knowledge Graph Construction Workshop (KGCW) challenge1 is a next iteration
of the KGCW2024 challenge’s conformance track2. Its continued aim is to assess the compliance with
the RML specification as established by the W3C Community Group on Knowledge Graph Construction
(from hereon dubbed “RMLkgc”) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The RML specification is made modular and extends both the RDB
to RDF Mapping Language (R2RML) specification [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and the initial RML specification [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] (from hereon
dubbed “RMLio”).
      </p>
      <p>RML’s modules encompass (i) RML-Core, an iteration of the original R2RML functionality (exempt
database connection and query functionality); (ii) RML-IO, an iteration of the RMLio data source
reference functionality; (iii) RML-IO-Registry, a living document of reference formulations; (iv)
RMLFNML, an iteration of the RMLio FnO integration for data transformation functionality; (v) RML-LV, a
new module that provides an abstraction layer between RML-IO and RML-Core; (vi) RML-CC, a new
module that provides support for RDF Collection and RDF Container generation; and (vii) RML-Star,
a new module that provides support for RDF 1.2 triple term generation. The challenge contains test
cases for each of these modules, to determine their feasibility and correctness by applying them in
implementations.</p>
      <p>
        In this paper, we present the results for RMLMapper, a reference in-memory JAVA implementation of
the RMLio specification. Since our previous submission [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], we further updated RMLMapper to translate
R2RML and RMLio into the latest version of RMLkgc, and combined it with RML-view-to-CSV [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], a
(B. De Meester)
      </p>
      <p>CEUR</p>
      <p>ceur-ws.org</p>
      <p>CHALLENGE INPUT:</p>
      <p>RML mapping
CHALLENGE INPUT:
source data</p>
      <p>RML mapping 
without logical views
logical view 
converted to</p>
      <p>CSV files
RML-view-to-CSV</p>
      <p>RMLMapper</p>
      <p>OUTPUT KGC pipeline:
RDF knowledge graph
preprocessing library for RML-LV that materializes Logical Views as CSV files and produces an adapted
mapping document that can be processed by any RMLkgc engines supporting CSV files as source data
format (Figure 1). With release v1.1.03, RML-view-to-CSV has been updated to support most features of
the latest version of the RML-LV specification.</p>
    </sec>
    <sec id="sec-3">
      <title>2. Results</title>
      <p>We participated in the challenge for the modules RML-Core, RML-IO and RML-LV. We employed
RMLMapper v7.3.34 and RML-view-to-CSV v1.1.03.</p>
      <p>RMLMapper supports 53 out of 59 RML-Core test cases (89.83%). Failing cases are due to (i) using
JSONPath expressions for keys that contain special characters (RMLTC010a-JSON, RMLTC0010b-JSON,
RMLTC0010c-JSON), whereas the employed RMLMapper library does not follow these latest IETF
draft specifications yet; (ii) no support for the shortcut property rml:subject in combination with a
blank node (RMLTC0025a-JSON); (iii) being too lenient in wrongfully ignoring the rml:termType in a
constant-valued term map (RMLTC0025b-JSON).</p>
      <p>Considering the failing of test case RMLTC0023e-JSON, this is deemed an incorrect test case, and
discussion is ongoing5.</p>
      <p>RMLMapper supports 41 out of 73 RML-IO test cases (56.16%). Failing cases are due to (i) no
support for compressed data sources (RMLSTC0002b, RMLSTC0002c, RMLSTC0002d, RMLSTC0002e); (ii) no
support for logical source serialization n-triples (RMLSTC0003); (iii) incorrectly handling CSWV null
values (RMLSTC0004a, RMLSTC0004c); (iv) testing implementation of RMLMapper misses support for
templated values for SQL test cases (RMLSTC0006a); (v) no support for rml:CurrentWorkingDirectory
(RMLSTC0006b); (vi) no support for XML reference formulation with namespaces (RMLSTC0007d);
(vii) returning warnings instead of errors (RMLSTC0010a, RMLSTC0010b); (viii) no support for targets in
rml:DatatypeMap (RMLTTC0001f, RMLTTC0002q); and (ix) no support for logical target serializations
N3, RDF/JSON, and RDF/XML (RMLTTC0004b, RMLTTC0004e, RMLTTC0004f), UTF-16 encoding
(RMLTTC0005b), and compression (RMLTTC0006b, RMLTTC0006c, RMLTTC0006d, RMLTTC0006e).</p>
      <p>Considering the failing of test cases RMLTTC0002f, RMLTTC0002g, RMLTTC0002h, RMLTTC0002i,
RMLTTC0002k, RMLTTC0002l, RMLTTC0002m, RMLTTC0002n, RMLTTC0002r: mistakes in the test
cases were discovered concerning correct graph assignment6. Considering the failing of test case
RMLTTC0004g, a mistake in the test case was discovered concerning incorrect test data7. By fixing
these mistakes8, RMLMapper supports 51 out of 73 RML-IO test cases (69.86%).</p>
      <p>RMLMapper in combination with RML-view-to-CSV supports 26 out of 32 RML-LV test
cases (81.25%). Failing cases are due to (i) no support for constant-valued and tempate-valued
rml:ExpressionFields in RML-view-to-CSVW (RMLLVTC0001b, RMLLVTC0001c); and (ii) no support
3https://github.com/RMLio/rml-view-to-csv/releases/tag/v1.1.0
4https://github.com/RMLio/rmlmapper-java/releases/tag/v7.3.3
5https://github.com/kg-construct/rml-core/issues/203
6https://github.com/kg-construct/rml-io/issues/129, https://github.com/kg-construct/rml-io/issues/130, and https://github.
com/kg-construct/rml-io/issues/131
7https://github.com/kg-construct/rml-io/issues/132
8https://github.com/kg-construct/rml-io/pull/133
for natural data type mapping in RML-view-to-CSV (RMLLVTC0004a, RMLLVTC0004b, RMLLVTC0004c,
RMLLVTC0004d).</p>
      <p>RML-view-to-CSV can not support natural data type mapping because it materializes logical views
as CSV files. CSV does not provide any native data types, therefore there is no natural RDF mapping of
CSV values upon XSD data types9. A mistake in RML-LV test case RMLLVTC0008b was discovered and
corrected10.</p>
    </sec>
    <sec id="sec-4">
      <title>3. Conclusion</title>
      <p>We currently can not claim full coverage of the RMLkgc specification for modules RML-Core, RML-IO,
or RML-LV, however, this challenge enabled us to uncover the current gaps in our implementation,
and allowed us to provide basic RMLkgc support in a mature JAVA implementation. Finally, some
errors in the test cases have been discovered. Our continued participation in the community group
will allow us to remain up to date with the latest developments in RMLkgc and further identify future
implementation priorities for the RMLMapper.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>The described research activities were supported by SolidLab Vlaanderen (Flemish Government, EWI
and RRF project VV023/10), and the European Union’s Horizon Europe research and innovation program
under grant agreement no. 101058682 (Onto-DESIDE).</p>
    </sec>
    <sec id="sec-6">
      <title>Declaration on Generative AI</title>
      <p>The author(s) have not employed any Generative AI tools.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Iglesias-Molina</surname>
          </string-name>
          ,
          <string-name>
            <surname>D. Van Assche</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Arenas-Guerrero</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>De Meester</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Debruyne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jozashoori</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Maria</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Michel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Chaves-Fraga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Dimou</surname>
          </string-name>
          ,
          <article-title>The RML Ontology: A Community-Driven Modular Redesign After a Decade of Experience in Mapping Heterogeneous Data to RDF</article-title>
          , in
          <source>: Proceedings of the International Semantic Web Conference (ISWC), Lecture Notes in Computer Science</source>
          , Springer, Cham,
          <year>2023</year>
          . doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>031</fpage>
          - 47243-
          <issue>5</issue>
          _
          <fpage>9</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>S.</given-names>
            <surname>Das</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sundara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cyganiak</surname>
          </string-name>
          ,
          <article-title>R2RML: RDB to RDF Mapping Language</article-title>
          , Working Group Recommendation,
          <source>World Wide Web Consortium (W3C)</source>
          ,
          <year>2012</year>
          . URL: http://www.w3.org/TR/r2rml/.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Dimou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. Vander</given-names>
            <surname>Sande</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Colpaert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Verborgh</surname>
          </string-name>
          , E. Mannens, R. Van de Walle,
          <article-title>RML: A Generic Language for Integrated RDF Mappings of Heterogeneous Data</article-title>
          ,
          <source>in: Proceedings of the 7th Workshop on Linked Data on the Web</source>
          , volume
          <volume>1184</volume>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Van</given-names>
            <surname>Assche</surname>
          </string-name>
          ,
          <source>Dylan and Jankaj</source>
          , Jozef and De Meester, Ben, Backwards or forwards? [R2]
          <article-title>RML backwards compatibility in RMLMapper</article-title>
          , in: KGCW 2024 :
          <article-title>Knowledge Graph Construction 2024 :</article-title>
          <source>Proceedings of the 5th International Workshop on Knowledge Graph Construction colocated with 21th Extended Semantic Web Conference (ESWC</source>
          <year>2024</year>
          ), volume
          <volume>3718</volume>
          ,
          <year>2024</year>
          , p.
          <fpage>7</fpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3718</volume>
          /.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>E. de Vleeschauwer</surname>
          </string-name>
          , B. De Meester, P. Colpaert,
          <article-title>RML-view-to-CSV: A Proof-of-Concept Implementation for RML Logical Views</article-title>
          ,
          <source>in: Proceedings of the 5th International Workshop on Knowledge Graph Construction (KGCW</source>
          <year>2024</year>
          )
          <article-title>co-located with 20th Extended Semantic Web Conference (ESWC</article-title>
          <year>2024</year>
          ),
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>