<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Preface of MEPDaW 2020: Managing the Evolution and Preservation of the Data Web</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Fabrizio Orlandi</string-name>
          <email>orlandif@tcd.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Damien Graux</string-name>
          <email>grauxd@tcd.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maria-Esther Vidal</string-name>
          <email>mvidal@umiacs.umd.edu</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Javier D. Fernández</string-name>
          <email>javier_d.fernandez@roche.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jeremy Debattista</string-name>
          <email>jdebattista@topquadrant.com</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ADAPT SFI Centre, Trinity College Dublin</institution>
          ,
          <country country="IE">Ireland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>F. Hoffmann-La Roche AG</institution>
          ,
          <country country="CH">Switzerland</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Technische Informationsbibliothek (TIB)</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>TopQuadrant Inc</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>The MEPDaW workshop series targets one of the emerging and fundamental problems of the Web, specifically the management and preservation of evolving knowledge graphs. During the past six years, the workshop series has been gathering a community of researchers and practitioners around these challenges. To date, the series has successfully published more than 30 articles allowing more than 50 individual authors to present and share their ideas. This 6th edition, virtually co-located with the International Semantic Web Conference (ISWC 2020), gathered the community around nine research publications and one invited keynote presentation. The event took place online on the 1st of November, 2020.</p>
      </abstract>
      <kwd-group>
        <kwd>Web Data evolution</kwd>
        <kwd>Data preservation</kwd>
        <kwd>provenance and lineage</kwd>
        <kwd>Temporal &amp; Evolving Knowledge Graphs</kwd>
        <kwd>RDF archiving and versioning</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        There is a vast and rapidly increasing quantity of scientific, corporate,
government, and crowd-sourced data openly published on the Web. Open Data plays
a catalyst role in the way structured information is exploited on a large scale.
A traditional view of digitally preserving these datasets by “pickling and locking
them away” for future use, like groceries, conflicts with their evolution. There
are several approaches and frameworks (e.g. Linked Data Stack [
        <xref ref-type="bibr" rid="ref1">10</xref>
        ], PoolParty
Suite1, Metaphactory2, etc.) targeted at managing the life-cycle of the Data
Web. More specifically, these solutions are expected to tackle major issues such
as the synchronisation problem (monitoring changes) [
        <xref ref-type="bibr" rid="ref3 ref7">12,16</xref>
        ], the curation
problem (repairing data imperfections) [
        <xref ref-type="bibr" rid="ref5">14</xref>
        ], the appraisal problem (assessing the
quality of a dataset) [
        <xref ref-type="bibr" rid="ref2">11</xref>
        ], the citation problem (how to cite a particular
version of a dataset) [4], the archiving problem (retrieving a specific version of a
dataset) [
        <xref ref-type="bibr" rid="ref4 ref6">13,15</xref>
        ], and the sustainability problem (preserving at scale, ensuring
long-term access) [4].
      </p>
      <p>The sixth edition of this workshop was organised for the first time at the
International Semantic Web Conference (ISWC) and followed the structure of
the previous editions. We invited a number of experts in the field of Linked
Data and Data Evolution &amp; Preservation in order to suggest and advise on the
different topics that our workshop covered this year. This year, at ISWC 2020,
we successfully gathered more than 60 participants for our half-day event. In
line with most academic events, this year MEPDaW was held as a virtual event
and we had to re-think the interactions between participants.</p>
    </sec>
    <sec id="sec-2">
      <title>MEPDaW Scientific programme</title>
      <p>The workshop started with the keynote entitled “Sharing, Tracking, and
Enhancing Highly Dynamic Knowledge Graphs” given by Prof. Philippe Cudré-Mauroux
from eXascale Infolab3 (University of Fribourg, Switzerland). He described how
Knowledge Graphs are in practice highly dynamic and incomplete and the
challenges that this entails. In particular, Prof. Cudré-Mauroux gave an overview of
some of the recent techniques they developed in his lab to improve the automated
processing of large-scale and evolving Knowledge Graphs. First, he described
data-driven techniques to identify information gaps in Knowledge Graphs (e.g.,
in terms of missing classes or properties). Then, he presented a series of methods
to impute missing values from the graphs. He eventually gave insights on two
of their funded projects and large-scale system deployments: one for Swiss open
research data, and one for knowledge tracking on Microsoft Azure. Overall, this
keynote gave the audience in-depth details on practical (and industrial) use cases
backed by cutting-edge research techniques.</p>
      <p>The first article presented dealt with an approach to detect and assess the
semantic drift among timely-distinct versions of an ontology [1]. It was followed
by [4] which proposes the implementation of a global, persistent identifier
system built upon time-based immutable resource revisioning of generic HTTP
resources, as identified by their URL and resolved via time-based HTTP content
negotiation, building upon existing Web standards.</p>
      <p>Following the keynote, three research efforts dedicated to specific use-cases
and systems were presented. Both Jamal A. Nasir [7] and André Regino [8]
focused on Linked Open Data related challenges. The former presented i LOD: a
decentralised file system dedicated to the LOD cloud [7]. The latter described
a new strategy to discover semantically broken links within LOD datasets [8].
In [9], the authors tackled the representation of scientific literature evolution
across time. As new research is conducted, knowledge evolves, getting
documented in dissertations, theses and articles. They presented new methods
exploiting Temporal Knowledge Graphs (TKGs) to model knowledge evolution in
corpora of unstructured texts.
3 https://exascale.info/</p>
      <p>In addition, MEPDaW 2020 gathered three publications addressing
challenges related to the RDF standard. First, Cuevas &amp; Hogan [2] explored solutions
for querying archives of versioned RDF data using SPARQL and off-the-shelf
engines; in particular they considered multiple representations of RDF archives,
and described how input queries can be automatically rewritten to return
solutions for a particular version (or solutions that change between versions).
Second, in [6], the authors presented a framework for an automatic adaptation of
RDF-based semantic annotations when RDF graphs are modified. Third, Gleim
et al. [5] focused on provenance tracking and presented a concrete alignment
of all roles and relations in the FactDAG model to the W3C PROV
provenance standard, allowing future software implementations to directly produce
standard-compliant provenance information.</p>
      <p>Finally, wrapping up the article sessions and starting the open-discussion,
Lars Gleim and Stefan Decker [3] presented a review of open challenges for the
management and preservation of evolving data on the Web.</p>
      <sec id="sec-2-1">
        <title>Organizing Committee</title>
        <p>– Fabrizio Orlandi, ADAPT SFI Centre, Trinity College Dublin, Ireland
– Damien Graux, ADAPT SFI Centre, Trinity College Dublin, Ireland
– Maria-Esther Vidal, TIB, Hannover, Germany
– Javier D. Fernández, F. Hoffmann-La Roche AG, Switzerland
– Jeremy Debattista, TopQuadrant Inc.</p>
      </sec>
      <sec id="sec-2-2">
        <title>Advisory Board</title>
        <p>– Laure Berti-Equille, IRD Marseille, France
– Declan O’Sullivan, ADAPT Centre, Trinity College Dublin, Ireland
– James Anderson, Dydra - Datagraph, USA
– Axel Polleres, Vienna University of Economics and Business, Austria
Programme Committee
– Natanael Arndt, Leipzig University, Germany
– Ioannis Chrysakis, FORTH-ICS, Greece &amp; Ghent Uni. (imec), Belgium
– Diego Collarana, Fraunhofer IAIS, Germany
– Pieter Colpaert, Ghent University, Belgium
– Christophe Debruyne, Trinity College Dublin, Ireland
– Luis Ibanez-Gonzalez, University of Southampton, England
– Harshvardhan J. Pandit, ADAPT Centre - Trinity College Dublin, Ireland
– George Papastefanatos, IMIS / RC "Athena", Greece
– Giuseppe Pirrò, Sapienza University of Rome, Italy
– Julio Cesar dos Reis, University of Campinas, Brazil
– Ruben Taelman, Ghent University – imec, Belgium
– Brecht Van de Vyvere, Ghent University, Belgium</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Acknowledgements</title>
      <p>We would like to thank all the authors, reviewers, committee members and the
invited speaker for their contributions, support and commitment during this
particularly challenging year.</p>
      <p>These research activities were conducted with the financial support of the
European Union’s Horizon 2020 research and innovation programme under the Marie
Skłodowska-Curie Grant Agreements No. 801522 and No. 713567 at the ADAPT
SFI Research Centre at Trinity College Dublin. The ADAPT SFI Centre for
Digital Media Technology is funded by Science Foundation Ireland through the SFI
Research Centres Programme and is co-funded under the European Regional
Development Fund (ERDF) through Grant #13/RC/2106.</p>
    </sec>
    <sec id="sec-4">
      <title>Articles presented at MEPDaW 2020</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          10.
          <string-name>
            <surname>Auer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bühmann</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dirschl</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Erling</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hausenblas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Isele</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mendes</surname>
            ,
            <given-names>P.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Van Nuffelen</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , et al.:
          <article-title>Managing the life-cycle of linked data with the LOD2 stack</article-title>
          . In: International semantic Web conference. pp.
          <fpage>1</fpage>
          -
          <lpage>16</lpage>
          . Springer (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          11.
          <string-name>
            <surname>Debattista</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Auer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lange</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Luzzu-a methodology and framework for linked data quality assessment</article-title>
          .
          <source>J. Data and Information Quality</source>
          <volume>8</volume>
          (
          <issue>1</issue>
          ) (Oct
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          12.
          <string-name>
            <surname>Endris</surname>
            ,
            <given-names>K.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Faisal</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Orlandi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Auer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scerri</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Interest-based RDF update propagation</article-title>
          .
          <source>In: Proceedings of the 14th International Conference on The Semantic Web - ISWC 2015 - Volume</source>
          <volume>9366</volume>
          . p.
          <fpage>513</fpage>
          -
          <lpage>529</lpage>
          . Springer-Verlag, Berlin, Heidelberg (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          13.
          <string-name>
            <surname>Fernández</surname>
            ,
            <given-names>J.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Polleres</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Umbrich</surname>
          </string-name>
          , J.:
          <article-title>Towards efficient archiving of dynamic linked open data</article-title>
          . In: MEPDaW workshop at ESWC'
          <volume>15</volume>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          14.
          <string-name>
            <surname>Freitas</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Curry</surname>
          </string-name>
          , E.:
          <article-title>Big data curation</article-title>
          . In:
          <article-title>New Horizons for a Data-Driven Economy (</article-title>
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          15.
          <string-name>
            <surname>Pelgrin</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Galárraga</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Towards fully-fledged archiving for RDF datasets. (Accepted with Minor Revisions)</article-title>
          ,
          <source>Semantic Web Journal</source>
          (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          16.
          <string-name>
            <surname>Tasnim</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Collarana</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Graux</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Orlandi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vidal</surname>
            ,
            <given-names>M.E.</given-names>
          </string-name>
          :
          <article-title>Summarizing entity temporal evolution in knowledge graphs</article-title>
          .
          <source>In: Companion Proceedings of The 2019 World Wide Web Conference</source>
          . p.
          <fpage>961</fpage>
          -
          <lpage>965</lpage>
          . WWW '
          <volume>19</volume>
          ,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>