<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Building a French Revolution Narrative from Wikidata</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Inès Blin</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science</institution>
          ,
          <addr-line>Vrije Universiteit Amsterdam</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>SONY Computer Science Laboratories</institution>
          ,
          <addr-line>Paris</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Historical reconstructions can help us better understand important events. This is essential for bettering our knowledge on specific topics and this enables us to build coherent sequences of information. Knowledge graphs like Wikidata contain information about generic knowledge that includes historical events, but the information is often entity-centric rather than event-centric. This makes the analysis and understanding of events related to a same topic harder, since it is less easily accessible. In this demonstration paper, we developed a system to build a timeline about the French Revolution in the form of a graph of a sequence of events, also called a narrative in this work. The system comes with an interface, where a user can first select the types of events to be retrieved, and is then walked through the steps of the system. The output of the system is a timeline of the French Revolution.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        We constantly create narratives to provide explana- A narrative is one way to understand or explain a
tions or justifications for why and how somethings series of related events. In this work, a narrative is
happens. This capability of creating such narratives technically defined as a sequence of events in the
is even suggested to be part of what makes us human form of a graph. A narrative in this context is thus
[
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. AI systems able to create narratives could a type of knowledge graph. Therefore, our work is
therefore help in the realisation of human-centric related to narratives and to how events are
strucsystems. In this demonstration paper, we present a turally represented, especially in event knowledge
system that can build a timeline of the French Rev- graphs structures. The output of our system is a
olution by exploring Wikidata. The system takes timeline generated from a knowledge graph, hence
the user’s inputs to select nodes in Wikidata, walks our work is also related to timeline generation from
the users through the diferent steps and outputs a knowledge bases. We describe the aforementioned
timeline. research domains in this section.
      </p>
      <p>
        The system presented in this demonstration pa- Narratives are a means for better understanding
per builds a narrative in the form of a graph. A how a situation unfolds over time. The events are
narrative is defined here as a sequence of events. the atomic concepts underlying these narratives.
The main novelty of the paper is a system that walks Towards formally defining such structures, Bartalesi
the user through the diferent steps of the process and Meghini [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] for example propose to define formal
and lets the user choose the events to include for components of narratives using event calculus theory
the narrative. Earlier systems could let the user [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
navigate through events and entities without a fi- From a resource perspective, the closest to our
nal timeline output [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], or could build the timeline work would be event knowledge graphs that focus
without interacting with the user, for example by on describing events and links between them. Guan
retrieving most significant events for a biography et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] survey event knowledge graphs from four
description [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. diferent perspectives: history, ontology, instance
and application. More recent research has focused
on providing better access to semantic structured
data about events, and has emphasised the need to
have such event-centric information for applications
such as timeline generation or history
reconstrucIJCAI 2022: Workshop on semantic techniques for tion. Available resources now include (i) EventKG
tnraiarrative-based understanding, July 24, 2022, Vienna, Aus- [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] built from generic knowledge graphs, (ii) a
knowl$ ines.blin@sony.com (I. Blin) edge graph built from textual news data [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] (iii) a
0000-0003-0956-9466 (I. Blin) knowledge graph built from a Sherlock Holmes novel
©Cr2e0a2t2ivCeoCpyormigmhtonfosrLtihciesnpsaepAerttbryibiuttsioaunt4h.o0rsI.nUtesrenapteiromniatlte(dCCunBdeYr [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] (iv) semantic data for cultural heritage [11] and
CPWrEooUrckReshdoinpgs IhStpN:/c1e6u1r3-w-0s.o7r3g 4C.0E).UR Workshop Proceedings (CEUR-WS.org) (v) GDELT [12] and ICEWS [13].
      </p>
      <p>
        From an application perspective, the closest to with a description of each event as well as event
our work would be systems that use knowledge bases type and participants. The demonstrator currently
for timeline or hypothesis generation. The aim of outputs a timeline for the French Revolution only,
such systems is to help a user understand better one but could be extended to other types of revolutions
topic. For quality, Althof et al. [14] define three for instance. The code is publicly available1.
criteria for a timeline quality: relevance, tempo- The backend is written in Python 3.9.4, the
interrally diverse and content diverse. For applications, face was developed with Streamlit2. Wikidata was
Metilli et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] use Wikidata to build biographies, accessed through the Wikidata SPARQL endpoint.
e.g. Dante’s one. Kroll et al. [15] furthermore build Figure 1 shows the home page of the demonstrator
narratives as hypotheses on top of diferent knowl- that explains the purpose of the application and the
edge bases, with an application in the biomedical main steps to extract the timeline.
domain.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Demonstrator Features</title>
      <p>In the demonstration system, the user is walked
through the diferent steps to build a narrative
graph: 1) Collect events from Wikidata and
select paths to extract nodes from 2) Extract features
from Wikidata 3) Extract additional information
from Wikipedia info boxes 4) Convert the data
into RDF triples compatible with the Simple Event
Model 5) Display the narrative graph as a timeline,</p>
      <sec id="sec-3-1">
        <title>3.1. Event Collection from Wikidata</title>
        <sec id="sec-3-1-1">
          <title>The first step to build the timeline is to collect</title>
          <p>events from Wikidata. In the demonstrator, the
user can select a collection of paths to extract events
from, depicted in Figure 2. The user can then press
the button “Collect events” to query the Wikidata
knowledge graph.</p>
          <p>Events and links to Wikidata and Wikipedia are
stored in the backend, and the user is provided with
some additional information, such as the number of
events retrieved.</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Extracting narrative features</title>
        <p>Once the events have been extracted from Wikidata
depending on the user’s input paths, the next step
is to extract narrative features for the events. A
narrative feature for an event contains information
about either a participant, a timestamp, a location,
a type of event, or a temporal or causal link to
another event. In Section 2. and 3. displayed in
Figure 1, the user can press a button to extract
these features.</p>
        <p>In the demonstrator, the user can see the features
that were extracted, as well as some additional
statistics on the distribution of the types of features.</p>
        <p>We found that features extracted from Wikidata
contained mostly information about time and
location, whereas features extracted from Wikipedia
contained more information about participants and
causal links.</p>
        <p>Features in Wikipedia also contain hyper links
to other Wikipedia pages, which can therefore be
linked back to Wikidata. We use and retrieve these
corresponding Wikidata pages to have consistent
URIs for the final graph.
1https://github.com/SonyCSLParis/
building-fr-narrative-from-wikidata
2https://streamlit.io
were therefore discarded from the timeline.</p>
        <p>Figure 3 shows one event description for the 10
August. The user can read a summary of what
hapFigure 2: Collecting events from Wikidata pened during this event, as well as some additional
information retrieved from Wikidata and Wikipedia.</p>
        <p>The user therefore understands that 10 August was
3.3. Building the network a riot implying diferent actors like Louis XVI of
France as a commanding oficer, or the French First
The next step is to build a narrative graph of Republic as a combatant. Figure 4 lastly displays
the French Revolution events and their descrip- the interactive timeline overview. The user can
tion, in the form of RDF triples. The rules manually slide events over time and click on each of
to convert the extracted features for each event them to better understand what happened. On the
to triples were designed manually. The ontol- bottom of the figure, one can see that it is also
posogy used is the Simple Event Model [16]. The sible to see which events happened during a coarser
four main classes of this model are: sem:Event period of time, like for example the Kingdom of
(what), sem:Actor (who), sem:Place (where), France.
and sem:Time (when). Further constraints classes
sem:Role, sem:Temporary and sem:View can
add information on the role of an actor, a temporal 4. Conclusion
constraint or on a specific viewpoint respectively.</p>
        <sec id="sec-3-2-1">
          <title>In this demonstration paper, we presented a sys</title>
          <p>tem that is able to retrieve events and features of
3.4. Timeline output the French Revolution to construct a timeline of
The final output in this demonstrator is an interac- ordered events and descriptions. During this
protive timeline. This timeline contains ordered events cess, the user can choose how events can be selected
extracted from Wikidata, as well as three addi- and is walked through the diferent steps of the
tional components: (i) a brief description taken from process, that makes the system transparent. The
Wikipedia (ii) event type information (iii) partici- user lastly has access to an interactive timeline to
pant information. The events with no timestamps better understand the series of events during the</p>
        </sec>
        <sec id="sec-3-2-2">
          <title>French Revolution.</title>
          <p>The system described in this paper furthermore
helped to identify some first challenges for building
narratives from knowledge graphs. These challenges
are linked to the main steps described in Section 3:
1) collect data 2) extract features 3) build the
network. The main challenge for collecting data is to
assess how to best choose the set of events that
optimally describe the coarser event. Should such
a set contain all events that happened, or only the
most relevant, or only the ones that are interesting
for one user? When extracting features, there is the
challenge of completing missing information, like
participants or locations. What additional resources
or algorithms are best to complete the narrative
graph? Lastly, the challenge when building the
graph network is to find or define the ontology that
is best suited for the application, and to populate
this ontology.</p>
          <p>Takeaways from this paper are also about possible
future work directions. One first direction could be
to study the temporality of the narrative, to better
understand how events unfold. In the demonstrator,
the timeline output provides a sequence of events
ordered in time, but it does not provide explanations
for how one event might have triggered changes
that in turn caused another event. One second
challenge and direction is about the evaluation of
the narrative. For instance, one assumption here is
that Wikidata and Wikipedia contain objective
noncontradictory facts, but one interesting track would
be to compare diferent input resources, or to add
confidence scores to each triple. Another direction
would be to see how well the system would scale to
other events such as other revolutions. Lastly, one
direction could be linked to pattern identification
and event forecasting, to generalise from
instantiated examples.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgments</title>
      <sec id="sec-4-1">
        <title>The work reported in this paper was</title>
        <p>funded by the European MUHAI project
from the Horizon 2020 research and
innovation programme under grant number
951846 and the Sony Computer Science
Laboratories Paris.</p>
        <p>We thank Ilaria Tiddi, Annette ten Teije, Frank
van Harmelen and Remi van Trijp for their time
and support.
tic Technology Conference, Springer, 2019, pp.
18–34.
[11] I. C. Dorobăt,, V. Posea, Enriching the cultural
heritage metadata using historical events: a
graph-based representation, in: International
Conference on Theory and Practice of Digital
Libraries, Springer, 2019, pp. 344–347.
[12] K. Leetaru, P. A. Schrodt, Gdelt: Global data
on events, location, and tone, 1979–2012, in:
ISA annual convention, volume 2, Citeseer,
2013, pp. 1–49.
[13] E. Boschee, J. Lautenschlager, S. O’Brien,
S. Shellman, J. Starz, M. Ward, ICEWS Coded
Event Data, 2015. URL: https://doi.org/10.
7910/DVN/28075. doi:10.7910/DVN/28075.
[14] T. Althof, X. L. Dong, K. Murphy, S. Alai,
V. Dang, W. Zhang, Timemachine:
Timeline generation for knowledge-base entities, in:
Proceedings of the 21th ACM SIGKDD
International Conference on Knowledge Discovery
and Data Mining, 2015, pp. 19–28.
[15] H. Kroll, D. Nagel, W.-T. Balke, Modeling
narrative structures in logical overlays on top
of knowledge repositories, in: International
Conference on Conceptual Modeling, Springer,
2020, pp. 250–260.
[16] W. R. Van Hage, V. Malaisé, R. Segers,
L. Hollink, G. Schreiber, Design and use of
the simple event model (sem), Journal of Web
Semantics 9 (2011) 128–136.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>B.</given-names>
            <surname>Boyd</surname>
          </string-name>
          ,
          <article-title>On the origin of stories: Evolution, cognition, and fiction</article-title>
          , Harvard University Press,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J.</given-names>
            <surname>Gottschall</surname>
          </string-name>
          ,
          <article-title>The storytelling animal: How stories make us human</article-title>
          ,
          <source>Houghton Miflin Harcourt</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>V.</given-names>
            <surname>d. Boer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Melgar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Inel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. M.</given-names>
            <surname>Ortiz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Aroyo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Oomen</surname>
          </string-name>
          ,
          <article-title>Enriching media collections for event-based exploration</article-title>
          ,
          <source>in: Research Conference on Metadata and Semantics Research</source>
          , Springer,
          <year>2017</year>
          , pp.
          <fpage>189</fpage>
          -
          <lpage>201</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>D.</given-names>
            <surname>Metilli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Bartalesi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Meghini</surname>
          </string-name>
          ,
          <article-title>A wikidata-based tool for building and visualising narratives</article-title>
          ,
          <source>International Journal on Digital Libraries</source>
          <volume>20</volume>
          (
          <year>2019</year>
          )
          <fpage>417</fpage>
          -
          <lpage>432</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>V.</given-names>
            <surname>Bartalesi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Meghini</surname>
          </string-name>
          ,
          <article-title>Formal components of narratives</article-title>
          ,
          <source>in: Italian Research Conference on Digital Libraries</source>
          , Springer,
          <year>2016</year>
          , pp.
          <fpage>112</fpage>
          -
          <lpage>121</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>R.</given-names>
            <surname>Kowalski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Sergot</surname>
          </string-name>
          ,
          <article-title>A logic-based calculus of events, in: Foundations of knowledge base management</article-title>
          , Springer,
          <year>1989</year>
          , pp.
          <fpage>23</fpage>
          -
          <lpage>55</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Guan</surname>
          </string-name>
          , X. Cheng, L.
          <string-name>
            <surname>Bai</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Zeng</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          <string-name>
            <surname>Jin</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Guo</surname>
          </string-name>
          ,
          <article-title>What is event knowledge graph: A survey</article-title>
          ,
          <source>arXiv preprint arXiv:2112.15280</source>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S.</given-names>
            <surname>Gottschalk</surname>
          </string-name>
          , E. Demidova,
          <article-title>Eventkg: A multilingual event-centric temporal knowledge graph</article-title>
          ,
          <source>in: European Semantic Web Conference</source>
          , Springer,
          <year>2018</year>
          , pp.
          <fpage>272</fpage>
          -
          <lpage>287</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Rospocher</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. van Erp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Vossen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Fokkens</surname>
          </string-name>
          , I. Aldabe,
          <string-name>
            <given-names>G.</given-names>
            <surname>Rigau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Soroa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ploeger</surname>
          </string-name>
          , T. Bogaard,
          <article-title>Building event-centric knowledge graphs from news</article-title>
          ,
          <source>Journal of Web Semantics</source>
          <volume>37</volume>
          (
          <year>2016</year>
          )
          <fpage>132</fpage>
          -
          <lpage>151</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>T.</given-names>
            <surname>Kawamura</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Egami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Tamura</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Hokazono</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ugai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Koyanagi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Nishino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Okajima</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Murakami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Takamatsu</surname>
          </string-name>
          , et al.,
          <source>Report on the first knowledge graph reasoning challenge</source>
          <year>2018</year>
          , in: Joint International Seman-
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>