=Paper= {{Paper |id=Vol-3322/short1 |storemode=property |title=Building a French Revolution Narrative from Wikidata |pdfUrl=https://ceur-ws.org/Vol-3322/short1.pdf |volume=Vol-3322 |authors=Inès Blin |dblpUrl=https://dblp.org/rec/conf/ijcai/Blin22 }} ==Building a French Revolution Narrative from Wikidata== https://ceur-ws.org/Vol-3322/short1.pdf
Building a French Revolution Narrative from Wikidata
Inès Blin1,2
1
    SONY Computer Science Laboratories, Paris, France
2
    Department of Computer Science, Vrije Universiteit Amsterdam, The Netherlands


                                       Abstract
                                       Historical reconstructions can help us better understand important events. This is essential for bettering our
                                       knowledge on specific topics and this enables us to build coherent sequences of information. Knowledge graphs
                                       like Wikidata contain information about generic knowledge that includes historical events, but the information
                                       is often entity-centric rather than event-centric. This makes the analysis and understanding of events related
                                       to a same topic harder, since it is less easily accessible. In this demonstration paper, we developed a system
                                       to build a timeline about the French Revolution in the form of a graph of a sequence of events, also called a
                                       narrative in this work. The system comes with an interface, where a user can first select the types of events
                                       to be retrieved, and is then walked through the steps of the system. The output of the system is a timeline of
                                       the French Revolution.


1. Introduction                                                                          2. Related Work
We constantly create narratives to provide explana-                          A narrative is one way to understand or explain a
tions or justifications for why and how somethings                           series of related events. In this work, a narrative is
happens. This capability of creating such narratives                         technically defined as a sequence of events in the
is even suggested to be part of what makes us human                          form of a graph. A narrative in this context is thus
[1, 2]. AI systems able to create narratives could                           a type of knowledge graph. Therefore, our work is
therefore help in the realisation of human-centric                           related to narratives and to how events are struc-
systems. In this demonstration paper, we present a                           turally represented, especially in event knowledge
system that can build a timeline of the French Rev-                          graphs structures. The output of our system is a
olution by exploring Wikidata. The system takes                              timeline generated from a knowledge graph, hence
the user’s inputs to select nodes in Wikidata, walks                         our work is also related to timeline generation from
the users through the different steps and outputs a                          knowledge bases. We describe the aforementioned
timeline.                                                                    research domains in this section.
   The system presented in this demonstration pa-                               Narratives are a means for better understanding
per builds a narrative in the form of a graph. A                             how a situation unfolds over time. The events are
narrative is defined here as a sequence of events.                           the atomic concepts underlying these narratives.
The main novelty of the paper is a system that walks                         Towards formally defining such structures, Bartalesi
the user through the different steps of the process                          and Meghini [5] for example propose to define formal
and lets the user choose the events to include for                           components of narratives using event calculus theory
the narrative. Earlier systems could let the user                            [6].
navigate through events and entities without a fi-                              From a resource perspective, the closest to our
nal timeline output [3], or could build the timeline                         work would be event knowledge graphs that focus
without interacting with the user, for example by                            on describing events and links between them. Guan
retrieving most significant events for a biography                           et al. [7] survey event knowledge graphs from four
description [4].                                                             different perspectives: history, ontology, instance
                                                                             and application. More recent research has focused
                                                                             on providing better access to semantic structured
                                                                             data about events, and has emphasised the need to
                                                                             have such event-centric information for applications
                                                                             such as timeline generation or history reconstruc-
IJCAI 2022: Workshop on semantic techniques for
                                                                             tion. Available resources now include (i) EventKG
narrative-based understanding, July 24, 2022, Vienna, Aus-
tria                                                                         [8] built from generic knowledge graphs, (ii) a knowl-
$ ines.blin@sony.com (I. Blin)                                               edge graph built from textual news data [9] (iii) a
 0000-0003-0956-9466 (I. Blin)                                              knowledge graph built from a Sherlock Holmes novel
         © 2022 Copyright for this paper by its authors. Use permitted under
         Creative Commons License Attribution 4.0 International (CC BY [10] (iv) semantic data for cultural heritage [11] and
         4.0).
    CEUR

         CEUR Workshop Proceedings (CEUR-WS.org)
    Workshop
    Proceedings
                  http://ceur-ws.org
                  ISSN 1613-0073
                                                                             (v) GDELT [12] and ICEWS [13].




                                                                                                                                                        1
   From an application perspective, the closest to      with a description of each event as well as event
our work would be systems that use knowledge bases      type and participants. The demonstrator currently
for timeline or hypothesis generation. The aim of       outputs a timeline for the French Revolution only,
such systems is to help a user understand better one    but could be extended to other types of revolutions
topic. For quality, Althoff et al. [14] define three    for instance. The code is publicly available1 .
criteria for a timeline quality: relevance, tempo-        The backend is written in Python 3.9.4, the inter-
rally diverse and content diverse. For applications,    face was developed with Streamlit2 . Wikidata was
Metilli et al. [4] use Wikidata to build biographies,   accessed through the Wikidata SPARQL endpoint.
e.g. Dante’s one. Kroll et al. [15] furthermore build   Figure 1 shows the home page of the demonstrator
narratives as hypotheses on top of different knowl-     that explains the purpose of the application and the
edge bases, with an application in the biomedical       main steps to extract the timeline.
domain.
                                                        3.1. Event Collection from Wikidata
3. Demonstrator Features                                The first step to build the timeline is to collect
                                                        events from Wikidata. In the demonstrator, the
                                                        user can select a collection of paths to extract events
                                                        from, depicted in Figure 2. The user can then press
                                                        the button “Collect events” to query the Wikidata
                                                        knowledge graph.
                                                           Events and links to Wikidata and Wikipedia are
                                                        stored in the backend, and the user is provided with
                                                        some additional information, such as the number of
                                                        events retrieved.

                                                        3.2. Extracting narrative features
                                                        Once the events have been extracted from Wikidata
                                                        depending on the user’s input paths, the next step
                                                        is to extract narrative features for the events. A
                                                        narrative feature for an event contains information
                                                        about either a participant, a timestamp, a location,
                                                        a type of event, or a temporal or causal link to
                                                        another event. In Section 2. and 3. displayed in
                                                        Figure 1, the user can press a button to extract
                                                        these features.
                                                           In the demonstrator, the user can see the features
                                                        that were extracted, as well as some additional
                                                        statistics on the distribution of the types of features.
                                                        We found that features extracted from Wikidata
                                                        contained mostly information about time and lo-
                                                        cation, whereas features extracted from Wikipedia
                                                        contained more information about participants and
                                                        causal links.
Figure 1: Home page of the application.                    Features in Wikipedia also contain hyper links
                                                        to other Wikipedia pages, which can therefore be
                                                        linked back to Wikidata. We use and retrieve these
   In the demonstration system, the user is walked      corresponding Wikidata pages to have consistent
through the different steps to build a narrative        URIs for the final graph.
graph: 1) Collect events from Wikidata and se-
lect paths to extract nodes from 2) Extract features
from Wikidata 3) Extract additional information
from Wikipedia info boxes 4) Convert the data
                                                        1
into RDF triples compatible with the Simple Event           https://github.com/SonyCSLParis/
                                                            building-fr-narrative-from-wikidata
Model 5) Display the narrative graph as a timeline,     2
                                                            https://streamlit.io




                                                                                                                   2
                                                     Figure 3: Event description example for 10 August



                                                      were therefore discarded from the timeline.
                                                         Figure 3 shows one event description for the 10
                                                      August. The user can read a summary of what hap-
Figure 2: Collecting events from Wikidata             pened during this event, as well as some additional
                                                      information retrieved from Wikidata and Wikipedia.
                                                      The user therefore understands that 10 August was
3.3. Building the network                             a riot implying different actors like Louis XVI of
                                                      France as a commanding officer, or the French First
The next step is to build a narrative graph of Republic as a combatant. Figure 4 lastly displays
the French Revolution events and their descrip- the interactive timeline overview. The user can
tion, in the form of RDF triples. The rules manually slide events over time and click on each of
to convert the extracted features for each event them to better understand what happened. On the
to triples were designed manually. The ontol- bottom of the figure, one can see that it is also pos-
ogy used is the Simple Event Model [16]. The sible to see which events happened during a coarser
four main classes of this model are: sem:Event period of time, like for example the Kingdom of
(what), sem:Actor (who), sem:Place (where), France.
and sem:Time (when). Further constraints classes
sem:Role, sem:Temporary and sem:View can
add information on the role of an actor, a temporal 4. Conclusion
constraint or on a specific viewpoint respectively.
                                                      In this demonstration paper, we presented a sys-
                                                      tem that is able to retrieve events and features of
3.4. Timeline output                                  the French Revolution to construct a timeline of
The final output in this demonstrator is an interac- ordered events and descriptions. During this pro-
tive timeline. This timeline contains ordered events cess, the user can choose how events can be selected
extracted from Wikidata, as well as three addi- and is walked through the different steps of the
tional components: (i) a brief description taken from process, that makes the system transparent. The
Wikipedia (ii) event type information (iii) partici- user lastly has access to an interactive timeline to
pant information. The events with no timestamps better understand the series of events during the




                                                                                                            3
                                                      Acknowledgments
                                                                 The work reported in this paper was
                                                                 funded by the European MUHAI project
                                                                 from the Horizon 2020 research and in-
                                                                 novation programme under grant number
                                                      951846 and the Sony Computer Science Laborato-
                                                      ries Paris.
                                                         We thank Ilaria Tiddi, Annette ten Teije, Frank
                                                      van Harmelen and Remi van Trijp for their time
                                                      and support.
Figure 4: Output timeline overview

                                                      References
French Revolution.                                    [1] B. Boyd, On the origin of stories: Evolu-
   The system described in this paper furthermore         tion, cognition, and fiction, Harvard University
helped to identify some first challenges for building     Press, 2010.
narratives from knowledge graphs. These challenges    [2] J. Gottschall, The storytelling animal: How
are linked to the main steps described in Section 3:      stories make us human, Houghton Mifflin Har-
1) collect data 2) extract features 3) build the net-     court, 2012.
work. The main challenge for collecting data is to    [3] V. d. Boer, L. Melgar, O. Inel, C. M. Ortiz,
assess how to best choose the set of events that          L. Aroyo, J. Oomen, Enriching media collec-
optimally describe the coarser event. Should such         tions for event-based exploration, in: Research
a set contain all events that happened, or only the       Conference on Metadata and Semantics Re-
most relevant, or only the ones that are interesting      search, Springer, 2017, pp. 189–201.
for one user? When extracting features, there is the  [4] D. Metilli, V. Bartalesi, C. Meghini, A
challenge of completing missing information, like         wikidata-based tool for building and visualising
participants or locations. What additional resources      narratives, International Journal on Digital
or algorithms are best to complete the narrative          Libraries 20 (2019) 417–432.
graph? Lastly, the challenge when building the        [5] V. Bartalesi, C. Meghini, Formal components
graph network is to find or define the ontology that      of narratives, in: Italian Research Conference
is best suited for the application, and to populate       on Digital Libraries, Springer, 2016, pp. 112–
this ontology.                                            121.
   Takeaways from this paper are also about possible  [6] R. Kowalski, M. Sergot, A logic-based calculus
future work directions. One first direction could be      of events, in: Foundations of knowledge base
to study the temporality of the narrative, to better      management, Springer, 1989, pp. 23–55.
understand how events unfold. In the demonstrator, [7] S. Guan, X. Cheng, L. Bai, F. Zhang, Z. Li,
the timeline output provides a sequence of events or-     Y. Zeng, X. Jin, J. Guo, What is event
dered in time, but it does not provide explanations       knowledge graph: A survey, arXiv preprint
for how one event might have triggered changes            arXiv:2112.15280 (2021).
that in turn caused another event. One second         [8] S. Gottschalk, E. Demidova, Eventkg: A
challenge and direction is about the evaluation of        multilingual event-centric temporal knowledge
the narrative. For instance, one assumption here is       graph, in: European Semantic Web Confer-
that Wikidata and Wikipedia contain objective non-        ence, Springer, 2018, pp. 272–287.
contradictory facts, but one interesting track would  [9] M. Rospocher, M. van Erp, P. Vossen,
be to compare different input resources, or to add        A. Fokkens, I. Aldabe, G. Rigau, A. Soroa,
confidence scores to each triple. Another direction       T. Ploeger, T. Bogaard, Building event-centric
would be to see how well the system would scale to        knowledge graphs from news, Journal of Web
other events such as other revolutions. Lastly, one       Semantics 37 (2016) 132–151.
direction could be linked to pattern identification [10] T. Kawamura, S. Egami, K. Tamura, Y. Hoka-
and event forecasting, to generalise from instanti-       zono, T. Ugai, Y. Koyanagi, F. Nishino, S. Oka-
ated examples.                                            jima, K. Murakami, K. Takamatsu, et al., Re-
                                                          port on the first knowledge graph reasoning
                                                          challenge 2018, in: Joint International Seman-




                                                                                                             4
     tic Technology Conference, Springer, 2019, pp.
     18–34.
[11] I. C. Dorobăt, , V. Posea, Enriching the cultural
     heritage metadata using historical events: a
     graph-based representation, in: International
     Conference on Theory and Practice of Digital
     Libraries, Springer, 2019, pp. 344–347.
[12] K. Leetaru, P. A. Schrodt, Gdelt: Global data
     on events, location, and tone, 1979–2012, in:
     ISA annual convention, volume 2, Citeseer,
     2013, pp. 1–49.
[13] E. Boschee, J. Lautenschlager, S. O’Brien,
     S. Shellman, J. Starz, M. Ward, ICEWS Coded
     Event Data, 2015. URL: https://doi.org/10.
     7910/DVN/28075. doi:10.7910/DVN/28075.
[14] T. Althoff, X. L. Dong, K. Murphy, S. Alai,
     V. Dang, W. Zhang, Timemachine: Time-
     line generation for knowledge-base entities, in:
     Proceedings of the 21th ACM SIGKDD Inter-
     national Conference on Knowledge Discovery
     and Data Mining, 2015, pp. 19–28.
[15] H. Kroll, D. Nagel, W.-T. Balke, Modeling
     narrative structures in logical overlays on top
     of knowledge repositories, in: International
     Conference on Conceptual Modeling, Springer,
     2020, pp. 250–260.
[16] W. R. Van Hage, V. Malaisé, R. Segers,
     L. Hollink, G. Schreiber, Design and use of
     the simple event model (sem), Journal of Web
     Semantics 9 (2011) 128–136.




                                                         5