=Paper= {{Paper |id=Vol-2829/paper5 |storemode=property |title=OEKG: The Open Event Knowledge Graph |pdfUrl=https://ceur-ws.org/Vol-2829/paper5.pdf |volume=Vol-2829 |authors=Simon Gottschalk,Endri Kacupaj,Sara Abdollahi,Diego Alves,Gabriel Amaral,Elisavet Koutsiana,Tin Kuculo,Daniela Major,Caio Mello,Gullal S. Cheema,Abdul Sittar,Swati,Golsa Tahmasebzadeh,Gaurish Thakkar |dblpUrl=https://dblp.org/rec/conf/www/GottschalkKAAAK21 }} ==OEKG: The Open Event Knowledge Graph== https://ceur-ws.org/Vol-2829/paper5.pdf
      OEKG: The Open Event Knowledge Graph

 Simon Gottschalk1 , Endri Kacupaj2 , Sara Abdollahi1 , Diego Alves3 , Gabriel
  Amaral4 , Elisavet Koutsiana4 , Tin Kuculo1 , Daniela Major5 , Caio Mello5 ,
   Gullal S. Cheema6 , Abdul Sittar7 , Swati7 , Golsa Tahmasebzadeh6 , and
                              Gaurish Thakkar3
           1
              L3S Research Center, Leibniz Universität Hannover, Germany
                    {gottschalk,abdollahi,kuculo}@L3S.de
                            2
                               University of Bonn, Germany
                              kacupaj@cs.uni-bonn.de
                            3
                               University of Zagreb, Croatia
                     dfvalio@ffzg.hr, gthakkar@m.ffzg.hr
                       4
                         King’s College London, United Kingdom
       {gabriel.maia rocha amaral,elisavet.koutsiana}@kcl.ac.uk
         5
           School of Advanced Study, University of London, United Kingdom
                    {Daniela.Major,caio.mello}@sas.ac.uk
6
  TIB – Leibniz Information Centre for Science and Technology, Hannover, Germany
                {gullal.cheema,golsa.tahmasebzadeh}@tib.eu
7
  Jožef Stefan Institute and Jožef Stefan International Postgraduate School, Slovenia
                          {abdul.sittar,swati}@ijs.si



        Abstract. Accessing and understanding contemporary and historical
        events of global impact such as the US elections and the Olympic Games
        is a major prerequisite for cross-lingual event analytics that investigate
        event causes, perception and consequences across country borders. In this
        paper, we present the Open Event Knowledge Graph (OEKG), a mul-
        tilingual, event-centric, temporal knowledge graph composed of seven
        different data sets from multiple application domains, including ques-
        tion answering, entity recommendation and named entity recognition.
        These data sets are all integrated through an easy-to-use and robust
        pipeline and by linking to the event-centric knowledge graph EventKG.
        We describe their common schema and demonstrate the use of the OEKG
        at the example of three use cases: type-specific image retrieval, hybrid
        question answering over knowledge graphs and news articles, as well as
        language-specific event recommendation. The OEKG and its query end-
        point are publicly available.


1     Introduction
Contemporary and historical events such as the US presidential elections, the
Olympic Games and major earthquakes change the world. Their media coverage,
their varying perception by different communities, their historical evolution and
    Copyright © 2021 for this paper by its authors. Use permitted under Creative
    Commons License Attribution 4.0 International (CC BY 4.0).
2         Gottschalk et al.

potentially global impact make cross-lingual event analytics a significant research
topic in various fields of studies, including social science, computer science and
digital humanities [4, 19].
    When performing cross-lingual event analytics, the requirements towards
event knowledge representation are manifold, given the heterogeneity, dynamic-
ity and multilingualism of events [9]. Until now, there exists a large variety of
event-related data sets [1, 4, 11, 20] that may help understand specific character-
istics of events, but they are barely connected by now. This calls for new models
and processes that enable intuitive access to the event-related knowledge spread
across the world.
    In this paper, we present the OEKG, the Open Event Knowledge Graph,
which makes a step towards a holistic representation of event knowledge by
the integration of event-related data sets from multiple and diverse application
domains such as Question Answering, entity recommendation and Named En-
tity Recognition. Also, these data sets originate from different data collections,
including knowledge graphs and news articles. One of these knowledge graphs
is EventKGlight , a new version of the event-centric and multilingual knowledge
graph EventKG [11]. The OEKG is built on top of EventKGlight , allowing for
easier integration of additional data sets using RDF named graphs. We propose
an efficient and robust pipeline facilitating this integration of several data sets
in an easy-to-use manner.
    Fig. 1 shows four example resources of OEKG and thus demonstrates its
versatility resulting from the integration of several data sets:


    – Events (Fig. 1a): Events are at the core of the OEKG. For example, the fire
      of the Notre-Dame in Paris is covered with its locations, labels in multiple
      languages, related events such as “The Notre Dame Cathedral holds its first
      mass since the April 15 fire”, and more event characteristics.
    – Places (Fig. 1b): Most events happen at specific event locations which are
      also part of the OEKG. Such places do not only hold labels and coordinates,
      but also images and further characteristics.
    – News articles (Fig. 1c): Events are often reported in the media [5,13]. There-
      fore, the OEKG provides access to annotated news articles. For example, the
      news article entitled “Boris Johnson takes charge of Olympic Park’s future”
      is related to the Olympic Games.
    – Questions and answers (Fig. 1d): Question Answering over knowledge graphs
      is an important natural language understanding task. The OEKG provides
      questions about events such as the Apollo 11 spaceflight, plus their answers
      (here, Neil Armstrong, Michael Collins and Buzz Aldrin).

    Furthermore, the OEKG covers several other event-related aspects, including
but not limited to (temporal) event relations, language-specific relevance scores
and specialised class hierarchies. Put together, this makes the OEKG a versatile
resource targeting a variety of potential information needs.
                                OEKG: The Open Event Knowledge Graph            3




    (a) Example event in the OEKG.            (b) Example place in the OEKG.




 (c) Example news article in the OEKG.      (d) Example question in the OEKG.

        Fig. 1. Example resources in the OEKG (not all triples are shown).8



    The OEKG contains more than 400 million triples from seven data sets and
is publicly available: We provide the triple dumps for download, a SPARQL end-
point and access to all nodes on the OEKG website9 . We also provide permanent
access to the OEKG on Zenodo10 .

8
   The photos of the example place are taken from Wikimedia Commons, with the
   second photo being licensed under the Creative Commons Attribution-Share Alike
   3.0 Unported license. The maps of the example event and the example place are
   licensed under the Open Data Commons Open Database License (ODbL) by the
   OpenStreetMap Foundation (OSMF).
 9
   http://oekg.l3s.uni-hannover.de
10
   https://zenodo.org/record/4503163
4         Gottschalk et al.

    The remainder of this paper is organised as follows: First, we present our
integration pipeline (Section 2). Then, we describe the data sets integrated into
the OEKG (Section 3) and the OEKG schema (Section 4). In Section 5, we
provide two example use cases of the OEKG. Finally, we conclude in Section 6.


2      Creation of the OEKG
The creation of the OEKG requires an integration pipeline where a set of data
sets is transformed into a single, integrated knowledge graph that provides links
between all the involved resources. EventKGlight – a multilingual, event-centric
knowledge graph later described in Section 3 – serves as the base data set of the
OEKG that contains nodes representing real-world entities and events.
     Our integration pipeline is driven by the goal to make the inclusion of a new
data set into the OEKG as simple as possible, which allows a robust and efficient
process. Only then, it is possible to integrate a large variety of data sets in an
efficient and faultless way. To do so, we follow a strategy defined by Galkin et
al. [8] where the data from different sources is stored under respective named
graphs. Starting from EventKGlight , new data sets are added consecutively, each
accompanied by a unique named graph. Fig. 2 exemplifies this integration process
when adding the first new data set to EventKGlight , under the named graph
new graph.


    new_graph                                                           event_kg




                  Entity              Graph            Graph
                 Linking             Creation          Upload


                                                            new_graph

                                                                        OEKG
     event_kg




Fig. 2. Example of the OEKG integration pipeline where a new, tabular, data set is
added to the OEKG under the named graph new graph.


     In detail, the integration process follows the following three steps:

1. Entity Linking: We require that each graph added to OEKG is connected to
   EventKGlight . That means any resource representing a real-world entity or
   event is represented by an OEKG resource URI. To facilitate this linking,
   we provide a web API that allows easy access to the OEKG resource URIs
   given Wikidata or DBpedia URIs. In our example in Fig. 2, some input table
   cells are successfully linked to EventKGlight .
                                 OEKG: The Open Event Knowledge Graph                5

Algorithm 1 Example: Extension of the OEKG with a data set news that has
an article about Barack Obama
 1: procedure ExtendOEKG(e)
 2:    graphName ← ”news” .
                                                                       . Entity Linking
 3:    entityId ← getId(”en”,”Barack Obama”)
                                                                      . Graph Creation
 4:    G ← new Graph(graphName)
 5:    articleId ← ”article1”
 6:    G.add(oekg-r:articleId, rdf:type, so:Article)
 7:    G.add(oekg-r:articleId, so:mentions, oekg-r:entityId)
 8:    fileName ← storeGraphIntoFile(G)
                                                                       . Graph Upload
 9:    uploadGraph(fileName, graphName)



 2. Graph Creation: After retrieval of the OEKG resource URIs, a set of triples
    is created for each data set and serialised as an N-Triples11 file, using the
    RDFLib Python library12 . In our example, a graph consisting of five nodes
    is created, two of them being already part of the OEKG.
 3. Graph Upload: We provide another API method that allows uploading an
    N-Triples file together with the identifier of a named graph. The respective
    triples are then added to the OEKG. In our example, the resulting graph
    consists of two subgraphs that can be queried in isolation or together.

2.1   Example
Consider Algorithm 1 for an example of our integration pipeline. In this example,
the new data set to be added to the OEKG under the named graph news contains
one news article about Barack Obama. First, the OEKG URI of Barack Obama
is retrieved via the provided API method using the English Wikipedia label (line
3). Second, a graph is created consisting of two triples and serialised into an RDF
file (lines 4 - 8)13 . Third, this file is uploaded via the provided API method (line
9). In this example, one new node is added to the OEKG (oekg-r:articleId)
connected to an existing node (oekg-r:entityId).

2.2   Schema Extension
If possible, the data sets were transformed into triples using the EventKGlight
schema of the base graph. Otherwise, the use of standard vocabularies such as
schema.org14 was encouraged. In every other case, schema extensions were up-
loaded into the OEKG through separate schema files using the same procedure.
We will present the resulting OEKG schema in Section 4.
11
   https://www.w3.org/TR/n-triples/
12
   https://rdflib.dev/
13
   Relevant prefixes used by the OEKG are later defined in Table 2.
14
   https://schema.org/
6         Gottschalk et al.

3      Data Sets

The OEKG integrates seven data sets which are described in this section. Table
1 provides an overview of these data sets, including the number of triples in the
OEKG within their respective named graph. While some of these data sets are
implicitly related to events, others add to the event knowledge from a different
perspective, which will also prove useful as we will later show at the example of
three use cases.

          Table 1. Statistics of the different data sets contained in the OEKG.


Data Set              Short Description                                         Triples
EventKGlight [11]     A light-weight version of EventKG, a multilingual, event- 434, 752, 387
                      centric, knowledge graph.
EventKG+Click [1] A data set of language-specific event-centric user inter- 118, 662
                  action traces
VQuAnDa [12]          A verbalization question answering dataset                38, 243
MLM [3]               A benchmark dataset for multitask learning with multi- 942, 753
                      ple languages and modalities
InfoSpread [20]       A data set for information spreading over the news        277, 992
TIME [4]              Two collections of news articles related to the Olympic 70, 754
                      legacy and Euroscepticism
UNER [2]              The universal named-entity recognition framework          206, 622
OEKG                  The Open Event Knowledge Graph                            436, 407, 413




    – EventKGlight [11]: The EventKG is a multilingual resource incorporat-
      ing event-centric information extracted from several large-scale knowledge
      graphs such as Wikidata, DBpedia and YAGO, as well as less structured
      sources such as the Wikipedia Current Events Portal and Wikipedia event
      lists in 15 languages. It contains nodes representing real-world entities and
      events and (temporal) relations between them. For the OEKG, we have cre-
      ated EventKGlight , a light-weight version of EventKG that omits provenance
      information denoting the origin of relations, favouring an easier integration
      with other data sets.
      In the OEKG, EventKGlight serves as the base graph that other data sets
      are connected to. That is to establish an agreement concerning the identifi-
      cation of event-related real-world objects such as persons, places and events
      themselves.
    – EventKG+Click [1]: EventKG+Click is a cross-lingual dataset that reflects
      the language-specific relevance of events and their relations and aims to
                               OEKG: The Open Event Knowledge Graph            7

  provide a reference source to train and evaluate novel models for event-
  centric cross-lingual user interaction. It directly builds upon EventKG and
  language-specific information on user interactions with events, entities, and
  their relations derived from the Wikipedia clickstream.
  In the OEKG, EventKG+Click can be used for recommending events to
  users based on actual user interaction traces. Examples of particular relevant
  events from a language-specific view include the 2016 Berlin truck attack
  from the German perspective and the 2009 Russian Premier League from
  the Russian perspective [1].
– VQuAnDa [12]: The Verbalization QUestion ANswering DAtaset is a dataset
  for Question Answering (QA) over knowledge graphs that includes the ver-
  balization of each answer. Through this verbalisation, VQuAnDa intends to
  completely hide any semantic technologies and provides a fluent experience
  between the users and the knowledge graph. VQuAnDa consists of 5, 000
  questions accompanied by SPARQL queries and DBpedia entity links.
  QA over Knowledge Graphs is a common task in natural language processing
  [6]. Via the integration of question/answer pairs into the OEKG, both the
  question/answers pairs and the background knowledge are encapsulated into
  the same resource, enabling seamless training and application of QA systems.
– MLM [3]: The Multiple Languages and Modalities data set is a resource
  for training and evaluating multitask systems in multiple modalities, for
  example, cross-modal (text/image) retrieval and location estimation. MLM
  comprises text in three languages, images and location data, extracted from
  the Wikidata entries of 236, 000 human settlements.
  MLM is added to the OEKG for adding images as an additional modality to
  the knowledge graph. As locations are typical event characteristics, photos
  of locations are an immediate benefit to the representation of events.
– InfoSpread [20]: The data set for Information Spreading over the News
  provides news articles covering three contrasting events (Global Warming,
  FIFA world cups and earthquakes). Initially, the goal of this data set was
  to understand information spreading patterns over news articles. InfoSpread
  contains 7, 773 news articles related to these events in five languages.
  News articles are often used as a means to identify events [13] and oftentimes
  it is the media itself that makes events known to the public [5]. Therefore,
  the inclusion of news articles into the OEKG is an important step towards
  coverage of event-centric data from different viewpoints.
– TIME [4]: The temporal discourse analysis applied to media articles data
  set is a collection of Brazilian, British and Spanish news articles covering the
  concept of Olympic legacy and the concept of Euroscepticism.
  With the collection of news articles to specified events, the OEKG serves as
  an example for in-depth analysis of single events through knowledge graphs.
– UNER [2]: The Universal Named Entity Recognition framework proposes
  a 4-level class hierarchy for training and testing Named Entity Recognition
  tools. For example, UNER contains the class Earthquake, which is a leaf
  node of the following branch of superclasses: Natural, NaturalPhenome-
  non, Event and Name.
8         Gottschalk et al.

      In the OEKG, UNER adds to the already given class hierarchy from the
      DBpedia ontology. Given how challenging it is to recognise named events in
      texts [16], we envision that the inclusion of UNER classes into the OEKG
      can help training and evaluating NER systems in the specific context of
      event-centric data.

    Following the integration pipeline described in Section 2, the described data
sets were added to the OEKG. For additional information or increased interlink-
age with EventKGlight , some data sets were extended before:

    – Via the Wikifier15 and spaCy16 , entities and events mentioned in news arti-
      cles (TIME and InfoSpread ) were identified. This is to establish a connection
      between the news articles and EventKGlight : Given this connection, one may
      query for news articles about specific events or entities.
    – Sentiment analysis, i.e., the computational study of people’s opinions, sen-
      timents, emotions, moods, and attitudes [14], contributes towards the un-
      derstanding of natural-language texts and can, in particular, facilitate an
      analysis of news articles across languages [15]. In the OEKG, we enrich news
      articles by employing the sentiment detection system SentiStrength [21] on
      their headlines. That way, the OEKG enables queries for particularly pos-
      itive or negative news articles, potentially initiating further event-centric
      analyses of the news articles in the context of specific events.
    – To further increase the linkage between different sources, the UNER classes
      were aligned to the DBpedia ontology using the skos vocabulary17 when
      possible.


4      Schema
Fig. 3 shows the OEKG schema. As described in Section 2.2, this schema is
based on the EventKGlight schema and then extended by demand. Prefixes used
in the OEKG schema and in the remainder of this paper are listed in Table 218 .
    In detail, the different data sets contribute to the following parts of the OEKG
schema:

    – EventKGlight : The EventKG schema is based on the Simple Event Model
      (sem)19 and its three main classes sem:Event, sem:Actor and sem:
      Place, that are connected via sem:hasPlace and (temporal) relations
      modeled by oekg-s:Relation (omitted from Fig. 3 for brevity). Event-
      KG further distinguishes between different types of events (oekg-s:Text-
      Event, oekg-s:EventSeries and oekg-s:EventSeriesEdition). In
15
   http://wikifier.org/
16
   https://spacy.io/
17
   https://www.w3.org/TR/swbp-skos-core-spec/
18
   For a full list of prefixes used in the OEKG, see oekg.l3s.uni-hannover.de/
   sparql.
19
   https://semanticweb.cs.vu.nl/2009/11/sem/
                              OEKG: The Open Event Knowledge Graph            9

                Table 2. Selected prefixes used by the OEKG.

            Prefix URI
            oekg-r: http://oekg.l3s.uni-hannover.de/resource/
            oekg-s: http://oekg.l3s.uni-hannover.de/schema/
            oekg-g: http://oekg.l3s.uni-hannover.de/graph/
            uner: http://oekg.l3s.uni-hannover.de/uner/
            so:     http://schema.org/
            rdf:    http://www.w3.org/1999/02/22-rdf-syntax-ns#
            rdfs:   http://www.w3.org/2000/01/rdf-schema#
            xs:     http://www.w3.org/2001/XMLSchema#
            sem:    http://semanticweb.cs.vu.nl/2009/11/sem/
            onyx: http://www.gsi.dit.upm.es/ontologies/onyx/ns#
            skos: http://www.w3.org/2004/02/skos/core#




  comparison to the EventKG schema, EventKGlight omits link count relations
  and adds the skos:prefLabel to entities for a more efficient access to their
  labels.
– EventKG+Click : To model language-specific, weighted relations for the
  representation of event-centric cross-lingual user interaction, we have intro-
  duced two new classes: oekg-s:LanguageSpecificRelation that as-
  signs one or more instances of oekg-s:LanguageSpecificRelation-
  Score to a source entity and a target entity. Such instances hold the score
  between the source and target entity in a specific language.
– VQuAnDa: A question, its suggested answer and their verbalisation are
  represented using schema.org’s classes so:Question and so:Answer. En-
  tities that appear in the question text are linked to EventKGlight instances
  via so:mainEntity, entities in the answer via so:mentions.
– MLM : Images are assigned to places via so:image, descriptions via so:
  description.
– InfoSpread and TIME : News articles are represented via so:Article
  and the respective properties denoting the headline (so:headline), for
  instance. News articles are connected to EventKGlight instances via so:
  mentions, which denote the appearance of an OEKG entity or event in
  the text. For the representation of news articles’ sentiment, we follow the
  schema of the TweetsKB [7], using the onyx vocabulary and its classes
  onyx:EmotionSet, onyx:Emotion and onyx:EmotionCategory to
  assign a set of emotions of different strengths to a news article.
– UNER: Entities are assigned UNER classes using rdf:type. Furthermore,
  the UNER class hierarchy and its connection to the DBpedia ontology are
  established using the owl and the skos vocabulary.
10            Gottschalk et al.

                                                                                      so:inLanguage
                                                                  xs:language
                                                                                                               TIME &              onyx:Emotion
                                                                                        so:date
                                                                     xs:date                                 InfoSpread              Category
                                                                                       Published

                                                                                      so:publisher                        onyx:hasEmo
                                                                     xs:string                                             tionCategory
        VQuAnDa                     UNER
                                                    skos:narrower
                                                                                          so:head                                     onyx:
                                                                          xs:string                        onyx:hasEmo               Emotion
             xs:string                                                                      line
                                  rdfs:sub                                                              tionIntensity
                                  ClassOf            owl:Class
          so:text                                                                          so:url
                                                                       xs:anyURI                             xs:double          onyx:has
                           owl:equivalent                                                                                        Emotion
                                    Class
            so:Question
                             so
                               :m                 rdf:type
                                                                                                                  onyx:has
                                                                                                                                      onyx:
    so:suggested                    ain                                                        so:Article                            Emotion
                                        En                                          ns                           EmotionSet
         Answer                                                                ntio                                                    Set
                                          tity
                                                                        so :me

            so:Answer       so:mentions              sem:Core
                                                                      rdf:
                                                                           su
          so:text                        skos:                        rdf: bject
                                                                           obje /
                                                                                                                 oekg-s:lang          oekg-s:
                                  prefLabel                                    ct        oekg-s:Language
                                                                                                                 uageScore       LanguageSpecific
             xs:string                                                                    SpecificRelation
                                                                                                                                   RelationScore
                              xs:string               sem:Actor
                                                                                                                    oekg-s:score           oekg-s:
     dbo:previousEvent /                                                                                                                   score
                                                                                                so:                Language
         dbo:nextEvent /                                                                                                                    Value
       sem:hasSubEvent             sem:Event        sem:hasPlace          sem:Place             contained
                                                                                                InPlace
                                                                                                                  xs:language         xs:double
                                                               so:image                    so:
        EventKG                                                                              description
                                                                                                                               EventKG+
                                                                    xs:anyURI               xs:string                            Click
                             oekg-s:
         oekg-s:           EventSeries
                                                   oekg-s:
        TextEvent                                EventSeries
                             Edition                                                MLM




Fig. 3. Excerpt of the OEKG schema. _ marks owl:subClassOf relations. Regular
arrows mark the rdfs:domain and rdfs:range restrictions on properties. Classes are
coloured w.r.t. the data set for which they have been added. For brevity, we have
omitted classes regarding relations between entities and events, as well as temporal
attributes from the EventKG schema.


5       Example Use Cases

In this section, we demonstrate the OEKG and its ability to enable integrated
access over multiple datasets via three example use cases.


5.1        Image Retrieval: EventKGlight , MLM & UNER

Event classification in images is an important task for various applications in
the fields of computer vision, including geolocation estimation and place clas-
sification [17]. Such tasks typically rely on the existence of a well-defined class
hierarchy and the availability of images. The OEKG facilitates queries both for
the UNER type hierarchy specifically designed for Named Entity Recognition,
and for images of locations, using the MLM data. In combination, event loca-
tions in EventKGlight , MLM ’s image links, and the UNER type hierarchy enable
retrieval of images relevant for specific event types.
                                     OEKG: The Open Event Knowledge Graph              11

    We demonstrate the OEKG’s potential for image retrieval by an example
query for images from earthquake regions shown in Listing 1.1: It queries for
entities typed as earthquakes using the uner:Earthquake class, their loca-
tions (EventKGlight ) and the images assigned to such locations (MLM ). Table 3
presents selected results of this query, including a photo of the port of Messina
and more.


SELECT DISTINCT ?Location ?Image WHERE {
  ?earthquake rdf:type uner:Earthquake ;
    sem:hasPlace ?Location .
  ?Location so:image ?Image .
}
     Listing 1.1. SPARQL query: Images of locations where earthquakes happened.




         Table 3. Selected OEKG results of the SPARQL query in Listing 1.1.20

 Location       Ferrara                  Messina                  Guaranda




 Image




5.2      Question Answering over News Articles: EventKGlight ,
         VQuAnDa, InfoSpread & TIME

Question Answering (QA) is the task of supplying precise answers to questions,
posed by users in natural language, and is typically divided into QA over free
text and QA over knowledge graphs [6]. Through the integration of EventKGlight ,
VQuAnDa, TIME and InfoSpread into the OEKG, the OEKG facilitates a com-
bination of these two tasks, i.e., hybrid approaches: We can query for news ar-
ticles which specifically mention the entities part of the question/answer pair.
20
     These photos are taken from Wikimedia Commons. They are licensed under the
     following licenses. Ferrara: Creative Commons Attribution 2.5 Italy license. Messina:
     Creative Commons Attribution-Share Alike 3.0 Unported, 2.5 Generic, 2.0 Generic
     and 1.0 Generic. Guaranda: Creative Commons Attribution 2.0 Generic license.
12       Gottschalk et al.

This way, two sources for answering the question can be provided: the OEKG
itself, as well as the news article potentially holding the answer to the initially
posed question.
    For example, the query in Listing 1.2 asks for a question in VQuAnDa (?ques-
tion) that is about an event (?questionEntity rdf:type sem:Event).
The query then searches for news articles (?article) mentioning both that event
and one of the suggested answer entities. It returns the question “Whose wife is
a presenter at WWE? (en)” and its verbalised answer “The people whose part-
ners are presenters at WWE are John Cena, Dwayne Johnson.” together with
the Spanish news articles entitled “¿Qué luchador tiene el mayor porcentaje de
victorias en la historia de WWE?” (Which wrestler has the highest percentage
of victories in in the history of WWE? ). The question entity “WCE (en)” is
mentioned in the news article, as well as both answers: John Cena and Dwayne
Johnson.


SELECT DISTINCT ?questionText ?answerText ?headline
                ?questionEntity ?answerEntity WHERE {

     ?question so:suggestedAnswer ?answer;
       so:mainEntity ?questionEntity ;
       so:text ?questionText .
     ?questionEntity rdf:type sem:Event .

     ?answer so:mentions ?answerEntity ;
       so:text ?answerText .

     ?article rdf:type so:Article ;
       so:mentions ?questionEntity, ?answerEntity ;
       so:headline ?headline .
}
Listing 1.2. SPARQL query: News articles that mention entities of a question/answer
pair.




5.3    Event Recommendation: EventKGlight & EventKG+Click

As defined by Ni et. al, entity recommendation is the problem of suggesting
a contextually-relevant list of entities in a particular context [18]. This task
is particularly relevant in Web search. With the OEKG, we can specifically
create language-specific recommendations for events and further enrich them
with relevant event characteristics.
    The query in Listing 1.3 asks for events relevant to the First World War,
from the Russian point of view. We filter for the most relevant related events
(FILTER(?value >= 0.8)) and retrieve EventKGlight ’s event characteristics
                                  OEKG: The Open Event Knowledge Graph               13

SELECT ?Label ?StartDate WHERE {
  ?event owl:sameAs dbr:World_War_I.
  ?r oekg-s:source ?event ;
    oekg-s:target ?target ;
    oekg-s:hasLanguageSpecificRelationScore [
      oekg-s:scoreValue ?value ;
      oekg-s:scoreLanguage ’ru’ˆˆxsd:language
    ] .
  ?target skos:prefLabel ?Label ;
    sem:hasBeginTimeStamp ?StartDate .
  FILTER(?value >= 0.8) .
}
ORDER BY ?StartDate
Listing 1.3. SPARQL query: Events related to the First World War from a Russian
point of view.



to order the resulting list of events chronologically. Table 4 lists the results of this
query, that clearly show a Russian focus. This result could be used for creating a
language-specific event timeline similar to the link-based EventKG+TL system
[10], but now inferred from actual user interaction traces in EventKG+Click .


          Table 4. All OEKG results for the SPARQL query in Listing 1.3.

                                  Label             StartDate
                       Brusilov Offensive (en)      1916-05-22
                       Russian Civil War (en)       1917-11-07
                       Treaty of Brest-Litovsk (en) 1918-03-03




6     Conclusion
In this paper, we have introduced the OEKG – the Open Event Knowledge
Graph21 . The OEKG comprises event-related knowledge from seven data sets
of various application domains. We have presented an easy-to-use, efficient and
robust pipeline that facilitated a seamless integration of seven data sets into the
OEKG. At the examples of image retrieval, question answering over text and
event recommendation, we have exemplified three use cases of the OEKG.

Acknowledgements The project leading to this publication has received fund-
ing from the European Union’s Horizon 2020 research and innovation programme
under the Marie Sklodowska-Curie grant agreement No. 812997 (Cleopatra).
21
     http://oekg.l3s.uni-hannover.de/
14      Gottschalk et al.

References

 1. Abdollahi, S., Gottschalk, S., Demidova, E.: EventKG+Click: A Dataset of
    Language-specific Event-centric User Interaction Traces. In: CLEOPATRA Work-
    shop @ ISWC (2020)
 2. Alves, D., Kuculo, T., Amaral, G., Thakkar, G., Tadic, M.: Uner: Universal named-
    entity recognitionframework. In: CLEOPATRA Workshop @ ISWC (2020)
 3. Armitage, J., Kacupaj, E., Tahmasebzadeh, G., Maleshkova, M., Ewerth, R.,
    Lehmann, J.: MLM: A Benchmark Dataset for Multitask Learning with Multi-
    ple Languages and Modalities. In: Proceedings of the 29th ACM International
    Conference on Information & Knowledge Management. pp. 2967–2974 (2020)
 4. Cheema,       G.S.,      Major,     D.,    Mello,      C.,    Sittar,   A.:  TIME:
    Temporal        Discourse       Analysis      applied      to      Media    Articles.
    http://cleopatra-project.eu/index.php/2020/06/01/
    time-temporal-discourse-analysis-applied-to-media-articles/
    (2020)
 5. Dayan, D., Katz, E.: Media Events. harvard university press (1994)
 6. Dimitrakis, E., Sgontzos, K., Tzitzikas, Y.: A Survey on Question Answering Sys-
    tems over Linked Data and Documents. Journal of Intelligent Information Systems
    pp. 1–27 (2019)
 7. Fafalios, P., Iosifidis, V., Ntoutsi, E., Dietze, S.: TweetsKB: A Public and Large-
    scale RDF Corpus of Annotated Tweets. In: European Semantic Web Conference.
    pp. 177–190. Springer (2018)
 8. Galkin, M., Auer, S., Kim, H., Scerri, S.: Integration Strategies for Enterprise
    Knowledge Graphs. In: 2016 IEEE Tenth International Conference on Semantic
    Computing (ICSC). pp. 242–245 (2016)
 9. Gottschalk, S., Bernacchi, V., Rogers, R., Demidova, E.: Towards Better Under-
    standing Researcher Strategies in Cross-lingual Event Analytics. In: International
    Conference on Theory and Practice of Digital Libraries. pp. 139–151. Springer
    (2018)
10. Gottschalk, S., Demidova, E.: EventKG+TL: Creating Cross-lingual Timelines
    from an Event-centric Knowledge Graph. In: European Semantic Web Conference.
    pp. 164–169. Springer (2018)
11. Gottschalk, S., Demidova, E.: EventKG–the Hub of Event Knowledge on the Web–
    and Biographical Timeline Generation. Semantic Web 10(6), 1039–1070 (2019)
12. Kacupaj, E., Zafar, H., Lehmann, J., Maleshkova, M.: VQuAnDa: Verbalization
    QUestion ANswering DAtaset. In: European Semantic Web Conference. pp. 531–
    547. Springer (2020)
13. Leban, G., Fortuna, B., Brank, J., Grobelnik, M.: Event Registry: Learning about
    World Events from News. In: Proceedings of the 23rd International Conference on
    World Wide Web. pp. 107–110 (2014)
14. Liu, B.: Sentiment Analysis: Mining Opinions, Sentiments, and Emotions. Cam-
    bridge University Press (2020)
15. Lo, S.L., Cambria, E., Chiong, R., Cornforth, D.: Multilingual Sentiment Analysis:
    from Formal to Informal and Scarce Resource Languages. Artificial Intelligence
    Review 48(4), 499–527 (2017)
16. Marujo, L., Ling, W., Gershman, A., Carbonell, J.G., Neto, J.P., Matos, D.: Recog-
    nition of Named-Event Passages in News Articles. In: Proceedings of COLING
    2012: Demonstration Papers. pp. 329–336 (2012)
                                 OEKG: The Open Event Knowledge Graph             15

17. Muller-Budack, E., Springstein, M., Hakimov, S., Mrutzek, K., Ewerth, R.:
    Ontology-driven Event Type Classification in Images. In: Proceedings of the
    IEEE/CVF Winter Conference on Applications of Computer Vision. pp. 2928–
    2938 (2020)
18. Ni, C.C., Sum Liu, K., Torzec, N.: Layered Graph Embedding for Entity Rec-
    ommendation using Wikipedia in the Yahoo! Knowledge Graph. In: Companion
    Proceedings of the Web Conference 2020. pp. 811–818 (2020)
19. Rogers, R.: Digital Methods. MIT press (2013)
20. Sittar, A., Mladenić, D., Erjavec, T.: A Dataset for Information Spreading over
    the News. In: Conference on Data Mining and Data Warehouses (SiKDD) (2020)
21. Thelwall, M., Buckley, K., Paltoglou, G., Cai, D., Kappas, A.: Sentiment Strength
    Detection in Short Informal Text. Journal of the American society for information
    science and technology 61(12), 2544–2558 (2010)