<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Mastering the Media Hype: Methods for Deduplication of Conflict Events from News Reports</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Vanni Zavarella</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jakub Piskorski</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Camelia Ignat</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hristo Tanev</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martin Atkinson</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>In: A. Jorge, R. Campos, A. Jatowt, A. Aizawa (eds.): Proceedings of the first AI4Narratives Workshop</institution>
          ,
          <addr-line>Yokohama</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Joint Research Centre of the European Commission</institution>
          ,
          <addr-line>Ispra</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Polish Academy of Sciences</institution>
          ,
          <addr-line>Warsaw</addr-line>
          ,
          <country country="PL">Poland</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Machine coding of conflict event datasets has recently emerged as a time-effective method which can back up predictive models for conflict escalation at national and sub-national level. However, the event record duplication issue, caused by large news coverage of major conflict events, significantly degrades the accuracy of these datasets and makes them unreliable for micro-analysis of conflict processes. In this paper, we assess the effectiveness of two automatic approaches for mitigating the event duplication issue. The first approach (Cluster Linking) consists of linking news article clusters across time, prior to event extraction, while the second one (Event Linking) is based on classification and aggregation of related events. The comparative evaluation is performed by measuring the correlation of the output from an automatic event detection system with human-coded conflict events from the ACLED project, spatially aggregated on administrative units. We find out that, while both methods effectively reduce the automatic system's large outlier event and victim counts (with a slight prevalence of Event Linking), they can only increase the correlation coefficients with human-coded data significantly if coupled with an accurate and fine-grained geocoding module.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>The last decade has seen a surge of interest in models of
socio-political violence and conflicts integrating the standard
static indicators (e.g. census data) with more dynamic
indicators such as time-stamped event records. This has
stimulated the creation of several machine-coded event datasets,
fully- or semi-automatically generated from news reports
with relatively rich semantic representations (see [Leetaru
⇤ Corresponding Author.</p>
      <p>
        Copyright c 2020 by the paper’s authors. Use permitted under
Creative Commons License Attribution 4.0 International (CC BY 4.0).
and Schrodt, 2013], [Lorenzini et al., 2016]). At the same
time, several concerns have emerged towards the usability
of machine-coded event datasets for micro-level modelling,
particularly in the domain of political violence where
spatial analysis has become standard. These range from
inconsistency of data schemas, making datasets not directly
comparable
        <xref ref-type="bibr" rid="ref10 ref15">([Wang et al., 2016])</xref>
        , through source inconsistency
over time, up to the more general problem of the validity
of machine-coded data with respect to a Gold Standard of
unique real-world events [Hammond and Weidmann, 2014].
      </p>
      <p>
        In particular, a low correlation has been found with respect
to human-coded reference data, upon quantitative analysis
based on temporal-geographical aggregation
        <xref ref-type="bibr" rid="ref5">(e.g. see
[Hammond and Weidmann, 2014] for GDELT)</xref>
        . This seems to be
due to news-based event-coding systems being too sensitive
to the intensity of media reporting and generating large sets
of event duplicates or near-duplicates. In this paper we
experiment with two automatic approaches for mitigating the event
duplication issue. The first approach is based on linking
clusters of news items, while the second one is based on
classification and aggregation of related events. The comparative
evaluation is performed by measuring the correlation of the
output from an automatic event detection system with
humancoded conflict events from the ACLED project [Raleigh et al.,
2010].
      </p>
      <p>By making the extracted representation of complex,
evolving processes such as conflicts closer to gold standards, these
techniques contribute to the larger endeavour in the NLP
community on automatic narrative extraction and
construction from text [Jorge et al., 2019].</p>
      <p>The paper is structured as follows. Section 2 describes
some background work on event dataset correlation studies
and some existing approaches on tackling the event
duplication problem. Section 3 briefly introduces an existing
automatic event extraction engine and presents a correlation
analysis vis-a-vis a Gold Standard dataset. Section 4 presents two
approaches we deployed to increase the baseline correlation
figures. In Section 5 we report on the impact of these methods
on our target datasets. Finally, we end up with conclusions in
Section 6.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related work</title>
      <p>Several publications recently focused on assessing the
correlation of event datasets based on disaggregated event counts,
for example [Ward et al., 2013]. [Schrodt and Analytics,
2015] presents an extensive comparison of large well-known
event datasets. [Hammond and Weidmann, 2014] apply
spatio-temporal disaggregation of events incidents to assess
whether GDELT data can approximate the spatial pattern of
conflicts. We applied an adapted version of their correlation
analysis in this paper.</p>
      <p>[Schutte et al., ] address the usability and the presence of
duplicates in various popular datasets of political and conflict
events in the research community, such as ACLED, GDELT,
and ICEWS1. A number of other papers refer to usability of
the event conflict databases. For example, [Demarest and
Langer, 2018] analyzed conflicts and social unrest in Africa,
using event datasets.</p>
      <p>The event duplication issue that we tackle here has been
approached in Computational Linguistics by several works on
co-reference resolution applied to event mentions [Lu and Ng,
2018]. Full-fledged event co-reference resolution is a harder
semantic task than the one we deal with here and requires to
take into consideration various deep linguistic features, due to
the complexity of the event mentions. For example, [Lee et
al., 2012] resolve simultaneously noun phrase co-references
and cross-document event co-reference, using an original
algorithm which exploits clustering, pronoun resolution and
semantic roles labeling. An unsupervised graph-based method
for event co-reference is presented in [Bejan and Harabagiu,
2010]. [Zhang et al., 2015] use both textual and visual scene
similarity features, when resolving co-reference news
captions. Another interesting method for linking similar events
based on machine learning and various similarity features has
been presented in [Piskorski et al., 2018]. We deploy a
modified version of this approach for our Event Linking in
Section 4.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Correlation across datasets</title>
      <p>For our correlation analysis we focus on two major violent
conflicts that recently plagued the African region: the Libyan
Civil War and the Mali War. For each of them, we compare
the datasets generated by a fully automatic event detection
engine with Gold Standard data coded by human experts within
the ACLED project [Raleigh et al., 2010]. For our
experiments we use the output of the English language instance
of NEXUS [Atkinson et al., 2017], a Joint Research Center
in-house multilingual system that has been running
continuously since 2007. NEXUS is a finite-state rule based event
extraction engine that processes in real-time the title and lead
sentences of monolingual clusters of news articles (for up to
10 languages) and outputs an event description template
corresponding to the main event reported in each cluster. The
clusters are computed every 10 minutes on a 4 hour window
of RSS feeds (title and lead sentences) of news sources by
the Europe Media Monitor (EMM), a large-scale multilingual
news aggregation engine that gathers articles from ca. 7000
sources (from local to global level) in 60 languages on a 24/7
basis [Atkinson and Van der Goot, 2009]. Event templates
1See http://www.lockheedmartin.com/us/products/W-ICEWS/
W-ICEWS overview.html for more information.
include two main slots: Event Type and Event Location,
together with other event-type specific descriptive and
numerical slots such as Number Dead,Number Injured etc. NEXUS
features a rule-based event geocoding algorithm that
integrates (a subset of) the Geonames gazetteer2 for place
name matching and a number of article-level disambiguation
heuristics for geo-disambiguation.</p>
      <p>Because NEXUS does not feature the same data schema
and event taxonomy as ACLED, some type mapping was
performed. Table 1 describes the datasets and the type
filtering that generated them. Moreover, while both NEXUS and
ACLED encode geographical information as a hierarchy of
administrative level components, like in the following
example:</p>
      <p>Populated Place=Kidal
Admin1=Tessalit
Admin2=AmiAdjelhoc</p>
      <p>Country=Mali
the components are not id-indexed. Therefore we normalized
geographical references by matching the name variants on the
high coverage Geonames gazetteer 3.</p>
      <p>The machine-coded event datasets (including instructions
on how to access the underlying news stories from which
they were extracted) can be accessed at: http://labs.emm4u.
eu/events.html</p>
      <p>In order to set a baseline correlation between Nexus and the
ACLED data, we aggregate event counts per time/space cells,
where time is either a week or a month range, and the space
is either a Province or Region level administrative unit.
Figure 1a and 1b below visualize the dynamics of the Libyan and
Malian conflict escalation/de-escalation by showing on each
week (month) the total number of province and regions,
respectively, experiencing one or more violent event incidents.</p>
      <p>While rather standard, this analysis is highly affected
by the relatively more coarse-grained event geocoding of
NEXUS compared to the human-coded Gold Standard 4.
Moreover, it does not consider variance in conflict intensity
estimation within time/space units, which is crucial for
microlevel analysis of conflicts at sub-national level. Conflict
intensity can be measured by absolute event counts and by total
victim counts. Therefore, in Figure 2a and Figure 2b we plot
total weekly event counts and victim counts (respectively) of
NEXUS compared to statistics from ACLED data. The same
figures are reported, on a monthly base, in Figure 3a and
Figure 3b for Mali War.</p>
      <p>Table 2 reports a number of error rate measures and
correlation coefficients between ACLED and NEXUS datasets for
the two target conflicts.</p>
      <p>Overall, the analysis shows a moderate to strong level of
correlation for event counts and a correlation from negligible
2http://geonames.org/.</p>
      <p>3We filtered out a total of 5% of ACLED events that could not be
normalized with respect of the Geonames resource.</p>
      <p>4The range of geographical distribution of NEXUS events is
much lower because whenever it fails to locate an event at the
exact populated place level, it backs off to the capital city of the most
specific administrative unit it could detect.
Libyan Civil War</p>
      <p>Mali War
342
to weak for victim counts, according to standard
interpretation of correlation coefficients [Hinkle et al., 2003].</p>
      <p>By looking at the curves, one can notice that the automatic
system is less sensitive to low conflict signals. This might be
due to both a general recall deficit of rule-based approaches
and a granularity deficit of the underlying geocoding
algorithm. Minor conflict incidents receive relatively lower
media reporting and small sized news clusters are more likely
to produce false negatives from a low-recall automatic
system. Moreover, even in cases where such small signals are
detected, there is a significant chance that the system lacks
the geographical knowledge to correctly locate them. On the
other hand, NEXUS tends to over-generate events at high
signal intensity points, possibly because it is unable to normalize
the increased stream of media reporting coverage on major
event incidents. This, combined with the invalid extraction of
outlier figures, makes the error rates particularly poor for
victim counts. These two drawbacks combined highly hinder the
usability of NEXUS-generated datasets for quantitative
conflict modelling at sub-national level.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Deduplication methods</title>
      <p>We now experiment with combining Nexus with two
alternative methods, in order to mitigate the issues underlined
by the correlation analysis. The first one, Cluster Linking,
is a pre-processing step that consists of further aggregating
over a given time window the daily news clusters on which
NEXUS is run. The rationale is that major event incidents
generate complex news stories spanning over several days
and following a range of related topic threads: being able to
track a priori these stories spares the downstream event
extraction engine the burden of detecting and aggregating event
co-references.</p>
      <p>The second method, Event Linking, is a post-processing
step that approximates event co-reference resolution by
deploying a classifier for event Relatedness, and then clusters
the resulting graph, aggregating the output event classes.
4.1</p>
      <sec id="sec-4-1">
        <title>Cluster linking</title>
        <p>The cluster linking is part of a larger multilingual application
that identifies equivalent news clusters across languages and
over time. It uses language-independent features as weighted
vectors, and calculates the news cluster similarity as their
linear combination. In this experiment we only apply the
historical cluster links on English news.</p>
        <p>The cluster representation is based on the following
features and their weights: (1) Named Entities: person names
and organisations; (2) Geolocations: geographical places
detected in each cluster; (3) Content categories: predefined
thematic categories that are assigned to each cluster; (4)
Automatically produced translations into English for cross-lingual
linking, or word tokens for monolingual linking, based on the
titles and the short descriptions of each article in a cluster; (5)
Eurovoc categorisation: the whole clusters are automatically
indexed with Eurovoc categories, using the freely available
software JEX5. (6) Combined feature: Name Entities +
Geolocations. We have explored and evaluated different
weighting methods: (a) normalized frequency; (b) log-likelihood,
that compares the term frequency in the cluster with the
frequency in a reference corpus and (c) TF-IDF, that is
proportional to the term frequency quantified by the inverse
function of the number of documents in which the term occurs.
A different weighting technique is selected for each feature:
log-likelihood (using a reference corpus) for feature (2) and
(4) and tf-idf for (1),(2) and (6). For each of these features
separately, the similarity is computed between two clusters
as the cosine between the weighted vectors generated by the
feature modified by a penalty metrics:</p>
        <p>SimF EAT URE (x, y) = cosine(x, y)⇤ penalty(x, y)
The penalty metrics are: (1) Dimension Penalty, that
decreases the similarity value in case of low dimensionality of
the feature vectors (low numbers of entities found), (2)
Jaccard Penalty, based on Jaccard index - that considers the ratio
between the number of common terms and the total number
of terms, (3) combination of 1 and 2 as the product, the
minimum or the maximum of the values. After optimising each
individual feature vector similarity, the global similarity
between two clusters is calculated as a linear combination of the
individual feature vector similarities:
SimCluster = w1⇤ SimN E + w2⇤ SimGL + w3⇤ SimCat
+ w4⇤ SimT ransl + w5⇤ SimEvcDesc + w6⇤ SimN G</p>
        <p>Further, for a given period of time, a graph of clusters is
generated by selecting the ones with the similarity above a
given threshold and the graph is expanded to its transitive
closure. Each graph represents a group of clusters that are
similar.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2 Event Linking</title>
        <p>For the sake of computing pairs of related events we used
a Random Forest-based classifier trained on a corpus of ca.
23K event pairs tagged as related or unrelated, where the
5https://ec.europa.eu/jrc/en/language-technologies/jrc-eurovocindexer.</p>
        <p>Libya Civil War</p>
        <p>Mali War</p>
        <p>Events
Dead
Events
Dead
events are represented as texts consisting of the title and
12 lead sentences from news articles reporting on crisis and
security-related events. In particular, for training the
classifier a set of about 15 features was exploited including, i.a.,
string distance metrics (e.g., Longest Common Substrings),
features that exploit knowledge bases (e.g.,WordNet,
BabelNet) to compute, e.g., WORDNET-word overlap,
NamedEntity overlap, Hypernym overlap, and some corpus-based
event similarity metrics, e.g., Weighted Word overlap, which
measures the overlap of words between the two texts, where
words bearing more content (i.e., appear more frequently in
the domain corpus) are assigned higher weight. The trained
classifier obtained 91.5% f measure on hold-out test data
consisting of approximately 20% of the entire event corpus.
Further details on the classifier can be found in [Piskorski et al.,
2018].
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Evaluation</title>
      <sec id="sec-5-1">
        <title>5.1 Cluster Linking application</title>
        <p>In our experiment, the cluster linking module was applied
to all news clusters geo-coded in Libya from the period
02/2011-11/2011. Only four (out of six) features were
con(a) Total monthly event counts for Mali war.</p>
        <p>(b) Total monthly victim counts for Mali war.
sidered: named entities, geolocations, tokens and the
combination of named entities and geolocations. The usage of
the other two features (categories and Eurovoc descriptors)
are recommended in cross-lingual experiments with poor or
no translation resources. Otherwise, in the monolingual case,
they are not increasing significantly the cluster linking
precision (and Eurovoc categorisation is a heavy computation
task). We have used two thresholds: one as a cut-off for the
clusters in the same day (monolingual daily threshold 0.72)
and the second for the historical linking, to select similar
cluster over time (historical linking threshold 0.62). Firstly, we
have selected all the pairs of clusters with a similarity above
the thresholds and then the new cluster links were added by
transitivity. Groups of clusters have been generated
considering all similar clusters. NEXUS was then run on these
expanded clusters and one main event was extracted from each.
This produced a lower size dataset for the Libyan conflict, as
it can be seen in Column Nexus CL of Table1).</p>
        <p>Event Linking application The pairs of related events
returned by the classifier were transformed into an undirected
graph and path distance was used as similarity metrics for
applying Agglomerative Clustering (with cluster cardinality
set to 50 for both datasets6). We finally applied this
clustering to time-based partitions of the datasets comprising events
within 3 days intervals and geocoded in the same region and
merged the resulting aggregated events. The final datasets are
referred to as Nexus RLT in Table1).</p>
        <p>As it can be visually seen in Figures 2a through 3b, both
methods seems to get the curves for cumulative weekly and
monthly event and victim counts closer to the ACLED data,
for both conflicts. This is particularly true with respect to
event counts, while victim counts seem to suffer from some
outlier values extracted by NEXUS. Root Mean Squared Error
figures in Table 2 confirm that the application of both
deduplication techniques produces a systematic reduction of the
absolute error rate of NEXUS, more significant for event than
for victim counts.</p>
        <p>6We used https://networkx.github.io/ and https://scikit-learn.org/
libraries for graph modelling and clustering implementation,
respectively.</p>
        <p>Event Linking seems to be more robust to outliers as these
can be mitigated by slot value merging heuristics, applied
downstream of the extraction engine. This explains why
Nexus RLT almost doubles the correlation coefficients for
Libya victim counts with respect to Nexus, as outliers are
more likely to be extracted for victims. For event counts
instead, Cluster Linking produces the higher drop in error rates.</p>
        <p>On the other hand, none of the methods is able to
consistently increase the correlation coefficients Nexus with
ACLED data by a significant factor. We hypothesize this
might be due to the inaccuracy of the event location
information. As we mentioned in Section 3, the range of
provincelevel spread of NEXUS events is only 10% and 20% of the
ACLED data for Libyan and Mali conflicts, respectively. In
this scenario, the information aggregation achieved by either
Cluster Linking or Event Linking is applied at a too
coarsegrained geographical level and might actually over-compress
the signal, by underestimating the total event counts.
6</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusions</title>
      <p>The usability of news-based, automatic coding of event
datasets for conflict analysis at sub-national level has been
questioned in the conflict analysis research community.</p>
      <p>We have shown how the application of two linguistically
light-weight text processing modules can mitigate, although
only partially, some of the standard flaws of news-based event
coding engines, namely the over-generation of event
duplicates at peaks of news reporting intensity.</p>
      <p>The cluster linking method is unsupervised and the only
customization to the event duplicate detection task consisted
of setting up plausible similarity thresholds, with no
optimization performed.</p>
      <p>The event linking is based on a trained supervised
classifier, however the training set was not overlapping with the
two target conflict datasets, which means that it shared
virtually no semantic context (e.g. named entities) with it.</p>
      <p>Therefore, we estimate that both approaches can well
generalize and be deployed to mitigate event duplication across
different datasets.</p>
      <p>On the other hand, customizing the presented methods so
as to optimize the boost in correlation coefficients of target
datasets is a promising direction to explore. For instance, we
plan to sample the NEXUS event records at peaks of
generation (with respect to Gold Standard) in order to collect
annotated data for training a more specialized event duplicate
classifier. In this respect, we expect semantic features such
as geolocation and time stamp might to turn out being highly
discriminative.</p>
      <p>Overall, we estimate that the effectiveness of the presented
methods can be better assessed when coupled with an event
extraction engine with an underlying high-accuracy,
finegrained geocoding module. Therefore we plan to re-run the
correlation benchmark after moderating the NEXUS event
location slots.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [Atkinson and
          <string-name>
            <surname>Van der Goot</surname>
            , 2009]
            <given-names>Martin</given-names>
          </string-name>
          <string-name>
            <surname>Atkinson and Erik Van der Goot</surname>
          </string-name>
          .
          <article-title>Near real time information mining in multilingual news</article-title>
          .
          <source>In Proceedings of the 18th international conference on World wide web</source>
          , pages
          <fpage>1153</fpage>
          -
          <lpage>1154</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [Atkinson et al.,
          <year>2017</year>
          ]
          <string-name>
            <given-names>Martin</given-names>
            <surname>Atkinson</surname>
          </string-name>
          , Jakub Piskorski, Hristo Tanev, and
          <string-name>
            <given-names>Vanni</given-names>
            <surname>Zavarella</surname>
          </string-name>
          .
          <article-title>On the creation of a security-related event corpus</article-title>
          .
          <source>In Proceedings of the Events and Stories in the News Workshop</source>
          , pages
          <fpage>59</fpage>
          -
          <lpage>65</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <source>[Bejan and Harabagiu</source>
          , 2010]
          <article-title>Cosmin Adrian Bejan</article-title>
          and
          <string-name>
            <given-names>Sanda</given-names>
            <surname>Harabagiu</surname>
          </string-name>
          .
          <article-title>Unsupervised event coreference resolution with rich linguistic features</article-title>
          .
          <source>In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics</source>
          , pages
          <fpage>1412</fpage>
          -
          <lpage>1422</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <source>[Demarest and Langer</source>
          , 2018]
          <string-name>
            <given-names>Leila</given-names>
            <surname>Demarest</surname>
          </string-name>
          and
          <string-name>
            <given-names>Arnim</given-names>
            <surname>Langer</surname>
          </string-name>
          .
          <article-title>The study of violence and social unrest in africa: A comparative analysis of three conflict event datasets</article-title>
          .
          <source>African Affairs</source>
          ,
          <volume>117</volume>
          (
          <issue>467</issue>
          ):
          <fpage>310</fpage>
          -
          <lpage>325</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <source>[Hammond and Weidmann</source>
          , 2014]
          <string-name>
            <given-names>Jesse</given-names>
            <surname>Hammond and Nils</surname>
          </string-name>
          <string-name>
            <surname>B Weidmann.</surname>
          </string-name>
          <article-title>Using machine-coded event data for the micro-level study of political violence</article-title>
          .
          <source>Research &amp; Politics</source>
          ,
          <volume>1</volume>
          (
          <issue>2</issue>
          ):
          <fpage>2053168014539924</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [Hinkle et al.,
          <year>2003</year>
          ]
          <string-name>
            <given-names>Dennis E</given-names>
            <surname>Hinkle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>William</given-names>
            <surname>Wiersma</surname>
          </string-name>
          , and Stephen G Jurs.
          <article-title>Applied statistics for the behavioral sciences (vol</article-title>
          .
          <volume>663</volume>
          ),
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [Jorge et al.,
          <year>2019</year>
          ] Al´ıpio
          <string-name>
            <surname>M Jorge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Ricardo</given-names>
            <surname>Campos</surname>
          </string-name>
          , Adam Jatowt, and Se´rgio Nunes.
          <source>Information processing &amp; management journal special issue on narrative extraction from texts (text2story)</source>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>[Lee</surname>
          </string-name>
          et al.,
          <year>2012</year>
          ]
          <string-name>
            <given-names>Heeyoung</given-names>
            <surname>Lee</surname>
          </string-name>
          , Marta Recasens,
          <string-name>
            <given-names>Angel</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Mihai</given-names>
            <surname>Surdeanu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Dan</given-names>
            <surname>Jurafsky</surname>
          </string-name>
          .
          <article-title>Joint entity and event coreference resolution across documents</article-title>
          .
          <source>In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning</source>
          , pages
          <fpage>489</fpage>
          -
          <lpage>500</lpage>
          . Association for Computational Linguistics,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <source>[Leetaru and Schrodt</source>
          , 2013]
          <article-title>Kalev Leetaru and Philip A Schrodt. GDELT: Global data on events, location</article-title>
          , and tone, 1979-
          <fpage>2012</fpage>
          .
          <article-title>In ISA annual convention</article-title>
          , volume
          <volume>2</volume>
          , pages
          <fpage>1</fpage>
          -
          <lpage>49</lpage>
          . Citeseer,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [Lorenzini et al.,
          <year>2016</year>
          ]
          <string-name>
            <given-names>Jasmine</given-names>
            <surname>Lorenzini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Peter</given-names>
            <surname>Makarov</surname>
          </string-name>
          , Hanspeter Kriesi, and
          <string-name>
            <given-names>Bruno</given-names>
            <surname>Wueest</surname>
          </string-name>
          .
          <article-title>Towards a Dataset of Automatically Coded Protest Events from Englishlanguage Newswire Documents</article-title>
          .
          <source>In Paper presented at the Amsterdam Text Analysis Conference</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <source>[Lu and Ng</source>
          , 2018]
          <string-name>
            <given-names>Jing</given-names>
            <surname>Lu</surname>
          </string-name>
          and
          <string-name>
            <given-names>Vincent</given-names>
            <surname>Ng</surname>
          </string-name>
          .
          <article-title>Event coreference resolution: A survey of two decades of research</article-title>
          .
          <source>In IJCAI</source>
          , pages
          <fpage>5479</fpage>
          -
          <lpage>5486</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [Piskorski et al.,
          <year>2018</year>
          ]
          <string-name>
            <given-names>Jakub</given-names>
            <surname>Piskorski</surname>
          </string-name>
          , Fredi Sˇ aric´, Vanni Zavarella, and
          <string-name>
            <given-names>Martin</given-names>
            <surname>Atkinson</surname>
          </string-name>
          .
          <article-title>On training classifiers for linking event templates</article-title>
          .
          <source>In Proceedings of the Workshop Events and Stories in the News</source>
          <year>2018</year>
          , pages
          <fpage>68</fpage>
          -
          <lpage>78</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [Raleigh et al.,
          <year>2010</year>
          ]
          <string-name>
            <given-names>Clionadh</given-names>
            <surname>Raleigh</surname>
          </string-name>
          , Andrew Linke, Ha˚vard Hegre, and
          <string-name>
            <given-names>Joakim</given-names>
            <surname>Karlsen</surname>
          </string-name>
          .
          <article-title>Introducing acled: an armed conflict location and event dataset: special data feature</article-title>
          .
          <source>Journal of peace research</source>
          ,
          <volume>47</volume>
          (
          <issue>5</issue>
          ):
          <fpage>651</fpage>
          -
          <lpage>660</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <source>[Schrodt and Analytics</source>
          , 2015]
          <article-title>Philip A Schrodt and Parus Analytics</article-title>
          .
          <article-title>Comparing methods for generating large scale political event data sets. In Text as Data meetings</article-title>
          , New York University,
          <fpage>16</fpage>
          -
          <lpage>17</lpage>
          ,
          <year>2015</year>
          , pages
          <fpage>1</fpage>
          -
          <lpage>32</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>[Wang</surname>
          </string-name>
          et al.,
          <year>2016</year>
          ]
          <string-name>
            <given-names>Wei</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <surname>Ryan Kennedy</surname>
            , David Lazer,
            <given-names>and Naren</given-names>
          </string-name>
          <string-name>
            <surname>Ramakrishnan</surname>
          </string-name>
          .
          <article-title>Growing pains for global monitoring of societal events</article-title>
          .
          <source>Science</source>
          ,
          <volume>353</volume>
          (
          <issue>6307</issue>
          ):
          <fpage>1502</fpage>
          -
          <lpage>1503</lpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [Ward et al.,
          <year>2013</year>
          ]
          <string-name>
            <given-names>Michael D</given-names>
            <surname>Ward</surname>
          </string-name>
          , Andreas Beger, Josh Cutler, Matthew Dickenson, Cassy Dorff, and
          <string-name>
            <given-names>Ben</given-names>
            <surname>Radford</surname>
          </string-name>
          .
          <article-title>Comparing gdelt and icews event data</article-title>
          .
          <source>Event Data Analysis</source>
          ,
          <volume>21</volume>
          (
          <issue>1</issue>
          ):
          <fpage>267</fpage>
          -
          <lpage>297</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [Zhang et al.,
          <year>2015</year>
          ]
          <string-name>
            <given-names>Tongtao</given-names>
            <surname>Zhang</surname>
          </string-name>
          , Hongzhi Li,
          <string-name>
            <given-names>Heng</given-names>
            <surname>Ji</surname>
          </string-name>
          , and
          <string-name>
            <surname>Shih-Fu Chang</surname>
          </string-name>
          .
          <article-title>Cross-document event coreference resolution based on cross-media features</article-title>
          .
          <source>In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing</source>
          , pages
          <fpage>201</fpage>
          -
          <lpage>206</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>