<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Junbo Huang</string-name>
          <email>junbo.huang@uni-hamburg.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Longquan Jiang</string-name>
          <email>longquan.jiang@uni-hamburg.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cedric Möller</string-name>
          <email>ller@uni-hamburg.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ricardo Usbeck</string-name>
          <email>ricardo.usbeck@leuphana.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Event Deduplication, Event Temporal Ordering, Media Narrative Discourse</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>In: R. Campos, A. Jorge, A. Jatowt, S. Bhatia, M. Litvak (eds.): Proceedings of the Text2Story'24 Workshop</institution>
          ,
          <addr-line>Glasgow</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Leuphana University Lüneburg</institution>
          ,
          <addr-line>Universitätsallee 1, 21335 Lüneburg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Hamburg, Department of Computer Science</institution>
          ,
          <addr-line>Vogt-Kölln-Straße 30, 22527 Hamburg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <abstract>
        <p>With the growing amount of online data, distinguishing between similar events and news about them poses a significant challenge for both companies and crisis reaction units. To discriminate event instances, we present Eventist, a silver-standard event instance dataset from news in English, containing 23,304 news headlines from 90 countries covering in total 113 storm-related events between 1 January 2021 and 1 September 2023. Sampled data is validated by two human raters. Additionally, we propose to adopt a sentence-level event representation for modeling media narrative discourse. Finally, we provide two pairwise comparison benchmarks on event deduplication and event temporal ordering, enabling the practicality of event extraction.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR
ceur-ws.org</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>
        With climate change, a noticeable uptick in the frequency and severity of tropical storms and
their associated impacts is observed [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. Notably, storm surges, including extreme floods and
tsunamis, often result in more significant damage than the storms themselves. To automatically
identify storm events holds the potential for fast crisis responses. Consequently, event extraction
from media texts has emerged as a pivotal task in Natural Language Processing (NLP) [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ].
      </p>
      <p>
        We view media texts as narratives describing specific events with their often implicit causal
or temporal relations. We address news headlines as compressed narratives containing partial
event information. An event, in this context, denotes the real-world occurrence of a particular
incident, attributed to its constituent elements such as participants, temporal attributes, and
geographical location, often resulting in a change of state of a set of geopolitical entities (GPEs)
[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. In this study, we represent events with English-language news headlines, with a focus
on storm-related narratives. Specifically, storm-related narratives are texts with at least one
storm-related event mention.
      </p>
      <p>
        In NLP, events are referred to as word-based event mentions, typically represented as verbs
or nouns [
        <xref ref-type="bibr" rid="ref3 ref4 ref6">3, 4, 6</xref>
        ]. For instance, in the headline “Latest from the Tropics: Tropical Storm Bret
(a) Event distribution over temporally overlapping (b) Sentence-level event representation on 2023
Cyevents. Note that multiple events can have a clone Gabrielle in diferent stages of event
decompound efect on a set of geopolitical entities. velopment. This representation captures event
Such compound efect can not be captured by attributes, the efect of events and the telling of
extracting only word-based event mentions. the events, known as narrative discourse.
weakens while Cindy strengthens”, the extracted word-based event mentions are “Tropical
Storm Bret” (formed on June 19, 2023) and “Cindy” (formed on June 27, 2023). We argue for
the significance of recognizing narrative discourses involves nuances in news storytelling. In
this example, the narrative’s temporal focus shifts from the decline of tropical storm Bret to
the emergence of tropical storm Cindy. We consider the self-referential change of state, of one
being weakened and another being strengthened, as the efect of an event, represented by a
single news headline containing two word-based event mentions.
      </p>
      <p>
        A significant yet often overlooked challenge lies in determining whether diferent news
articles pertain to the same event. Discrepancies between word-based event mentions and
events present a formidable obstacle known as event instance discrimination [
        <xref ref-type="bibr" rid="ref7 ref8 ref9">7, 8, 9</xref>
        ]. Despite
its critical importance, there exists no standardized benchmark or task formulation to address
this issue. Prior attempts have involved entity matching based on factors such as location and
time [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8, 10</xref>
        ], or linking event mentions to entries in a knowledge graph (KG) like Wikidata
[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. However, heuristic-driven methods often sufer from poor generalization, while KG-driven
approaches are limited by the inability to represent events absent from the KG. More importantly,
extracting only word-based event mentions fails to represent the efect of events.
      </p>
      <p>
        Furthermore, annotating mentions at the word level is a costly process, often resulting in low
agreement among annotators [11, 12, 13]. Evidence suggests that word-based event mentions
are not crucial for efective event detection [ 14, 15]. In our approach, we utilize sentence-level
event representation, as illustrated in Figure 1b. Sentence-level event representation can better
capture (1) event attributes, (2) the efect of events, and (3) the telling of the events, also known
as narrative discourse [16]. Narrative discourse varies before, during, and after the event, which
is practically important in many disciplines [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4, 17, 18</xref>
        ].
      </p>
      <p>To achieve this goal of distinguishing event instances in news streams (Figure 1a), drawing
inspiration from the success of deep distance metric learning [19, 20], we suggest a baseline
system for two pairwise comparison tasks. Our contributions include:
• Introducing a large-scale silver-standard event instance dataset named Eventist. The
dataset comprises 23,304 English news headlines, covering a total of 113 storm-related
real-world events from 90 countries between January 1, 2021, and September 1, 2023.
• Providing two benchmarks on event deduplication and event temporal ordering. We have
made the dataset, baseline models, and code openly accessible on https://github.com/
semantic-systems/paper-event-instance-discrimination.</p>
    </sec>
    <sec id="sec-3">
      <title>2. Related Work</title>
      <p>
        Dataset To our knowledge, the most closely related dataset concerning crisis-related news
articles linked to KGs is Crisisfacts [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], initially designed for extracting atomic facts for temporal
summarization. Events are linked to Wikidata, and publication dates are available as
metainformation. It is theoretically possible to re-formulate the temporal summarization task into
event deduplication and event temporal ordering. However, as depicted in Table 1, Crisisfacts
encompasses only 8 events spread across 31 unique dates (all confined to the United States),
limiting its applicability. In contrast, Eventist contains news headlines describing 113 diferent
events, covering 688 unique dates, and a significantly longer narrative duration.
      </p>
      <p>
        Methods for Event Deduplication Heuristic-based event deduplication methods first
extract entities from texts, which are used as features for event deduplication. [10] deployed
a graph-based event merging strategy. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] considered multiple metrics measuring temporal
similarity, string similarity, and entity similarity. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] utilized external entities information
in a KG. KG-based approaches aim to directly link word-based event mentions to Wikipedia
[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. While heuristic-based approaches can only identify texts of high similarity, KG-based
approaches fail to capture events not included in the KG.
      </p>
      <p>Representation of Event Temporal Relations Freksa’s cognitive perspective on time
and temporal reasoning, as elucidated in his work [21], ofers a simplified version of Allen’s
interval-based temporal relations, comprising 13 distinct types [22]. Unlike Allen’s approach,
which presents a compositionally complete framework for temporal reasoning, Freksa’s
semiinterval-based representation acknowledges the uncertainty inherent in event boundaries and
measures temporal relations based on the occurrence of events. While Allen’s model may
be theoretically robust, it may not always align with the intricacies of narrative studies. In
this work, the temporality of events refers to the narrated time of events rather than their
actual occurrence. Viewing narratives as a point in time ofers a coherent perspective, which
recognizes that narratives encapsulate a specific temporal snapshot rather than a comprehensive
depiction of events as they unfold in the real world.</p>
    </sec>
    <sec id="sec-4">
      <title>3. Dataset Construction</title>
      <p>
        News Acquisition News articles are initially collected from the Global Database of Events,
Language, and Tone (GDELT) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], utilizing storm-related keywords such as storm, flood,
hurricane, typhoon, and tornado. The search spans from January 1, 2021, to September 1, 2023,
with English language restrictions, yielding 1,978,483 raw articles from 149 countries. After
identifying and removing 50.28% of duplicate entries, 983,775 unique news articles remain. Our
preliminary studies showed that the regular expressions-based GDELT search includes pages
where at least one keyword appears in the title, body, image caption, or advertisements. To refine
the selection to storm-related news, additional steps are taken, including cosine similarity-based
clustering, event type annotation, entity recognition, and temporal clustering.
      </p>
      <sec id="sec-4-1">
        <title>3.1. Denoising Pipeline</title>
        <p>The steps in denoising data are designed to construct a dataset with high precision. This entails
ensuring that each cluster uniquely refers to a specific event, and that all headlines within any
cluster reference the same event, despite potential narrative variations.</p>
        <p>Clustering on headlines To generate representations for event mentions, we employ a
pre-trained sentence transformer (all-MiniLM-L6-v21). Subsequently, we calculate a distance
matrix  ∈ ℝ × based on cosine similarity for  sentence pairs. Clusters are established with
two criteria: 1) each cluster must comprise a minimum of 50 instances, and 2) cosine similarities
between every pair of instances must exceed 0.7. This process results in the retrieval of 786,944
news instances, organized into 264 clusters.</p>
        <p>
          Annotation on Event Type and GPE We used the Text REtrieval Conference Incident
Stream (TREC-IS) dataset to train an event type classifier for soft labeling clusters. TREC-IS is a
collection of microblog posts about pre-disaster, in-disaster, and post-disaster discussions [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. It
contains gold-standard annotation of 118 events. Clusters are chosen if the majority of headlines
are predicted as tropical storms, hurricanes, floods, typhoons, or tornadoes. Additionally,
spaCy’s NER (en_core_web_md2) is used to annotate GPEs. Given the anticipated variations in
1https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2
2https://spacy.io/models/en#en_core_web_md
narrative discourse within each cluster, diferences in GPEs among headlines belonging to the
same cluster are expected. The annotated GPEs play a crucial role in the subsequent merging of
clusters.
        </p>
        <p>Temporal Clustering For each cluster, a one-dimensional temporal clustering algorithm
(DBSCAN) is employed to eliminate temporal outliers, with temporal information represented by
the news publish date. We adopted a temporal granularity of one day, setting min_samples = 3
and  = 1 , resulting in the removal of 752,429 headlines.</p>
        <p>Merging Clusters The remaining 34,515 continuous mentions form 278 clusters representing
storm-related disaster instances. These clusters encompass various narratives, including
predisaster, in-disaster and post-disaster information. The maximum overlapping ratio, denoted as
 , between the GPEs in any pair of clusters (  and   ) is computed using Equation 1.
  =</p>
        <p>|  ∩   |
max(|  |, |  |)
(1)</p>
        <p>To capture long events such as 2023 Cyclone Freddy,3 we allow a temporal gap of 10 days
between clusters. Two clusters are merged if (1) GPEs overlap ratio   ≥ 0.5, and (2) minimum
between-cluster temporal distance   = min∈  , ′∈  (,  ′) ≤ 10. As a result, 263 unique
clusters are returned. To further examine the clustering quality, one domain expert manually
linked each cluster to a Wikidata entity of type “occurrence”4 and checked for geospatial and
temporal consistency between mentions and the actual event occurrences, concluding 113
unique clusters.</p>
      </sec>
      <sec id="sec-4-2">
        <title>3.2. Dataset Validation</title>
        <p>We randomly selected 5 headlines from each cluster, resulting in a total of 565 sampled headlines
for assessing cluster coherence and uniqueness. All headlines are shown at once. Cluster
coherence was evaluated using a 5-point Likert scale, with two human raters indicating the
extent to which media narratives described the same event, ranging from ”Strongly Disagree”
(1) to ”Strongly Agree” (5). The mean Likert response score for coherence was   = 4.03,
Krippendorf’s alpha  = 0.736 . The uniqueness of events within each cluster was assessed
by determining whether each event was distinct within the set. Cohen’s kappa statistic for
uniqueness was  = 1 .</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>4. Benchmarks</title>
      <p>Event deduplication is a binary classification task 5 and event temporal ordering is a multiclass
classification task 6. We use a bi-encoder (siamese network structure) comparing
DistilRoBERTabase, DistilBERT-base-cased, RoBERTa-base and BERT-base-cased. Consistent with [19], a
vector concatenation considering element-wise diference between both embedded vectors,
3The longest recorded tropical cyclone
4https://www.wikidata.org/wiki/Q1190554
5Two labels for event deduplication: same_event_instance, diferent_event_instance
6Three labels for temporal ordering: before, equal and after
 ⊕  ⊕ | −  | , is later fed into a softmax classification layer. We used Adam optimizer with
learning rate 2e−5, and a linear learning rate warm-up over 10% of the training data. Number of
training epoch is 5. The dataset is partitioned to include clusters of varying sizes in each split
(Table 2), consisting of 9 large clusters (  ≥ 500), 48 medium-sized clusters (500 &gt;   ≥ 200),
and 58 small clusters (  &lt; 200).</p>
      <p>Within each split, stratified sampling is applied to sample headline pairs, balancing date
distribution, label distribution, and event instance distribution in headline pairs. Each model is
run with three seeds, and all experiments are run with an NVIDIA RTX A6000 48 GB graphics
card. Table 3 shows the benchmark result over 4 seeds (0, 1, 2, 3).</p>
      <p>The baseline result under the pairwise comparison task formulation reveals two significant
implications. Firstly, it demonstrates the feasibility of discriminating event instances using
only news headlines, as evidenced by the result in event deduplication. This suggests that the
denoising pipeline and event type detection mechanisms are efective in distinguishing how
diferent storm-related events are mentioned in the headlines. Secondly, the result highlights
the challenge of identifying the temporal ordering of headlines mentioning storm-related events.
This task proves to be much more complex compared to discriminating between individual event
instances. The nuanced temporal relationships and contextual dependencies within narratives
pose a greater dificulty in determining the chronological sequence of events discussed in news
articles.</p>
    </sec>
    <sec id="sec-6">
      <title>5. Limitations and Discussions</title>
      <sec id="sec-6-1">
        <title>5.1. Generalizability of the Proposed Denoising Pipeline</title>
        <p>One noticeable ad-hoc component in the denoising pipeline that hinders generalizability is
the domain-specific event type detector, trained in a supervised learning fashion. We used
an event type detector on a gold-standard human-annotated dataset (TREC-IS) to annotate
event types to select news headlines classified as being storm-related. While this approach
prioritizes the accuracy of event type annotations, it limits the generalizability to diferent event
types. We highlight the importance of showcasing the practical implications of the relatively
under-explored task of event instance discrimination, exemplified by storm-related events.
However, advancements in zero-shot learning techniques, such as prompt-based approaches
leveraging Large Language Models (LLMs), ofer promising avenues to replace the ad-hoc event
type detector, potentially enhancing the pipeline’s generalizability across diverse event types.
It is crucial to conduct a thorough evaluation of zero-shot event type detectors to ensure the
quality of annotations.</p>
      </sec>
      <sec id="sec-6-2">
        <title>5.2. Event Temporal Ordering Beyond News Publication Date</title>
        <p>
          We used news publication dates as features to identify the temporal order of any pair of news
headlines. This representation ignores the temporal lag between news publication and the
actual occurrence of events. We acknowledge the limitations of this approach, particularly in
extracting precise absolute temporal attributes of events. A more accurate method would involve
linking event mentions to well-structured data sources such as Knowledge Graphs (KGs) to
extract strictly precise temporal attributes. However, the task of event linking remains relatively
unexplored, with existing works primarily linking event mentions to community-driven sources
like Wikipedia, which may contain inaccurate, or false information [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ].
        </p>
        <p>Despite the inherent limitations, leveraging news publication dates for temporal ordering
provides an approximate timeline of event developments within media narratives. This
approach allows for the construction of a detailed narrative timeline, facilitating the identification
of in-disaster news reports and further extraction, normalization, and analysis of temporal
expressions.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>6. Conclusion</title>
      <p>In conclusion, we introduced the Eventist dataset, a comprehensive silver-standard dataset that
serves as a benchmark for two essential pairwise comparison tasks: event deduplication and
event temporal ordering. While we validated the dataset with human raters, it’s important to
note that it may contain noise introduced by biases inherent in the denoising pipeline. However,
our focus on event instance discrimination underscores the dataset’s significance and opens
avenues for further research in refining and enhancing these critical NLP tasks.</p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgments</title>
      <p>The authors acknowledge the financial support by the Federal Ministry for Economic Afairs
and Energy of Germany in the project CoyPu (project number 01MK21007G).
Chapter of the Association for Computational Linguistics, EACL 2023, Dubrovnik, Croatia,
May 2-6, 2023, Association for Computational Linguistics, 2023, pp. 2671–2680. URL:
https://doi.org/10.18653/v1/2023.eacl-main.196. doi:10.18653/V1/2023.EACL-MAIN.196.
[10] W. Ai, J. Xu, H. Shao, Z. Wang, T. Meng, An entity event deduplication method based on
connected subgraph, in: J. Yang, K. Li, W. Tu, Z. Xiao, L. Wang (Eds.), 7th International
Conference on Systems and Informatics, ICSAI 2021, Chongqing, China, November 13-15,
2021, IEEE, 2021, pp. 1–6. URL: https://doi.org/10.1109/ICSAI53574.2021.9664040. doi:10.
1109/ICSAI53574.2021.9664040.
[11] O. Inel, L. Aroyo, Validation methodology for expert-annotated datasets: Event annotation
case study, in: M. Eskevich, G. de Melo, C. Fäth, J. P. McCrae, P. Buitelaar, C. Chiarcos,
B. Klimek, M. Dojchinovski (Eds.), 2nd Conference on Language, Data and Knowledge,
LDK 2019, May 20-23, 2019, Leipzig, Germany, volume 70 of OASIcs, Schloss Dagstuhl
Leibniz-Zentrum für Informatik, 2019, pp. 12:1–12:15. URL: https://doi.org/10.4230/OASIcs.</p>
      <p>LDK.2019.12. doi:10.4230/OASICS.LDK.2019.12.
[12] Z. Song, A. Bies, S. M. Strassel, J. Ellis, T. Mitamura, H. T. Dang, Y. Yamakawa, S. Holm,
Event nugget and event coreference annotation, in: M. Palmer, E. H. Hovy, T. Mitamura,
T. O’Gorman (Eds.), Proceedings of the Fourth Workshop on Events,
EVENTS@HLTNAACL 2016, San Diego, California, USA, June 17, 2016, Association for Computational
Linguistics, 2016, pp. 37–45. URL: https://doi.org/10.18653/v1/W16-1005. doi:10.18653/
V1/W16-1005.
[13] C. Colruyt, O. D. Clercq, T. Desot, V. Hoste, EventDNA: A dataset for dutch news event
extraction as a basis for news diversification, Lang. Resour. Evaluation 57 (2023) 189–221.</p>
      <p>URL: https://doi.org/10.1007/s10579-022-09623-2. doi:10.1007/S10579-022-09623-2.
[14] S. Liu, Y. Li, F. Zhang, T. Yang, X. Zhou, Event detection without triggers, in: J. Burstein,
C. Doran, T. Solorio (Eds.), Proceedings of the 2019 Conference of the North American
Chapter of the Association for Computational Linguistics: Human Language Technologies,
NAACL-HLT 2019, Minneapolis, MN, USA, June 2-7, 2019, Volume 1 (Long and Short
Papers), Association for Computational Linguistics, 2019, pp. 735–744. URL: https://doi.
org/10.18653/v1/n19-1080. doi:10.18653/V1/N19-1080.
[15] T. Ling, L. Chen, H. Sheng, Z. Cai, H. Liu, Sentence-level event detection without
triggers via prompt learning and machine reading comprehension, CoRR abs/2306.14176
(2023). URL: https://doi.org/10.48550/arXiv.2306.14176. doi:10.48550/ARXIV.2306.14176.
arXiv:2306.14176.
[16] A. Piper, R. J. So, D. Bamman, Narrative theory for computational narrative
understanding, in: M. Moens, X. Huang, L. Specia, S. W. Yih (Eds.), Proceedings of the 2021
Conference on Empirical Methods in Natural Language Processing, EMNLP 2021, Virtual
Event / Punta Cana, Dominican Republic, 7-11 November, 2021, Association for
Computational Linguistics, 2021, pp. 298–311. URL: https://doi.org/10.18653/v1/2021.emnlp-main.26.
doi:10.18653/V1/2021.EMNLP-MAIN.26.
[17] S. Diaf, J. Döpke, U. Fritsche, I. Rockenbach, Sharks and minnows in a shoal of words:
Measuring latent ideological positions based on text mining techniques, European Journal
of Political Economy 75 (2022) 102179. URL: https://doi.org/10.1016/j.ejpoleco.2022.102179.
doi:10.1016/j.ejpoleco.2022.102179.
[18] T. Zhang, A. M. Schoene, S. Ji, S. Ananiadou, Natural language processing applied to
mental illness detection: A narrative review, npj Digit. Medicine 5 (2022). URL: https:
//doi.org/10.1038/s41746-022-00589-7. doi:10.1038/S41746-022-00589-7.
[19] N. Reimers, I. Gurevych, Sentence-BERT: Sentence embeddings using siamese
BERTnetworks, in: K. Inui, J. Jiang, V. Ng, X. Wan (Eds.), Proceedings of the 2019 Conference
on Empirical Methods in Natural Language Processing and the 9th International Joint
Conference on Natural Language Processing, EMNLP-IJCNLP 2019, Hong Kong, China,
November 3-7, 2019, Association for Computational Linguistics, 2019, pp. 3980–3990. URL:
https://doi.org/10.18653/v1/D19-1410. doi:10.18653/V1/D19-1410.
[20] S. Nishikawa, R. Ri, I. Yamada, Y. Tsuruoka, I. Echizen, EASE: Entity-aware contrastive
learning of sentence embedding, in: M. Carpuat, M. de Marnefe, I. V. M. Ruíz (Eds.),
Proceedings of the 2022 Conference of the North American Chapter of the Association
for Computational Linguistics: Human Language Technologies, NAACL 2022, Seattle,
WA, United States, July 10-15, 2022, Association for Computational Linguistics, 2022, pp.
3870–3885. URL: https://doi.org/10.18653/v1/2022.naacl-main.284. doi:10.18653/V1/2022.</p>
      <p>NAACL-MAIN.284.
[21] C. Freksa, Temporal reasoning based on semi-intervals, Artif. Intell. 54 (1992) 199–227. URL:
https://doi.org/10.1016/0004-3702(92)90090-K. doi:10.1016/0004-3702(92)90090-K.
[22] J. F. Allen, Maintaining knowledge about temporal intervals, Commun. ACM 26 (1983)
832–843. URL: https://doi.org/10.1145/182.358434. doi:10.1145/182.358434.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>R.</given-names>
            <surname>Méndez-Tejeda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Hernández-Ayala</surname>
          </string-name>
          ,
          <article-title>Links between climate change and hurricanes in the North Atlantic, PLOS Climate 2 (</article-title>
          <year>2023</year>
          )
          <article-title>e0000186</article-title>
          . URL: https://doi.org/10.1371/journal. pclm.0000186. doi:
          <volume>10</volume>
          .1371/journal.pclm.
          <volume>0000186</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>D.</given-names>
            <surname>Xi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gori</surname>
          </string-name>
          ,
          <article-title>Increasing sequential tropical cyclone hazards along the US East and Gulf coasts</article-title>
          ,
          <source>Nature Climate Change</source>
          <volume>13</volume>
          (
          <year>2023</year>
          )
          <fpage>258</fpage>
          -
          <lpage>265</lpage>
          . URL: https://doi.org/10.1038/ s41558-023-01595-7. doi:
          <volume>10</volume>
          .1038/s41558-023-01595-7.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>R.</given-names>
            <surname>McCreadie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Buntain</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Soborof</surname>
          </string-name>
          ,
          <article-title>TREC incident streams: Finding actionable information on social media</article-title>
          , in: Z.
          <string-name>
            <surname>Franco</surname>
            ,
            <given-names>J. J.</given-names>
          </string-name>
          <string-name>
            <surname>González</surname>
            ,
            <given-names>J. H.</given-names>
          </string-name>
          <string-name>
            <surname>Canós</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the 16th International Conference on Information Systems for Crisis Response and Management</source>
          , València, Spain, May
          <volume>19</volume>
          -22,
          <year>2019</year>
          ,
          <string-name>
            <given-names>ISCRAM</given-names>
            <surname>Association</surname>
          </string-name>
          ,
          <year>2019</year>
          . URL: http://idl.iscram.org/files/richardmccreadie/2019/1867_RichardMcCreadie_etal2019.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>R.</given-names>
            <surname>McCreadie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Buntain</surname>
          </string-name>
          ,
          <article-title>Crisisfacts: Buidling and evaluating crisis timelines</article-title>
          ,
          <source>in: 20th International Conference on Information Systems for Crisis Response and Management (ISCRAM</source>
          <year>2023</year>
          ), Omaha,
          <string-name>
            <surname>NE</surname>
          </string-name>
          , USA,
          <year>2023</year>
          , pp.
          <fpage>320</fpage>
          -
          <lpage>339</lpage>
          . URL: https://doi.org/10.59297/JVQZ9405. doi:
          <volume>10</volume>
          .59297/JVQZ9405.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>G. R.</given-names>
            <surname>Doddington</surname>
          </string-name>
          , A. Mitchell,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Przybocki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. A.</given-names>
            <surname>Ramshaw</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Strassel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. M.</given-names>
            <surname>Weischedel</surname>
          </string-name>
          ,
          <article-title>The automatic content extraction (ACE) program - Tasks, Data, and Evaluation</article-title>
          , in
          <source>: Proceedings of the Fourth International Conference on Language Resources and Evaluation</source>
          ,
          <string-name>
            <surname>LREC</surname>
          </string-name>
          <year>2004</year>
          , May 26-28,
          <year>2004</year>
          , Lisbon, Portugal,
          <source>European Language Resources Association</source>
          ,
          <year>2004</year>
          . URL: http://www.lrec-conf.org/proceedings/lrec2004/summaries/5.htm.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>X.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Han</surname>
          </string-name>
          ,
          <string-name>
            <surname>W</surname>
          </string-name>
          . Jiang, R. Han,
          <string-name>
            <surname>Z</surname>
          </string-name>
          . Liu,
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <article-title>MAVEN: A massive general domain event detection dataset</article-title>
          , in: B.
          <string-name>
            <surname>Webber</surname>
            , T. Cohn,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>He</surname>
          </string-name>
          , Y. Liu (Eds.),
          <source>Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, EMNLP</source>
          <year>2020</year>
          , Online,
          <source>November 16-20</source>
          ,
          <year>2020</year>
          , Association for Computational Linguistics,
          <year>2020</year>
          , pp.
          <fpage>1652</fpage>
          -
          <lpage>1671</lpage>
          . URL: https://doi.org/10.18653/v1/
          <year>2020</year>
          .emnlp-main.
          <volume>129</volume>
          . doi:
          <volume>10</volume>
          .18653/V1/
          <year>2020</year>
          .EMNLP-MAIN.
          <year>129</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>F.</given-names>
            <surname>Rollo</surname>
          </string-name>
          , L. Po,
          <article-title>Crime event localization and deduplication</article-title>
          , in: J.
          <string-name>
            <given-names>Z.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. A. M.</given-names>
            <surname>Tamma</surname>
          </string-name>
          , C. d'Amato,
          <string-name>
            <given-names>K.</given-names>
            <surname>Janowicz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Fu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Polleres</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Seneviratne</surname>
          </string-name>
          , L. Kagal (Eds.),
          <source>The Semantic Web - ISWC 2020 - 19th International Semantic Web Conference</source>
          , Athens, Greece, November 2-
          <issue>6</issue>
          ,
          <year>2020</year>
          , Proceedings,
          <string-name>
            <surname>Part</surname>
            <given-names>II</given-names>
          </string-name>
          , volume
          <volume>12507</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2020</year>
          , pp.
          <fpage>361</fpage>
          -
          <lpage>377</lpage>
          . URL: https://doi.org/10.1007/978-3-
          <fpage>030</fpage>
          -62466-8_
          <fpage>23</fpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>030</fpage>
          -62466-8\_
          <fpage>23</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>V.</given-names>
            <surname>Zavarella</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Piskorski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Ignat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Tanev</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Atkinson, Mastering the media hype: Methods for deduplication of conflict events from news reports</article-title>
          , in: A.
          <string-name>
            <surname>M. Jorge</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Campos</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Jatowt</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          . Aizawa (Eds.),
          <source>Proceedings of AI4Narratives - Workshop on Artificial Intelligence for Narratives in conjunction with the 29th International Joint Conference on Artificial Intelligence and the 17th Pacific Rim International Conference on Artificial Intelligence (IJCAI</source>
          <year>2020</year>
          ), Yokohama, Japan,
          <source>January 7th and 8th</source>
          ,
          <year>2021</year>
          <article-title>(online event due to Covid-19 outbreak)</article-title>
          , volume
          <volume>2794</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>29</fpage>
          -
          <lpage>34</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2794</volume>
          /paper6.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>X.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Yin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Roth</surname>
          </string-name>
          ,
          <article-title>Event linking: Grounding event mentions to wikipedia</article-title>
          , in: A.
          <string-name>
            <surname>Vlachos</surname>
          </string-name>
          , I. Augenstein (Eds.),
          <source>Proceedings of the 17th Conference of the European</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>