<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Multi-label Infectious Disease News Event Corpus</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jakub Piskorski</string-name>
          <email>jpiskorski@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nicolas Stefanovitch</string-name>
          <email>nicolas.stefanovitch@ec.europa.eu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Brian Doherty</string-name>
          <email>brian.doherty@ec.europa.eu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jens P. Linge</string-name>
          <email>jens.linge@ec.europa.eu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sopho Kharazi</string-name>
          <email>sopho.kharazi@ext.ec.europa.eu</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jas Mantero</string-name>
          <email>jasmantero@gmail.com</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Guillaume Jacquet</string-name>
          <email>guillaume.jacquet@ec.europa.eu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alessio Spadaro</string-name>
          <email>alessio.spadaro@ec.europa.eu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giulia Teodori</string-name>
          <email>giulia.teodori@ec.europa.eu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>European Commission Joint Research Centre</institution>
          ,
          <addr-line>Ispra</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Polish Academy of Sciences</institution>
          ,
          <addr-line>Warsaw</addr-line>
          ,
          <country country="PL">Poland</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <abstract>
        <p>This paper describes a new corpus consisting of circa 4.5K news snippets (multi-)labelled with finegrained infectious disease-related event types. The paper presents the underlying event taxonomy consisting of 25 fine-grained event types grouped into 8 main categories, the process of creating the corpus, related statistics and reports on the performance of SVM- and RoBERTa transformer-based baseline models for multi-label event classification. The former model obtains macro  1 score of 0.56 and 0.68 for fine- and coarse-grained classification, respectively, whereas the corresponding macro</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>1 scores
for the latter model are 0.65 and 0.76, respectively.</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>Surveillance and quick response to situations emerging from outbreaks of infectious diseases,
e.g. Covid-19, relies on comprehension of all related events, which, among others, are reported
in large amounts in news articles being published every day. Automated solutions that facilitate
extraction and classification of such events is crucial in order to leverage such sources of
information, especially for early-warning systems.</p>
      <p>In this paper, we describe a new corpus consisting of news snippets multi-labelled with
ifne-grained infectious disease-related event types reported therein. The main drive behind
this endeavour was to create material for training and building respective ML-based models for
event detection/classification in epidemics-related online news gathered by a large-scale news
aggregation and analysis engine, and to share such a resource with the scientific community,
since, to the best of our knowledge, no similar publicly accessible event-centred corpus exists
for this specific domain. Event detection and classification constitutes a key enabling technique
to build higher-level applications, e.g. event extraction, news summarization, and narrative
understanding.</p>
      <p>
        Since the beginning of the Covid-19 pandemic, a vast amount of work on studying
Covid-19related online media and automated analysis thereof has been reported, which mainly focused
on exploiting topic detection [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], fake news/misinformation narrative analysis [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], entity and
demographic-based analysis [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], and sentiment detection [
        <xref ref-type="bibr" rid="ref1 ref4">4, 1</xref>
        ], whereas relatively little work
on automated event detection and extraction has been published in this context.
      </p>
      <p>
        A corpus of 10K tweets containing public reports of Covid-19 events centered around reporting
cases, deaths, prevention measures, and cures was presented in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. A large hand-coded dataset
of over 13K policy measures introduced worldwide related to Covid-19, gathered among others
from online news, is presented in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Other online media resources related to Covid-19 have
been listed on the CLARIN Covid-19 response web page. 1
      </p>
      <p>
        [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] presented a BERT-based system that extracts and classifies Covid-19 related events and
relations between them, using a semi-automatically created event taxonomy consisting of
76 event types. The event taxonomy in the aforementioned work exhibits, to some degree,
similarity with the one presented in this paper; however, no event-labelled corpora have been
released by the authors. Furthermore, our event taxonomy was not created automatically, but
emerged from a business requirement analysis by public health experts and has been designed
upfront to cover any infectious diseases, going beyond the Covid-19 pandemic. Finally, the
news snippets in our corpus are multi-label annotated.
      </p>
      <p>
        Related to our work, some short news text classification datasets have been published, e.g.
[
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] introduce a corpus of ca. 200k news headlines labelled with 40 general news categories, and
work related to exploring ML-based models (accompanied with datasets) for the detection and
classification of natural disasters [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], financial [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and socio-political events [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] reported in
the news, covering domains that, however, have little in common with pandemics and infectious
diseases.
      </p>
    </sec>
    <sec id="sec-3">
      <title>2. News Snippet Event Corpus</title>
      <p>This section describes the event taxonomy, creation of the corpus of news snippets with labels
corresponding to events referred to in these snippets, and provides some corpus statistics. We
consider an event2, a situation (or a group thereof) that has either: occurred, is currently taking
place, or is planned or considered to happen in the future, in some place and at a certain point
in time (punctual events) or spanned/spans a time period with a start date and potential end
date. Furthermore, references to a state (of play) of a situation (an ongoing event) that has not
yet ended, statements and opinions made about it are also considered events.</p>
      <sec id="sec-3-1">
        <title>2.1. Infectious Disease-related Event Taxonomy</title>
        <p>The events are grouped into 8 main categories that revolve around: reporting on the disease
outbreak development, impact, measures, violations, research, support, communication, which</p>
        <sec id="sec-3-1-1">
          <title>1https://www.clarin.eu/content/clarin-responses-covid-19</title>
          <p>
            2Our notion of events is based on the TimeML standard specifications [
            <xref ref-type="bibr" rid="ref12">12</xref>
            ]
          </p>
          <p>REPORTING: reporting single/multiple infection cases and deaths that occurred within a short period of time and
provision of general situation overview (in terms of people afected) spanning a longer time period.</p>
          <p>IMPACT: all events that are impacted by the outbreak of the infectious disease/pandemic, e.g. cancellation of events
MEASURE: introduction and changes to legislation, restrictions and recommendations of preventive nature necessary
to combat the disease, i.e. the number of infected/afected people and spread of the disease, roll-out of related vaccines,
medicines and equipment.</p>
          <p>VIOLATION: any illegal activity, fraud, fake product discovery, unrest related to the introduced measures, and spread of
misinformation.</p>
          <p>RESEARCH &amp; DEVELOPMENT: reporting on the phenomena observed during the spread of the disease, progress on
vaccines, medicine and relevant equipment development, and support to research and development related to diagnose or
treat the disease.</p>
          <p>COMMUNICATION: high-level meetings to discuss the situation, impacts and/or introduce measures, and launch of new
information sharing/collection instruments concerning the disease and related phenomena.</p>
          <p>SUPPORT: provision of financial and other type of support to the afected entities, community, economy, etc., and
mentions of the need or lack of such support.</p>
          <p>MISCELLANEOUS: any other events related (not covered above) or unrelated to infectious diseases, and non-events, i.e.
texts not referring to any actual event nor a state of an event, e.g. descriptions of processes.
are all further subdivided into 25 fine-grained event types that refer to specific aspects of the
main categories, e.g. Reporting is subdivided into Reporting cases and Reporting situation.
The brief description of the main event categories is provided in Figure 1, whereas the one
for the fine-grained types is provided in Annex A in Figure 6. The event definitions are to a
large extent ‘inclusive’, e.g., the Support: goods category covers not only the factual provision
of goods to the afected people, but also plans and intents to do so, and expression of the needs
of those in need to receive such support.</p>
          <p>The Miscellaneous category is envisaged to capture everything that does not fit anywhere
else, and is subdivided into: (a) other events that are related to the domain, but do not fall under
any other type, (b) events that are unrelated, and (c) non events, e.g., descriptions of certain
generic processes and phenomena that are neither tailored in time nor refer to any specific
event instances, although relevant for the domain though. It is important to emphasize at this
stage that, in a practical set-up, a diferent merging and subdivision of Miscellaneous might be
more beneficial for ML modelling purposes; however, the main drive behind this subdivision
was to explore how well the 3 diferent fine-grained classes can be distinguished. Furthermore,
the Miscellaneous: Other category was deemed as relevant from end-user perspective, i.e.
constituting a source of providing ‘interesting’ information.</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>2.2. Data Sampling</title>
        <p>
          The input data for annotation was randomly sampled from news articles gathered by MEDISYS3,
a large-scale health-related news aggregation engine [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] from a period that spans 2016-2021.
Apart from conventional media sources, MEDISYS also monitors news on hundreds of oficial
public health websites such as ministry of health and public health agency websites.
10n(OR(economy, economic, economies, financial, unemployment, bankrupt, bankruptcy, unemployed),
        </p>
        <p>OR(pandemic, lockdown, disease, diseases, infection, infections, infectious, virus, viruses))</p>
        <p>
          News articles were sourced using keywords, and snippets were further extracted from them
by selecting up to max. first 4 sentences comprised within the first 500 characters of the
article4. The rationale behind considering the initial part of news articles was the assumption
of inverted-pyramid style [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] of writing news articles, i.e. the most relevant events are placed
in the beginning and the least important ones are left toward the end. First, news articles were
randomly sourced using a list of circa 800 infectious disease names5, e.g. Covid-19, ebola, zika,
malaria, etc., and relevant name variants and acronyms. Given that a large fraction of text
snippets acquired in this way fell under Miscellaneous in order to populate proportionally the
other classes in the taxonomy, an additional document sampling for each category was carried
out through the use of a more ‘focused’ combination of keywords (including synonyms) which
were required to be found within a specific text window anywhere in the body of a news article.
An example of such a keyword query is provided in Figure 2. This allowed to improve the
precision (i.e. ca. 50% of the fetched articles were reporting on events in the taxonomy that fall
into non-Miscellaneous categories). The potential bias that might have been introduced by the
use of specific keywords is mitigated by extracting text snippets only from the beginning of
articles, which do not necessarily contain any of the keywords of the query and instead use a
diferent wording to report on an event.
        </p>
        <p>In addition, circa 10% of text snippets were further manually selected from the news articles
to ensure the corpus is even more balanced.</p>
      </sec>
      <sec id="sec-3-3">
        <title>2.3. Data Annotation</title>
        <p>From the sampled text snippets described above, circa 4.5K were randomly selected for
annotation. 7 annotators were involved in this process, all of which had prior experience of annotating
news texts. 2 of the annotators have a background in NLP and computational linguistics, whereas
5 others were news analysts. Initially, circa 400 randomly selected snippets were annotated
by 5 annotators, who subsequently jointly resolved the conflicts. The main motivation behind
this part of the annotation process was to revise the event codebook comprising the event
definitions, which turned to be overlapping or incomplete to some degree. Next, the remainder
of the snippets was annotated, each by at least 2 annotators. Given the fact that annotations are
sets of labels, we have computed strict and loose Cohen’s  , where for the former an agreement
is considered only for identical label sets, whereas in the latter case, a non-empty overlap of the
label sets is considered an agreement. The average strict and loose  for a pair of annotators are</p>
        <sec id="sec-3-3-1">
          <title>4Some snippets are longer than 500 characters in order to respect sentence boundaries.</title>
          <p>5This list contains diseases considered as the most common public health threats created for the MEDISYS platform
for the purpose of retrieving relevant news articles.</p>
          <p>Amid rising vaccination rates across the European Union, the 27 EU leaders on Tuesday
committed to collectively donate at least 100 million doses of Covid-19 vaccine to countries in need by
the end of 2021. The bloc, which described itself in a joint statement signed of at summit .
0.59 and 0.63 resp. The conflict resolution in the annotations was jointly carried out by 2 to 4
annotators.</p>
          <p>An example of a news snippet annotated with two event labels is provided in Figure 3. Further
examples are provided in Figure 7 in Annex A.</p>
        </sec>
      </sec>
      <sec id="sec-3-4">
        <title>2.4. Data Statistics</title>
        <p>The corpus consists of 4441 text snippets, whose average length is 412 characters. The average
number of fine- and coarse-grained labels per snippet is 1.26 and 1.19, respectively. Concerning
ifne-grained labels, circa 77.1% of the snippets have only one such label assigned to them, whereas
the percentage of the snippets with 2, 3 and 4 labels are 19.65%, 2.9% and 0.34%, respectively.
The corpus is relatively well-balanced. The statistics for the coarse- and fine-grained labels
are provided in Table 1. The columns labelled with ‘Co-occurrence’ provide the percentage of
instances of the given class that are labelled with at least one other label. While this figure is
maximum 6.15% (Communication class) for the coarse-grained types, it can reach up to 20.56%
(Impact: displacement of people class) for the fine-grained types. The snippets labelled with
Miscellaneous do not co-occur with other labels by definition of the former. Figure 4 presents
the text snippet length histogram.</p>
        <p>Table 2 provides a list of most frequently co-occurring pairs of fine-grained event types, while
the complete event co-occurrence matrix is shown in Figure 8 in Annex A. Interestingly, the
two Reporting classes are the most co-occurring ones and co-occur most frequently together;
Measure: Authority Regulation and Impact: Health System tend to frequently co-occur with
them as well. The other Measure classes tend to co-occur with Measure: Authority Regulation.
Given the fact that Covid-19 has triggered a vast amount of news articles over the last 3 years,
a large part (more than 70%) of the snippets in the corpus are related to the Covid-19 pandemic.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>3. Benchmark Models</title>
      <p>
        We have evaluated two benchmark models, namely: (a) L2-regularized linear SVM 6 using the
One-vs-the-Rest strategy, with log TFIDF-weighted 3-5 character n-grams as features, using
vector normalization and  = 0.2 resulting from parameter optimization, and (b) RoBERTA
base [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], a transformer-based model, using a batch size of 32, learning rate of 2−5 and 100
warming steps with 5 training epochs.
6We used the Liblinear implementation provided in Scikit-learn library: https://scikit-learn.org/
      </p>
      <p>For the purpose of evaluation of these models, we use micro, macro, weighted and samples
 1 scores, where the latter is computed as an average of  1 scores computed for each pair of
sets of ground-truth and system-response labels for each instance in the training data. 5-fold
cross-validation was used.</p>
      <p>The overall results for both fine- and coarse-grained classification are provided in Table 3,
whereas the per-class performance of the benchmark models for the fine- and coarse-grained
scenarios is provided in Table 4 and 5, respectively.</p>
      <p>For both models, the overall performance shows little variation between the  1 measures.
The performance of RoBERTa vis-à-vis SVM is better in both the coarse- and the fine-grained
classification scenario, with improvements of up to 9 and 13 points in  1 measures, respectively.
 1 scores for benchmark models for fine- and coarse-grained event classification.
 1 scores for benchmark models per class for the
fine-grained event types.</p>
      <p>SVM</p>
      <p>RoBERTa</p>
      <p>As regards the models’ performance on individual classes, one can observe that, for both
SVM and RoBERTa, the three worst performing classes, namely, Impact: Other, Measure: Other,
Miscellaneous: Other, have almost all an  1 &lt; 0.45, and reduce the global  1 scores. The
performance behaviour might be linked to the more open and less-focused nature of the
definition of the Other classes.</p>
      <p>Studying the most common confusion between labels, when both classifier and ground truth
have only one label, shows (see Figure 5) that Miscellaneous has the most false positives and
that the classes Impact, Measure and Research have more false positives than all the other classes.</p>
    </sec>
    <sec id="sec-5">
      <title>4. Conclusions</title>
      <p>This paper briefly described the creation of a new corpus consisting of circa 4.5K news snippets
(multi-)labelled with fine-grained infectious disease-related event types and reported on the
performance of SVM- and transformer-based baseline models trained using the corpus. We
intend to enlarge the corpus in the future, in particular using snippets that cover a wider range
of diseases.</p>
      <p>The news event corpus, accompanied by the full-fledged Codebook and annotation guidelines
is publicly available at https://github.com/jpiskorski/infectious-diseases-events to the scientific
community for research purposes. All future extensions and updates of the corpus will be made
available under the same link.</p>
    </sec>
    <sec id="sec-6">
      <title>A. Supplementary corpus information</title>
      <p>The definition (in a simplified form) of the fine-grained event types related to infectious diseases
is provided in Figure 6. Some examples of news snippets annotated with these labels are
provided in Figure 7. Figure 8 presents event type co-occurrence matrix.</p>
      <p>Reporting cases: reporting on cases of infections, hospitalizations, deaths, recoveries of single persons and groups,
provision of updates thereon, which covers a short time span and specific location.</p>
      <p>Reporting situation: provision of updates on the overall situation of the outbreak, current total figures, observed trends,
forecast, which spans longer period of time, and also covers cross-regional and cross-country comparisons.
Impact: Displacement of people: reporting on movement of persons/groups that were either forced, obliged or
voluntarily fled or left their homes of places of habitual residence as a consequence of the spread of the infectious disease
and/or introduction of measures to combat the disease. Bringing back displaced people to the place of origin falls under
this category as well.</p>
      <p>Impact: Health system: covers events related to the impact the disease has on the health-care system, e.g. deployment
of additional staf, shortage of medical equipment, high bed occupancy rate, establishment of new facilities, etc.
Impact: Economy: covers events related to impact on the economy, e.g., decline/growth of certain sectors,
reducing/increasing production, gains/losses, unveiling studies on the analysis and prognosis of the economic situation.
Impact: Events: reporting on cancellation, postponement, and changing of modi operandi in the context of political,
sport, cultural and other mass events, etc.</p>
      <p>Impact: Other: reporting on other impacts of the disease, e.g., societal phenomena, political situation, future predictions,
etc.</p>
      <p>Measure: Authority Regulation/Recommendation: covers events related to the introduction of measures like, e.g.
law, formal regulations, restrictions, and recommendations by competent government authorities and international bodies
which are specifically put in place to decrease the number of infected/afected people and thwart further spread of the
disease.</p>
      <p>Measure: Facilities: covers closures of facilities (e.g. schools, universities, museums, parks) resulting from regulations
and/or situations, re-openings, changing related modi operandi, e.g. the introduction of teleworking, etc.
Measure: Travel: introduction of travel restrictions, recommendations, closure of borders, cancellation of flights, closure
of airports, provision of specific transportation means to facilitate travel, etc.</p>
      <p>Measure: Vaccine/Medicine Roll-out: covers events revolving around the roll-out of vaccines, medicines, equipment
to combat the disease or mitigate the consequences, and includes also events related to sharing experience, measure
hesitancy, anti-vax movements, etc.</p>
      <p>Measure: Other: covers any other events related to measures, resulting from non-governmental organization decisions,
private sector, e.g. linked to introduced laws and regulations.</p>
      <p>Violation: Restrictions and Unrest: covers violations against introduced laws, regulations, measures and potential
lockdowns, and protests against the introduced laws and measures.</p>
      <p>Violation: Fake product or Fraud: covers events related to unveiling or warning on fake medicine or any counterfeits,
falsified or substandard disease-related material/equipment being sold and/or distributed, and infectious disease-related
fraud.</p>
      <p>Violation: Misinformation: embraces events related to revealing misinformation incidents and attempts, and issuing
warnings about disease-related misinformation.</p>
      <p>Research &amp; Development: Medicine Progress: dissemination of information and updates on the progress of research
and development of medicines, vaccines and equipment to combat and/or protect against infectious diseases.
Research &amp; Development: Phenomena: reporting on research on specific phenomena observed in the context of
infectious diseases and findings which might potentially contribute to the development of medicines, vaccines, etc.
Research &amp; Development: Funding: raising funding, launching programmes and resources for R&amp;D of technologies
and materials related to fight infectious diseases.</p>
      <p>Communication: Meeting: covers oficial meetings, conferences and meetings, press conferences of authorities, states,
international organizations, task forces, experts, etc., to discuss topics related to the (outbreak of) infectious diseases and
related topics
Communication: Launch Instrument: reporting on new communication, information sharing and gathering
instruments and methods related to infectious diseases, e.g. online platforms, databases, smartphone apps, etc.
Support: Financial: launching, proposing and elaborating financial instruments to support afected people,
organizations, economy, etc., e.g. the introduction of changes in tax regulations to relieve the most vulnerable groups.
Support: Goods: providing afected people with goods, materials, and services to help and alleviate the problems resulting
from the outbreak of the disease.</p>
      <p>Miscellaneous: Other: is a placeholder to capture other events related to infectious diseases, which do not fall under any
of the above categories, e.g. recruitment of new experts by a company that develops infectious disease-related vaccines.
Miscellaneous: Unrelated: covers events that are not related to infectious diseases in any way.</p>
      <p>Miscellaneous: Non Events: covers texts that do not refer to any event that could be tailored to a particular point in
time, e.g. general descriptions of processes, etc.</p>
      <p>DUBAI, United Arab Emirates Dubai’s Expo 2020 world’s fair will be postponed to Oct. 1, 2021, over the new
coronavirus pandemic, a Paris-based body behind the events said Monday. The announcement by the Bureau
International des Expositions came just hours after police in Kuwait dispersed what they described as a riot by
stranded Egyptians unable to return home amid the coronavirus pandemic. The riot was the first reported sign of
unrest from the region’s vast population of foreign workers who have lost their jobs over the crisis
EVENTS: Impact: Events, Impact: Displacement of people, Violation: restrictions and unrest
The World Health Organization (WHO) has confirmed the first three cases of Zika virus disease in India. Health
Ministry oficials said Sunday that the three patients in western Gujarat state had recovered. ”There is no need to
panic,” Dr. Soumya Swaminathan, a top health ministry oficial, told reporters. The World Health Organization
said in a statement released Friday that the three cases that India reported to the WHO on May 15 were detected
through routine blood surveillance in a hospital in Ahmadabad, Gujarat’s capital”
EVENTS: Reporting: cases
The Gates Foundation will give Rotary $255 million, with Rotary pledging to raise $100 million, and the UK and
Germany contributing $150 million and $130 million respectively to the global initiative. It is the second
such grant from the foundation to Rotary International — in 2007, it gave Rotary a $100 million
grant for a polio eradication programme, which Rotary matched dollar for dollar. The new money
will go to vaccination programmes, better disease surveillance and research on new vaccines.</p>
      <p>EVENTS: Research: funding Support: financial
Warsaw (dpa) - Czech Prime Minister Andrej Babis said on Sunday that he would like residents over
the age of 60 to be able to register for a Covid-19 vaccination from March. The move would see the ofer of
vaccinations extended beyond the current priority groups of health care workers, nursing home residents and staf
and all citizens aged over 80.</p>
      <p>EVENTS: Measure: vaccine/medicine roll-out
Little air cleansers are digital gadgets that are utilized to tidy up the air by decreasing or removing interior toxins
such as germs, odours, smoke and chemicals that could be hazardous to the wellness. These small air purifier
cleansers have diferent types such as the HEPA air cleanser, ozone air cleanser, or the ionic air cleanser.
EVENTS: Miscellaneous: non event
RAI News 24 reports that, as of January 7, Italy will go back to the colour-coded system sub-dividing Regions on the
basis of Covid-19 restrictions. The government will decide on the colour zone for each Region on the basis of 21
Covid-19-related criteria. However, the Regions are calling on the government to revise these criteria. Meanwhile,
up until January 6, all of Italy will be in a red zone, meaning that bars and restaurants will stay closed
EVENTS: Measure: authority regulation, Measure: facilities</p>
      <p>Figure 8: Event type co-occurrence matrix.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>P.</given-names>
            <surname>Ghasiya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Okamura</surname>
          </string-name>
          ,
          <article-title>Investigating covid-19 news across four nations: A topic modeling and sentiment analysis approach</article-title>
          ,
          <source>IEEE Access 9</source>
          (
          <year>2021</year>
          )
          <fpage>36645</fpage>
          -
          <lpage>36656</lpage>
          . doi:
          <volume>10</volume>
          .1109/ACCESS.
          <year>2021</year>
          .
          <volume>3062875</volume>
          ,
          <string-name>
            <surname>publisher</surname>
            <given-names>Copyright</given-names>
          </string-name>
          : ©
          <year>2013</year>
          IEEE.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>T.</given-names>
            <surname>Marcoux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Agarwal</surname>
          </string-name>
          ,
          <source>Narrative Trends of COVID-19 Misinformation, in: Proceedings of the 4ℎ Workshop on Narrative Extraction From Texts (Text2Story</source>
          <year>2021</year>
          ),
          <article-title>held in conjunction with the 43</article-title>
          <source>European Conference on Information Retrieval (ECIR</source>
          <year>2021</year>
          ), Association for Computational Linguistics,
          <year>2021</year>
          , pp.
          <fpage>77</fpage>
          -
          <lpage>80</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A. E.</given-names>
            <surname>Varol</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Kocaman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. U.</given-names>
            <surname>Haq</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Talby</surname>
          </string-name>
          , Understanding covid-19
          <source>news coverage using medical nlp</source>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>R.</given-names>
            <surname>Chandra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Krishna</surname>
          </string-name>
          , Covid
          <article-title>-19 sentiment analysis via deep learning during the rise of novel cases</article-title>
          ,
          <source>PLOS ONE 16</source>
          (
          <year>2021</year>
          )
          <fpage>1</fpage>
          -
          <lpage>26</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S.</given-names>
            <surname>Zong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Baheti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ritter</surname>
          </string-name>
          ,
          <article-title>Extracting a knowledge base of covid-19 events from social media</article-title>
          ,
          <year>2020</year>
          . URL: https://arxiv.org/abs/
          <year>2006</year>
          .02567. doi:
          <volume>10</volume>
          .48550/ARXIV.
          <year>2006</year>
          .
          <volume>02567</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>C.</given-names>
            <surname>Cheng</surname>
          </string-name>
          , J.
          <string-name>
            <surname>Barceló</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Hartnett</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Kubinec</surname>
          </string-name>
          , L. Messerschmidt, COVID-19
          <source>Government Response Event Dataset (CoronaNet v.1.0)</source>
          ,
          <source>Nat Hum Behav</source>
          .
          <volume>4</volume>
          (
          <year>2020</year>
          )
          <fpage>756</fpage>
          -
          <lpage>768</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>B.</given-names>
            <surname>Min</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Rozonoyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Qiu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Zamanian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Xue</surname>
          </string-name>
          , J. MacBride, ExcavatorCovid:
          <article-title>Extracting events and relations from text corpora for temporal and causal analysis for COVID-19</article-title>
          , in:
          <source>Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations</source>
          ,
          <article-title>Association for Computational Linguistics, Online</article-title>
          and
          <string-name>
            <given-names>Punta</given-names>
            <surname>Cana</surname>
          </string-name>
          , Dominican Republic,
          <year>2021</year>
          , pp.
          <fpage>63</fpage>
          -
          <lpage>71</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>R.</given-names>
            <surname>Misra</surname>
          </string-name>
          , News category dataset,
          <year>2018</year>
          . URL: https://www.kaggle.com/datasets/rmisra/ news-category-dataset.
          <source>doi:10.13140/RG.2.2.20331.18729.</source>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>T.</given-names>
            <surname>Nugent</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Petroni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Raman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Carstens</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Leidner</surname>
          </string-name>
          ,
          <article-title>A comparison of classification models for natural disaster and critical event detection from news</article-title>
          ,
          <source>in: 2017 IEEE International Conference on Big Data (Big Data)</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>3750</fpage>
          -
          <lpage>3759</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>E.</given-names>
            <surname>Lefever</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Hoste</surname>
          </string-name>
          ,
          <article-title>A classification-based approach to economic event detection in Dutch news text</article-title>
          ,
          <source>in: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)</source>
          ,
          <source>European Language Resources Association (ELRA)</source>
          , Portorož, Slovenia,
          <year>2016</year>
          , pp.
          <fpage>330</fpage>
          -
          <lpage>335</lpage>
          . URL: https://aclanthology.org/L16-1051.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>J.</given-names>
            <surname>Haneczok</surname>
          </string-name>
          , G. Jacquet,
          <string-name>
            <given-names>J.</given-names>
            <surname>Piskorski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Stefanovitch</surname>
          </string-name>
          ,
          <article-title>Fine-grained event classification in news-like text snippets - shared task 2</article-title>
          ,
          <source>CASE 2021, in: Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE</source>
          <year>2021</year>
          ),
          <article-title>Association for Computational Linguistics</article-title>
          , Online,
          <year>2021</year>
          , pp.
          <fpage>179</fpage>
          -
          <lpage>192</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>R.</given-names>
            <surname>Saurí</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Littman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Knippen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Gaizauskas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Setzer</surname>
          </string-name>
          , J. Pustejovsky,
          <article-title>TimeML annotation guidelines</article-title>
          , https://www.researchgate.net/publication/248737128_TimeML_Annotation_ Guidelines_Version_
          <fpage>121</fpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>J.</given-names>
            <surname>Linge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Steinberger</surname>
          </string-name>
          , T. Weber,
          <article-title>Internet surveillance systems for early alerting of health threats</article-title>
          ,
          <source>Euro Surveill</source>
          <volume>14</volume>
          (
          <year>2009</year>
          )
          <fpage>1</fpage>
          -
          <lpage>2</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>J.</given-names>
            <surname>Canavilhas</surname>
          </string-name>
          ,
          <article-title>Web journalism : from the inverted pyramid to the tumbled pyramid</article-title>
          , https://www.bocc.ubi.pt/pag/canavilhas-joao
          <article-title>-inverted-pyramid</article-title>
          .pdf,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>A.</given-names>
            <surname>Conneau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Khandelwal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Goyal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Chaudhary</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Wenzek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Guzmán</surname>
          </string-name>
          , E. Grave,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ott</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zettlemoyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Stoyanov</surname>
          </string-name>
          ,
          <article-title>Unsupervised cross-lingual representation learning at scale</article-title>
          , arXiv preprint arXiv:
          <year>1911</year>
          .
          <volume>02116</volume>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>