<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Detection of Adverse Drug Events from Social Media Texts - Research Project Overview</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Simone Scaboro</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Beatrice Portelli</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giuseppe Serra</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Mathematics</institution>
          ,
          <addr-line>Computer Science and Physics</addr-line>
          ,
          <institution>University of Udine</institution>
          ,
          <addr-line>Udine UD 33100, IT</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper presents an overview of the current research project on the detection of Adverse Drug Events from social media texts led by the Artificial Intelligence Laboratory of Udine (AILAB Udine). In the latest years, patients started reporting Adverse Drug Events (ADEs) on social media and similar online outlets, making it necessary to monitor them for pharmacovigilance purposes. For this reason, systems for the automatic extraction of ADEs are becoming an important research topic in the Natural Language Processing community. In this paper we present our research project, focused on the Detection, Extraction and Normalization of ADEs, detailing its objectives, achievements and future directions.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Adverse Drug Events</kwd>
        <kwd>Social Media</kwd>
        <kwd>Deep Learning</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Each year, dozens of new drugs and medical compounds are developed, tested and released in
the market. As an example, in 2021, the Food and Drug Administration (FDA) approved the use
of 50 new drugs [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], while the European Medicines Agency (EMA) recommended 92 medicines
for marketing authorization [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. These regulators are in charge of supervising the process that
leads to the release of these new medicines, authorizing only the products that are deemed safe.
For this reason, new drugs are approved only after rigorous medical trials, which aim to prove
their therapeutic eficacy and their safety. In particular, regulatory authorities are interested
in documenting possible Adverse Drug Events (ADE), that are unexpected reactions or efects
related to the correct use of a medicine.
      </p>
      <p>However, unexpected ADEs might emerge once the new medication is used by a larger
population. This is why pharmaceutical and governance agencies have looked into new ways
to detect these events in large populations, creating Pharmacovigilance programs.</p>
      <p>
        Traditionally, the process to collect ADEs relies on formal reporting methods, based on the
communication between patients, healthcare providers, pharmaceutical companies, and local
PV authorities. ADEs can also be extracted (either manually of automatically) from formal
medical documents, such as Electronic Health Records (EHR) (see [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] for a recent overview).
      </p>
      <p>
        Recently, a lot of efort has been put into applying AI models to automatically extract useful
information. In particular, user-generated content such as social media posts present a great
opportunity: an increasing number of patients discuss their health on forums and
microblogging platforms, including details on their physical and mental health, as well as feedback
on medications and medical procedures. AI systems could be used to analyze this health-related
chatter on social media, extract useful information, and perform automatic tasks including ADE
detection [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>However, social media texts introduce a series of new challenges, such as dealing with
highly informal speech and the presence of layman terms, typos, and linguistic phenomena like
humour, irony, speculations and negations that could afect the meaning of the message. These
characteristics make the texts dificult to understand for current natural language understanding
techniques, which often rely on models pretrained on large general-domain corpora based on
Wikipedia, English literature or other resources. These models have an hard time adapting to
the informal language of social media, as they are not used to encounter typos and they also
need to learn new contextual meanings for the words used in metaphors and slang. To tackle
these problems, a growing branch of the Natural Language Processing (NLP) community has
focused on the extraction of ADE mentions from social media texts, encouraging research in
this field through conferences and dedicated challenges. One of the most significant examples
is the Social Media Mining for Health Applications (SMM4H) Shared Task, which is co-located
with top-tier international conferences such as ACL and COLING.</p>
      <p>Our international research team led by the Artificial Intelligence Laboratory of Udine (AILAB
Udine)1 is working on an ongoing research project focused on deep learning methods for
automatic handling of ADE mentions in social media texts. In this paper we present a summary
of our activities and the future directions of our work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Current Research Project</title>
      <p>Our research project stemmed from the growing importance of social media in the field of
digital pharmacovigilance. The aim of pharmacovigilance systems is to identify Adverse Drug
Events, document them and analyze them. Traditionally, this task has been broken down into
three sub-tasks (see Figure 1):
1. ADE Detection: the binary classification of pieces of text as either containing or not
containing a possible ADE. This initial step is especially needed when working with social
media posts, due to the large volume of input data.
2. ADE Extraction: the extraction of the exact ADE mention(s) from a given texts. During
this step, the model is given a full text and is expected to return only the excerpt which is
relevant to describe the ADE.
3. ADE Normalization: the ADE extracted at the previous step, which is usually written
in informal or colloquial language, is mapped to a formal term belonging to a medical
ontology. This is the last step of process is necessary to identify precisely what kind of
reaction was caused by the drug, and allows medical professionals to perform further
analyses on the aggregated data.
Unilateral
neck swelling
prior to
tsen sNtuarbteiqnag.the
m
cou I've got the
d Mirena, I've
tpu experienced
in cramping
and a little
bit of
bleeding.</p>
      <p>original text
I've got the
Mirena, I've
experienced
cramping
and a little
bit of
bleeding.</p>
      <p>ADE
Detection</p>
      <p>ADE
Extraction</p>
      <p>ADEs
cramping
a little bit
of bleeding</p>
      <p>ADE
Normalization</p>
      <p>MedDRA</p>
      <p>terms
Abdominal
pain lower</p>
      <p>Genital
haemorrhage</p>
      <p>Among the three tasks, Extraction and Normalization are the ones on which the community
has focused more in the latest years, as they present multiple challenges connected to the
peculiarities of informal social media language. Additionally, texts mined from diferent sources,
such as internet forums, tweets, and reddit threads, might have diferent linguistic characteristics,
which make it harder to build a unique system that works for all of them.</p>
      <p>Our research group briefly tackled the task of ADE Detection, to then focus mainly on the
ADE Extraction, its limits and its applications. Finally, we recently tackled the problem of ADE
Normalization, with interesting findings on how to improve the generalization capabilities of
models on large ontologies.</p>
      <sec id="sec-2-1">
        <title>2.1. Materials and Methods</title>
        <p>
          Most of the resources and datasets used in our research are publicly available. The datasets
used in our works are widely used benchmarks for ADE-related tasks on social media texts.
The following is a list of the main resources used in the projects:
• CADEC [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. Public dataset containing 1,250 posts from the health-related forum
“AskaPatient”. Each post is annotated for the presence of ADEs, and each ADE is mapped to the
corresponding medical term in various medical ontologies (MedDRA and SNOMED CT).
• SMM4H’19 [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], SMM4H’20 [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] and SMM4H’22 [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. Public datasets for the following
shared tasks: SMM4H 2019 - Task 1–3, SMM4H 2020 - Task 3, SMM4H 2022 - Task 1.
They contains 2,000 to 23,000 tweets which mention at least one drug name. A portion of
the tweet contain one or more ADEs, which are also mapped to a term (Preferred Term
or Lowest Level Term) in the MedDRA ontology.
        </p>
        <p>The texts contained in the SMM4H datasets are short, highly informal and rich of typos
and/layman terms. Diferently, the forum posts contained in CADEC are longer (up to 5 times
the length of the tweets), more descriptive and sometimes contain more precise medical terms,
as they were posted on an health-related forum. Using datasets with diferent textual styles
allows us to test and develop models in diferent scenarios, working towards the creation of
models that can deal with varying levels of informality within texts.</p>
        <p>
          Most of the systems we developed built upon Transformer-based models which use BERT
variants [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. These large language models proved to be extremely efective in several NLP
tasks, including ADE Detection/Extraction from social media texts. Moreover, over the years
researchers developed several BERT variants pre-trained on medical texts which hold useful
in-domain knowledge for the these tasks (e.g., PubMedBERT [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]).
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. ADE Detection</title>
        <p>As previously mentioned, our group briefly touched on the topic of ADE Detection, mainly to
test the capabilities of current BERT-based models on the task, and to have an in-house system
ready to use for future tasks.</p>
        <p>We developed a simple BERT-based model the binary classification of texts as containing or
not containing an ADE. It consists in a bert-base-uncased model with a binary classification
head. If the text contains more than one sentence, it is split in single sentences and each one is
evaluated for the presence of ADEs. If at least one of the sentences is labeled as positive, the
whole sample is marked as positive for the presence of ADEs.</p>
        <p>In Table 1, we compared the performance of this model with the best systems reported in
literature on two datasets: SMM4H’19 and CADEC. The model reaches a good performance on
SMM4H’19, surpassing in Precision the two best models which took part in the shared task.
On CADEC, our system largely outperforms previously reported results in Recall and F1-score.
Since most of the texts in CADEC are composed by several sentences, with an average of 5
sentences per sample, we believe that splitting them and evaluating them separately is what
contributed the most to the increase in performance.</p>
        <p>In following works, we empirically replaced the BERT module with other BERT variants
pre-trained on medical texts (e.g., PubMedeBERT) to get small performance boosts in specific
tasks and datasets.</p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. ADE Extraction</title>
        <p>
          We addressed the task of ADE extraction as a tagging task (or Named Entity Recognition task):
given a text, our goal is to identify whether it contains an ADE and its precise span. To this
end, we applied for the first time SpanBERT [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ], a BERT variant specialized in multi-token
text spans, such as those describing ADE. In our experiments [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ], we use a SpanBERT model
to generate one embedding for each word in the input text. The embeddings of each word are
then fed to a linear layer to output a 3-class BIO label, using the Begin-Inside-Outside tagging
scheme employed for NER tasks. The B label marks the beginning of an entity, while the I
label is used to mark the following tokens in a multi-token entity. Figure 2 (left) illustrates this
architecture. We also experimented the combination of SpanBERT with a Conditional Random
Field (CRF) [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] module to further de-noise its output. The CRF is applied to the output of the
linear layer, producing another set of BIO labels, as represented in Figure 2 (right).
        </p>
        <p>Input text
(as a sequence of tokens)</p>
        <p>Input text
(as a sequence of tokens)
BERT-based</p>
        <p>Model</p>
        <p>Token-level labels</p>
        <p>O
B
O
I
I
B
O
O</p>
        <p>BERTbased
Model</p>
        <p>CRF</p>
        <p>Token-level labels</p>
        <p>O
B
O
I
I
B
O
O</p>
        <p>The performance of the system was evaluated on SMM4H’19 and CADEC. The results we
of our experiments show that this approach obtains competitive performance compared to
other models that took part in the SMM4H’19 task, and also other BERT variants. Moreover
SpanBERT+CRF outperforms all the other approaches on both datasets.</p>
        <p>The system we developed in currently at the first place on the public leaderboard of the
SMM4H’19 ADE Extraction task.2</p>
        <sec id="sec-2-3-1">
          <title>2.3.1. Application: A Web Platform to Monitor Adverse Reactions to COVID-19</title>
        </sec>
        <sec id="sec-2-3-2">
          <title>Vaccines on Twitter.</title>
          <p>
            The development of our ADE Extraction system coincided with the start of the COVID-19
vaccination campaigns at the end of 2020. The difusion of COVID-19 vaccines through massive
vaccination campaigns brought a lively debate on social media, with thousands of public
discussions about possible ADEs. This was the perfect environment to test the ability of an
automatic system for ADE Extraction, which could also provide insightful information to general
users. In this context, our international research team developed a web platform3 for monitoring
English tweets about COVID-19 vaccines [
            <xref ref-type="bibr" rid="ref18">18</xref>
            ].
          </p>
          <p>The objective of the platform is to collect and process tweets, providing visual representations
of the information extracted. This is possible thanks to a set of modules (shown in Figure 3)
that interact with each other. These modules include the ADE Extraction module described
in the previous section. This extracted information is then aggregated and visualized through
2https://competitions.codalab.org/competitions/20798, Sub-Task 2, Post-evaluation
3http://ailab.uniud.it/covid-vaccines/
an interactive word-cloud that can be filtered by date and vaccine name, permitting the user
to investigate of trends across time and vaccines. In fact, thanks to this tool, is possible to
monitor the tweets that mention a specific drug and obtain an overview of the possible side
efects reported by the users, their frequency, and the original texts from which the ADEs
were extracted. Despite the promising performance of the model on benchmark datasets, it is
important to highlight that the data produced by these current automatic systems still need an
expert’s supervision to be validated, and should not be used without human revision. However,
they are a powerful tool to redirect the experts’ attention to potentially interesting phenomena.</p>
          <p>As previously mentioned, the portal is also enriched with other modules that give diferent
information on the population of users, such as their geographical location, an important piece
of information which could be used to identify patterns in ADE reports. The system also
analyzes the URLs that the users share together with information about the vaccines, which is
important to take into consideration in the fight against misinformation.</p>
          <p>The web portal proved to be useful to perform some analyses on social media chatter
surrounding the AstraZeneca vaccines, and it is still actively processing data.</p>
        </sec>
        <sec id="sec-2-3-3">
          <title>2.3.2. Focus: The Efect of Negations and Speculation on ADE Extraction</title>
          <p>
            Analyzing the incorrect outputs of our ADE Extraction system and other baselines systems
we discovered that a great number of False Positives (incorrectly extracted ADEs) were caused
by the presence of particular linguistic phenomena, such as irony, negations and speculations.
These phenomena are pervasive in social media texts, and they seriously hamper the ability of
an automated system to discriminate between factual and nonfactual statements. Therefore, we
studied methods to develop more robust models for ADE Extraction in face of negations and
speculations [
            <xref ref-type="bibr" rid="ref19 ref20">19, 20</xref>
            ].
          </p>
          <p>In our research, we took into consideration some systems for ADE detection on social media
texts and introduced SNAX, a benchmark dataset to test their performance against negations
and speculations on ADE Extraction from social media texts. The dataset is composed by a set of
samples (tweets) belonging to four categories: tweets containing ADEs, tweets not containing
ADEs, tweets that explicitly negate the presence of an ADE, and tweets that speculate an ADE.
We then introduced two possible strategies to increase the robustness of these models:
• data augmentation: the use of artificially negated and/or speculated samples during the
training of the model;
• negation/speculation detection module: the use of a negation and/or speculation module
as part of the ADE Extraction pipeline;</p>
          <p>All the baseline ADE Extraction models that we analyzed are not robust against negations
and speculations. The two strategies that we introduced successfully lower the number of False
Positive predictions of the baseline models. However, they have the drawback of lowering the
Recall of the models, which is especially noticeable when using the second strategy. Combining
both strategies leads to the best results in terms of FP reduction but exacerbates the drop in
Recall, so it is not recommended in pracical scenarios.</p>
          <p>With these works, we proved the lack of robustness of the current state-of-the-art models for
the ADE Extraction task, providing a challenging setting to test future models. Furthermore, we
introduced two strategies to address the problem, highlighting the improvements they achieved
and also their weaknesses. In conclusion, we believe that these kinds of studies, which explore
the lack of robustness against diferent natural language phenomena, should be the backbone
of the analysis of models that deal with informal language.</p>
        </sec>
      </sec>
      <sec id="sec-2-4">
        <title>2.4. ADE Normalization</title>
        <p>Currently, our research project is focused on ADE Normalization, that is mapping the
extracted mentions to large formal medical ontologies (e.g., MedDRA, containing over 24K unique
Preferred Terms).</p>
        <p>
          We [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ] recently took part in the SMM4H’22 Shared Task for ADE Normalization, using a
system based on GPT-2, a text generation model. It ranked third in the task [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], reaching 76%
normalization accuracy.
        </p>
        <p>However, one of the main challenges of this task is the high cardinality of the output space,
and long tail distribution of the labels in available datasets. This is because current datasets
for this task contain at most 5,000 samples, covering only 200-500 of the possible 24K output
classes, and the label distribution is highly skewed towards few very frequent terms. For this
reason, the models usually perform well on examples that are seen in the training set, but are
unable to generalize on rare or unseen ones.</p>
        <p>
          Therefore, we focused our research on ways to overcome the long tail distribution of the
training terms and increase the generalization capabilities of ADE Normalization models. We
proposed “Ontology Pretraining” (OP) [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ], a pre-training strategy based on the hierarchical
nature of the MedDRA medical ontology, that improves the generalization capabilities of large
language models. This pretraining step injects domain knowledge about all the output labels of
the ontology before performing a classical fine-tuning using just the few labels present in the
actual training dataset. The OP strategy uses the Lowest Level Terms (LLT) present in MedDRA
and teaches the ADE Normalization model to map them to MedDRA Preferred Terms (PT), our
actual output labels. LLTs sub-categories of PTs, and are more informal names, which make
them stylistically closer to ADEs. Therefore, a model trained with OP will have an advantage
when presented with ADEs as input.
        </p>
        <p>We performed several experiments on CADEC, SMM4H’20 and a proprietary dataset, using
several ADE Normalization models. The experiments show that using the OP strategy before
classical fine-tuning drastically improves the generalization abilities of all models without
damaging their performance on seen concepts. The models trained using OP also showed promising
results for zero-shot cross-dataset normalization, which is interesting in the perspective of
developing a model that can work well across diferent text typologies.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Conclusions and Future Directions</title>
      <p>Our research group tackled the tasks of ADE Detection, Extraction and Normalization on social
media texts. The results consisted in the development of efective deep learning models for the
tasks, some of which were also applied to create an online web platform for ADE monitoring.</p>
      <p>The activities also highlighted several limitations of current models, which still require further
exploration. For example, the performance of ADE Detection models is still scarce on highly
informal texts, such as tweets. However, these are kind of texts for which the community
needs a robust ADE Detection model, as they could become the input of real-time digital
pharmacovigilance systems. Moreover, ADE Extraction models still lack the ability to deal with
some pervasive linguistic phenomena (e.g., irony and humor), and the proposed solutions of
negation/speculation detection could be improved to reduce drops in recall. Right now, ADE
Extraction models cannot be blindly applied to real-time streams of social media posts, as
they lack to ability to distinguish personal ADE reports from news pieces and second-hand
reports (i.e., recounting what happened to another person). Even more importantly, they
are not able to distinguish real ADE reports from maliciously-constructed posts that spread
misinformation. Therefore, there is a need to look into ways to carefully integrate them in online
pharmacovigilance activities, with the aid of other systems and expert human supervision.</p>
      <p>As regards the future directions of this project, one of the top priorities is extending the
current systems to other languages. SMM4H recently introduced small datasets in Spanish,
French and Russian, but there is still a lack in resources and models for languages other than
English. Medical ontologies, on the other hand, come with several oficial translations, so it
would be interesting to explore the idea of leveraging them to solve other tasks. We also plan to
focus more on term normalization, incorporating diferent ontologies (e.g., SNOMED CT and
UMLS). This will help in two ways: by adding new complementary knowledge to train the AI
systems, and by creating more bridges between the outputs of the NLP systems and the formal
medical world. Finally, it would be interesting to move towards to extraction and normalization
of other medical entities in informal texts, such as drugs and diseases.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>B.</given-names>
            <surname>G. de la Torre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Albericio</surname>
          </string-name>
          ,
          <article-title>The pharmaceutical industry in 2021. an analysis of fda drug approvals from the perspective of molecules</article-title>
          ,
          <source>Molecules</source>
          <volume>27</volume>
          (
          <year>2022</year>
          ). doi:
          <volume>10</volume>
          .3390/ molecules27031075.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>European</given-names>
            <surname>Medicines</surname>
          </string-name>
          <string-name>
            <surname>Agency</surname>
          </string-name>
          ,
          <source>Human medicines highlights</source>
          <year>2021</year>
          , https://www.ema.europa. eu/documents/report/human-medicines-highlights
          <article-title>-2021_en</article-title>
          .pdf,
          <year>2022</year>
          . Accessed:
          <fpage>2022</fpage>
          -10- 07.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C.</given-names>
            <surname>Feng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Le</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. B. McCoy</surname>
          </string-name>
          ,
          <article-title>Using Electronic Health Records to Identify Adverse Drug Events in Ambulatory Care: A Systematic Review</article-title>
          ,
          <source>Applied Clinical Informatics</source>
          <volume>10</volume>
          (
          <year>2019</year>
          )
          <fpage>123</fpage>
          -
          <lpage>128</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>H.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. C.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <article-title>Using health-consumer-contributed data to detect adverse drug reactions by association mining with temporal analysis</article-title>
          ,
          <source>ACM Trans. Intell. Syst. Technol</source>
          .
          <volume>6</volume>
          (
          <year>2015</year>
          ). URL: https://doi.org/10.1145/2700482. doi:
          <volume>10</volume>
          .1145/2700482.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S.</given-names>
            <surname>Karimi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Metke-Jimenez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kemp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Cadec: A Corpus of Adverse Drug Event Annotations</article-title>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Biomed</surname>
          </string-name>
          . Inform.
          <volume>55</volume>
          (
          <year>2015</year>
          )
          <fpage>73</fpage>
          -
          <lpage>81</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>D.</given-names>
            <surname>Weissenbacher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sarker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Magge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Daughton</surname>
          </string-name>
          ,
          <string-name>
            <surname>K. O'Connor</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Paul</surname>
          </string-name>
          , G. Gonzalez,
          <article-title>Overview of the Social Media Mining for Health (SMM4H) Shared Tasks at ACL 2019</article-title>
          ,
          <source>in: Proceedings of the ACL Workshop on Social Media Mining for Health Applications</source>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>G.</given-names>
            <surname>Gonzalez-Hernandez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Z.</given-names>
            <surname>Klein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Flores</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Weissenbacher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Magge</surname>
          </string-name>
          ,
          <string-name>
            <surname>K. O'Connor</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Sarker</surname>
            ,
            <given-names>A.-L.</given-names>
          </string-name>
          <string-name>
            <surname>Minard</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Tutubalina</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          <string-name>
            <surname>Miftahutdinov</surname>
            ,
            <given-names>I. Alimova</given-names>
          </string-name>
          ,
          <source>Proceedings of the COLING Social Media Mining for Health Applications Workshop &amp; Shared Task</source>
          ,
          <year>2020</year>
          . URL: https://aclanthology.org/
          <year>2020</year>
          .smm4h-
          <fpage>1</fpage>
          .0.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>D.</given-names>
            <surname>Weissenbacher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Banda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Davydova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Estrada</given-names>
            <surname>Zavala</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. Gasco</given-names>
            <surname>Sánchez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Ge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Klein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Krallinger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Leddin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Magge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rodriguez-Esteban</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sarker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          , E. Tutubalina,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>Gonzalez-Hernandez, Overview of the seventh social media mining for health applications (#SMM4H) shared tasks at COLING 2022</article-title>
          ,
          <source>in: Proceedings of The Seventh Workshop on Social Media Mining for Health Applications</source>
          , Workshop &amp; Shared Task, Association for Computational Linguistics, Gyeongju, Republic of Korea,
          <year>2022</year>
          , pp.
          <fpage>221</fpage>
          -
          <lpage>241</lpage>
          . URL: https://aclanthology.org/
          <year>2022</year>
          .smm4h-
          <fpage>1</fpage>
          .
          <fpage>54</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>J.</given-names>
            <surname>Devlin</surname>
          </string-name>
          , M.-
          <string-name>
            <given-names>W.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Toutanova</surname>
          </string-name>
          , BERT:
          <article-title>Pre-training of Deep Bidirectional Transformers for Language Understanding</article-title>
          ,
          <source>in: Proceedings of NAACL</source>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Gu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Tinn</surname>
          </string-name>
          , H. Cheng, M. Lucas,
          <string-name>
            <given-names>N.</given-names>
            <surname>Usuyama</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Naumann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Poon</surname>
          </string-name>
          ,
          <article-title>Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing</article-title>
          , arXiv preprint arXiv:
          <year>2007</year>
          .
          <volume>15779</volume>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Qin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Yan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Tang</surname>
          </string-name>
          , HITSZ-ICRC:
          <article-title>A report for SMM4H shared task 2019-automatic classification and extraction of adverse efect mentions in tweets, in: Proceedings of the Fourth Social Media Mining for Health Applications (#SMM4H) Workshop</article-title>
          &amp; Shared Task, Association for Computational Linguistics, Florence, Italy,
          <year>2019</year>
          , pp.
          <fpage>47</fpage>
          -
          <lpage>51</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>W19</fpage>
          -3206.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>T.</given-names>
            <surname>Ellendorf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Furrer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Colic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Aepli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Rinaldi</surname>
          </string-name>
          ,
          <article-title>Approaching SMM4H with merged models and multi-task learning</article-title>
          ,
          <source>in: Proceedings of the Fourth Social Media Mining for Health Applications (#SMM4H) Workshop</source>
          &amp; Shared Task, Association for Computational Linguistics, Florence, Italy,
          <year>2019</year>
          , pp.
          <fpage>58</fpage>
          -
          <lpage>61</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>W19</fpage>
          -3208.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>I.</given-names>
            <surname>Alimova</surname>
          </string-name>
          , E. Tutubalina,
          <article-title>Automated detection of adverse drug reactions from social media posts with machine learning</article-title>
          , in: W. M.
          <string-name>
            <surname>van der Aalst</surname>
            ,
            <given-names>D. I.</given-names>
          </string-name>
          <string-name>
            <surname>Ignatov</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Khachay</surname>
            ,
            <given-names>S. O.</given-names>
          </string-name>
          <string-name>
            <surname>Kuznetsov</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Lempitsky</surname>
            ,
            <given-names>I. A.</given-names>
          </string-name>
          <string-name>
            <surname>Lomazova</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Loukachevitch</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Napoli</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Panchenko</surname>
            ,
            <given-names>P. M.</given-names>
          </string-name>
          <string-name>
            <surname>Pardalos</surname>
            ,
            <given-names>A. V.</given-names>
          </string-name>
          <string-name>
            <surname>Savchenko</surname>
          </string-name>
          , S. Wasserman (Eds.),
          <source>Analysis of Images, Social Networks and Texts</source>
          , Springer International Publishing, Cham,
          <year>2018</year>
          , pp.
          <fpage>3</fpage>
          -
          <lpage>15</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>I.</given-names>
            <surname>Alimova</surname>
          </string-name>
          , E. Tutubalina,
          <article-title>Detecting adverse drug reactions from biomedical texts with neural networks</article-title>
          ,
          <source>in: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop</source>
          , Association for Computational Linguistics, Florence, Italy,
          <year>2019</year>
          , pp.
          <fpage>415</fpage>
          -
          <lpage>421</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>P19</fpage>
          -2058.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>M.</given-names>
            <surname>Joshi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. S.</given-names>
            <surname>Weld</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zettlemoyer</surname>
          </string-name>
          ,
          <string-name>
            <surname>O. Levy,</surname>
          </string-name>
          <article-title>SpanBERT: Improving Pre-training by Representing and Predicting Spans, Transactions of the Association for Computational Linguistics 8 (</article-title>
          <year>2020</year>
          )
          <fpage>64</fpage>
          -
          <lpage>77</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>B.</given-names>
            <surname>Portelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Passabì</surname>
          </string-name>
          , E. Lenzi,
          <string-name>
            <given-names>G.</given-names>
            <surname>Serra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Santus</surname>
          </string-name>
          , E. Chersoni,
          <source>Improving Adverse Drug Event Extraction with SpanBERT on Diferent Text Typologies</source>
          , Springer International Publishing, Cham,
          <year>2022</year>
          , pp.
          <fpage>87</fpage>
          -
          <lpage>99</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>030</fpage>
          -93080-6\_8.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>J.</given-names>
            <surname>Laferty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mccallum</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Pereira</surname>
          </string-name>
          , Conditional Random Fields:
          <article-title>Probabilistic Models for Segmenting and Labeling Sequence Data</article-title>
          ,
          <source>in: Proc. of ICML</source>
          <year>2001</year>
          , Morgan Kaufmann Publishers Inc., San Francisco, CA, USA,
          <year>2001</year>
          , pp.
          <fpage>282</fpage>
          --
          <lpage>289</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>B.</given-names>
            <surname>Portelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Scaboro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Tonino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Chersoni</surname>
          </string-name>
          , E. Santus, G. Serra,
          <article-title>Monitoring user opinions and side efects on covid-19 vaccines in the twittersphere: Infodemiology study of tweets</article-title>
          ,
          <source>J Med Internet Res</source>
          <volume>24</volume>
          (
          <year>2022</year>
          )
          <article-title>e35115</article-title>
          . doi:
          <volume>10</volume>
          .2196/35115.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>S.</given-names>
            <surname>Scaboro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Portelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Chersoni</surname>
          </string-name>
          , E. Santus,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>Serra, NADE: A benchmark for robust adverse drug events extraction in face of negations</article-title>
          , in: Proceedings of the Seventh Workshop on Noisy User-generated
          <string-name>
            <surname>Text</surname>
          </string-name>
          (
          <article-title>W-NUT</article-title>
          <year>2021</year>
          ),
          <article-title>Association for Computational Linguistics</article-title>
          , Online,
          <year>2021</year>
          , pp.
          <fpage>230</fpage>
          -
          <lpage>237</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <year>2021</year>
          .wnut-
          <volume>1</volume>
          .
          <fpage>26</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>S.</given-names>
            <surname>Scaboro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Portelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Chersoni</surname>
          </string-name>
          , E. Santus, G. Serra,
          <article-title>Increasing adverse drug events extraction robustness on social media: case study on negation and speculation</article-title>
          ,
          <year>2022</year>
          . doi:
          <volume>10</volume>
          .48550/ARXIV.2209.02812.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>B.</given-names>
            <surname>Portelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Scaboro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Chersoni</surname>
          </string-name>
          , E. Santus, G. Serra, AILAB-Udine@
          <article-title>SMM4H'22: Limits of transformers and BERT ensembles</article-title>
          ,
          <source>in: Proceedings of The Seventh Workshop on Social Media Mining for Health Applications</source>
          , Workshop &amp; Shared Task, Association for Computational Linguistics, Gyeongju, Republic of Korea,
          <year>2022</year>
          , pp.
          <fpage>130</fpage>
          -
          <lpage>134</lpage>
          . URL: https://aclanthology.org/
          <year>2022</year>
          .smm4h-
          <fpage>1</fpage>
          .
          <fpage>36</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>B.</given-names>
            <surname>Portelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Scaboro</surname>
          </string-name>
          , E. Santus,
          <string-name>
            <given-names>H.</given-names>
            <surname>Sedghamiz</surname>
          </string-name>
          , E. Chersoni, G. Serra,
          <article-title>Generalizing over Long Tail Concepts for Medical Term Normalization</article-title>
          ,
          <source>in: Proceedings of The 2022 Conference on Empirical Methods in Natural Language Processing</source>
          , Association for Computational Linguistics, Abu Dhabi, United Arab Emirates,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>