<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Overview of TASS 2018: Opinions, Health and Emotions</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Eugenio Mart nez-Camara</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yudivian Almeida-Cruz</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Manuel Carlos D az-Galiano</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Suilan Estevez-Velarde</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Miguel A. Garc a-Cumbreras</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Manuel Garc a-Vega</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yoan Gutierrez</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Arturo Montejo-Raez</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andres Montoyo</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rafael Mun~oz</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alejandro Piad-Mor s</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Julio Villena-Roman</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Andalusian Research Institute in Data Science and Computational Intelligence (DaSCI) Universidad de Granada</institution>
          ,
          <addr-line>Espan~a</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Centro de Estudios Avanzados en Tecnolog as de la Informacion y de la Comunicacion (CEATIC) Universidad de Jaen</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>MeaningCloud</institution>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Universidad de Alicante</institution>
          ,
          <addr-line>Espan~a</addr-line>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Universidad de La Habana</institution>
          ,
          <country country="CU">Cuba</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <fpage>13</fpage>
      <lpage>27</lpage>
      <abstract>
        <p>This is an overview of the Workshop on Semantic Analysis at the SEPLN congress held in Sevilla, Spain, in September 2018. This forum proposes to participants four di erent semantic tasks on texts written in Spanish. Task 1 focuses on polarity classi cation; Task 2 encourages the development of aspect-based polarity classi cation systems; Task 3 provides a scenario for discovering knowledge from eHealth documents; nally, Task 4 is about automatic classi cation of news articles according to safety. The former two tasks are novel in this TASS's edition. We detail the approaches and the results of the submitted systems of the di erent groups in each task.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>The Workshop on Semantic Analysis at
the SEPLN1 (in Spanish Taller de Analisis
Semantico en la SEPLN, TASS) is the
evolution of the Workshop on Sentiment Analysis
at the SEPLN which is being held since 2012.</p>
      <p>1http://www.sepln.org/workshops/tass
The aim of the workshop is the furtherance of
the research in semantic tasks on texts
written in Spanish, roughly speaking in Spanish
data. The edition 2018 has proposed two new
challenges (Tasks 3 and 4), and provided
several linguistic resources.</p>
      <p>The processing of health data is attracting
the attention of the Natural Language
Processing (NLP) research community (Denecke,</p>
      <p>Copyright © 2018 by the paper's authors. Copying permitted for private and academic purposes.
2015). In this line, Task 3 proposes
modelling the human language in a scenario in
which Spanish electronic health documents
could be machine readable from a semantic
point of view. This Task 3 consists of
detecting and classifying concepts for
semantic relating them. Task 4 is related to the
brand safety concept, which is crucial for the
reputation of a brand or the company of the
brand. Task 4 proposes the classi cation of
the level of safety of a news for the
publication of a ads spot of a brand according to the
headline of that news.</p>
      <p>
        Tasks 3 and 4 provided speci c datasets
for accomplishing the proposed challenge,
and are described in Sections 2.3.1 and 2.4.1
respectively. Task 1 provided an extension of
the InterTASS corpus, that was presented in
the edition of 2017
        <xref ref-type="bibr" rid="ref19">(Mart nez-Camara et al.,
2017)</xref>
        . The main novelty of the new version
of InterTASS is the incorporation of tweets
written in the Spanish language spoken in
Spain and in the several other countries of
America. Since the di culty of Task 2 is
high, the organisation proposed the same
setting of the task as in previous editions.
      </p>
      <p>The paper is organised as follows: Section
2 describes all the tasks proposed in the
edition of year 2018. The speci c details of each
Subtask are in Section 2.1, 2.2, 2.3 and 2.4
respectively. Section 3 exposes the conclusions
of the paper.
2</p>
      <p>Spanish Semantic Analysis</p>
      <p>Tasks
As mentioned before, TASS is a relevant
workshop for semantic analysis tasks,
particularly for Spanish. In 2018, new resources
and challenges were introduced to evolve
Sentiment Analysis systems to a semantic
level. In the last editions, several research
groups from di erent countries, like Uruguay
or Costa Rica, presented their systems, and
it was mandatory to make an e ort to build
adequate resources for their languages.</p>
      <p>In addition, society and companies are
interested in new speci c challenges, and for
this reason new tasks arise, while
maintaining the main task (global polarity).</p>
      <p>In this Section, we describe the four tasks
of the edition of 2018, namely Section 2.1
expose the details of Task 1; Section 2.2
describes the corpus and the systems
submitted to Task 2; Section 2.3 is focused on the
Task 3; and Section 2.4 describes all details
of Task 4.
2.1 Task 1
This task focused on the evaluation of
polarity classi cation systems at tweet level of
tweets written in Spanish.</p>
      <p>The submitted systems had to face, as
usual, the lack of context due to length of
tweets written in an informal language with
misspelling or emojis, even onomatopeias.</p>
      <p>But this edition brought new challenges to
this task:</p>
      <p>Multilinguality: training, tests and
development corpus contain tweets written
in Spanish from Spain, Peru and Costa
Rica.</p>
      <p>Generalization: Several corpora have
been used. One of them is the
development set, so it follows a similar
distribution. The second corpus is the test set
of the General Corpus of TASS, which
was compiled some years ago, so it may
be lexically and semantically di erent
from the training and development data.</p>
      <p>Furthermore, the system will be
evaluated with test sets of tweets written in
the Spanish language spoken in di erent</p>
      <p>American countries.</p>
      <p>
        The General Corpus of TASS has been
provided in the same way as previous
editions. Further details in
        <xref ref-type="bibr" rid="ref19">(Mart nez-Camara
et al., 2017)</xref>
        .
      </p>
      <p>However, International TASS Corpus
(InterTASS) is a corpus released in 2017 that
has been updated for this edition with new
texts. It is composed of tweets written in
di erent varieties of Spanish (for Spain, Peru
and Costa Rica), so it exhibits a large amount
of lexical and even structural di erences in
each variant. The main purpose of compiling
and using an inter-varietal corpus of Spanish
for the evaluation tasks is to challenge
participating systems to cope with the many faces
of this language worldwide.</p>
      <p>Datasets were annotated with 4 di erent
polarity labels positive, negative,
neutral and none), and systems had to
identify the orientation of the opinion expressed
in each tweet in any of those 4 polarity levels.</p>
      <p>
        The Spanish variety part was released in
2017 and its description can be found in
        <xref ref-type="bibr" rid="ref19">(Mart nez-Camara et al., 2017)</xref>
        . Table 1
shows the tweets distribution for training,
development (dev.) and test corpora.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Training Dev.</title>
      <p>The Peru and Costa Rica varieties have
been released for this edition. The tweets
distributions are shown in Tables 2 and 3
respectively for both variants.</p>
    </sec>
    <sec id="sec-3">
      <title>Training Dev.</title>
      <p>Four sub-tasks were proposed, working
with the datasets of the di erent countries:</p>
      <p>Subtask-1: Monolingual ES. training
and test were the InterTASS ES datasets.</p>
      <p>Subtask-2: Monolingual PE. training
and test were the InterTASS PE datasets.</p>
      <p>Subtask-3: Monolingual CR. training
and test were the InterTASS CR datasets.</p>
      <p>Subtask-4: Cross-lingual. The training
could be done with any dataset, but using
a di erent one for the evaluation, in order
to test the dependency of systems on a
language.</p>
      <p>Results were submitted in a plain text le
with the following format:
t w e e t i d n t p o l a r i t y</p>
      <p>Accuracy and the macro-averaged
versions of Precision, Recall and F1 were used
as evaluation measures. Systems were ranked
by the Macro-F1 and Accuracy measures.</p>
    </sec>
    <sec id="sec-4">
      <title>2.1.1 Analysis of the Results</title>
      <p>For task 1 ve system were presented. Most
of them make use of deep learning algorithms,
combining di erent ways of obtaining the
word embeddings.</p>
      <p>
        INGEOTEC.
        <xref ref-type="bibr" rid="ref22">Moctezuma1 et al. (2018)</xref>
        present a polarity classi cation system based
on the combination of di erent labelling
systems. The main component is the EvoMSA
system, based on genetic algorithms, which
combines the outputs of the other systems.
EvoMSA is based on the B4MSA system for
the adjustment of the di erent parameters
(how the text is normalised, how the
tokens are calculated or how the tokens are
weighted) and on the EvoDAG program that
carries out the classi cation. As for the
input systems, various systems are used based
on lexicons of a ectivity or aggressiveness.
It also uses the algorithm of word
embeddings called FastText, using the Wikipedia
in Spanish to train it. Vectors are generated
for each document and SVM is used for
training. Their approach performs better when it
is trained with tweets from Spain and test
with other Spanish varieties.
      </p>
      <p>
        RETUYT-InCo.
        <xref ref-type="bibr" rid="ref3">Chiruzzo and Rosa
(2018)</xref>
        submitted three approaches: SVM
using word embedding centroids and
manually crafted features, CNN using word
embeddings as input, and Long Short
Term Memory (LSTM) using word
embeddings, trained with focus on improving the
recognition of neutral tweets. In all cases,
embedding improves results and LSTM has
the best behaviour for neutral tweets. The
use of a mixed-balanced training method
for the LSTM resulted in a signi cant
improvement in the detection of neutral
tweets.
      </p>
      <p>ITAINNOVA. Montanes, Aznar, and del
Hoyo (2018) analyse the use of convolutional
network models (CNN), LSTM, Bidirectional
LSTM (BiLSTM) and a hybrid approach
between CNN and LSTM. The combination
CNN-LSTM has been chosen as it integrates
the bene ts of both models. They choose
the CNN-LSTM combination because it
integrates the bene ts provided from both
models.
ELiRF-UPV. Gonzalez, Hurtado, and
Pla (2018b) explore di erent approaches
based on Deep Learning. Speci cally, they
study the behaviour of the CNN,
Attention Bidirectional Long Short Term Memory
(Att-BLSTM) and Deep Averaging Networks
(DAN). In order to study the behaviour of
the di erent models, they carry out an
adjustment process. They get the best
results in InterTASS-ES. However, linguistic
variability a ects the choice of architecture
and its hyperparameters, so the application
of the same system to InterTASS-CR and
InterTASS-PE tasks, without making any
adjustment, has not allowed to obtain results as
competitive as in InterTASS-ES.</p>
      <p>ATALAYA. Luque and Perez (2018)
presented a system that uses a weighted scheme
to average the subword-aware embeddings
obtained from preprocessed tweets that have
been enriched with data obtained from
machine translation. This novel solution
involves translating tweets into another
language and back into the source language, to
lexically and grammatically increase them.</p>
      <p>Tables 4, 5 and 6 show the results
obtained in the monolingual subtasks (Spain,
Costa Rica and Peru variants).</p>
      <p>For the cross-lingual runs, the participants
selected an InterTASS dataset to train their
systems and a di erent one to test, in order
to test the dependency of systems on a
language. Tables 7, 9 and 8 show the results
obtained in these cross-lingual subtasks.</p>
      <p>The overall results, in terms of F1,
obtained with the monolingual and multilingual
systems for the Spanish and Costa Rica
collections are quite comparable, but the one
with the Peru collection fall by around 10%.
2.2</p>
      <p>
        Task 2
Task 2, Aspect-based Sentiment Analysis,
proposes the development of aspect-based
polarity classi cation systems. Similar to
previous editions
        <xref ref-type="bibr" rid="ref19">(Mart nez-Camara et al.,
2017)</xref>
        , two datasets were used to evaluate the
di erent approaches: Social-TV and
STOMPOL. Both datasets were annotated with
Run
retuyt-lstm-cr-2
retuyt-svm-cr-2
retuyt-svm-cr-1
elirf-cr-run-2
retuyt-cnn-cr-1
atalaya-cr-lr-50-2
ingeotec-run1
retuyt-lstm-cr-1
retuyt-cnn-cr-2
elirf-intertass-cr-run-1
atalaya-mlp-300-sentiment
atalaya-mlp-ubav3-50-3
ingeotec-run1
elirf-cr-run-1
Run
retuyt-cnn-pe-1
atalaya-pe-lr-50-2
retuyt-lstm-pe-2
retuyt-svm-pe-2
ingeotec-run1
elirf-intertass-pe-run-2
atalaya-mlp-sentimentubav3-50-3
retuyt-svm-pe-1
elirf-intertass-pe-run-1
atalaya-mlp-300-sentiment
atalaya-mlp-50-sentiment
retuyt-svm-pe-2
retuyt-cnn-pe-2
retuyt-lstm-pe-1
elirf-intertass-pe-run-1
M. F1
sion of Precision, Recall, F1, and Accuracy
were considered, and Macro-F1 was used for
a nal ranking of proposed systems.
      </p>
    </sec>
    <sec id="sec-5">
      <title>2.2.1 Collections</title>
      <p>
        The Social-TV corpus was collected during
the 2014 Final of \Copa del Rey"
championship in Spain. After ltering out
useless information, a subset of 2,773 tweets
was obtained. The details of the corpus
are described in
        <xref ref-type="bibr" rid="ref19 ref28 ref7">(Villena-Roman et al., 2015;
Garc a-Cumbreras et al., 2016; Mart
nezCamara et al., 2017)</xref>
        .
      </p>
      <p>
        STOMPOL (corpus of Spanish Tweets for
Opinion Mining at aspect level about
POLitics) is a corpus for the task of Aspect Based
Sentiment Analysis. The corpus is composed
of 1,284 tweets manually annotated by two
annotators, and a third one in case of
disagreement. The details of the corpus are
described in
        <xref ref-type="bibr" rid="ref19 ref28 ref7">(Villena-Roman et al., 2015;
Garc a-Cumbreras et al., 2016; Mart
nezCamara et al., 2017)</xref>
        .
      </p>
    </sec>
    <sec id="sec-6">
      <title>2.2.2 Results</title>
      <p>
        Only the research group ELiRF
        <xref ref-type="bibr" rid="ref10 ref13 ref18 ref20 ref23 ref3 ref8 ref9">(Gonzalez,
Hurtado, and Pla, 2018c)</xref>
        participated in this
edition. They explored di erent approaches
based on Deep Learning. Speci cally, they
studied the behaviour of the CNN,
Attention Bidirectional Long Short Term Memory
(Att-BLSTM) and Deep Averaging Networks
(DAN), similar to the proposal of the team
for Task 1. In order to study the performance
of the di erent models, they carried out an
adjustment process. Tables 10 and 11 show
the results obtained in their experiments.
aspect-related metadata: the main category
of the aspect, and the polarity of the opinion
about the aspect. Systems had to classify
the opinion about the given aspect in 3
different polarity labels (positive, negative,
neutral).
      </p>
      <p>Participants were expected to submit up
to 3 experiments for each provided collection,
each in a plain text le with tweet identi
cation, aspect and polarity.</p>
      <p>
        For evaluation, exact match with a single
label combining \aspect-polarity" was used.
Similarly to Task 1, the macro-averaged
ver2.3 Task 3
NLP methods are increasingly being used to
mine knowledge from unstructured content of
health
        <xref ref-type="bibr" rid="ref11 ref17 ref5">(Liu et al., 2013; Doing-Harris and
Zeng-Treitler, 2011; Gonzalez-Hernandez et
al., 2017)</xref>
        and other domains
(EstevezVelarde et al., 2018). Over the years, many
eHealth challenges have taken place, such
as SemEval2, CLEF3 campaigns and
others
        <xref ref-type="bibr" rid="ref1">(Augenstein et al., 2017)</xref>
        . These tasks
have mainly dealt with identi cation,
classi cation, extraction and linking of
knowledge. The Task 3: eHealth Knowledge
Discovery (eHealth-KD) proposes modelling the
human language in a scenario in which
Spanish electronic health documents could be
machine readable from a semantic point of view.
This task is designed to encourage the
development of software technologies to
automatically extract a large variety of knowledge
from eHealth documents written in the
Spanish language.
      </p>
      <p>In order to capture the semantics of a
broad range of health related text,
eHealthKD proposes the identi cation of two types
of elements: Concepts and Actions.
Concepts are key phrases that represent actors or
entities which are relevant in a domain, while
Actions represent how these Concepts
interact with each other. Actions and Concepts
can be linked by two types of relations:
subject and target, which describe the main
roles that a Concept can perform. Also, four
speci c semantic relations between Concepts
are de ned: is-a, part-of, property-of and
same-as. Figure 1 provides an example.</p>
      <p>To simplify and normalise the extraction
process, the overall task is divided into three
subtasks:</p>
      <p>Subtask A is concerned with the
extraction of the relevant key phrases.</p>
      <p>Subtask B is concerned with the
classication of the key phrases identi ed in
2International Workshop on Semantic Evaluation
3Conference and Labs of the Evaluation Forum</p>
      <p>Subtask A as either Concept or
Action.</p>
      <p>Subtask C is concerned with the
discovery of the semantic relations between
pairs of entities.</p>
      <p>To compute the evaluation metrics for
each subtask, we de ne the following sets for
comparing the annotations between both the
expected output (gold standard) and the
actual output in each subtask:</p>
    </sec>
    <sec id="sec-7">
      <title>Correct matches (C): in all subtasks,</title>
      <p>when one gold and one given annotation
exactly match.</p>
    </sec>
    <sec id="sec-8">
      <title>Partial matches (P ): in subtask A, when</title>
      <p>two key phrases have a non-empty
intersection.</p>
    </sec>
    <sec id="sec-9">
      <title>Missing matches (M ): in subtasks A and</title>
      <p>C, when an annotation in the gold
output is not provided by the system.</p>
    </sec>
    <sec id="sec-10">
      <title>Spurious matches (S): in subtasks A and</title>
      <p>C, when an annotation given by the
system does not appear in the gold output.</p>
    </sec>
    <sec id="sec-11">
      <title>Incorrect matches (I): in subtask</title>
      <p>when one assigned label is incorrect.
B,</p>
      <p>To measure the individual subtasks results
as well as overall results, the eHealth-KD
challenge proposes three evaluation
scenarios.</p>
      <p>Scenario 1. The rst scenario requires all
subtasks (i.e. A, B and C) to be performed
sequentially. The input in this scenario
consists of plain text (100 sentences), and
participants must submit the three output les
corresponding to subtasks A, B and C. In this
scenario the overall quality of the participant
systems is evaluated. So, a combined micro
F1 metric was de ned, taking into account
results of the three tasks:</p>
      <p>F1ABC</p>
      <sec id="sec-11-1">
        <title>PABC</title>
      </sec>
      <sec id="sec-11-2">
        <title>RABC</title>
      </sec>
      <sec id="sec-11-3">
        <title>TABC</title>
        <p>=
=
=
=</p>
      </sec>
      <sec id="sec-11-4">
        <title>2 PABC RABC</title>
        <p>PABC + RABC</p>
        <p>TABC + 12 PA
TABC + PA + MA + IB + MC</p>
        <p>TABC + 12 PA
TABC + PA + SA + IB + SC
CA + CB + CC
(1)
(2)
(3)
(4)
Scenario 2. In the second scenario only
subtasks B and C are performed. Hence,
participants receive plain text inputs and the
corresponding outputs for subtask A (a
different subset of 100 sentences). This
scenario allows participants to focus on the key
phrases classi cation, without being a ected
by errors related to the extraction of key
phrases. Like Scenario 1, a combined micro
F1 is de ned which takes into account the
results for subtasks B and C:</p>
        <p>F1BC
PBC
RBC
TBC
=
=
=
=
2 PBC RBC
PBC + RBC</p>
        <p>TBC
TBC + IB + MC</p>
        <p>TBC
TBC + IB + SC</p>
        <p>CB + CC
Scenario 3. Finally, the third scenario
evaluates only subtask C. Participants are
provided with plain text inputs and the
corresponding output of subtasks A and B (a
nal subset of another 100 sentences). In this
scenario, the following metric is de ned for
evaluation:</p>
        <p>F1C
PC
RC
= 2
=
=</p>
        <p>PC RecC
PC + RC</p>
        <p>CC
CC + SC</p>
        <p>CC</p>
        <p>CC + MC</p>
        <p>For competition purposes, the best system
is de ned as the submission that maximises
the macro-average F1 across all three
scenarios:</p>
        <p>F1
=</p>
        <p>F1ABC + F1BC + F1C
3</p>
      </sec>
    </sec>
    <sec id="sec-12">
      <title>2.3.1 Corpora</title>
      <p>For evaluation purposes, a corpus of
healthrelated sentences in Spanish was manually
built and tagged. The corpus consists of a
selection of articles collected from the
MedlinePlus4 website. These les contain
several entries related to health and medicine
topics, and environmental topics strongly
related to health care. Spanish language items
were converted to a plain text document,
processed, and manually tagged using the Brat
Files
Sentences
Annotations</p>
    </sec>
    <sec id="sec-13">
      <title>Entities</title>
      <p>- Concepts
- Actions</p>
    </sec>
    <sec id="sec-14">
      <title>Roles</title>
      <p>- subject
- target</p>
    </sec>
    <sec id="sec-15">
      <title>Relations</title>
      <p>- is-a
- part-of
- property-of
- same-as</p>
    </sec>
    <sec id="sec-16">
      <title>Train Dev.</title>
      <p>annotation tool5 by 15 human annotators
divided into seven groups. The nal 1; 173
tagged sentences were organised in three
collections: training, development and test.
Table 12 summarises the main statistics of the
corpus.
2.3.2 Analysis of the Results
eHealth-KD challenge attracted the attention
of a total 31 registered teams of which six
of then successfully concluded their
participation. Their results are summarised in
Table 13. The following tag labels are designed
to provide an overview of the main
characteristics of each participant system:
S: Uses shallow supervised models such as
CRF, logistic regression, SVM, decision
trees, etc.</p>
      <p>D: Uses deep learning models, such as LSTM
or convolutional networks.</p>
      <p>E: Uses word embeddings or other
embedding models trained with external
corpora.</p>
      <p>K: Uses external knowledge bases, either
explicitly or implicitly (i.e, through
thirdparty tools).</p>
      <p>R: Uses hand crafted rules based on domain
expertise.</p>
      <p>N: Uses natural language processing
techniques or features, i.e., POS-tagging,
dependency parsing, etc.</p>
    </sec>
    <sec id="sec-17">
      <title>Baseline description: A baseline,</title>
      <p>trained on the training corpus, was de ned.
This strategy consists of a dummy approach
based solely on the text of key phrases. This
technique collects all training data and stores
three maps: (1) key phrases associated with
their most common class (either Concept or
Action); (2) pairs of concepts associated with
their most common relation; and (3) tuples
of &lt;Action,Concept&gt; associated with their
most common role. At prediction time, these
maps are used to select a key phrase, decide
its class, and predict relations and roles.</p>
      <p>Once the shared subtask ended, the o cial
results were published. However, some
participants noticed that their systems provided
duplicated outputs on some occasions. These
duplicated outputs, even if correct, were
being counted as spurious after the rst match.
To account for this duplication, the
evaluation script was modi ed to remove
duplicated outputs from the participants
submissions prior to calculating the evaluation
metrics. Table 14 shows this second version of the
metrics, where variations in scores are
highlighted in bold text. This proved not to be
a signi cant problem, since only two
participants were a ected, and even though their
metrics improved marginally, the overall
results or the main conclusions of the shared
subtask did not change.</p>
      <p>The results of this task, eHealth-KD, show
that a variety of approaches, on the whole,
deal e ectively with the health knowledge
discovery problem. However, issues still need
to be resolved to obtain highly
competitive systems. The best performing
submissions include classic supervised learning, deep
learning and knowledge-based techniques. In
subtask A, the best approach (UC3M) is based
on a CRF model with pre-trained
embeddings as features. This approach obtains
similar scores in subtask B. In general, subtask
B appears to be easier than the rest, which
is understandable given that there are only
two classes and there is a large correlation
between word lemmas and their classes (as
shown by the relatively high performance of
the baseline).</p>
      <p>Subtask C, in concordance with Scenario
3, does not exceed 45% in F-score. This
reinforces the belief that this task is di cult
7This extracts lexical and syntactic features for
each token. Afterwards, it applies a set of handcrafted
heuristics for each subtask.
to deal with, even after having applied novel
approaches (i.e. TALP and LaBDA) based on
convolutional neural networks.</p>
      <p>The best-performing systems in each
scenario highly coincide with all three task
results. For Scenario 1, the top performing
strategy belongs to UC3M, which achieves the
best scores in subtasks A and B, and the
overall best result in the shared subtask
(averaged across all three scenarios), pretty close
to SINAI. Likewise, the best strategy in
Scenario 3 corresponds to TALP, which achieves
the best score for subtask C. However, for
the overall results, other participants such as
SINAI and UPF-UPC achieve higher average
scores, even though their performance in
subtask C and Scenario 3 is practically
negligible. In contrast, these teams obtain relatively
high scores in subtasks A and B.</p>
      <p>The diverse nature and complexity of the
three subtasks make it di cult to design a
single fair evaluation metric. For this
reason, we consider that each system submission
gets more accurate results related to the
speci c sub-problems that it tackles. Although
generalisation across the three subtasks is a
desirable characteristic, advances in any
particular subtask are also very valuable.</p>
      <p>In general, the most competitive
approaches in individual subtasks are
dominated by state-of-the-art machine learning.
In the particular case of subtask C, where
modern deep learning approaches seem to
outperform classic techniques. In
addition, incorporating domain-speci c
knowledge provides a signi cant boost to the
performance. Most participants use NLP
features, either explicitly, or implicitly captured
in word embeddings and other
representations. An interesting phenomenon is that
the best systems in subtask A do not
correlate with the best systems in subtask C.
This suggests that the optimal approach for
either subtask is di erent, giving rise to an
interesting research line that would explore
integrated approaches to simultaneously
solving these three subtasks. The overall results
show that general purpose knowledge
discovery in domain-speci c documents is
potentially a proli c research area, particularly for
the Spanish language. We expect similar
future initiatives to provide fruitful evaluation
scenarios where researchers can deploy
techniques from several domains, and compete in
friendly contests to improve the
state-of-the</p>
    </sec>
    <sec id="sec-18">
      <title>Tags</title>
    </sec>
    <sec id="sec-19">
      <title>Subtask A</title>
    </sec>
    <sec id="sec-20">
      <title>Subtask B</title>
    </sec>
    <sec id="sec-21">
      <title>Subtask C</title>
    </sec>
    <sec id="sec-22">
      <title>Average</title>
    </sec>
    <sec id="sec-23">
      <title>Scenario 1</title>
    </sec>
    <sec id="sec-24">
      <title>Scenario 2</title>
    </sec>
    <sec id="sec-25">
      <title>Scenario 3</title>
    </sec>
    <sec id="sec-26">
      <title>Average</title>
    </sec>
    <sec id="sec-27">
      <title>Tags</title>
    </sec>
    <sec id="sec-28">
      <title>Subtask A</title>
    </sec>
    <sec id="sec-29">
      <title>Subtask B</title>
    </sec>
    <sec id="sec-30">
      <title>Subtask C</title>
    </sec>
    <sec id="sec-31">
      <title>Average</title>
    </sec>
    <sec id="sec-32">
      <title>Scenario 1</title>
    </sec>
    <sec id="sec-33">
      <title>Scenario 2</title>
    </sec>
    <sec id="sec-34">
      <title>Scenario 3</title>
    </sec>
    <sec id="sec-35">
      <title>Average</title>
      <p>UC3My
SDEN
When news are about natural disasters,
readers usually feel negative emotions (sadness,
for instance), whereas when those news are
about the last championship won by your
favourite team, readers feel positive emotions
like happiness. Moreover, it is commonly
assumed in marketing that emotions aroused
in the reader by news articles have an
impact in the perception of the advertisements
displayed along with those articles. Thus,
from that marketing perspective, if a
company wants to promote their brand, the ads
should better be associated to (i.e., shown
with) news that arouse positive emotions.</p>
      <p>The objective of Task-4 is to encourage the
development of systems that can classify a
news article into safe (positive emotions, so
safe for ads) or unsafe (negative emotions,
so better avoid ads). This task could be
considered as a kind of stance classi cation, on
the positioning of readers of news contents.
The task is a strong challenge because it has
to deal with the polarity of feeling (safe vs.
unsafe) and to work in combination with a
(pseudo) thematic classi cation to be able to
determine the meaning of the news. For
example, the reduction of tra c accidents has
a negative feeling because of the accidents,
but the context of reducing the numbers of
accidents makes those bad news good news,
hence safe news.</p>
    </sec>
    <sec id="sec-36">
      <title>2.4.1 Corpora</title>
      <p>The Spanish brANd Safe Emotion corpus
(SANSE) corpus was speci cally built for this
task. RSS feeds of di erent online
newspapers written in di erent varieties of Spanish
(Argentina, Chile, Colombia, Cuba, Spain,
USA, Mexico, Peru and Venezuela) were
collected for over a month. Finally 15,152
articles were captured, containing the URL,
the publication date and the headline. News
summaries were also collected for several
sources, but nally they were discarded to
make the dataset consistent and
homogeneous.</p>
      <p>Then 2,000 articles (L1 subset) were
randomly selected and were manually annotated
into an emotional categorisation of SAFE or
UNSAFE, from the point of view of the
general public of each corresponding country.
The other 13,152 articles (L2 subset) were
not annotated.</p>
    </sec>
    <sec id="sec-37">
      <title>Subset</title>
      <sec id="sec-37-1">
        <title>Training Development Test</title>
      </sec>
    </sec>
    <sec id="sec-38">
      <title>Subset</title>
      <sec id="sec-38-1">
        <title>Training (Spain)</title>
        <p>Dev. (Spain)
Test (Mexico)
Test (Cuba)
Test (Chile)
Test (Colombia)
Test (Argentina)
Test (Venezuela)
Test (Peru)
Test (USA)</p>
        <p>As the datasets were annotated with two
levels of safety: SAFE and UNSAFE, the
task can be considered as a binary classi
cation task.</p>
        <p>The annotation was carried out by two
human annotators (the two organisers of the
task), and, for those cases with no agreement
between the two annotators, a third
annotator undid the tie. A safe headline of a news
was de ned as an utterance that arises a
positive or neutral emotion in the reader and is
not related to a controversial topics: religion,
extreme wing political topics, or
controversial topics (those that arise strong positive
emotions to some readers but strong negative
emotions to other ones). An unsafe headline
was de ned as an utterance that arises
negative emotions on the reader.</p>
        <p>Some examples in Spanish:
As sera el nuevo pan integral en
Espan~a, segun una nueva ley en
marcha. ! SAFE
This will be the new integral bread in
Spain, according to a new law underway.
Casi 300 municipios de Colombia en
riesgo electoral. ! UNSAFE
Almost 300 municipalities in Colombia at
electoral risk.</p>
        <p>
          The agreement of the annotation was 0.58
according to
          <xref ref-type="bibr" rid="ref26">(Scott, 1955)</xref>
          and k (Cohen,
1960), which may consider moderate
according to
          <xref ref-type="bibr" rid="ref16">Landis and Koch (1977)</xref>
          . Although
the agreement is moderate, it is close to be
considered substantial, and we have also to
take into account that it is a new classi
cation task that works with a strong subjective
content. We will work in making the
annotations guidelines more precise in order to
improve the agreement of the annotators.
Besides, we hope that the participants will give
us insights with the aim of improving the
annotation of the data.
        </p>
        <p>The L1 subset was then again divided in
three subsets, speci cally: training,
development and test. The statistics of the three
subsets are in Table 15.
2.4.2</p>
      </sec>
    </sec>
    <sec id="sec-39">
      <title>Tasks</title>
      <p>Two subtasks were proposed. Subtask 1 (S1)
consists of the classi cation of headlines into
safe or unsafe for incorporating an ad of
a brand. The evaluation of the systems does
not take into account the cultural varieties of</p>
    </sec>
    <sec id="sec-40">
      <title>Size</title>
      <p>the Spanish language, it thus a monolingual
evaluation. In this task, datasets are
composed of headlines of news written in di
erent version of the Spanish language, but the
country of the text is not relevant for this
task.</p>
      <p>Participants were provided with the
training and development subsets of L1 SANSE
corpus for building the systems, and two test
sets for the evaluation: the test subset of L1
SANSE corpus and the L2 SANSE corpus.</p>
      <p>The systems presented were evaluated
using the measures of Macro-Precision (M. P.),
Macro-Recall (M. R.), Macro-F1 (M. F1) and
Accuracy (Acc.).</p>
      <p>
        Subtask 2 (S2) is similar to S1, but in this
case the aim is to evaluate the generalisation
capacity of the submitted systems. For
training their systems, participants were provided
with SANSE subsets with headlines written
only in the Spanish language spoken in Spain.
The test set was composed of headlines
written in the Spanish language spoken in di
erent countries of America. The statistics of
SENSE corpus for S2 are shown in the Table
16.
Task 4 attracted the attention of seven teams,
and most of them participated in both
levels of evaluation of the S1 and in S2. Table
2.4.3 shows the participation of the teams in
each Subtask. Five groups of the seven ones
submitted a system description paper, whose
main features will be detailed as what follows.
INGEOTEC.
        <xref ref-type="bibr" rid="ref21 ref22">Moctezuma et al. (2018)</xref>
        propose an ensemble classi cation system
(EvoMSA), which is composed of several and
heterogeneous base systems and a genetic
programming system (EvoDAG,
        <xref ref-type="bibr" rid="ref12">(Gra et al.,
2017)</xref>
        ) that optimises the contribution of each
base system in the nal classi cation. The
authors combined supervised and
unsupervised system as base classi cation systems.
The supervised ones are based on the use
of the algorithm SVM with di erent
representations of the input text, namely TF-IDF
and pre-trained word vectors. The system
reached the best results in the monolingual
and the multilingual evaluations, however the
performance of the system dropped a bit in
S1 L2. Since the annotation test set of S1
L2 was conducted by a voting system of the
all the submitted systems, the lower
performance in S1 L2 may be caused by a di erent
error distribution between INGEOTEC and
the systems submitted by the other groups.
ELiRF UPV. Gonzalez, Hurtado, and
Pla (2018a) propose a deep neural network,
speci cally the model Deep Averaging
Networks (DAN)
        <xref ref-type="bibr" rid="ref15">(Iyyer et al., 2015)</xref>
        . The
authors used a set of pre-trained word
embeddings for representing the news headlines.
The set of pre-trained word embeddings were
prepared by the authors and built upon a
corpus of tweets
        <xref ref-type="bibr" rid="ref11 ref14">(Hurtado, Pla, and Gonzalez,
2017)</xref>
        . The high performance reached by
a set of pre-trained word embeddings built
upon tweets with news headlines stands out,
because the genre of news headlines and
tweets are di erent. However, it may mean
that the use of language in tweets and news
headlines is similar.
      </p>
      <p>S1 L1</p>
      <p>S1 L2</p>
    </sec>
    <sec id="sec-41">
      <title>Team</title>
      <sec id="sec-41-1">
        <title>INGEOTECy</title>
        <p>ELiRF-UPVy
rbnUGRy
MeaningCloudy
SINAIy
lone wolf
TNT-UA-WFU</p>
        <p>X
X
X
X
X
X
X</p>
        <p>
          X
X
X
X
X
X
X
S2
X
X
X
X
rbnUGR. Rodr guez Barroso, Mart
nezCamara, and
          <xref ref-type="bibr" rid="ref13">Herrera (2018)</xref>
          submitted three
systems grounded in deep learning. Although
the three systems are based on Long
ShortTerm Memory (LSTM) Recurrent Neural
Network (RNN), they have several di
erences:
Run 1. It uses a LSTM layer as encoding
layer, and its output is the last vector
state of the LSTM layer.
        </p>
        <p>Run 2. It uses a BiLSTM8 layer as encoding
layer, and its output is the concatenation
of the last vector state of the two LSTM
layers.</p>
        <p>Run 3. It uses a LSTM layer as encoding
layer, and its output is the concatenation
of the corresponding output state vector
of each input token.</p>
        <p>The results show that the systems based
on one single LSTM layer perform better
than the one based on BiLSTM. Regarding
the di erent results in S1 and S2 indicate
that the use the entire output of the
encoding layer allow to improve the generalisation
capacity of the model, because the
multilingual evaluation requires a higher
generalisation capacity.</p>
        <p>
          MeaningCloud.
          <xref ref-type="bibr" rid="ref13">Herrera-Planells and
Villena-Roman (2018)</xref>
          propose three
supervised systems, two of them are lineal
classi cation systems and the other one a
non-lineal classi cation system. The linear
classi cation systems use XGBoost
          <xref ref-type="bibr" rid="ref2">(Chen
and Guestrin, 2016)</xref>
          as classi cation system.
They di er in the set of features used to
represent the news headlines, which are
mainly built using the public APIs of the
text analytic platform of MeaningCloud.
The non lineal classi cation system is a
neural network based on the use of a CNN
layer. The proposal that reached higher
results was the one grounded in a CNN
(Run 3).
        </p>
        <p>
          SINAI.
          <xref ref-type="bibr" rid="ref24">Plaza del Arco et al. (2018</xref>
          )
propose to represent the news headlines as a
vector of unigrams weighted with TF-IDF, and
the number of positive and negative words
according to three list of opinion bearing words.
The authors used SVM as classi cation
algorithm.
        </p>
        <p>The evaluation measures in the two
Subtasks were accuracy and the macro-average
8A BiLSTM is an elaboration of two LSTM layers.
System
INGEOTEC run1 0.794
ELiRF UPV run2 0.787
ELiRF UPV run1 0.795
rbnUGR run1 0.784
MEANING- 0.767
CLOUD run3
rbnUGR run3 0.763
rbnUGR run2 0.774
SINAI 0.733
MEANING- 0.723
CLOUD run2
MEANINGCLOUD run1
of precision, recall and F1, and the systems
were ranked according to the value of
macroF1. Table 18 show the results reached by each
group that submitted the description of their
systems in S1 L1, S1 L2 and S2 respectively.
3</p>
        <p>Conclusions
The edition of TASS 2018 has attracted the
participation of 16 systems, and the
submission of 15 system description papers.
Moreover, we have proposed two new challenges to
the international reserach community, which
are in line to the requirements of the
Industry.</p>
        <p>The submitted systems are in the line of
the state-of-the-art in other similar
workshops, and most of them are grounded in
Deep Learning and the use of hand-crafted
linguistic features. Therefore, TASS may be
considered as a reference forum for setting
up the state-of-the-art in semantic analysis
in Spanish.</p>
        <p>As future work, we plan to enlarge the
coverage of the Spanish language of the corpus
InterTASS, as well as consolidating the new
challenges (Task 3 and Task 4). Moreover,
we will keep working in the development of
new corpora and linguistic resources for the
research community.</p>
        <p>Acknowledgments
This work has been partially supported by a
grant from the Fondo Europeo de Desarrollo
Regional (FEDER), the projects REDES
(TIN2015-65136-C2-1-R,
TIN2015-65136C2-2-R) and SMART-DASCI
(TIN2017</p>
        <p>Workshop on Semantic Analysis at
SEPLN (TASS 2018).</p>
        <p>Workshop Proceedings, Sevilla, Spain,
September. CEUR-WS.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Augenstein</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>M. Das</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Riedel</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Vikraman</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>McCallum</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Semeval 2017 task 10: Scienceie-extracting keyphrases and relations from scienti c publications</article-title>
          .
          <source>arXiv preprint arXiv:1704</source>
          .
          <fpage>02853</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Chen</surname>
            , T. and
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Guestrin</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Xgboost: A scalable tree boosting system</article-title>
          .
          <source>In Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '16</source>
          , pages
          <fpage>785</fpage>
          {
          <fpage>794</fpage>
          , New York, NY, USA. ACM.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Chiruzzo</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Rosa</surname>
          </string-name>
          .
          <year>2018</year>
          . Retuytinco at tass 2018:
          <article-title>Sentiment analysis in spanish variants using neural networks and svm</article-title>
          . In E. Mart nezCamara,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Almeida Cruz</surname>
          </string-name>
          , M. C.
          <article-title>D az-</article-title>
          <string-name>
            <surname>Galiano</surname>
            ,
            <given-names>S. Estevez</given-names>
          </string-name>
          <string-name>
            <surname>Velarde</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
            a-Cumbreras,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
            a-Vega,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Gutierrez Vazquez</surname>
            ,
            <given-names>A. Montejo</given-names>
          </string-name>
          <string-name>
            <surname>Raez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Montoyo</surname>
            <given-names>Guijarro</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mun</surname>
          </string-name>
          <article-title>~oz Guillena, A. Piad Mor s, and</article-title>
          J. Villena-Roman, editors,
          <source>Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          ), volume
          <volume>2172</volume>
          of CEUR Cohen,
          <string-name>
            <surname>J.</surname>
          </string-name>
          <year>1960</year>
          .
          <article-title>A coe cient of agreement for nominal scales</article-title>
          .
          <source>Educational and Psychological Measurement</source>
          ,
          <volume>20</volume>
          (
          <issue>1</issue>
          ):
          <volume>37</volume>
          {
          <fpage>46</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Denecke</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <year>2015</year>
          . Health Web Science. Springer International Publishing.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Doing-Harris</surname>
            ,
            <given-names>K. M.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>Q.</given-names>
            <surname>Zeng-Treitler</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Computer-assisted update of a consumer health vocabulary through mining of social network data</article-title>
          .
          <source>Journal of medical Internet research</source>
          ,
          <volume>13</volume>
          (
          <issue>2</issue>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Estevez-Velarde</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Gutierrez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Montoyo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Piad-Mor</surname>
          </string-name>
          <string-name>
            <surname>s</surname>
          </string-name>
          , R. M. noz, and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Almeida-Cruz</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Gathering object interactions as semantic knowledge</article-title>
          .
          <source>In Proceedings of the 2018 International Conference on Arti cial Intelligence (ICAI'18).</source>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Garc</surname>
            a-Cumbreras,
            <given-names>M. A.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Villena-Roman</surname>
          </string-name>
          ,
          <string-name>
            <surname>E.</surname>
          </string-name>
          <article-title>Mart nez-</article-title>
          <string-name>
            <surname>Camara</surname>
            ,
            <given-names>M. C.</given-names>
          </string-name>
          <article-title>D az-</article-title>
          <string-name>
            <surname>Galiano</surname>
            ,
            <given-names>M. T.</given-names>
          </string-name>
          <string-name>
            <surname>Mart</surname>
            n-Valdivia, and
            <given-names>L. A.</given-names>
          </string-name>
          <string-name>
            <surname>Uren</surname>
          </string-name>
          ~a Lopez.
          <year>2016</year>
          .
          <article-title>Overview of tass 2016</article-title>
          .
          <source>In TASS 2016: Workshop on Sentiment Analysis at SEPLN</source>
          , pages
          <volume>13</volume>
          {
          <fpage>21</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Gonzalez</surname>
            ,
            <given-names>J.-A.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>L.-F.</given-names>
            <surname>Hurtado</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F.</given-names>
            <surname>Pla</surname>
          </string-name>
          . 2018a.
          <article-title>ELiRF-UPV en TASS 2018: Categorizacio emocional de noticias</article-title>
          .
          <source>In Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          ), volume
          <volume>2172</volume>
          ,
          <string-name>
            <surname>September</surname>
          </string-name>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Gonzalez</surname>
            ,
            <given-names>J.-A.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>L.-F.</given-names>
            <surname>Hurtado</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F.</given-names>
            <surname>Pla</surname>
          </string-name>
          . 2018b.
          <article-title>Elirf-upv en tass 2018: Analisis de sentimientos en twitter basado en aprendizaje profundo</article-title>
          . In E. Mart nezCamara,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Almeida Cruz</surname>
          </string-name>
          , M. C.
          <article-title>D az-</article-title>
          <string-name>
            <surname>Galiano</surname>
            ,
            <given-names>S. Estevez</given-names>
          </string-name>
          <string-name>
            <surname>Velarde</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
            a-Cumbreras,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
            a-Vega,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Gutierrez Vazquez</surname>
            ,
            <given-names>A. Montejo</given-names>
          </string-name>
          <string-name>
            <surname>Raez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Montoyo</surname>
            <given-names>Guijarro</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mun</surname>
          </string-name>
          <article-title>~oz Guillena, A. Piad Mor s, and</article-title>
          J. Villena-Roman, editors,
          <source>Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          ), volume
          <volume>2172</volume>
          <source>of CEUR Workshop Proceedings</source>
          , Sevilla, Spain, September. CEUR-WS.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Gonzalez</surname>
            ,
            <given-names>J.-A.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>L.-F.</given-names>
            <surname>Hurtado</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F.</given-names>
            <surname>Pla</surname>
          </string-name>
          . 2018c.
          <article-title>Elirf-upv en tass 2018: Analisis de sentimientos en twitter basado en aprendizaje profundo</article-title>
          .
          <source>In Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Gonzalez-Hernandez</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sarker</surname>
          </string-name>
          ,
          <string-name>
            <surname>K. O'Connor</surname>
            , and
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Savova</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Capturing the patient's perspective: a review of advances in natural language processing of health-related text</article-title>
          .
          <source>Yearbook of medical informatics</source>
          ,
          <volume>26</volume>
          (
          <issue>01</issue>
          ):
          <volume>214</volume>
          {
          <fpage>227</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Gra</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>E. S.</given-names>
            <surname>Tellez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. Jair</given-names>
            <surname>Escalante</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Miranda-Jimenez</surname>
          </string-name>
          ,
          <year>2017</year>
          .
          <article-title>Semantic Genetic Programming for Sentiment Analysis</article-title>
          , pages
          <volume>43</volume>
          {
          <fpage>65</fpage>
          . Springer International Publishing, Cham.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Herrera-Planells</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Villena-Roman</surname>
          </string-name>
          .
          <year>2018</year>
          . MeaningCloud at TASS 2018:
          <article-title>News headlines categorization for brand safety assessment</article-title>
          .
          <source>In Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          ), volume
          <volume>2172</volume>
          ,
          <string-name>
            <surname>September</surname>
          </string-name>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Hurtado</surname>
            , L.-
            <given-names>F.</given-names>
            , F.
          </string-name>
          <string-name>
            <surname>Pla</surname>
            , and
            <given-names>J.-A.</given-names>
          </string-name>
          <string-name>
            <surname>Gonzalez</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Elirf-upv en tass 2017: Analisis de sentimientos en twitter basado en aprendizaje profundo</article-title>
          .
          <source>In Proceedings of TASS 2017: Workshop on Sentiment Analysis at SEPLN co-located with 33nd SEPLN Conference (SEPLN</source>
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Iyyer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Manjunatha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Boyd-Graber</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>H. Daume</given-names>
            <surname>III</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Deep unordered composition rivals syntactic methods for text classi cation</article-title>
          .
          <source>In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)</source>
          , pages
          <fpage>1681</fpage>
          {
          <fpage>1691</fpage>
          .
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Landis</surname>
            ,
            <given-names>J. R.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>G. G.</given-names>
            <surname>Koch</surname>
          </string-name>
          .
          <year>1977</year>
          .
          <article-title>The measurement of observer agreement for categorical data</article-title>
          .
          <source>biometrics</source>
          , pages
          <volume>159</volume>
          {
          <fpage>174</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Liu</surname>
            , H.,
            <given-names>S. J.</given-names>
          </string-name>
          <string-name>
            <surname>Bielinski</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Sohn</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Murphy</surname>
          </string-name>
          ,
          <string-name>
            <surname>K. B. Wagholikar</surname>
            ,
            <given-names>S. R.</given-names>
          </string-name>
          <string-name>
            <surname>Jonnalagadda</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Ravikumar</surname>
            ,
            <given-names>S. T.</given-names>
          </string-name>
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>I. J.</given-names>
          </string-name>
          <string-name>
            <surname>Kullo</surname>
            , and
            <given-names>C. G.</given-names>
          </string-name>
          <string-name>
            <surname>Chute</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>An information extraction framework for cohort identi cation using electronic health records</article-title>
          .
          <source>AMIA Summits on Translational Science Proceedings</source>
          ,
          <year>2013</year>
          :
          <fpage>149</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Lopez-Ubeda</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>M. C.</surname>
          </string-name>
          <article-title>D az-</article-title>
          <string-name>
            <surname>Galiano</surname>
            ,
            <given-names>M. T.</given-names>
          </string-name>
          <string-name>
            <surname>Mart</surname>
            n-Valdivia, and
            <given-names>L. A.</given-names>
          </string-name>
          <string-name>
            <surname>Urena-Lopez</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Sinai en tass 2018 task 3. clasi - cando acciones y conceptos con umls en medline</article-title>
          .
          <source>In Proceedings of TASS</source>
          <year>2018</year>
          : Luque,
          <string-name>
            <surname>F. M.</surname>
          </string-name>
          and
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Perez</surname>
          </string-name>
          .
          <year>2018</year>
          . Atalaya at tass 2018:
          <article-title>Sentiment analysis with tweet embeddings and data augmentation</article-title>
          . In E. Mart nezCamara,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Almeida Cruz</surname>
          </string-name>
          , M. C.
          <article-title>D az-</article-title>
          <string-name>
            <surname>Galiano</surname>
            ,
            <given-names>S. Estevez</given-names>
          </string-name>
          <string-name>
            <surname>Velarde</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
            a-Cumbreras,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
            a-Vega,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Gutierrez Vazquez</surname>
            ,
            <given-names>A. Montejo</given-names>
          </string-name>
          <string-name>
            <surname>Raez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Montoyo</surname>
            <given-names>Guijarro</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mun</surname>
          </string-name>
          <article-title>~oz Guillena, A. Piad Mor s, and</article-title>
          J. Villena-Roman, editors,
          <source>Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          ), volume
          <volume>2172</volume>
          <source>of CEUR Workshop Proceedings</source>
          , Sevilla, Spain, September. CEUR-WS.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Mart</surname>
            nez-Camara,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>M. C.</surname>
          </string-name>
          <article-title>D az-</article-title>
          <string-name>
            <surname>Galiano</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
            a-Cumbreras,
            <given-names>M.</given-names>
          </string-name>
          <article-title>Garc aVega, and</article-title>
          <string-name>
            <given-names>J.</given-names>
            <surname>Villena-Roman</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Overview of TASS 2017</article-title>
          . In E. Mart nezCamara, M. C.
          <article-title>D az-</article-title>
          <string-name>
            <surname>Galiano</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
            a-Cumbreras,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
          </string-name>
          <article-title>a-Vega, and</article-title>
          <string-name>
            <surname>J</surname>
          </string-name>
          . Villena-Roman, editors,
          <source>Proceedings of TASS 2017: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2017</year>
          ), volume
          <volume>1896</volume>
          <source>of CEUR Workshop Proceedings</source>
          , Murcia, Spain, September. CEUR-WS.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Medina</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Turmo</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Joint classi cation of key-phrases and relations in electronic health documents</article-title>
          .
          <source>In Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Moctezuma</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ortiz-Bejar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. S.</given-names>
            <surname>Tellez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Miranda-Jimenez</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Gra</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Ingeotec solution for task 4 in tass'18 competition</article-title>
          .
          <source>In Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          ), volume
          <volume>2172</volume>
          ,
          <string-name>
            <surname>September</surname>
          </string-name>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <surname>Moctezuma1</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ortiz-Bejar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. S.</given-names>
            <surname>Tellez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Miranda-Jimenez</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Gra</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Ingeotec solution for task 1 in tass'18 competition</article-title>
          . In E. Mart nezCamara,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Almeida Cruz</surname>
          </string-name>
          , M. C.
          <article-title>D az-</article-title>
          <string-name>
            <surname>Galiano</surname>
            ,
            <given-names>S. Estevez</given-names>
          </string-name>
          <string-name>
            <surname>Velarde</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
            a-Cumbreras,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
            a-Vega,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Gutierrez Vazquez</surname>
            ,
            <given-names>A. Montejo</given-names>
          </string-name>
          <string-name>
            <surname>Raez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Montoyo</surname>
            <given-names>Guijarro</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mun</surname>
          </string-name>
          <article-title>~oz Guillena, A. Piad Mor s, and</article-title>
          J. Villena-Roman, editors,
          <source>Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          ), volume
          <volume>2172</volume>
          of CEUR Montanes,
          <string-name>
            <surname>R.</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aznar</surname>
          </string-name>
          , and
          <source>R. del Hoyo</source>
          .
          <year>2018</year>
          .
          <article-title>Aplicacion de un modelo h brido de aprendizaje profundo para el analisis de sentimiento en twitter</article-title>
          . In E. Mart nezCamara,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Almeida Cruz</surname>
          </string-name>
          , M. C.
          <article-title>D az-</article-title>
          <string-name>
            <surname>Galiano</surname>
            ,
            <given-names>S. Estevez</given-names>
          </string-name>
          <string-name>
            <surname>Velarde</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
            a-Cumbreras,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
            a-Vega,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Gutierrez Vazquez</surname>
            ,
            <given-names>A. Montejo</given-names>
          </string-name>
          <string-name>
            <surname>Raez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Montoyo</surname>
            <given-names>Guijarro</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mun</surname>
          </string-name>
          <article-title>~oz Guillena, A. Piad Mor s, and</article-title>
          J. Villena-Roman, editors,
          <source>Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          ), volume
          <volume>2172</volume>
          <source>of CEUR Workshop Proceedings</source>
          , Sevilla, Spain, September. CEUR-WS.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Palatresi</surname>
            ,
            <given-names>J. V.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>H. R.</given-names>
            <surname>Hontoria</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Tass2018: Medical knowledge discovery by combining terminology extraction techniques with machine learning classi cation</article-title>
          .
          <source>In Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>Plaza del Arco</surname>
            ,
            <given-names>F. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>E.</surname>
          </string-name>
          <article-title>Mart nez-</article-title>
          <string-name>
            <surname>Camara</surname>
            ,
            <given-names>M. T.</given-names>
          </string-name>
          <article-title>Mart n Valdivia, and</article-title>
          <string-name>
            <surname>A</surname>
          </string-name>
          .
          <source>Uren~a Lopez</source>
          .
          <year>2018</year>
          .
          <article-title>SINAI en TASS 2018: Insercion de conocimiento emocional externo a un clasi cador lineal de emociones</article-title>
          .
          <source>In Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          ), volume
          <volume>2172</volume>
          ,
          <string-name>
            <surname>September</surname>
          </string-name>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <surname>Rodr guez Barroso</surname>
            , N.,
            <given-names>E.</given-names>
          </string-name>
          <article-title>Mart nez-Camara, and</article-title>
          <string-name>
            <given-names>F.</given-names>
            <surname>Herrera</surname>
          </string-name>
          .
          <year>2018</year>
          . SCI2S at TASS 2018:
          <article-title>Emotion classi cation with recurrent neural networks</article-title>
          .
          <source>In Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          ), volume
          <volume>2172</volume>
          ,
          <string-name>
            <surname>September</surname>
          </string-name>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <surname>Scott</surname>
            ,
            <given-names>W. A.</given-names>
          </string-name>
          <year>1955</year>
          .
          <article-title>Reliability of content analysis: The case of nominal scale coding</article-title>
          .
          <source>Public opinion quarterly</source>
          , pages
          <volume>321</volume>
          {
          <fpage>325</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <surname>Suarez-Paniagua</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Segura-Bedmar</surname>
          </string-name>
          , and P. Mart nez.
          <year>2018</year>
          .
          <article-title>Labda at tass-2018 task 3: Convolutional neural networks for relation classi cation in spanish ehealth documents</article-title>
          .
          <source>In Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <string-name>
            <surname>Villena-Roman</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Garc</surname>
          </string-name>
          a-Morera,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Garc</surname>
          </string-name>
          a-Cumbreras,
          <string-name>
            <surname>E.</surname>
          </string-name>
          <article-title>Mart nez-</article-title>
          <string-name>
            <surname>Camara</surname>
            ,
            <given-names>M. T.</given-names>
          </string-name>
          <string-name>
            <surname>Mart</surname>
            n-Valdivia, and
            <given-names>L. A.</given-names>
          </string-name>
          <string-name>
            <surname>Uren</surname>
          </string-name>
          ~a Lopez.
          <year>2015</year>
          . Overview of tass
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          <source>In TASS 2015: Workshop on Sentiment Analysis at SEPLN</source>
          , pages
          <volume>13</volume>
          {
          <fpage>21</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          <string-name>
            <surname>Zavala</surname>
            ,
            <given-names>R. M. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>P.</surname>
          </string-name>
          <article-title>Mart nez, and</article-title>
          <string-name>
            <surname>I. SeguraBedmar.</surname>
          </string-name>
          <year>2018</year>
          .
          <article-title>A hybrid bi-lstm-crf model for knowledge recognition from ehealth documents</article-title>
          .
          <source>In Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>