<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>NLP UNED at eHealth-KD Challenge 2019</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Hermenegildo Fabregat</string-name>
          <email>fgildo.fabregat@lsi</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andres Duque</string-name>
          <email>aduque@scc</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Juan Martinez-Romo</string-name>
          <email>juaner@lsi</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lourdes Araujo</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Departamento de Sistemas de Comunicacion y Control</institution>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Instituto Mixto de Investigacion - Escuela Nacional de Sanidad (IMIENS)</institution>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>NLP &amp; IR Group</institution>
          ,
          <addr-line>Dpto. Lenguajes y Sistemas Informaticos</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <fpage>67</fpage>
      <lpage>77</lpage>
      <abstract>
        <p>This paper describes the approach presented by the NLP UNED team in the eHealth Knowledge Discovery challenge of the IberLEF 2019 competition. Our proposal is based on the use of deep neural networks for performing keyphrase detection and attention-based networks for extracting relationships between those keyphrases. Our experiments show promising results especially in the Relation Extraction subtask, o ering the second best results among all participant systems.</p>
      </abstract>
      <kwd-group>
        <kwd>Deep learning</kwd>
        <kwd>Neural networks</kwd>
        <kwd>Attention systems</kwd>
        <kwd>Biomed- ical domain</kwd>
        <kwd>Named Entity Recognition</kwd>
        <kwd>Relation Extraction</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>The biomedical domain is one of the most important research elds at the
moment when it comes to Natural Language Processing (NLP). Successful identi
cation and extraction of valuable data in an automatic way is a crucial process
considering the huge amount of information available in this particular domain.
Clinical notes, medical reports or biomedical research papers are just some of
the multiple types of documents in which medical information can be found.</p>
      <p>
        In that context, the eHealth Knowledge Discovery Challenge [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], carried out
within the Iberian Languages Evaluation Forum (IberLEF 2019), o ers an
opportunity for the development of NLP systems that may help in this search for
useful data in biomedical information. The task aims for the correct identi
cation and classi cation of keyphrases in sentences extracted from biomedical
documents written in the Spanish language, and the search for meaningful
relationships between those keyphrases.
      </p>
      <p>In this paper, we present the deep learning-based system DeepNER+ARE,
designed for considering the challenge as a sequential pipeline divided into two
phases. In the rst phase, a Named Entity Recognizer (NER) is used for detecting
and classifying keyphrases, while the relationships between them are extracted in
a second step through the use of an Attention-based Relation Extractor (ARE).</p>
      <p>The paper is structured as follows: Section 2 o ers an overview on systems
developed for similar tasks. Section 3 brie y describes the characteristics of the
task. The proposed system is presented in Section 4, and the results obtained in
the challenge are shown in Section 5. Finally, some conclusions and future lines
of work are discussed in Section 6.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Background</title>
      <p>
        Many approaches can be found in the literature addressing the identi cation
and classi cation of keyphrases and the extraction of relationships between
them, both regarding general NLP [
        <xref ref-type="bibr" rid="ref12 ref14 ref19">19,14,12</xref>
        ] and its application in the
biomedical domain. The proposed challenge itself is closely related to task 3 (eHealth
Knowledge Discovery ) of TASS 2018 [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], in which systems addressed 3
subtasks: keyphrase identi cation, keyphrase classi cation and relation extraction.
The two best performing systems were based on deep learning solutions: one
of them made use of bidirectional Long Short-Term Memory (Bi-LSTM) layers
combined with Conditional Random Field (CRF) classi ers, although it only
o ered results for the detection and classi cation of keyphrases [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. The other
system considered the task as a whole and developed a strategy for jointly
classifying keyphrases and relationships, through the use of Convolutional Neural
Networks (CNN) [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Convolutional layers were also used by another system,
although only for the relation extraction subtask [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
      </p>
      <p>
        Other approaches have also been considered by participant teams in the task:
in [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], morphological analysis and the biomedical knowledge base Uni ed Medical
Language System (UMLS) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] are combined, only for the keyphrase extraction
subtask. A classic statistical NLP pipeline is combined in another system with
machine learning techniques such as CRF classi ers and logistic regression model
for o ering results for all the subtasks [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
      </p>
      <p>
        In a similar way to that presented in [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], our system also proposes the
use of Bi-LSTM layers, but is able to o er results for both phases of the task.
Moreover, it incorporates the use of attention layers [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] in the second step,
which help the model to focus on speci c parts of the input for improving the
relationship detection.
      </p>
      <p>
        Beyond this task, the use of deep learning techniques for entity and relation
extraction is being widely explored in the biomedical domain during the last
years. Di erent works can be found in the literature that exploit these techniques
for either extracting general entities and relationships [
        <xref ref-type="bibr" rid="ref4 ref9">4,9</xref>
        ] or addressing speci c
types such as drugs and adverse e ects [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] or rare diseases and dissabilities [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Task Proposal</title>
      <p>
        In this section the most important characteristics of the eHealth Knowledge
Discovery challenge are presented. Further details about the task can be found
in [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
3.1
      </p>
      <sec id="sec-3-1">
        <title>Subtask A: Identi cation and Classi cation of Keyphrases</title>
        <p>The rst subtask aims for nding and classifying relevant pieces of information
(words or sequences of words) within a sentence extracted from a biomedical
document. Once the span of those keyphrases have been detected, the systems
must classify them as belonging to one of the following classes: Concept (a general
term or idea), Action (a term that process or modi es other concepts), Predicate
(a term that represents a function or lter over a set of elements) or Reference
(a term that refers to a concept).
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Subtask B: Detection of Semantic Relations</title>
        <p>The second subtask proposes the classi cation of possible relationships between
the keyphrases. Types of relationships are categorized as general, conceptual
(both of them involving the four di erent keyphrase classes), action roles
(involving only actions) and predicate roles (involving only predicates). Each category
contains a set of classes, for a total of 13 di erent relationships.
3.3</p>
      </sec>
      <sec id="sec-3-3">
        <title>Datasets</title>
        <p>The eHealth-KD corpus has been published by the organizers of the task,
containing 700 (600 training and 100 development) di erent biomedical-related
sentences. The whole test dataset contains 8,800 sentences.
3.4</p>
      </sec>
      <sec id="sec-3-4">
        <title>Evaluation</title>
        <p>Three di erent scenarios are proposed for the evaluation of the participant
systems, which is carried out in terms of precision, recall and F1-measure. Metrics
are computed in terms of correct, incorrect and partial matches for both the
classi cation of keyphrases and relationships, and also missing and spurious matches
are considered in the classi cation of keyphrases.</p>
        <p>Scenario 1 considers the whole pipeline of the task, and hence can be seen as
the main evaluation, in which systems receive raw sentences as input and must
output the detected keyphrases and their assigned labels, as well as the
relationships between keyphrases and their labels. Scenario 2 only evaluates keyphrase
detection and classi cation from raw sentences, and Scenario 3 only evaluates
relationship detection and classi cation considering raw sentences and labelled
keyphrases as input.</p>
        <p>The complete test dataset composed of 8,800 sentences is used in Scenario
1, and participants are asked to provide their solution for all the sentences,
although only 100 of them have been used for the actual evaluation of the systems.
Regarding Scenarios 2 and 3, the test datasets contain 100 sentences each.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>System Description</title>
      <p>The DeepNER+ARE system is divided into two separate sequential subsystems
for addressing each of the subtasks of the challenge pipeline.
4.1</p>
      <sec id="sec-4-1">
        <title>Keyphrase recognition and classi cation</title>
        <p>The keyphrase detection phase has been addressed by using a subsystem which
consists of a pre-processing phase, where input data is adapted and prepared, a
supervised deep learning model, and a post-processing step for solving systematic
errors through hand-crafted rules.</p>
        <p>
          Pre-processing The corpus has been pre-processed and re-annotated following
the BILOU annotation scheme [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. Some simpli cation has been applied for
avoiding hops and overlappings that can be found in the keyphrases of the corpus.
Finally, in order to avoid con icts with the o set of the di erent annotations, a
tokenization process based on blank space splitting has been applied. A total of
14 classes resulting from all the possible combinations of the initial entity types
and the BILOU annotations have been generated.
        </p>
        <p>
          Features In this section we present the di erent attributes considered to be the
input of the deep learning stack:
{ Words: A representation based on pre-trained word embeddings has been
used. The word vectors presented in [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] have been selected due to the richness
of the sources from which they were generated and to their high recall. These
vectors have a total of 300 dimensions and gather around 1,000,653 unique
tokens.
{ Part-of-speech: This feature has been considered due to its importance in
general NLP tasks, and particularly to its connection and similarity with
the proposed classes to which each keyphrase should be related. The
PoSTagging model used was the one provided by the CoreNLP [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] library for
Spanish. An embedding representation of this feature is learned during
training, resulting in 25-dimensional vectors.
{ Casing: This feature satis es the need to minimize the impact of the
simpli cation process applied to complex expressions found in the di erent
instances. This is achieved by modeling each term with an additional 8-position
one-hot vector which represents di erent cases: term ending in comma or in
dot, uppercased rst letter or uppercased term, digits within the term, etc.
Deep Learning model The model implemented for keyphrase detection, as
shown in Figure 1, consists of a Bi-LSTM layer followed by two Dense layers.
Inputs of the architecture are represented by vectors Cx, Px and Wx, which
represent casing information, POS-tag embedding and word embedding, respectively.
The last dense layer corresponds to the output layer.
        </p>
        <p>Cx</p>
        <p>Px</p>
        <p>Wx</p>
        <p>Rules Two types of rules have been applied to the output of the deep learning
architecture in order to perform systematic error correction. The rst set of rules
is oriented to correcting frequent errors by extending or reducing the scope of
a detected keyphrase, or modifying its type, according to casing and POS-tag
information. On the other hand, the second set of rules aims to ensure that the
nal output of the system correctly follows the output BILOU format.</p>
        <p>Equations 1 and 2 show examples of rules that can be applied in each of the
aforementioned cases, respectively:</p>
        <p>T1(O) T2(BjAction) T3(LjConcept) ) T1(O) T2(BjAction) T3(LjAction) (1)
T1(O) T2(I) T3(L) ) T1(O) T2(B) T3(L)
(2)</p>
        <p>Equation 1 shows term T1, labeled as not belonging to an entity (O), term
T2, labeled as the beginning term (B) of an Action entity, and term T3 labeled
as the last term (L) of a Concept entity. In this case, the applied rule transforms
the last entity type from Concept to Action, for it to match the type of entity
beginning in T2.</p>
        <p>Equation 2, on the other hand, adapts the output to the expected BILOU
format: term T2 is labelled as intermediate term (I) of an entity, while the
previous term T1 is neither an intermediate nor a beginning term. The following term
T3 is the last term of the entity. Hence, for the output to make sense term T2
must be relabeled as beginning term, so the nal entity is composed of T2 and
T3.
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Attentive Relation Extraction</title>
        <p>A di erent deep learning stack has been developed for the second subtask,
devoted to the extraction meaningful relationships between keyphrases. This stack
is also based on a Bi-LSTM layer but is enriched by the addition of an attention
layer. Pre-processing is also needed in this step for correctly preparing the input
data.</p>
        <p>
          Features Some supplementary information has been added to the input features
of the model, apart from those features already mentioned in the rst subtask
(word embeddings, casing and POS-tagging):
{ Entities: In order to represent the entities that form part of each
relationship, the four di erent entity types (action, concept, predicate and reference)
have been represented using embeddings generated during the training phase.
The resulting vectors have 25 dimensions and encode both the BILOU
annotation and the type of each term of an entity.
{ Dependency graph: Considering that the relationships to be identi ed are
of a semantic nature (for instance is-a, causes or domain), the information
obtained by performing semantic parsing over the sentence and extracting
its dependency graph could be very valuable for the main aim of the
subtask. This dependency graph has also been generated using the CoreNLP
library. Speci cally, the information modeled represents the lowest level of
relationship that is o ered by the graph. Both the direction of the
relationship (related term) and the type of relationship are modeled. Both the
related term and the type of dependency are mapped using One-hot vectors.
Deep Learning model The proposed model is based on the architecture
presented in [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ]. This model extends the original approach by including the
previously mentioned NLP-based features. As Figure 2 shows, the model makes use
of a Bi-LSTM layer followed by an attention layer which considers the output of
the previous layer as a whole and merges word information into a higher level
vector that attempts to represent the attention relationships at sentence level.
        </p>
        <p>The output of the attention layer is then directed to an output dense layer
with all the possible classes a relationship between two terms can classi ed into.
The input is modeled using Wx, Px and Ex embeddings on one hand, representing
word, POS-tag and entity information respectively, and Dx, Tx and Cx One-hot
vectors on the other hand, which contain information regarding dependency
between terms, type of dependency and casing information respectively.</p>
        <p>Cx</p>
        <p>Dx</p>
        <p>Tx</p>
        <p>Ex</p>
        <p>Px</p>
        <p>Wx
In this section we show results obtained in the eHealth-KD competition, as
well as some results concerning di erent con gurations of the DeepNER+ARE
system. Tables 1, 2 and 3 show the task results for all the participating teams
in each of the proposed scenarios.</p>
        <p>Team</p>
        <p>F1</p>
        <p>P</p>
        <p>R</p>
        <p>These results clearly imply that the proposed system DeepNER+ARE is
able to obtain really promising results particularly in the subtask that aims for
the detection and classi cation of relationships between keyphrases previously
found. The system ranks in second place for this phase (subtask B, scenario</p>
        <p>P</p>
        <p>R</p>
        <p>P</p>
        <p>R
3). Regarding the whole evaluation, the DeepNER+ARE system obtains the
fourth position in terms of F1-Measure, while in scenario 2 (subtask A), ranks
in seventh place. However, in terms of precision our system is also able to o er
the second best performance (over 80%) in this subtask.</p>
        <p>The performance of our system is consistent with the complexity of the
networks used in each of the phases: for the rst subtask, the neural network
contains just a Bi-LSTM layer for accurately processing sequential textual
information. On the other hand the network used for the second step of the pipeline
adds an attention-based layer which is able to improve precision and raise up the
F1 measure. This attention mechanism allows the network to merge word-level
features into a sentence-level feature vector, which eventually helps the model to
focus on speci c parts of the input. Furthermore, the use of the graph extracted
from the dependency parsing over the input sentence also adds valuable prior
information to the network about the possible semantic relationships that can
be found in the sentence.</p>
        <p>In order to illustrate this behaviour, Table 4 shows the results obtained by
our system on the development set provided by organizers in subtask B
(relation extraction), as we added dependency parsing information and the attention
layer to the original base con guration. This base con guration only considered
embeddings, POS-tagging and letter case information as input, as well as the
output of subtask A (detected keyphrases and their classes).</p>
        <p>System</p>
        <p>F1</p>
        <p>P</p>
        <p>R</p>
        <p>As we can observe, F1-measure globally increases as we add both dependency
parsing information and the attention layer to the system. In particular, when the
dependency graph is also considered as input, both precision and recall increase,
which indicates that this additional information allows the system to nd more
meaningful relationships. The use of an attention layer, despite causing a small
decrease in recall, achieves a higher precision increase (less but more accurate
relationships are found) which results in a better F1 measure.
6</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusions and Future Work</title>
      <p>In this paper we have described our system DeepNER+ARE, and its performance
in the eHealth Knowledge Discovery challenge of the IberLEF 2019 competition.
The proposed system is divided into two phases which make use of deep neural
networks for addressing the two subtasks of the challenge: detection and classi
cation of keyphrases in biomedical texts, and relation extraction between those
keyphrases. The main contribution of our method is the combined use of
semantic parsing information and attention-based techniques in the network that
performs relation extraction. This contribution is re ected particularly in the
results that the system obtains in the relation extraction subtask, in which it is
ranked in second position out of 10 participant teams.</p>
      <p>We plan to address improvements in the keyphrase extraction as a future
lines of work, especially studying how more valuable syntactic and semantic
information can be added to the network that performs keyphrase identi cation,
and also how systematic post-processing rules can be automatically extracted
from the obtained results. The detection of multi-span and nested keyphrases
is also an interesting research line which may lead to increasing the number of
keyphrases correctly detected. Regarding relation extraction, more work should
be done on modeling the input of the network, in order to feed it with additional
and more complex information obtained from the dependency graph, and also
on the improvement of the attention mechanisms.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>Funding: This work has been partially supported by the Spanish Ministry of
Science and Innovation within the projects PROSA-MED
(TIN2016-77820-C32-R) and EXTRAE (IMIENS 2017).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bodenreider</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>The uni ed medical language system (umls): integrating biomedical terminology</article-title>
          .
          <source>Nucleic acids research 32(suppl 1)</source>
          ,
          <source>D267{D270</source>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Cardellino</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <source>Spanish Billion Words Corpus and Embeddings (March</source>
          <year>2016</year>
          ), https://crscardellino.github.io/SBWCE/
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Fabregat</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Araujo</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martinez-Romo</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Deep neural models for extracting entities and relationships in the new rdd corpus relating disabilities and rare diseases</article-title>
          .
          <source>Computer methods and programs in biomedicine 164</source>
          ,
          <volume>121</volume>
          {
          <fpage>129</fpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Gligic</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kormilitzin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goldberg</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nevado-Holgado</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Named entity recognition in electronic health records using transfer learning bootstrapped neural networks</article-title>
          .
          <source>arXiv preprint arXiv:1901</source>
          .
          <volume>01592</volume>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Hahnloser</surname>
            ,
            <given-names>R.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sarpeshkar</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mahowald</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Douglas</surname>
            ,
            <given-names>R.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Seung</surname>
            ,
            <given-names>H.S.:</given-names>
          </string-name>
          <article-title>Digital selection and analogue ampli cation coexist in a cortex-inspired silicon circuit</article-title>
          .
          <source>Nature</source>
          <volume>405</volume>
          (
          <issue>6789</issue>
          ),
          <volume>947</volume>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Hochreiter</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schmidhuber</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>Long short-term memory</article-title>
          .
          <source>Neural computation 9(8)</source>
          ,
          <volume>1735</volume>
          {
          <fpage>1780</fpage>
          (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fu</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ji</surname>
            ,
            <given-names>D.:</given-names>
          </string-name>
          <article-title>A neural joint model for entity and relation extraction from biomedical text</article-title>
          .
          <source>BMC bioinformatics 18(1)</source>
          ,
          <volume>198</volume>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Lopez-Ubeda</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <article-title>D az-</article-title>
          <string-name>
            <surname>Galiano</surname>
            ,
            <given-names>M.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mart</surname>
            n-Valdivia,
            <given-names>M.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Urena-Lopez</surname>
            ,
            <given-names>L.A.</given-names>
          </string-name>
          :
          <article-title>Sinai en tass 2018 task 3. clasi cando acciones y conceptos con umls en medline</article-title>
          .
          <source>In: Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          )
          <article-title>co-located with 34nd SEPLN Conference (SEPLN</article-title>
          <year>2018</year>
          ): Sevilla, Spain,
          <year>September 18th</year>
          ,
          <year>2018</year>
          . vol.
          <volume>2172</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Lv</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guan</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>Clinical relation extraction with deep learning</article-title>
          .
          <source>IJHIT 9</source>
          (
          <issue>7</issue>
          ),
          <volume>237</volume>
          {
          <fpage>248</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Manning</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Surdeanu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bauer</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Finkel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bethard</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McClosky</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>The stanford corenlp natural language processing toolkit</article-title>
          . In:
          <article-title>Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations</article-title>
          . pp.
          <volume>55</volume>
          {
          <issue>60</issue>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Mart nez Camara</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Almeida Cruz</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          , D az Galiano,
          <string-name>
            <given-names>M.C.</given-names>
            ,
            <surname>Estevez-Velarde</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          , Garc a Cumbreras,
          <string-name>
            <surname>M.A.</surname>
          </string-name>
          , Garc a Vega,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Gutierrez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            ,
            <surname>Montejo</surname>
          </string-name>
          <string-name>
            <surname>Raez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Montoyo</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          , Mun~oz, R., et al.:
          <article-title>Overview of tass 2018: Opinions, health and emotions (</article-title>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Martinez-Romo</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Araujo</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Duque</surname>
            <given-names>Fernandez</given-names>
          </string-name>
          ,
          <string-name>
            <surname>A.:</surname>
          </string-name>
          <article-title>S em g raph: Extracting keyphrases following a novel semantic graph-based approach</article-title>
          .
          <source>Journal of the Association for Information Science and Technology</source>
          <volume>67</volume>
          (
          <issue>1</issue>
          ),
          <volume>71</volume>
          {
          <fpage>82</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>Medina</given-names>
            <surname>Herrera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Turmo</surname>
          </string-name>
          <string-name>
            <surname>Borras</surname>
          </string-name>
          , J.:
          <article-title>Joint classi cation of key-phrases and relations in electronic health documents</article-title>
          .
          <source>In: Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          )
          <article-title>co-located with 34nd SEPLN Conference (SEPLN</article-title>
          <year>2018</year>
          ): Sevilla, Spain,
          <year>September 18th</year>
          ,
          <year>2018</year>
          . pp.
          <volume>83</volume>
          {
          <fpage>88</fpage>
          .
          <string-name>
            <surname>CEUR-WS. org</surname>
          </string-name>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Mintz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bills</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Snow</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jurafsky</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Distant supervision for relation extraction without labeled data</article-title>
          .
          <source>In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2-</source>
          Volume 2. pp.
          <volume>1003</volume>
          {
          <fpage>1011</fpage>
          .
          <article-title>Association for Computational Linguistics (</article-title>
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Piad-Mor s</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gutierrez</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Consuegra-Ayala</surname>
            ,
            <given-names>J.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Estevez-Velarde</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Almeida-Cruz</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          , Mun~oz, R.,
          <string-name>
            <surname>Montoyo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Overview of the ehealth knowledge discovery challenge at iberlef</article-title>
          <year>2019</year>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Ratinov</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roth</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Design challenges and misconceptions in named entity recognition</article-title>
          .
          <source>In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning</source>
          . pp.
          <volume>147</volume>
          {
          <fpage>155</fpage>
          . CoNLL '09,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computational Linguistics, Stroudsburg, PA, USA (
          <year>2009</year>
          ), http://dl.acm.org/citation.cfm?id=
          <volume>1596374</volume>
          .
          <fpage>1596399</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Suarez-Paniagua</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Segura-Bedmar</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          , Mart nez, P.: Labda at tass
          <article-title>-2018 task 3: Convolutional neural networks for relation classi cation in spanish ehealth documents</article-title>
          .
          <source>In: Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          )
          <article-title>co-located with 34nd SEPLN Conference (SEPLN</article-title>
          <year>2018</year>
          ): Sevilla, Spain,
          <year>September 18th</year>
          ,
          <year>2018</year>
          . vol.
          <volume>1510</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <given-names>Vivaldi</given-names>
            <surname>Palatresi</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          , Rodr guez Hontoria, H.:
          <article-title>Tass2018: Medical knowledge discovery by combining terminology extraction techniques with machine learning classi - cation</article-title>
          .
          <source>In: Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          )
          <article-title>co-located with 34nd SEPLN Conference (SEPLN</article-title>
          <year>2018</year>
          ): Sevilla, Spain,
          <year>September 18th</year>
          ,
          <year>2018</year>
          . pp.
          <volume>89</volume>
          {
          <fpage>95</fpage>
          .
          <string-name>
            <surname>CEUR-WS. org</surname>
          </string-name>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Witten</surname>
            ,
            <given-names>I.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paynter</surname>
            ,
            <given-names>G.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frank</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gutwin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nevill-Manning</surname>
            ,
            <given-names>C.G.</given-names>
          </string-name>
          :
          <article-title>Kea: Practical automated keyphrase extraction</article-title>
          . In:
          <article-title>Design and Usability of Digital Libraries: Case Studies in the Asia Paci c</article-title>
          , pp.
          <volume>129</volume>
          {
          <fpage>152</fpage>
          .
          <string-name>
            <given-names>IGI</given-names>
            <surname>Global</surname>
          </string-name>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Zavala</surname>
            ,
            <given-names>R.M.R.</given-names>
          </string-name>
          , Mart nez, P.,
          <string-name>
            <surname>Segura-Bedmar</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>A hybrid bi-lstm-crf model for knowledge recognition from ehealth documents</article-title>
          .
          <source>In: Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          )
          <article-title>co-located with 34nd SEPLN Conference (SEPLN</article-title>
          <year>2018</year>
          ): Sevilla, Spain,
          <year>September 18th</year>
          ,
          <year>2018</year>
          . vol.
          <volume>2172</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shi</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tian</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Qi</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hao</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Attention-based bidirectional long short-term memory networks for relation classi cation</article-title>
          .
          <source>In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume</source>
          <volume>2</volume>
          :
          <string-name>
            <given-names>Short</given-names>
            <surname>Papers</surname>
          </string-name>
          <article-title>)</article-title>
          .
          <source>vol. 2</source>
          , pp.
          <volume>207</volume>
          {
          <issue>212</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>