<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>HapLap at eHealth-KD Challenge 2020</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sergio Santana</string-name>
          <email>ssantana005@ikasle.ehu.eus</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alicia Pérez</string-name>
          <email>alicia.perez@ehu.eus</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Arantza Casillas</string-name>
          <email>arantza.casillas@ehu.eus</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>HiTZ Center - Ixa, University of the Basque Country UPV/EHU</institution>
          ,
          <addr-line>Manuel Lardizabal 1, 20080 Donostia</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <fpage>136</fpage>
      <lpage>140</lpage>
      <abstract>
        <p>We present the work carried out by the HapLap group in the in the subtask B of the eHealth-KD 2020 competition. Relation extraction was addressed with a pipeline system that makes use of a Joint ABLSTM neuronal network together with a pre-process and a post-process phase. We obtained a result of 0.316 in Scenario 3. We present the work carried out by the HapLap group in the eHealth-KD 2020 task [1]. In this third edition the purpose of the task is to automatically extract knowledge, represented by means of thirteen semantic relations, from Spanish electronic health documents. We have taken part in the optional subtask B: the input is a plain text with entity annotations in a BRAT file and the output is the previous BRAT file with both the entities and relations. To address this, we have implemented a pipeline system that makes use of a Joint AB-LSTM neuronal network together with a pre-process and a post-process phase.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Entity recognition</kwd>
        <kwd>Relation extraction</kwd>
        <kwd>Joint AB-LSTM neuronal network</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        In the last years various competitions related to relation extraction have been emerging such as:
Semeval 2018 task 7 [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] to extract relations from scientific texts; eHealthKD 2018 [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], eHealthKD
2019 [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] or BioNLP [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] to extract and classify clinical relations from clinical texts. So the relation
extraction problem is arousing interest in diferent areas and also in the clinical documentation
area. Since the resurgence of neural networks, diferent approaches have been implemented for
extracting clinical relations. DET-BLSTM system [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] makes use of a Bi-LSTM network. In [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]
the authors presented a combination of two diferent networks gated recurrent unit (GRU) and
convolutional neural network (CNN) to detect clinical relations. In [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] a convolutiona neural
network is also used to classify relations. In [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] an Joint AB-LSTM neuronal network is used to
extract adverse drug reaction relations. In this paper we present a Joint AB-LSTM neuronal, a
modification of the work presented in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] network for the extraction of clinical relations in the
context of eHealthKD 2020 competition.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Materials and Methods</title>
      <p>For this work we have divided the system into three phases: First the pre-process, where we
adapt the data format to use with the Joint AB-LSTM. After that we have the training phase,
where we train and evaluate the neural network and we get the prediction. And after getting
the predictions we have the post-process, where we convert those predictions into the data
format that is used in the competition.</p>
      <sec id="sec-3-1">
        <title>3.1. Pre-process</title>
        <p>In the preprocess we do the following operations:
• Convert the input from the Brat standof format to the format used in the eHeathKD 2019
challenge.
• Convert the data in the eHealthKD 2019 format into the format used by the Joint AB-LSTM.
• Create the NO_RELATION relations.</p>
        <p>In the first part of the system we have pre-processed the input relations. We have converted
the Brat Standof input relation-format (also referred to as ann) to the format used in the
previous eHealthKD 2019 competition by means of the ann2txt scripts ( https://github.com/
knowledge-learning/ehealthkd-2019/blob/master/scripts/ann2txt.py) provided there. Next, we
needed to adapt it to what the Joint AB-LSTM requires. Three programs have been implemented
for the pre-processing and their code has been posted on GitHub (https://github.com/Porobu/
HAPLAP-MAL). These three programs load the instances that are in the eHealthKD 2019 data
format and they join them into a single file.</p>
        <p>In an attempt to enable the neural network learn to discriminate between positive and negative
relations (absence of relation), both types of instances should be provided in the inference stage.
To this end, in the pre-processing an auxiliary relation class, NO_RELATION, was also created.
A critical point, hence, is how to choose instances that contain pairs of entities that could be
related and, thus, are candidate relations and label them as negative instances. Both the selection
and the proportions might be crucial. We have used a simple way of choosing them, that only
creates negative (NO_RELATION) relations between entity pairs that have at least one positive
relation instance in the data set. To further reduce the negative relations, we have only created
these between entity pairs in the same sentence.</p>
        <p>At this stage we have a set of data with the candidates marked as either related or not-related.
At this point a multi-class approach enables us to predict whether a candidate pair is related
with some of the relation-classes available (including NO_RELATION). This was, indeed, our
approach-1: a pair of entities that could be related (are a relation-candidate) are directly
classified by means of the Joint AB-LSTM.</p>
        <p>Needless to say, in the aforementioned sample negative instances substantially exceed the
positive ones leading to skewed class distribution. In table 1 we can see the number of positive
and negative relations in our training and development data sets. We have to remember that in
our multi class classification approach (approach 1) the positive relation number contains all the
thirteen classes, further skewing the data. Inference tends to be biased towards majority class.
To cope with this we proposed to tackle the classification in two stages (our approach-2):
• In the first phase we have created the binary data set, and all the positive relations (target,
causes...) have been grouped in the RELATION class. In this phase we filter all the negative
relations, to reduce the imbalance.
• In the second phase we have now only the data set with the positive relations (arg, target,
subject...), and we train the system to predict the relation.</p>
        <p>Both approaches (and both phases in the second approach) were implemented by means of
the Joint AB-LSTM approach. Further details are given in the following section.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Joint AB-LSTM network</title>
        <p>After pre-processing the instances we load them into the Joint AB-LSTM neural network. The
Joint AB-LSTM neural network has been inplemented by using Tensorflow. The network also
does its own pre-processing. First all tokens are lower-cased.</p>
        <p>The network employed word-embeddings as the main feature. For this work we have used
pretrained embeddings from the clinical domain. The embeddings have been trained in corpora that
consists of EHRs (electronic health records) that are not publicly available due to confidentiality
issues. Other choices might have resulted more appropriate than ours since the amount and
type of data employed has a big impact on the resulting embeddings. Apart from the
wordembeddings, the network employs another powerful feature: the distance-embeddings. The
distance is simply computed as the number of tokens between each annotated word in the
sentence and the target word entity.</p>
        <p>Having the relations completely pre-processed, the neural network is trained. This network
combines two widely used neural networks in NLP: a Bi-LSTM with max pooling and an
attentive Bi-LSTM. The Joint AB-LSTM is fed with the pre-processed sentences, their entities
and relations between those, and the previously created distance embeddings.</p>
        <p>We have optimised two hyper-parameters of the neural network, the dropout and learning
rate to get the final model. We have trained the model with a mixture of the eHealthKD 2019
train+dev and the eHealthKD 2020 datasets, and we have used the eHealthKD 2020 dev dataset
as validation. Note that this optimisation has been done over the so called multiclass dataset
(approach 1), not over the binary dataset (approach 2). After doing the optimisation, we set
0.001 as the learning rate, and we used no dropout.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Postprocess</title>
        <p>After getting the predictions from the neural network, we postprocess them to get the output
relations in the Brat Standof format, respecting the IDs if the gold entities.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions</title>
      <p>As described in section 3.1, we provided two diferent approaches. The results achieved with
each of them are given in table 2.</p>
      <p>Approach 1 outperforms Approach 2 in terms of precision but with the recall occurs the
opposite. Nevertheless, for both approaches the F1-measure has the same value.
Relation extraction was addressed with a neural approach, Joint AB-LSTM network. We applied
two simple pre-processing approaches to get both positive and negative instances. This stage
might result naive for the way in which the sampling was carried out and the proportions
selected. We explored two pre-processing approaches: a straight one, approach 1, which just
copes with multi-class problem; a filtered one (approach 2) that tried to get rid of negative
candidates prior to the multi-class stage. None of them surpassed the other significantly. For
future work, we should explore the embeddings provided to the network. Embeddings are the
main source of knowledge in this stage with limited training sets and was proven significantly
influential in related works.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This work was partially supported by the Spanish Ministry of Science and Technology
PADMED (PID2019-106942RB-C31) and by the Basque Government (IXA IT-1343-19 and a Grant for
the student Sergio Santana published in the 12/03/2020 BOPV).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Piad-Morfis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Gutiérrez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Cañizares-Diaz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Estevez-Velarde</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Almeida-Cruz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Muñoz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Montoyo</surname>
          </string-name>
          ,
          <article-title>Overview of the eHealth Knowledge Discovery Challenge at IberLEF 2020, in: Proceedings of the Iberian Languages Evaluation Forum co-located with 36th Conference of the Spanish Society for Natural Language Processing</article-title>
          ,
          <source>IberLEF@SEPLN</source>
          <year>2020</year>
          , Spain, September,
          <year>2020</year>
          .,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>K.</given-names>
            <surname>Gábor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Buscaldi</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.-K. Schumann</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>QasemiZadeh</surname>
          </string-name>
          , H. Zargayouna, T. Charnois, Semeval
          <article-title>-2018 task 7: Semantic relation extraction and classification in scientific papers</article-title>
          ,
          <source>in: Proceedings of The 12th International Workshop on Semantic Evaluation</source>
          ,
          <year>2018</year>
          , pp.
          <fpage>679</fpage>
          -
          <lpage>688</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>E.</given-names>
            <surname>Martínez Cámara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Almeida Cruz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Díaz Galiano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Estévez-Velarde</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. Á. García</given-names>
            <surname>Cumbreras</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. García</given-names>
            <surname>Vega</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Gutiérrez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Montejo</given-names>
            <surname>Ráez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Montoyo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Muñoz</surname>
          </string-name>
          , et al.,
          <source>Overview of tass</source>
          <year>2018</year>
          :
          <article-title>Opinions, health and emotions (</article-title>
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Piad-Morfis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Gutiérrez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Consuegra-Ayala</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Estevez-Velarde</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Almeida-Cruz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Munoz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Montoyo</surname>
          </string-name>
          ,
          <article-title>Overview of the ehealth knowledge discovery challenge at iberlef 2019, in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2019)</article-title>
          .
          <source>CEUR Workshop Proceedings, CEUR-WS. org</source>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D.</given-names>
            <surname>Demner-Fushman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. B.</given-names>
            <surname>Cohen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ananiadou</surname>
          </string-name>
          , J. Tsujii (Eds.),
          <source>Proceedings of the 18th BioNLP Workshop</source>
          and Shared Task, Association for Computational Linguistics, Florence, Italy,
          <year>2019</year>
          . URL: https://www.aclweb.org/anthology/W19-5000.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <article-title>Biomedical event extraction via long short term memory networks along dynamic extended tree</article-title>
          ,
          <source>in: Bioinformatics and Biomedicine (BIBM)</source>
          ,
          <source>2016 IEEE International Conference on, IEEE</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>739</fpage>
          -
          <lpage>742</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>B.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Guan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Dai</surname>
          </string-name>
          ,
          <article-title>Convolutional gated recurrent units for medical relation classification</article-title>
          ,
          <source>in: 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)</source>
          , IEEE,
          <year>2018</year>
          , pp.
          <fpage>646</fpage>
          -
          <lpage>650</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S.</given-names>
            <surname>Medina Herrera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Turmo</given-names>
            <surname>Borras</surname>
          </string-name>
          ,
          <article-title>Joint classification of key-phrases and relations in electronic health documents</article-title>
          ,
          <source>in: Proceedings of TASS 2018: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2018</year>
          )
          <article-title>co-located with 34nd SEPLN Conference (SEPLN</article-title>
          <year>2018</year>
          ): Sevilla, Spain,
          <year>September 18th</year>
          ,
          <year>2018</year>
          , CEUR-WS. org,
          <year>2018</year>
          , pp.
          <fpage>83</fpage>
          -
          <lpage>88</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S.</given-names>
            <surname>Santiso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Perez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Casillas</surname>
          </string-name>
          ,
          <article-title>Exploring joint ab-lstm with embedded lemmas for adverse drug reaction discovery</article-title>
          ,
          <source>IEEE journal of biomedical and health informatics</source>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>S.</given-names>
            <surname>Santiso</surname>
          </string-name>
          <string-name>
            <surname>González</surname>
          </string-name>
          ,
          <article-title>Adverse drug reaction extraction on electronic health records written in spanish (</article-title>
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>