<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>TextAI: Enhancing TextAE with Intelligent Annotation Support</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Maximilian Grundke</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Johannes Jasper</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mariya Perchyk</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jan Philipp Sachse</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ralf Krestel</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mariana Neves</string-name>
          <email>mariana.neves@hpi.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Hasso Plattner Institute Potsdam</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>We present TextAI, an extension to the annotation tool TextAE, that adds support for named-entity recognition and automated relation extraction based on machine learning techniques. Our learning approach is domain-independent and increases the quality of the detected relations with each added training document. We further aim at accelerating and facilitating the manual curation process for natural language documents by supporting simultaneous annotation by multiple users.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Faced with rapidly growing numbers of publicly
available natural language documents, it is
becoming increasingly difficult to extract the underlying
knowledge in a structured manner. Thus,
annotation of documents for the purpose of extracting
this information is an important task in many
research domains today. Creating these annotations
is mostly done manually, even though it is a very
time consuming work and requires deep
understanding and domain knowledge
        <xref ref-type="bibr" rid="ref6">(Hirschman et al.,
2012)</xref>
        .
      </p>
      <p>
        TextAI1 is a tool developed to support
annotators as a first step on the way to minimize the
effort of extracting information from written texts.
There exists already a wide range of annotation
editors
        <xref ref-type="bibr" rid="ref8">(Neves and Leser, 2012)</xref>
        . Thus, we chose
not to implement an additional standalone editor,
but to build our system around the TextAE2 tool,
which is an existing open source editor. We
extended TextAE by providing additional features
to manage documents of multiple users and
predicting entities and relations, which can then be
1https://github.com/LearningToNote
2http://textae.pubannotation.org/
adopted by users into their set of annotations. The
system learns from its users and improves
prediction of relations over time based on previously
annotated documents. We use the biomedical
domain as use case for our system and as basis for
evaluation. The developed approach, however, can
be used across different domains that require
manual annotation. With its automated annotation
suggestions and its multi-user support, TextAI
introduces capabilities that are conceptually different
from other annotation tools.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Automatic Annotation Suggestions. Various
studies on prediction of annotations confirm that
automated recommendations increase the speed
and improve the quality of annotations. Lingren
et al.
        <xref ref-type="bibr" rid="ref7">(Lingren et al., 2014)</xref>
        determined that
automatic annotation suggestions for named entities
result in 13.83% to 21.5% time saving without
reducing the inter-annotator agreement (IAA) or
qualitative annotator performance. Additionally,
Fort and Sagot
        <xref ref-type="bibr" rid="ref3">(Fort and Sagot, 2010)</xref>
        and South
et al.
        <xref ref-type="bibr" rid="ref9">(South et al., 2014)</xref>
        showed a significant
gain in quality. Hernandez et al.
        <xref ref-type="bibr" rid="ref4">(Hernandez et
al., 2014)</xref>
        reported an improvement of non-expert
annotator performance using automated
namedentity recognition. The WebAnno annotation tool
        <xref ref-type="bibr" rid="ref13">(Yimam et al., 2014)</xref>
        includes automatic
suggestions for three generic structures: spans,
relations, and chains. It integrates an external machine
learning tool, which requires users to configure the
features themselves. While this allows for domain
specific optimization, it excludes non-expert users
from using this functionality. By allowing
definition of custom annotation labels, WebAnno
increases its flexibility, however eliminates labels as
a suitable machine learning feature.
Frontend
      </p>
      <p>Middleware</p>
      <p>Database Server
Annotation Editor</p>
      <p>
        Docs
text
mining
machine
learning
Annotation Editors. TextAE is a browser-based
annotation tool which comes without a server
backend, but supports importing documents and
annotations based on a simple JSON format.
Being implemented using HTML and JavaScript,
TextAEs functionality can be easily expanded.
Furthermore, it supports both a wide range of
keyboard shortcuts and usability improvements that
aim to increase performance of its users. The brat
rapid annotation tool
        <xref ref-type="bibr" rid="ref10">(Stenetorp et al., 2012)</xref>
        features a client-server architecture, supports
annotation of documents by multiple users
simultaneously and allows comparison of different sets of
annotations for one document. We chose to adapt
several of these features and enhance TextAE to
support them, while leaving it a simple and easy
to use editor, as feature-rich tools, such as brat,
quickly become complicated to use as they require
complex configuration.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>System Architecture</title>
      <p>Annotating documents with entities and relations
traditionally involves manually highlighting
entities in the document and marking relations
between them. Our system employs machine
learning techniques to automate these steps by using
the workflow depicted in Figure 1. The user
interacts with a central front-end component, which is
used to manage annotation tasks, documents and
users, as well as importing documents from
multiple sources. Users also interact with the
annotation editor for editing and reviewing documents
and their annotations. In the back end, we use
an in-memory database (IMDB), which provides
document storage, text analysis features and
integration with machine learning algorithms. The
middleware layer mediates between the different
interfaces and the IMDB and deals with all logic
concerning user and document management.
3.1</p>
      <sec id="sec-3-1">
        <title>Front end</title>
        <p>
          The user interacts with a central front-end
component, which allows importing documents from
multiple sources, such as the local file system, e.g.,
files in the BioC file format
          <xref ref-type="bibr" rid="ref2">(Comeau et al., 2013)</xref>
          or by querying PubMed3, based on plain text.
After importing documents into TextAI, these can be
loaded from the database (cf. Back end).
Besides document storage, the IMDB further offers
basic text mining and analysis features that we use
and expand. Domain independence is achieved
through the introduction of another level of
hierarchy called ”tasks”, which can be used to
organize documents under one semantic group. This
has the technical implication that each task has its
own machine learning models, hence annotations
within one task have no impact on predictions in
another one.
3.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Annotation Editor</title>
        <p>Users can review and edit documents and their
annotations through the annotation editor. The later
also allows to trigger NER, relationship
prediction, and other methods aimed at improving
annotation quality and speed. We extended TextAE,
which is a powerful standalone annotation tool
based on JSON-formatted input, that allows
loading text and creating annotations and relations. It
allows adding multiple annotations to each
position in the document and displaying different
layers of annotations through color coding. Instead
of displaying different kinds of annotations, such
as POS tags and domain-specific information, e.g.,
medical terms, we use different colors for
annotations made by different users on the same
document. Because of the HTML span-tag-based
implementation for annotation rendering, TextAE
displays overlapping annotations in a stacked way
instead of inline.</p>
        <p>
          Users can create custom labels for their
annotations in TextAI. However, we ask the users to map
their custom labels to one of the UMLS semantic
types4, in order to improve the ability of our
system to learn based on annotations made by users
and to normalize the annotations made by
different users and tasks. For instance, when annotating
the DDI corpus
          <xref ref-type="bibr" rid="ref5">(Herrero-Zazo et al., 2013)</xref>
          , users
could create four labels, one for each drug type
        </p>
        <sec id="sec-3-2-1">
          <title>3http://www.ncbi.nlm.nih.gov/pubmed</title>
          <p>4https://metamap.nlm.nih.gov/
SemanticTypesAndGroups.shtml
included in this corpus, but they are asked map
every type to a UMLS semantic type, for instance,
”T2000 - Clinical Drug” or ”T121 -
Pharmacologic Substance”. Currently, we only allow
mapping a label to one single UMLS semantic type.</p>
          <p>Annotating large corpora is a task that can
involve multiple experts, which requires user
management to be available in a collaborative
annotation system. A problem that occurs when
multiple people do the same work is the creation of
annotations with similar, but not identical
meaning. Therefore, our system nudges users towards
better annotations by asking them to map their
labels to the UMLS semantic types. Further, during
annotation, users can choose to hide their
annotations from both other users and the machine
learning part of our system, while still being able to use
all of its features.</p>
          <p>Users can correct the predictions made by the
system by selectively adding them to their own
annotation sets and editing them if necessary. At any
point in the process, users can also manually add,
remove, and alter entities and relations to their
own set.
3.3</p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>Back end</title>
        <p>We use Rserve5, which provides an interface to
the statistical computing language R with its
extensive text mining and machine learning
capabilities. Further, our system relies on the text
analysis functionalities of an IMDB for entity and
relation predictions. Besides the documents,
taskspecific metadata, such as trained models, domain
specific stopword lists and NER dictionaries are
stored in the database, allowing fast access to
im</p>
        <sec id="sec-3-3-1">
          <title>5https://rforge.net/Rserve/</title>
          <p>portant domain-dependent information.</p>
        </sec>
      </sec>
      <sec id="sec-3-4">
        <title>3.4 Information Extraction</title>
        <p>The automated suggestion of annotations involves
two major steps, NER and relation prediction. For
NER we apply a simple dictionary-based approach
using the UMLS dictionary and part-of-speech
(POS) tagging of the documents.</p>
        <p>
          We train two support vector machines6 to
detect relations between two entities: one to
determine whether a relation exists between two
entities, and in case of positive, one to classify the
type of relation. The relations are represented by
n-dimensional feature vectors. We get
descriptive characteristics of the relation of two entities
by combining lexical and syntactical features In
the pre-processing step, the document text is split
in sentences and every word is tokenized,
lemmatized and POS-tagged. Similar to the approach
described in
          <xref ref-type="bibr" rid="ref1">(Bui et al., 2014)</xref>
          , we divided the
sentence into three groups by adding the prefix ”b”
to each token appearing before the first entity, the
prefix ”i” to the words in between the two entities
and the prefix ”a” to the words after the second
entity. While we remove stop words and ignore
entities in the context, we do not filter out punctuation,
i.e. comma, colon and semicolon, since their
appearance between two entities can be a strong
indicator that there is no relation between these
entities. We only consider three tokens on either side
of each entity in order to emphasize the near
context around the annotations. The annotated types
of both entities are considered as a feature as well.
We also include the distance, i.e. the number of
6e1071 (https://cran.r-project.org/
package=e1071), an R interface to LIBSVM with
its default RBF kernel
words and the number of characters between two
entities, as a feature. We avoid using trigger words
as proposed by
          <xref ref-type="bibr" rid="ref1">(Bui et al., 2014)</xref>
          , since this would
contradict with our domain-independence
objective.
        </p>
        <p>Every newly annotated document is used to
retrain the models for relationship extraction, thus
learning over time to improve its performance.
Further, using the UMLS predefined set of types
also improves the learning capabilities of our
system, as entities can now be chosen from a finite
set and the entity type becomes a stronger feature
for relation prediction. Since training these
models is only possible and reasonable given a certain
amount of information, TextAI needs a set of
preannotated documents, either by importing or
manually annotating a few documents of the corpus in
advance.
3.5</p>
      </sec>
      <sec id="sec-3-5">
        <title>Middleware</title>
        <p>
          As both database and front-end components have
highly independent schemata and interfaces,
transforming data between them is a key role of
the middleware. The middleware implements a
RESTful interface representing users, tasks,
documents, user-documents and their content to pass
on information about data objects to the
browserbased front end. Additionally, user management
and access rights management are handled here.
An integrated user model ensures that every
action taken is properly authenticated and
authorized. Our annotation editor can load and export
data provided in JSON format, thus the
middleware is also responsible for transforming
information between the different representations in our
system. In addition, the middleware provides an
interface to import and export documents.
Internally, we use the BioC format
          <xref ref-type="bibr" rid="ref2">(Comeau et al.,
2013)</xref>
          , as it is general enough to be used across
different annotation domains.
4
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Experiments</title>
      <p>
        We focused on the medical domain and used the
DDI corpus
        <xref ref-type="bibr" rid="ref5">(Herrero-Zazo et al., 2013)</xref>
        to
evaluate the performance of our NER and relation
extraction procedures.
      </p>
      <p>NER. In our evaluation, we consider not only
exact matches of the gold-standard and the
predicted entities, but also overlapping entity label
markers are accepted. Our average F-1 score was
77.3% with a recall of 85.62% and a precision of</p>
      <p>
        Relation Extraction. In total, the DDI corpus
names five different relation types: non-relation,
mechanism, effect, advise and a general
interaction. We achieved a precision of 72.95% on the
test set of the DDI corpus, averaged over 10
iterations. Other researchers who performed relation
prediction on the DDI corpus achieve comparable
performance results of 60.9% and 62.99% macro
averaged F-score
        <xref ref-type="bibr" rid="ref11 ref12">(Thomas et al., 2013a)</xref>
        .
Training Set Size. When users annotate new
relations, the systems performance increases since
the SVM models are retrained with new user
input. We measure the impact of this input on the
efficiency of our classifier with a setup as follows.
Starting with only one annotated document, we
train both SVM classifiers on the given data and
evaluate on 20 randomly selected documents. We
then select 10 more documents for training and
again test on 20 test documents. This cycle repeats
until 500 documents are included in the training
set. Figure 3 illustrates the F-score for each of the
DDI interaction types averaged over 10 runs with
varying training documents.
      </p>
      <p>
        In comparison to our performance for other
relations, the ”interaction” type obtained lower
results and these have oscillated over our
experiments. This relation type is under-represented in
the DDI corpus as it constitute only 6% of the
annotated DDI relations. As discussed in previous
work
        <xref ref-type="bibr" rid="ref11 ref12">(Thomas et al., 2013b)</xref>
        , this resulted in lower
performance of the systems for this relation type
on the test set. We believe that the oscillation on
the results occurs for those test sets which
contained more or less instances of the ”interaction”
relation type that could not be correctly detected
by our system.
5
      </p>
    </sec>
    <sec id="sec-5">
      <title>Conclusions and Future Work</title>
      <p>We have presented a prototype that extends the
annotation editor TextAE with multi-user
functionality and annotation prediction. This was achieved
by creating a concept of per-user annotation sets
and tasks, as well as a NER framework and
relation prediction algorithm. Our system provides
users with functionality for annotations prediction
without interfering with their day-to-day
annotation work.</p>
      <p>As future work, we plan on conducting a user
study on annotation speed and quality. Further,
we also want to explore NER algorithm based on
machine learning and on the labels which are
normalized to the UMLS semantic types and not only
on the current dictionary-based approach. Finally,
semi-supervised learning approaches, such as
active learning to leverage user feedback, could
improve NER and relation extraction even further.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Quoc-Chinh</surname>
            <given-names>Bui</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peter M.A. Sloot</surname>
          </string-name>
          ,
          <string-name>
            <surname>Erik M. van Mulligen</surname>
          </string-name>
          , and
          <string-name>
            <surname>Jan</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Kors</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>A novel featurebased approach to extract drugdrug interactions from biomedical text</article-title>
          .
          <source>Bioinformatics.</source>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Donald C.</given-names>
            <surname>Comeau</surname>
          </string-name>
          , Rezarta Islamaj Doan, Paolo Ciccarese, Kevin Bretonnel Cohen, Martin Krallinger, Florian Leitner, Zhiyong Lu, Yifan Peng, Fabio Rinaldi, Manabu Torii, Alfonso Valencia, Karin Verspoor, Thomas C. Wiegers,
          <string-name>
            <surname>Cathy H. Wu</surname>
            , and
            <given-names>W. John</given-names>
          </string-name>
          <string-name>
            <surname>Wilbur</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Bioc: a minimalist approach to interoperability for biomedical text processing</article-title>
          .
          <source>Database</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <source>Kare¨n Fort and Benoˆıt Sagot</source>
          .
          <year>2010</year>
          .
          <article-title>Influence of pre-annotation on pos-tagged corpus development</article-title>
          .
          <source>In Proceedings of the Fourth Linguistic Annotation Workshop</source>
          , LAW IV '
          <volume>10</volume>
          , pages
          <fpage>56</fpage>
          -
          <lpage>63</lpage>
          , Stroudsburg, PA, USA. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Andres M Hernandez</surname>
            , Harry S Hochheiser, John R Horn, Rebecca S Crowley, and
            <given-names>Richard D</given-names>
          </string-name>
          <string-name>
            <surname>Boyce</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Testing pre-annotation to help non-experts identify drug-drug interactions mentioned in drug product labeling</article-title>
          .
          <source>In Second AAAI Conference on Human Computation and Crowdsourcing.</source>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <article-title>Mar´ıa Herrero-Zazo, Isabel Segura-Bedmar, Paloma Mart´ınez, and</article-title>
          <string-name>
            <given-names>Thierry</given-names>
            <surname>Declerck</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>The ddi corpus: An annotated corpus with pharmacological substances and drug-drug interactions</article-title>
          .
          <source>Journal of Biomedical Informatics (JBI)</source>
          ,
          <volume>45</volume>
          (
          <issue>5</issue>
          ):
          <fpage>914</fpage>
          -
          <lpage>920</lpage>
          ,
          <fpage>10</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Lynette</given-names>
            <surname>Hirschman</surname>
          </string-name>
          ,
          <string-name>
            <surname>Gully A. P. C Burns</surname>
          </string-name>
          ,
          <string-name>
            <surname>Martin</surname>
            <given-names>Krallinger</given-names>
          </string-name>
          , Cecilia Arighi,
          <string-name>
            <given-names>K.</given-names>
            <surname>Bretonnel Cohen</surname>
          </string-name>
          , Alfonso Valencia,
          <string-name>
            <surname>Cathy H. Wu</surname>
          </string-name>
          , Andrew ChatrAryamontri, Karen G. Dowell, Eva Huala, Anlia Loureno, Robert Nash,
          <string-name>
            <surname>Anne-Lise</surname>
            <given-names>Veuthey</given-names>
          </string-name>
          , Thomas Wiegers, and
          <string-name>
            <given-names>Andrew G.</given-names>
            <surname>Winter</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Text mining for the biocuration workflow</article-title>
          .
          <source>Database</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Todd</given-names>
            <surname>Lingren</surname>
          </string-name>
          , Louise Deleger, Katalin Molnar, Haijun Zhai, Jareen Meinzen-Derr, Megan Kaiser, Laura Stoutenborough,
          <string-name>
            <given-names>Qi</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and Imre</given-names>
            <surname>Solti</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Evaluating the impact of pre-annotation on annotation speed and potential bias: natural language processing gold standard development for clinical named entity recognition in clinical trial announcements</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          ,
          <volume>21</volume>
          (
          <issue>3</issue>
          ):
          <fpage>406</fpage>
          -
          <lpage>413</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Mariana</given-names>
            <surname>Neves</surname>
          </string-name>
          and
          <string-name>
            <given-names>Ulf</given-names>
            <surname>Leser</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>A survey on annotation tools for the biomedical literature</article-title>
          . Briefings in bioinformatics, page bbs084.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Brett R. South</surname>
          </string-name>
          , Danielle Mowery, Ying Suo, Jianwei Leng, scar Ferrndez,
          <string-name>
            <surname>Stephane M. Meystre</surname>
          </string-name>
          , and
          <string-name>
            <surname>Wendy</surname>
            <given-names>W.</given-names>
          </string-name>
          <string-name>
            <surname>Chapman</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Evaluating the effects of machine pre-annotation and an interactive annotation interface on manual de-identification of clinical text</article-title>
          .
          <source>Journal of Biomedical Informatics</source>
          ,
          <volume>50</volume>
          :
          <fpage>162</fpage>
          -
          <lpage>172</lpage>
          . Special Issue on Informatics Methods in Medical Privacy.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Pontus</given-names>
            <surname>Stenetorp</surname>
          </string-name>
          , Sampo Pyysalo, Goran Topic´,
          <string-name>
            <surname>Tomoko</surname>
            <given-names>Ohta</given-names>
          </string-name>
          , Sophia Ananiadou, and
          <string-name>
            <surname>Jun'ichi Tsujii</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>brat: a web-based tool for nlp-assisted text annotation</article-title>
          .
          <source>In Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics</source>
          , pages
          <fpage>102</fpage>
          -
          <lpage>107</lpage>
          , Avignon, France, April. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Philippe</given-names>
            <surname>Thomas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Mariana</given-names>
            <surname>Neves</surname>
          </string-name>
          , Tim Rockta¨schel, and Ulf Leser. 2013a.
          <article-title>Wbi-ddi: Drug-drug interaction extraction using majority voting</article-title>
          .
          <source>In Second Joint Conference on Lexical and Computational Semantics (*SEM)</source>
          , Volume
          <volume>2</volume>
          :
          <source>Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval</source>
          <year>2013</year>
          ), pages
          <fpage>628</fpage>
          -
          <lpage>635</lpage>
          , Atlanta, Georgia, USA, June. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Philippe</given-names>
            <surname>Thomas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Mariana</given-names>
            <surname>Neves</surname>
          </string-name>
          , Tim Rockta¨schel, and Ulf Leser. 2013b.
          <article-title>Wbi-ddi: drug-drug interaction extraction using majority voting</article-title>
          .
          <source>In Second Joint Conference on Lexical and Computational Semantics (* SEM)</source>
          , volume
          <volume>2</volume>
          , pages
          <fpage>628</fpage>
          -
          <lpage>635</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Seid</given-names>
            <surname>Muhie</surname>
          </string-name>
          <string-name>
            <surname>Yimam</surname>
          </string-name>
          , Chris Biemann, Richard Eckart de Castilho, and
          <string-name>
            <given-names>Iryna</given-names>
            <surname>Gurevych</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Automatic annotation suggestions and custom annotation layers in webanno</article-title>
          .
          <source>In Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations</source>
          , pages
          <fpage>91</fpage>
          -
          <lpage>96</lpage>
          . Association for Computational Linguistics.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>