<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>E. Galeota and M. Pelizzola, “Ontology-based annotations and
semantic relations in large-scale (epi)genomics data,” Briefings in
Bioinformatics</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.1093/bib/bbw036</article-id>
      <title-group>
        <article-title>Comparison of Natural Language Processing Tools for Automatic Gene Ontology Annotation of Scientific Literature</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Lucas Beasley</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Prashanti Manda</string-name>
          <email>manda@uncg.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, University of North Carolina at Greensboro</institution>
          ,
          <addr-line>NC</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <volume>18</volume>
      <issue>3</issue>
      <fpage>7</fpage>
      <lpage>10</lpage>
      <abstract>
        <p>-Manual curation of scientific literature for ontologybased knowledge representation has proven infeasible and unscalable to the large and growing volume of scientific literature. Automated annotation solutions that leverage text mining and Natural Language Processing (NLP) have been developed to ameliorate the problem of literature curation. These NLP approaches use parsing, syntactical, and lexical analysis of text to recognize and annotate pieces of text with ontology concepts. Here, we conduct a comparison of four state of the art NLP tools at the task of recognizing Gene Ontology concepts from biomedical literature using the Colorado Richly Annotated Full-Text (CRAFT) corpus as a gold standard reference. We demonstrate the use of semantic similarity metrics to compare NLP tool annotations to the gold standard.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        There has been a rapid increase in the number of scientific
articles published each year [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. However, the majority of
information in these scientific articles remains in the form of
free text, and is therefore, opaque to computational analyses
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In several data intensive fields such as Biology, ontologies
have been adopted as the de-facto mode of data representation
to enable data integration, sharing, and, to make data
computationally amenable [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        While ontologies have helped transform information from
free-text to a machine readable form, the process of this
data transformation involves a huge bottleneck. The majority
of ontology-based data annotation is performed via manual
curation of scientific literature - the process of reading and
annotating parts of text with one or more ontology concepts
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Manual curation is tedious, time consuming, and highly
unscalable to the growing body of scientific literature.
      </p>
      <p>To counter these difficulties, there has been a push to
develop automated solutions based on Natural Language
Processing (NLP) that can automatically read literature and identify
ontology concepts from text, thereby performing automated
annotation of literature.</p>
      <p>
        One of the primary tasks for these NLP methods is Named
Entity Recognition (NER) - identifying entities such as genes,
proteins, or ontology concepts from text [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. NER is an
important component of information extraction and annotation
for a wide range of domains such as biomedical research,
biology, etc [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. In other applications, NER is one of the crucial
preliminary steps for subsequent creation of complex
ontologybased expressions [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. For example, the Entity Quality (EQ)
annotation format is widely used to describe clinical and
evolutionary phenotypes [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Curators identify appropriate
ontology concepts to represent the affected Entity and the
Quality in a phenotype, and then combine the concepts to
create an EQ expression. In more complex annotations, the
Entity component might comprise multiple concepts that are
combined using relational/spatial terms. In all of these
scenarios, the first and crucial step is recognizing individual ontology
concepts from text before building complex expressions.
      </p>
      <p>
        Here, we focus on ontology-based Named Entity
Recognition and conduct a formal comparison of methods and tools
for recognizing ontology concepts from scientific literature in
an automated manner. We present a comparison of four state
of the art concept recognition tools (MetaMap [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], NCBO
Annotator [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], Textpresso [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], and SciGraph [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]). We use
the Colorado Richly Annotated Full-Text (CRAFT) corpus
[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] as a Gold Standard reference to compare and assess the
performance of these four NLP tools.
      </p>
      <p>
        The CRAFT corpus contains 67 open access, full length
biomedical articles annotated with concepts from several
ontologies (such as Gene Ontology [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], Protein Ontology [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ],
Sequence Ontology [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], etc.). In our experiment, we
employed each of the four NLP tools to annotate the articles
in the CRAFT corpus with Gene Ontology concepts and
subsequently compared the resulting annotations to CRAFT’s
GO annotations in order to assess the tool’s annotation
performance.
      </p>
      <p>Precision and Recall are the most widely used metrics to
assess the performance of any information retrieval system.
However, these traditional metrics of performance don’t take
into account the notion of partial information retrieval. For
example, in a traditional database, information is either
retrieved or not retrieved by a search system, leading to a boolean
characterization of performance. However, when aiming to
“retrieve” an appropriate ontology concept for a piece of
text, a tool might partially retrieve the concept as compared
to a Gold Standard. For example, the Gold Standard might
annotate a piece of text with the Gene Ontology concept
“apoptotic process” while an NLP tool might annotate the
same text with the concept “programmed cell death”. From
the perspective of Precision and Recall, this is an instance of
information not being retrieved or incorrect retrieval. However,
subsumption reasoning within the Gene Ontology indicates
that the two concepts are closely related since “apoptotic
process” is a direct sub-class of “programmed cell death”.
Thus it can be argued the NLP tool retrieves a semantically
similar, albeit, a slightly more general version of the Gold
Standard’s annotation.</p>
      <p>
        We propose the use of semantic similarity metrics to capture
the degree of relatedness between an NLP tool’s performance
as compared to the Gold Standard to account for the notion of
partial retrieval for ontology-based systems. Semantic
similarity is defined as the degree of relatedness between ontology
concepts or objects based on their ontology annotations [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ].
The use of other metrics that utilize the ontology hierarchy to
evaluate NLP annotations can also be seen in BioCreative IV
[
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
      </p>
      <p>II.</p>
      <p>RELATED WORK</p>
      <p>
        There have been several studies that provide an overview of
text mining and NLP techniques as applied to
biological/biomedical literature [
        <xref ref-type="bibr" rid="ref18 ref19 ref20 ref21">18–21</xref>
        ].
      </p>
      <p>
        One of the recent studies focusing on comparison of
annotation tools by Funk et al. [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] conducts a comparison of three
tools - MetaMap, NCBO Annotator, and ConceptMapper [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ].
Performance of these tools is evaluated on eight biomedical
ontologies using the CRAFT corpus as a gold standard. The
paper explores different parameter setting for the tools to
identify the most optimal combinations.
      </p>
      <p>
        Most notable among these studies is from Kelley et al. [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ],
[
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] as they investigate strategies to build the National Center
for Biomedical Ontology Annotator (NCBO Annotator).
Kelley et al. [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] compared two concept recognition tools, Mgrep
[26] and MetaMap [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] using multiple data sets and
vocabularies/ontologies of different sizes to evaluate the scalability
and performance of each tool. Their results show that Mgrep
consistently recognized fewer unique concepts than MetaMap;
which led to a higher precision score in every evaluation with
the exception of one. Kelley et al. use Precision and Recall to
quantify the performance of the two tools therefore missing
closely related but not exactly matching annotations retrieved
by the tools. In contrast, we use semantic similarity metrics
that are capable of quantifying exact matches and matches
of varying degrees of relatedness to provide a more accurate
assessment of a tool’s annotation performance.
      </p>
      <p>
        Similarly, Galeota and Pelizzola. [27] performed a
comparison of MetaMap [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and ConceptMapper. In their comparison
they used a public repository (Sequence Read Archive) of
raw experiment data and its corresponding metadata, three
ontologies for annotating, and knowledge-based semantic
similarity measures such as ‘path finding’ (shortest path between
concepts) and Information Content based metrics. Their
findings show that ConceptMapper outperformed MetaMap in
precision and recall, however MetaMap performed better than
ConceptMapper when presented with entire sentences.
      </p>
      <p>III.</p>
      <p>METHODS</p>
      <p>Articles from the CRAFT corpus v2.0 were used as input for
each NLP tool. The CRAFT corpus [28] contains 67 publicly
available articles that have been manually annotated on six
ontologies including the Gene Ontology.</p>
      <p>The annotation performance of four tools - MetaMap,
NCBO Annotator, SciGraph Annotator, and Textpresso was
evaluated at the task of annotating the 67 articles in the CRAFT
corpus with GO concepts. Below, we provide methodological
details, execution configurations, and parameters of the four
NLP tools.</p>
    </sec>
    <sec id="sec-2">
      <title>A. NLP Tools</title>
      <p>
        1) MetaMap: MetaMap is a program designed to
recognize and annotate biomedical text to concepts in the Unified
Medical Language System (UMLS) Metathesaurus [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. The
program operates via four primary steps. First, text is parsed
to tag parts of speech such as nouns to provide a syntactic
analysis. Parsed phrases are then used to generate variants
that account for acronyms, abbreviations, synonyms, etc. Next,
candidate strings/concepts within the UMLS Metathesaurus
that match one or more of the above variants are identified
to build a candidate set of annotations. The candidate set is
further evaluated against the input text using several metrics
to assess the strength of the annotations. The highest scoring
candidate is used to form the final mapping for the phrase.
      </p>
      <p>MetaMap was run using the scheduled batch processor with
the following parameters:
-V USAbase: Selects the USA base data version
-L 17: Selects the 2017 version of the SPECIALIST
lexicon
-Z 1718: Selects the 2017AB UMLS Metathesaurus as
the knowledge source
-E: Indicates the end of a citation with the ’EOT’ flag
-T: Tagger output - lines up the tagged output on the
lines below the input.
-I: Displays the UMLS IDs for each tagged concept
-R GO: Restricts annotations to the Gene Ontology
UMLS concepts identified by MetaMap were mapped to GO
identifiers (GO release date 04/28/2017). In some instances,
MetaMap generates multiple annotations corresponding to a
piece of text (phrase). To determine the precise piece of text
being annotated to a concept, we verified if the concept name
of the annotation was a sub-string of the phrase. If yes, we
recorded the annotation as being tagged to the sub-string and
not to the larger phrase. If not, the annotation was associated to
the larger phrase. For example, if the phrase “several thousand
liver gene expression quantitative trait loci” was annotated
with the GO concept “gene expression”, the annotation would
be tagged to “gene expression” in text because it is a
substring of the original phrase. However, if the phrase “on the
regulation of many physiological traits” was annotated to GO
concept “regulation of biological process” then the annotation
would be mapped to the full phrase “on the regulation of many
physiological traits” rather than a specific sub-string within the
phrase.</p>
      <p>
        2) NCBO Annotator: The National Center for Biomedical
Ontology (NCBO) Annotator [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] was developed to recognize
ontology concepts from biological and biomedical text. NCBO
Annotator is implemented to provide annotations to 207
different ontologies.
      </p>
      <p>First, the text to be annotated is provided as input along with
a dictionary containing concepts from one or more desired
ontologies. An ontology concept recognizer named Mgrep
([26]) is used to match input text to concepts in the dictionary.
These matches called direct annotations are later expanded
to create semantic expansions using ontology semantics and
relationships.</p>
      <p>For example, the is a relationship can be used to identify
subsumer concepts of direct annotations. The ontology concept
recognizer also uses one-to-one cross mappings between
ontology terms to identify new annotations. For example, direct
annotations to a concept in one ontology can be expanded
to gather new annotations from another ontology based on
pre-existing bridge mappings between the two ontologies.
NCBO also proposes the use of semantic similarity metrics to
identify related concepts for direct annotations to create new
annotations.</p>
      <p>
        Using their REST API, NCBO was run with the following
parameters on the Gene Ontology (release date 10/25/2017).
ontologies=GO: Restricts the annotator to the Gene
Ontology
text=: Passes the UTF-8 encoded text to the annotator
3) Textpresso: Textpresso [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] uses the Textpresso ontology
that contains concepts from 33 parent ontologies such as gene,
cellular component, nucleic acid, organism, phenotype, drugs
including the Gene Ontology. These parent categories may be
further classified into one or more sub-categories.
      </p>
      <p>
        Textpresso breaks free text into sentences and words and
uses a text-to-XML converter that marks the text through and
generates XML documents [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Users can opt to retrieve
annotations from one or more parent/sub-categories along with
searching specific portions of the text such as the abstract/body,
etc. Textpresso was run using the default command with
no additional parameters specified. The Textpresso ontology
(release date 07/2011) at the time of execution used Gene
Ontology version GO v1.934.
      </p>
      <p>
        4) SciGraph Annotator: The Monarch annotation service
[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] provides the SciGraph annotator that annotates user
provided free text with ontology concepts and biological entities.
SciGraph annotator marks the text with concepts from the
Monarch knowledge graph that includes gene ontology terms,
genes, diseases, and phenotypes. SciGraph also allows the user
the flexibility of selecting any desired ontology for annotation.
SciGraph was run with the ann parameter on the Monarch
knowledge graph (as of 04/09/2018) that calls the annotation
service.
      </p>
    </sec>
    <sec id="sec-3">
      <title>B. Evaluation of Tool Annotation Performance</title>
      <p>Annotations generated by the four NLP tools were compared
to the CRAFT corpus for evaluating their annotation
performance. Annotations generated by the four tools and those in
the CRAFT corpus were represented in a consistent
locationbased format that indicated annotations corresponding to the
piece of text between a starting character index and an ending
character index. This representation enabled the comparison of
annotation output across sources in a consistent manner.</p>
      <p>Consider a piece of text with starting character index i and
ending index j. We denote a CRAFT annotation corresponding
to this text as Ci;j and a tool annotation for the same text
as Ni;j where N represents one of the four tools. NLP tool
annotations might be compared to those from CRAFT in one
of the following cases:</p>
      <p>Exact Match: Ci;j 6= , Ni;j 6= , and Ci;j = Ni;j
An exact annotation match occurs when a piece of text
is annotated with the same ontology concept in CRAFT
and a tool. For example, both the tool and CRAFT
corpus annotate the text “development” (i = 191, j
= 202) with the GO concept “developmental process”
(GO:0032502)
Partial Match: Ci;j 6= and Ni;j 6= , Ci;j 6= Ni;j
A partial annotation match occurs when a piece of
text is annotated both by CRAFT and a tool but the
ontology concepts used vary between the two sources.
For example, both the tool and CRAFT corpus annotate
the text “antibody” (i = 5964, j = 5972). However,
CRAFT used the GO concept “immunoglobulin
complex” (GO:0019814) while the tool used “B cell receptor
complex” (GO:0019815).</p>
      <p>False Positive: Ci;j = , Ni;j 6=
A false positive annotation occurs when a tool generates
an annotation for a piece of text that does not contain
an annotation in CRAFT. For example, a tool annotated
“homeostasis” (i = 233, j = 244) with “homeostatic
process” (GO:0042592), but there was no annotation in
CRAFT for the same text.</p>
      <p>False Negative: Ci;j 6= , Ni;j =
A false negative annotation occurs when a tool fails
to annotate a piece of text that has been annotated in
CRAFT. For example, the CRAFT corpus annotates the
text “immune function” (i = 204, j = 219) with “immune
system process” (GO:0002376), but the tool did not
annotate that piece of text.</p>
    </sec>
    <sec id="sec-4">
      <title>C. Semantic Similarity</title>
      <p>
        The accuracy of a tool’s annotation matches (exact and
partial) in comparison to CRAFT was assessed using Jaccard
semantic similarity. The Jaccard similarity (J ) of two ontology
concepts/classes (A, B) in an ontology is defined as the ratio
of the number of classes in the intersection of their subsumers
over the number of classes in their union of their subsumers
[
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], [29].
      </p>
      <p>J (A; B) = jS(A) \ S(B)j</p>
      <p>jS(A) [ S(B)j
where S(A) is the set of classes that subsume A. Jaccard
similarity ranges from 0 (no similarity) to 1 (exact match).
Jaccard similarity is designed to measure similarity between
ontology concepts based on their proximity to each other in
the ontology - the farther two concepts are from each other,
the less similar they are.</p>
      <p>IV.</p>
      <p>RESULTS AND DISCUSSION</p>
      <p>The CRAFT corpus contains 67 open access, full-length
papers annotated with concepts from 6 ontologies. The corpus
contains 91,446 total annotations and 3,242 unique annotations
spanning 6 ontologies. Table I shows the total and unique
number of annotations categorized by ontology. CRAFT contains a
total of 32,416 GO annotations (35.4% of all ontology
annotations) with an average of 416 annotations per paper. These GO
annotations are distributed across three sub-ontologies Cellular
Component (CC), Molecular Function (MF), and Biological
Process (BP) (Figure 1). Results in Figure 1 indicate that
a large portion of GO annotations in the CRAFT belong to
the Biological process sub-ontology with Molecular Function
accounting for the least number of annotations.</p>
      <p>DISTRIBUTION OF ANNOTATIONS IN THE CRAFT CORPUS</p>
      <p>BY ONTOLOGY
Number of total
annotations
32,416
22,090
15,594
8,137
tations. We observe striking similarities in the distributions
of annotations across GO depths for CRAFT and the NLP
tools except Textpresso (Figure 2). The majority of Textpresso
annotations belong at level 1 (direct children of the root) with
a distinct lack of annotations at deeper levels. This trend is
in stark contrast to trends seen with CRAFT and the other
three tools where the number of unique terms increases with
increasing depth peaking at level 6 and declining after. This
result is unsurprising given the low number of unique GO
terms retrieved by Textpresso (Table II).
Biological
Process</p>
      <p>Cellular
Component</p>
      <p>Molecular
Function</p>
      <p>Four NLP tools (MetaMap, NCBO Annotator, Textpresso,
and SciGraph) were used to annotate the 67 CRAFT articles
with GO concepts. First, we observed the performance of the
four tools using the total number of unique and non-unique
annotations generated by each tool as compared to the CRAFT
Corpus (Table II). Here, we see that Textpresso retrieves the
most number of annotations among the four tools, surprisingly,
about 24% more than the CRAFT corpus. However, Textpresso
annotations contain a very low number of unique GO terms
indicating that these annotations do not span across the Gene
Ontology well. MetaMap retrieves about 82% of the CRAFT
total annotation count and 94% of CRAFT’s unique annotation
count. NCBO Annotator and SciGraph generate a substantially
lower number of annotations as compared to MetaMap and
Textpresso.</p>
      <p>We also examined if the distributions of NLP annotations
at different depths of the GO were similar to CRAFT
anno0
2</p>
      <p>4 6 8 10
Depth in the Gene Ontology
12
14</p>
      <p>Next, we examined each tool’s performance with respect
to the proportion of false positives and negatives (Table III).
These results indicate that a staggering proportion of
annotations generated by MetaMap and Textpresso, the two tools
generating the most annotations (Table II) are false positives or
false negatives. In comparison, SciGraph and NCBO Annotator
generate lower false positives.</p>
      <p>We hypothesize that the large number of false positives and
false negatives from MetaMap might be an artifact of our
interpretation of MetaMap’s results. Without a clear mapping
between the exact piece of text being annotated to a concept,
we used a fuzzy matching between the tagged phrase and
the annotation which might have contributed to these false
positives.</p>
      <p>We suspect that a proportion of false positives generated by
the tools might be because CRAFT annotations were generated
MetaMap</p>
      <p>SciGraph NCBO</p>
      <p>NLP Tool</p>
      <p>Textpresso
by the NLP tools here is to analyze and match text lexically
to concept names, we hypothesized that it might be easier to
recognize short concept names as compared to longer ones. To
explore this, we plotted the semantic similarity of annotation
matches categorized by the number of words in the concept
names. Interestingly, we do not see any consistent decreases
in semantic similarity as the number of words in the GO
concept name increases (Figure 4). This indicates that the ease
of annotation does not decline as ontology concept names get
longer and more complex.</p>
      <sec id="sec-4-1">
        <title>MetaMap</title>
      </sec>
      <sec id="sec-4-2">
        <title>SciGraph</title>
      </sec>
      <sec id="sec-4-3">
        <title>NCBO</title>
      </sec>
      <sec id="sec-4-4">
        <title>Textpresso</title>
        <p>using an outdated version of the GO whereas the tools use
newer versions of the GO with a more complete set of ontology
concepts.</p>
        <p>Next, we focused on exact and partial matches between the
tools and CRAFT. Surprisingly, we notice that most of the
tools generate appreciably more exact matches as compared
to partial matches (Table IV). These numbers also indicate
that a rather small proportion (2-18%) of total annotations
generated by a tool are exact or partial matches to CRAFT.
We see that NCBO Annotator produces the highest number of
exact matches followed by SciGraph. Surprisingly, despite it’s
high number of generated annotations, MetaMap produces just
2.7% exact matches and 0.4% partial matches.</p>
        <p>The four tools generated a total of 15,809 exact matches and
14,352 partial matches to CRAFT. We analyzed the overall
quality and accuracy of these matched annotations by
comparing them to corresponding CRAFT annotations using Jaccard
semantic similarity. Here, we see that MetaMap has a very high
similarity to CRAFT ( 90%) followed by SciGraph and NCBO.
This result further reinforces our hypothesis that the MetaMap
false positives are likely a result of our fuzzy interpretation
of its annotation output. It is important to note that although
MetaMap results in the highest semantic similarity score, the
pool of annotations being analyzed is substantially smaller as
compared to the other tools. Textpresso annotations show the
lowest similarity scores. Given the discrepancy in the depth of
distribution between Textpresso’s annotations and the CRAFT
corpus, it is unsurprising to see the low semantic similarity
score between Textpresso and CRAFT. As in the above tests,
SciGraph and NCBO perform well (75-81% similarity) with
relatively few false positives and false negatives.</p>
        <p>Another factor that could affect the semantic similarity
scoring and the performance of the tools is the difference in
ontology versions used for annotation between CRAFT and the
various tools. Some of the NLP tools compared in this study
do not support user specified ontology versions for annotation.</p>
        <p>We examined if it is easier for NLP tools to annotate
GO concepts with simple names (as measured by number of
words in the concept name) as compared to concepts with
multiple word names. Since the primary process employed
V.</p>
        <p>CONCLUSIONS</p>
        <p>Here, we conducted a comparison of MetaMap, NCBO
Annotator, Textpresso, and SciGraph at the task of annotating
scientific literature with Gene Ontology concepts. We found
that while Textpresso generated the most amount of
annotations, a large proportion of the annotations were false positives
or false negatives. Also, Textpresso annotations featured a
disproportionate number of high level GO terms as compared
to CRAFT and other tools. MetaMap generated the largest
number of false positives and false negatives but had the
highest similarity to CRAFT based on partial and exact matches.
Although NCBO and Scigraph retrieve lower annotations as
compared to the other tools, their performance in terms of
false positives and false negatives is better in comparison
to the other tools. The semantic similarity of NCBO and
SciGraph is very comparable to the top performer, MetaMap.
As compared to MetaMap and Textpresso which show extreme
results, SciGraph and NCBO perform well on all evaluation
categories indicating potential utility for scientific applications.
Overall, the proportion of false positives and false negatives
across all the tools indicates that there is substantial room for
improvement in NLP tools for ontology-based Named Entity
Recognition.</p>
        <p>VI.</p>
        <p>DATA AND SOFTWARE</p>
        <p>The CRAFT data source can be found at http://
bionlp-corpora.sourceforge.net/CRAFT/. Code for results and
analysis can be found at https://github.com/lucasbeasley/
average-jaccard.</p>
        <p>VII.</p>
        <p>ACKNOWLEDGMENTS</p>
        <p>This work was supported by the University of North
Carolina at Greensboro Giant Steps initiative funding to Manda.
The authors acknowledge valuable technical assistance from
MetaMap’s development team.
building the open biomedical annotator,” BMC Bioinformatics,
vol. 10, no. 9, p. S14, Sep 2009. [Online]. Available: https:
//doi.org/10.1186/1471-2105-10-S9-S14
M. Dai, N. Shah, W. Xuan et al., “An efficient solution for mapping free
text to ontology terms. amia summit on translational bioinformatics,”
San Francisco CA, 2008.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>C.</given-names>
            <surname>Jonquet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Shah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. H</given-names>
            <surname>Youn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Musen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Callendar</surname>
          </string-name>
          , and M.-A. Storey, “
          <article-title>Ncbo annotator: Semantic annotation of biomedical data</article-title>
          ,”
          <volume>01</volume>
          2009.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhoff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. M.</given-names>
            <surname>Dahdul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. R.</given-names>
            <surname>Kothari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lapp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. G.</given-names>
            <surname>Lundberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mabee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. E.</given-names>
            <surname>Midford</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Westerfield</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T. J.</given-names>
            <surname>Vision</surname>
          </string-name>
          , “
          <article-title>Phenex: ontological annotation of phenotypic diversity,” PLoS One</article-title>
          , vol.
          <volume>5</volume>
          , no.
          <issue>5</issue>
          , p.
          <fpage>e10500</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Ashburner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Ball</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Blake</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Botstein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Butler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Cherry</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. P.</given-names>
            <surname>Davis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Dolinski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. S.</given-names>
            <surname>Dwight</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. T.</given-names>
            <surname>Eppig</surname>
          </string-name>
          et al., “
          <article-title>Gene ontology: tool for the unification of biology,” Nature genetics</article-title>
          , vol.
          <volume>25</volume>
          , no.
          <issue>1</issue>
          , p.
          <fpage>25</fpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>W.</given-names>
            <surname>Dahdul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. A.</given-names>
            <surname>Dececchi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ibrahim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lapp</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Mabee</surname>
          </string-name>
          , “
          <article-title>Moving the mountain: analysis of the effort required to transform comparative anatomy into computable anatomy</article-title>
          ,
          <source>” Database</source>
          , vol.
          <year>2015</year>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>I.</given-names>
            <surname>Spasic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ananiadou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>McNaught</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Kumar</surname>
          </string-name>
          , “
          <article-title>Text mining and ontologies in biomedicine: making sense of raw text,” Briefings in bioinformatics</article-title>
          , vol.
          <volume>6</volume>
          , no.
          <issue>3</issue>
          , pp.
          <fpage>239</fpage>
          -
          <lpage>251</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>U.</given-names>
            <surname>Yasavur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Amini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. L.</given-names>
            <surname>Lisetti</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N.</given-names>
            <surname>Rishe</surname>
          </string-name>
          , “
          <article-title>Ontology-based named entity recognizer for behavioral health</article-title>
          .”
          <source>in FLAIRS Conference</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>H.</given-names>
            <surname>Cui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Dahdul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. T.</given-names>
            <surname>Dececchi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ibrahim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mabee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhoff</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Gopalakrishnan</surname>
          </string-name>
          , “
          <article-title>Charaparser+ eq: Performance evaluation without gold standard</article-title>
          ,
          <source>” Proceedings of the Association for Information Science and Technology</source>
          , vol.
          <volume>52</volume>
          , no.
          <issue>1</issue>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>G. V.</given-names>
            <surname>Gkoutos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Mungall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Dolken</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ashburner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hancock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Schofield</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kohler</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P. N.</given-names>
            <surname>Robinson</surname>
          </string-name>
          , “
          <article-title>Entity/qualitybased logical definitions for the human skeletal phenome using pato,” in Engineering in Medicine</article-title>
          and
          <source>Biology Society</source>
          ,
          <year>2009</year>
          .
          <article-title>EMBC 2009</article-title>
          .
          <article-title>Annual International Conference of the IEEE</article-title>
          . IEEE,
          <year>2009</year>
          , pp.
          <fpage>7069</fpage>
          -
          <lpage>7072</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>P.</given-names>
            <surname>Manda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhoff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lapp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mabee</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T. J.</given-names>
            <surname>Vision</surname>
          </string-name>
          , “
          <article-title>Using the phenoscape knowledgebase to relate genetic perturbations to phenotypic evolution,” genesis</article-title>
          , vol.
          <volume>53</volume>
          , no.
          <issue>8</issue>
          , pp.
          <fpage>561</fpage>
          -
          <lpage>571</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Aronson</surname>
          </string-name>
          , “
          <article-title>Effective mapping of biomedical text to the umls metathesaurus: the metamap program</article-title>
          .”
          <source>in Proceedings of the AMIA Symposium. American Medical Informatics Association</source>
          ,
          <year>2001</year>
          , p.
          <fpage>17</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>H.-M. Mller</surname>
            ,
            <given-names>E. E.</given-names>
          </string-name>
          <string-name>
            <surname>Kenny</surname>
            , and
            <given-names>P. W.</given-names>
          </string-name>
          <string-name>
            <surname>Sternberg</surname>
          </string-name>
          , “
          <article-title>Textpresso: An ontology-based information retrieval and extraction system for biological literature</article-title>
          ,
          <source>” PLOS Biology</source>
          , vol.
          <volume>2</volume>
          , no.
          <volume>11</volume>
          , 09
          <year>2004</year>
          . [Online]. Available: https://doi.org/10.1371/journal.pbio.0020309
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Mungall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>McMurry</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          <article-title>Ko¨hler</article-title>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhoff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Borromeo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Brush</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Carbon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Conlin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Dunn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Engelstad</surname>
          </string-name>
          et al., “
          <article-title>The monarch initiative: an integrative data and analytic platform connecting phenotypes to genotypes across species</article-title>
          ,”
          <source>Nucleic acids research</source>
          , vol.
          <volume>45</volume>
          , no.
          <issue>D1</issue>
          , pp.
          <fpage>D712</fpage>
          -
          <lpage>D722</lpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Eckert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Evans</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Garcia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Shipley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Sitnikov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. A.</given-names>
            <surname>Baumgartner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. B.</given-names>
            <surname>Cohen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Verspoor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Blake</surname>
          </string-name>
          et al., “
          <article-title>Concept annotation in the craft corpus,” BMC bioinformatics</article-title>
          , vol.
          <volume>13</volume>
          , no.
          <issue>1</issue>
          , p.
          <fpage>161</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>D. A.</given-names>
            <surname>Natale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. N.</given-names>
            <surname>Arighi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. C.</given-names>
            <surname>Barker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Blake</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Bult</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Caudy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. J.</given-names>
            <surname>Drabkin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Deustachio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. V.</given-names>
            <surname>Evsikov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Huang</surname>
          </string-name>
          et al., “
          <article-title>The protein ontology: a structured representation of protein forms and complexes</article-title>
          ,”
          <source>Nucleic acids research</source>
          , vol.
          <volume>39</volume>
          , no.
          <source>suppl 1</source>
          , pp.
          <fpage>D539</fpage>
          -
          <lpage>D545</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>K.</given-names>
            <surname>Eilbeck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. E.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Mungall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Yandell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Durbin</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Ashburner</surname>
          </string-name>
          , “
          <article-title>The sequence ontology: a tool for the unification of genome annotations,” Genome biology</article-title>
          , vol.
          <volume>6</volume>
          , no.
          <issue>5</issue>
          , p.
          <fpage>R44</fpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>C.</given-names>
            <surname>Pesquita</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Faria</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. O.</given-names>
            <surname>Falcao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Lord</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F. M.</given-names>
            <surname>Couto</surname>
          </string-name>
          , “
          <article-title>Semantic similarity in biomedical ontologies,” PLoS computational biology</article-title>
          , vol.
          <volume>5</volume>
          , no.
          <issue>7</issue>
          , p.
          <fpage>e1000443</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Mao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Van Auken</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. N.</given-names>
            <surname>Arighi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>McQuilton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. T.</given-names>
            <surname>Hayman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tweedie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Schaeffer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. J.</given-names>
            <surname>Laulederkind</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.-J.</given-names>
            <surname>Wang</surname>
          </string-name>
          et al.,
          <article-title>“Overview of the gene ontology task at biocreative iv</article-title>
          ,
          <source>” Database</source>
          , vol.
          <year>2014</year>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>N.</given-names>
            <surname>Tiffin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. F.</given-names>
            <surname>Kelso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Powell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. B.</given-names>
            <surname>Bajic</surname>
          </string-name>
          , and
          <string-name>
            <given-names>W. A.</given-names>
            <surname>Hide</surname>
          </string-name>
          , “
          <article-title>Integration of text-and data-mining using ontologies successfully selects disease gene candidates</article-title>
          ,
          <source>” Nucleic acids research</source>
          , vol.
          <volume>33</volume>
          , no.
          <issue>5</issue>
          , pp.
          <fpage>1544</fpage>
          -
          <lpage>1552</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Cohen</surname>
          </string-name>
          and
          <string-name>
            <given-names>W. R.</given-names>
            <surname>Hersh</surname>
          </string-name>
          , “
          <article-title>A survey of current work in biomedical text mining,” Briefings in bioinformatics</article-title>
          , vol.
          <volume>6</volume>
          , no.
          <issue>1</issue>
          , pp.
          <fpage>57</fpage>
          -
          <lpage>71</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>M.</given-names>
            <surname>Krallinger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Morgan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Leitner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Tanabe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wilbur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Hirschman</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Valencia</surname>
          </string-name>
          , “
          <article-title>Evaluation of text-mining systems for biology: overview of the second biocreative community challenge,” Genome biology</article-title>
          , vol.
          <volume>9</volume>
          , no.
          <issue>2</issue>
          , p.
          <fpage>S1</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>M.</given-names>
            <surname>Krallinger</surname>
          </string-name>
          , R. A.
          <article-title>-A. Erhardt, and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Valencia</surname>
          </string-name>
          , “
          <article-title>Text-mining approaches in molecular biology and biomedicine,” Drug discovery today</article-title>
          , vol.
          <volume>10</volume>
          , no.
          <issue>6</issue>
          , pp.
          <fpage>439</fpage>
          -
          <lpage>445</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>C.</given-names>
            <surname>Funk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Baumgartner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Garcia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Roeder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Bada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. B.</given-names>
            <surname>Cohen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. E.</given-names>
            <surname>Hunter</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Verspoor</surname>
          </string-name>
          , “
          <article-title>Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters,” BMC bioinformatics</article-title>
          , vol.
          <volume>15</volume>
          , no.
          <issue>1</issue>
          , p.
          <fpage>59</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Tanenblatt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Coden</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and I. L.</given-names>
            <surname>Sominsky</surname>
          </string-name>
          , “
          <article-title>The conceptmapper approach to named entity recognition</article-title>
          .”
          <string-name>
            <surname>in</surname>
            <given-names>LREC</given-names>
          </string-name>
          ,
          <year>2010</year>
          , pp.
          <fpage>546</fpage>
          -
          <lpage>51</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>N.</given-names>
            <surname>Bhatia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. H.</given-names>
            <surname>Shah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. L.</given-names>
            <surname>Rubin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. P.</given-names>
            <surname>Chiang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Musen</surname>
          </string-name>
          , “
          <article-title>Comparing concept recognizers for ontology-based indexing: Mgrep vs</article-title>
          . metamap,”
          <article-title>Paper accepted to the AMIA Summit on Translational Bioinformatics</article-title>
          , San Francisco,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>N. H.</given-names>
            <surname>Shah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Bhatia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Jonquet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Rubin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. P.</given-names>
            <surname>Chiang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Musen</surname>
          </string-name>
          , “Comparison of concept recognizers for [27]
          <string-name>
            <given-names>M.</given-names>
            <surname>Mistry</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Pavlidis</surname>
          </string-name>
          , “
          <article-title>Gene ontology term overlap as a measure of gene functional similarity,” BMC bioinformatics</article-title>
          , vol.
          <volume>9</volume>
          , no.
          <issue>1</issue>
          , p.
          <fpage>327</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>