<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>2016b) for
expert user curation. We have also successfully integrated DIVE
with the publication pipeline for two internationally recognized
Plant Biology Journals (namely</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Domain Informational Vocabulary Extraction Experiences with Publication Pipeline Integration and Ontology Curation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Amit Gupta</string-name>
          <email>agupta@tacc.utexas.edu</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Weijia Xu</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>y Pankaj Jaiswal</string-name>
          <email>zjaiswalp@science.oregonstate.edu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>z Crispin Taylor</string-name>
          <email>xctaylor@aspb.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>x Jennifer Regala</string-name>
          <email>jregala@aspb.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>American Society of Plant Biologists</institution>
          ,
          <addr-line>Rockville, Maryland</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Botany and Plant Pathology, Oregon State University</institution>
          ,
          <addr-line>Corvallis, Oregon</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Texas Advanced Computing Center, University of Texas at Austin</institution>
          ,
          <addr-line>Austin, Texas</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <fpage>7</fpage>
      <lpage>10</lpage>
      <abstract>
        <p>We will present updates on an ongoing project DIVE (Domain Informational Vocabulary Extraction), a system designed for extracting domain information from scientific publications. DIVE implements an ensemble of text mining methods for biological entity extraction from article text. DIVE also attempts use the co-occurrence patterns of these entities to establish probable relationships between them. DIVE also features an improved web interface for expert user curation of extracted information, thereby providing a means for a constantly growing and expert curated body of domain information for an article corpus. We also discuss our experiences from successful integration of DIVE with the publishing pipeline for two prominent Plant Biology Journals (The Plant Cell and Plant Physiology ) from ASPB (American Society of Plant Biologists). The extracted results are embedded at the end of the final proof of the published article to enhance its accessibility and discoverability. Furthermore, DIVE tracks expert user curation actions on its web interface for future training and improvement of the entity detection algorithm.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>Synthesizing information from a large corpus of journal articles
or technical documents requires a great deal of time, non-trivial
effort to understand and digest the contents and also demands
significant expertise from the reader. Furthermore, journal articles
are often the first textual appearance of new terms, concepts,
ideas and discoveries that are without precedence. This furthermore
exacerbates the already difficult task of extracting and preserving
information contained in these articles. Over various scientific
domains, these articles amount in the millions and with continuous
publication they continue to grow at an astonishing rate. To
keep up with the constant influx and volume of new information,
computationally analyzing these growing corpora of technical
documents seems like a natural solution.</p>
      <p>
        To address this problem, we have designed and implemented a
system called DIVE (Domain Informational Vocabulary Extraction)
        <xref ref-type="bibr" rid="ref12 ref13">Weijia Xu (2016</xref>
        a). DIVE employs text mining methods for entity
extraction, and utilizes cyberinfrastructure for online processing
and service support. To detect entities of interest, DIVE uses an
      </p>
    </sec>
    <sec id="sec-2">
      <title>DIVE ARCHITECTURE</title>
      <p>Entity Candidate Extraction: Here a few rules are employed for
extract entity candidates from text tokens.</p>
      <p>Regular Expressions - where Entity candidates can be
selected based on patterns indicated by Regular Expression
rules. Furthermore, new rules can be added on the fly to
DIVE as and when required.</p>
      <p>Keyword Dictionary - where tokens can be matched against
a list of known words known to be entities or to even
eliminate known words that are not to be considered as
entities.</p>
      <p>
        Publishing Standards - where rules from publishing
conventions are used to identify possible entity candidates.
For example if something important is italicized, bold or
quotes. Ontology, where Entity candidates are cross-checked
against known ontology rules. In the case of Plant Biology
articles, DIVE uses GRAMENE
        <xref ref-type="bibr" rid="ref11">(Tello-Ruiz et al., 2018)</xref>
        ,
Arabidopsis Information Portal
        <xref ref-type="bibr" rid="ref1">(Araport, 2018)</xref>
        , CHeBI
        <xref ref-type="bibr" rid="ref9">(Hastings et al., 2016)</xref>
        and Plant Ontology
        <xref ref-type="bibr" rid="ref6">Foundry (2018)</xref>
        .
Entity Assessment: The detection techniques above provide
varying degrees of accuracy. Once a list of entity candidates
is identified, they have to be assessed for accuracy. We use
previously validated results and co-location with other verified
entities as methods to validate entities. The primary means of
verification is the expert user verifying the entities (via the
DIVE web interface, mentioned below).
      </p>
      <p>
        All of the above is stored in a relational database that can be
analyzed with complex queries later. DIVE also features a web
interface, implemented in the Django Web Framework
        <xref ref-type="bibr" rid="ref5">(Django,
2018)</xref>
        , for expert user curation of extracted entity information.
Here the user has the opportunity to Edit, Add and Delete entity
information related to the article. The website itself is backed by the
relational database with the entity information.
      </p>
      <p>These design features allows for 2 things:</p>
      <p>As DIVE processes more articles from a specific scientific
domain and its corpus size increases, the web interface
provides means to create a constantly growing and expert
curated body of domain information represented in that article
corpus.</p>
      <p>Besides tracking the entity information itself, the backend
database can track expert user actions on the web curation
interface. This forms a perfect testbed to test and develop
automated machine learning algorithms to both improve the
entity recognition and to recommend curation changes to
authors based on historical data of such changes.
3</p>
    </sec>
    <sec id="sec-3">
      <title>ASSOCIATION ANALYSIS</title>
      <p>
        We employ Association Analysis, a Machine Learning method,
to detect possible relationships between detected entities based on
their co-occurrence patterns in the text. We use the popular
FPGrowth Algorithm
        <xref ref-type="bibr" rid="ref8">(Han et al., 2000)</xref>
        implemented in R for this
analysis
        <xref ref-type="bibr" rid="ref7">(Hahsler et al., 2011)</xref>
        . This algorithm basically infers the
likelihood of 2 groups of entities occurring together based on their
co-occurence patterns in the article text. Using such occurrence
relationships between entity groups, frequently mirror or atleast
serve as a hint towards the deeper interaction relationship that
is often being explained in the article text. We also present a
visualization of the top rules inferred by the algorithm to the user for
their edification in the web interface as seen in Figure (2). As this
figure demonstrates, such pairs of these entity co-occurrence pattern
can collectively show interesting patterns of possible interaction,
worth analyzing by the expert user.
The algorithm may be scoped down narrow to the sentence level
or scoped globally to the corpus level. Each level of granularity
may reveal different entity relationships. In DIVE, these have been
mostly scoped to the article level as shown in Figure (2). However,
as the corpus being curated by DIVE grows larger, a global level
association analysis might also prove insightful for the consumption
of domain curation experts. As an exemplar, we can see a global
level association analysis for roughly 2000 Plant Biology article
corpus in Figure (3).
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>PUBLICATION PIPELINE INTEGRATION AND</title>
    </sec>
    <sec id="sec-5">
      <title>WEB SERVICE DESIGN</title>
      <p>
        DIVE has been integrated into the publication pipeline of two Plant
Biology Journals, namely The Plant Cell
        <xref ref-type="bibr" rid="ref2 ref3 ref4">(ASPB, 2018b)</xref>
        and Plant
Physiology
        <xref ref-type="bibr" rid="ref2 ref3 ref4">(ASPB, 2018c)</xref>
        from ASPB
        <xref ref-type="bibr" rid="ref2 ref3 ref4">(ASPB, 2018a)</xref>
        . A company
named Sheridan Journal Services develops and runs the publication
and proofing software for these journals. The architecture of the
integration is as shown in Figure (4). To enable this integration,
Short Version of Title
DIVE functionality was exposed to the publication software as a
web service with 2 endpoints.
      </p>
      <sec id="sec-5-1">
        <title>Article Endpoint</title>
        <p>This endpoint receives two HTTP POST requests related to the
article.</p>
        <p>Article Push request: This request is to push a new article
into DIVE. An article may also be pushed multiple times to
DIVE during the publication process to incorporate proofing
edits and corrections. This request contains the location of
the cloud storage service from where this article may be
retrieved. This request also contains metadata information
about the article file being pushed for verification purposes.
Pull Curation request: This request is to pull a summary
of curation information from DIVE about the article. It
is usually done at the end of the proofing process and is
embedded into the final proof of the article.</p>
      </sec>
      <sec id="sec-5-2">
        <title>Article Landing Page Endpoint</title>
        <p>This endpoint is the landing page of the article. It is where
the extracted entities for the article may be viewed and
curated. This is where the authors are directed during the
proofing process of their article to curate the terms. This
page contains instructions of author curation actions and the
list of entities extracted with metainformation. The authors
may either verify its accuracy or do curation actions of edits,
additions, deletions to this information. The extracted result is
appended at the end of the final proof version of the publication
with cross references to other known ontologies, to improve
its accessibility and discoverability. These contributions are
also tracked by the DIVE backend database and can serve
to improve the information quality and improve future entity
detection for DIVE.
5</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>ENTITY RANKING</title>
      <p>
        When expert users arrive at their articles landing page, there is only
a limited space to display information for curation. Furthermore,
our experience was that too much information tends to confuse
users and defeat the purpose of the curation interface. We therefore
made a design decision to only display the most important entities
(we display the 10 most important) to the users to curate, with a
paginated option for the interested users to curate more entities. We
designed a scoring function in DIVE to rank entities based on their
type, whether or not they have a cross-reference to a mature external
ontology like Gramene
        <xref ref-type="bibr" rid="ref11">(Tello-Ruiz et al., 2018)</xref>
        .
      </p>
      <p>Our three tiers of prioritization are:</p>
      <sec id="sec-6-1">
        <title>Genes</title>
      </sec>
      <sec id="sec-6-2">
        <title>Proteins</title>
      </sec>
      <sec id="sec-6-3">
        <title>Other entities like Plant Anatomies etc.</title>
        <p>Within each tier an entity that is confirmed by an external
ontology recieves higher priority and frequency of occurrence.
6</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>SEARCH INTERFACE</title>
      <p>Expert users may also use our search interface to search for other
articles within the corpus that contain an entity they are interested
in. This can further help the articles discoverability amongst other
users with overlapping domain expertise and interest. An example
of the search interface is as shown in Figure (5). The articles are
listed with their Title, Journal Name and other metadata like a doi
link which leads to a site hosting a copy of the article.
7</p>
    </sec>
    <sec id="sec-8">
      <title>FUTURE WORK</title>
      <p>DIVE is under ongoing development and we have a few exciting
features planned to be rolled out in the near future that would
immensely benefit expert user curators from scientific domains.
Some of them are summarize below:</p>
      <p>Based on the entities in the users article and/or the
entities they have chosen to curate, DIVE will make
article recommendations to the author by ranking articles
on similarity. Such recommendations can also incorporate
relationships discovered by Association Analysis.</p>
      <p>Based on historical curation action information for entities,
appropriate recommendations will be made to users when they
arrive on their articles landing page.</p>
      <p>We plan to incorporate full text indexing into search results to
make it more comprehensive.
8</p>
    </sec>
    <sec id="sec-9">
      <title>CONCLUSION AND FUTURE WORK</title>
      <p>Our early experience with deploying this solution in production with
two internationally recognized Plant Biology Journals from ASPB
has been promising. We are seeing enthusiastic participation by
expert users and at present see about 10 curation actions per article
in our corpus. Furthermore, although DIVE was developed for the
use case of Plant Biology Journal Articles, it has been designed to
be versatile and is quite readily adapted to document collections of
any domain. We are presently investigating a use cases for corpora
in other domains as well (ex. Aerospace Engineering) and will
continue to expand in this area.</p>
      <p>DIVE is under ongoing development and we have a few exciting
features planned to be rolled out in the near future that would
immensely benefit expert user curators from scientific domains. We
are working on improving the search features to incorporate full
text search, relationships uncovered by Association Analysis and are
also investigating improvements to our entity detection algorithms.
Other planned enhancements for expert users of DIVE include
article recommendations and curation action recommendations. We
aim to continue to build DIVE and deploy it important use case
scenarios from many domains to benefit scientists and researchers
at large make sense of their large document corpora.</p>
    </sec>
    <sec id="sec-10">
      <title>ACKNOWLEDGEMENTS</title>
      <p>DIVE is partially supported by CyVerse (NSF awards DBI0735191,
DBI1265383) and by Gramene, A Comparative Plant Genomics
Database (NSF award IOS 1127112).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Araport</surname>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Arabidopsis information portal</article-title>
          . https://www.araport.org/.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>ASPB</surname>
          </string-name>
          (
          <year>2018a</year>
          ).
          <article-title>American society of plant biologists</article-title>
          . https://aspb.org/.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>ASPB</surname>
          </string-name>
          (
          <year>2018b</year>
          ).
          <article-title>The plant cell</article-title>
          . http://www.plantcell.org/.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>ASPB</surname>
          </string-name>
          (
          <year>2018c</year>
          ).
          <article-title>Plant physiology</article-title>
          . http://www.plantphysiol.org/.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Django</surname>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Django web framework</article-title>
          . https://djangoproject.com.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Foundry</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Plant ontology</article-title>
          . http://www.obofoundry.org/ ontology/po.html.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Hahsler</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chelluboina</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hornik</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Buchta</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>The arules r-package ecosystem: Analyzing interesting patterns from large transaction datasets</article-title>
          .
          <source>Journal of Machine Learning Research</source>
          ,
          <volume>12</volume>
          ,
          <fpage>1977</fpage>
          -
          <lpage>1981</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Han</surname>
            ,
            <given-names>J</given-names>
          </string-name>
          .,
          <string-name>
            <surname>Pei</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Yin</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          (
          <year>2000</year>
          ).
          <article-title>Mining frequent patterns without candidate generation</article-title>
          .
          <source>SIGMOD Rec</source>
          .,
          <volume>29</volume>
          (
          <issue>2</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Hastings</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Owen</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dekker</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ennis</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kale</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Muthukrishnan</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Turner</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Swainston</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mendes</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Steinbeck</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>2016</year>
          ). Chebi in 2016:
          <article-title>Improved services and an expanding collection of metabolites</article-title>
          .
          <source>Nucleic Acids Research</source>
          ,
          <volume>44</volume>
          (
          <issue>D1</issue>
          ),
          <fpage>D1214</fpage>
          -
          <lpage>D1219</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>NIH</surname>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Journal article tag suite</article-title>
          . https://jats.nlm.nih.gov/.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Tello-Ruiz</surname>
            ,
            <given-names>M. K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Naithani</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stein</surname>
            ,
            <given-names>J. C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gupta</surname>
          </string-name>
          , P.,
          <string-name>
            <surname>Campbell</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Olson</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wei</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Preece</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Geniza</surname>
            ,
            <given-names>M. J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jiao</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>Y. K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mulvaney</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chougule</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Elser</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Al-Bader</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kumari</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thomason</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kumar</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bolser</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Naamati</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tapanari</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fonseca</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huerta</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Iqbal</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Keays</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>MunozPomerFuentes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tang</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fabregat</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , DEustachio,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Weiser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Stein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. D.</given-names>
            ,
            <surname>Petryszak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Papatheodorou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Kersey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. J.</given-names>
            ,
            <surname>Lockhart</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Taylor</surname>
          </string-name>
          , C.,
          <string-name>
            <surname>Jaiswal</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Ware</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Gramene 2018: unifying comparative genomics and pathway resources for plant research</article-title>
          .
          <source>Nucleic Acids Research</source>
          ,
          <volume>46</volume>
          (
          <issue>D1</issue>
          ),
          <fpage>D1181</fpage>
          -
          <lpage>D1189</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Weijia</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Amit</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <surname>P. J. C. T. P. L.</surname>
          </string-name>
          (
          <year>2016a</year>
          ).
          <article-title>Enhancing information accessibility of publications with text mining and ontology</article-title>
          .
          <source>International Conference on Biological Ontology.</source>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Weijia</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Amit</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <surname>P. J. C. T. P. L.</surname>
          </string-name>
          (
          <year>2016b</year>
          ).
          <article-title>A web application for extracting key domain information for scientific publications using ontology</article-title>
          .
          <source>International Conference on Biological Ontology.</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>