<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Extension of the Interaction Network Ontology for literature mining of gene-gene interaction networks from sentences with multiple interaction keywords</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Arzucan Özgür</string-name>
          <email>arzucan.ozgur@boun.edu.tr</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Junguk Hur</string-name>
          <email>junguk.hur@med.und.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yongqun He</string-name>
          <email>yongqunh@med.umich.edu</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Basic Sciences, University of North Dakota, School of Medicine and Health Sciences</institution>
          ,
          <addr-line>Grand Forks, North Dakota 58202</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Computer Engineering, Bogazici University</institution>
          ,
          <addr-line>34342 Istanbul</addr-line>
          ,
          <country country="TR">Turkey</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Michigan Medical School</institution>
          ,
          <addr-line>Ann Arbor, MI 48109</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The Interaction Network Ontology (INO) has been demonstrated to be valuable in providing a structured ontological vocabulary for literature mining of gene-gene interactions from biomedical literature. Our analysis of the Learning Logic in Language (LLL) challenge and vaccine datasets showed that many interactions are signaled with 2 or more interaction keywords used in combination. In this paper, we extend the INO by adding combinatory patterns of two or more literature mining keywords to related INO interaction classes. An INO-based literature mining pipeline was further developed based on SPARQL queries and SciMiner, an in-house literature mining program. The majority of the gene interaction sentences from the LLL and vaccine datasets were found to use multiple keywords to represent interaction types. A comprehensive analysis of the LLL dataset identified 27 gene regulation interaction types each associated with multiple keywords. Special patterns were discovered from the hierarchical structure of these 27 INO types.</p>
      </abstract>
      <kwd-group>
        <kwd>Interaction Network Ontology</kwd>
        <kwd>Literature mining</kwd>
        <kwd>Gene-gene interaction</kwd>
        <kwd>SciMiner</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Literature mining methods for extracting interactions among biomedical entities
including genes and proteins typically formulate the problem as a binary classification
task, where the goal is to identify the pairs of entities that are stated to interact with
each other in text [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. Several different methods have been proposed to tackle this
problem ranging from relatively simpler co-occurrence based methods [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] to more
complex methods that make use of the syntactic analysis of the sentences [
        <xref ref-type="bibr" rid="ref4 ref5 ref6">4-6</xref>
        ],
mostly in conjunction with machine learning methods [
        <xref ref-type="bibr" rid="ref7 ref8 ref9">7-9</xref>
        ].
      </p>
      <p>
        Besides, extracting the existence of interactions among biomolecules, identifying
the types of these interactions are vital for a better understanding of the underlying
biological processes and for the creation of more detailed and structured models of
interactions such as biological pathways. In order to improve the performance of
extracting biomolecular events and entities with varying roles (e.g. theme, causes, and
etc.), the literature mining community has established collaborative but competitive
challenges such as the BioNLP Shared Tasks on Event Extraction [
        <xref ref-type="bibr" rid="ref10 ref11">10, 11</xref>
        ].
      </p>
      <p>
        The types of interactions (or events) among biomolecules are in general signaled
with specific interaction keywords (trigger words). For example, the interaction
keyword “up-regulates” signals an interaction of type positive regulation, whereas the
keyword “inhibits” signals an interaction of type negative regulation. We have
previously collected over 800 interaction keywords, which we used with support
vector machines (SVM) [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] to classify pairs of genes or proteins as interacting or not
[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. We have also shown that the usage of ontologies, such as the Vaccine Ontology
(VO), can enhance the mining of gene-gene interactions under a specific domain, for
example, the vaccine domain [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] or vaccine induced fever domain [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. The over
800 interaction-associated keywords provide us tags for mining interactive relation
between two genes/proteins.
      </p>
      <p>
        However, this is basically a binary result of an interaction between two molecules
or entities. To extend from the binary yes/no results, we further hypothesized that the
ontological classification of these and more keywords would allow us to further
identify and classify the types of interactions (e.g., regulation of transcription). Based
on this hypothesis, we ontologically classified these interaction-related keywords in
the Interaction Network Ontology (INO), a community-driven ontology of biological
interactions, pathways, and networks [
        <xref ref-type="bibr" rid="ref13 ref15">13, 15</xref>
        ]. INO classifies and represents different
levels of interaction keywords used for literature mining of genetic interaction
networks. Its development follows the Open Biological/Biomedical Ontology (OBO)
Foundry ontology development principles (e.g., openness and collaboration) [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. We
also showed the utility of using INO and a modified Fisher's exact test to analyze
significantly over- and under-represented enriched gene-gene interaction types among
the vaccine-associated gene-gene interactions extracted using all PubMed abstracts
[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Our study showed that INO would provide a new platform for efficient mining
and analysis of topic-specific gene interaction networks.
      </p>
      <p>
        Nevertheless, there still exist two more challenges in regards to the INO-based
classification method. The first is that the INO-based data standardization is not easy
for tool developers to deploy. The second is that current INO-based classification
focuses on the classification of interaction types signaled with one keyword in a
sentence. However, it is quite frequent that two or more interaction-related keywords
collectively signal an interaction type in a sentence. Such combinations of keywords
were discussed in the Discussion section of our previous paper without further
exploration [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. In this article, we report our effort to address these two challenges,
including the further development and standardization of INO-based classification
method and INO-based classification of multiple interaction keywords representing
interaction types in sentences. We have also applied these in two use case studies.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Methods</title>
      <sec id="sec-2-1">
        <title>INO ontology modeling and editing</title>
        <p>
          INO was formatted using the Description Logic (DL) version of the Web Ontology
Language (OWL2) [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. The Protégé OWL Editor [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] was used to add and edit INO
specific terms. To identify INO interaction types containing two or more keywords
used for literature mining of gene-gene interactions, we manually annotated sentences
from selected PubMed abstracts as described later and ontologically modeled each
interaction types in INO.
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2 SPARQL query of the INO subset of interaction keywords used for literature mining of gene-gene interactions</title>
        <p>
          The Ontobee SPARQL endpoint (http://www.ontobee.org/sparql) was used to obtain
the literature mining keywords by querying the INO ontology content stored in the He
Group RDF triple store [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. This triple store was developed based on the Virtuoso
system. The data in the triple store can be queried using the standard Virtuoso
SPARQL queries.
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3 OntoFox extraction of an INO subset of interaction terms that can be classified by two or more keywords in one sentence</title>
        <p>
          All the INO terms containing literature mining keywords composed of multiple words
were identified, and a subset of INO containing these terms and related terms was
extracted using the OntoFox tool [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ].
2.4
        </p>
      </sec>
      <sec id="sec-2-4">
        <title>Gold standard LLL data analysis</title>
        <p>
          In order to analyze the characteristics of interactions which are signaled with more
than one keywords, we manually annotated the gene/protein interaction data set from
the Learning Logic in Language (LLL) Challenge [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ] for the interaction types and
the keywords that signal them. Two experts reviewed the output of the single-word
interaction keywords identified by SciMiner, then carefully examined for
multikeyword interactions. Discrepancy was resolved by agreement between two experts.
2.5
        </p>
      </sec>
      <sec id="sec-2-5">
        <title>Vaccine gene-gene interaction literature mining use case</title>
        <p>
          In our previous paper, we used ontology-based SciMiner [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ] to extract and analyze
gene-gene interactions in the vaccine domain using all PubMed abstracts [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. In this
paper, we further annotated those sentences including two or more interaction-related
keywords for annotating gene-gene interactions. The results were then systematically
analyzed.
3
3.1
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <sec id="sec-3-1">
        <title>INO representation of interaction terms and literature mining keywords</title>
        <p>
          As defined previously, INO is aligned with the upper level Basic Formal Ontology
(BFO) [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. In INO, a biological interaction is defined as a processual entity that has
two or more participants (i.e., interactors) that have an effect upon one another. To
support ontology reuse and data integration, INO imports many terms from existing
ontologies [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ], such as the Gene Ontology (GO) [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ], and PSI Molecular
Interactions (PSI-MI) [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ]. As of August 12th, 2015 INO has 571 terms including 153
terms with INO prefix and 418 terms imported from 10 other ontologies
(http://www.ontobee.org/ontostat.php?ontology=INO).
        </p>
        <p>In the present study, we focused on the branch of gene-gene regulation,
particularly gene expression regulation. For the INO term ‘gene expression
regulation’, the input interactor is a gene, the output interactor is a gene product
including a RNA or protein, and the regulator is typically a protein. There exist
different subtypes of ‘gene expression regulation’, for example, positive or negative
regulation of gene expression, and regulation of transcription or translation.</p>
        <p>Fig. 1 shows an example of how INO defines the term ‘regulation of
transcription’. In addition to its text definition, INO also generates many logic
axioms. An equivalent class definition of the term is defined: regulates some ‘gene
transcription’, where ‘regulates’ is an object property (or called relation) and ‘gene
transcription’ is a gene expression process that transcribes a gene to RNA. In addition
to asserted axioms, many axioms are also inherited for the parents of the term
‘regulation of transcription’ (Fig. 1).</p>
        <p>
          Various subtypes of ‘regulation of transcription’ exist. For example, there are
different subtypes of positive or negative regulation of transcription. One commonly
seen subtype of regulation of transcription is via a promoter. A promoter is a region of
DNA located near the transcription start site of a gene, and the binding between a
promoter sequence and a transcription factor is required to initiate a transcription. The
phrase of a sentence “sigmaB- and sigmaF-dependent promoters of katX” [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ]
indicates that sigmaB and sigmaF regulate katX through the katX promoters.
        </p>
        <p>
          Some interactions are characterized with a single interaction keyword. For
example, in the sentence “Dephosphorylation of SpoIIAA-P by SpoIIE is strictly
dependent on the presence of the bivalent metal ions Mn2+ or Mg2+” [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ], the type of
interaction between SpoIIAA-P and SpoIIE is dephosphorylation reaction, which is
characterized with the interaction keyword “dephosphorylation”.
        </p>
        <p>
          On the other hand, there are also more complex interactions that are characterized
with two or more interaction keywords. Consider the sentence “In the mother cell
compartment of sporulating cells, expression of the sigE gene, encoding the
earlieracting sigma factor, sigmaE, is negatively regulated by the later-acting sigma factor,
sigmaK” [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ]. The relation between the SigE and SigmaK genes is characterized with
the interaction keywords “expression” and “negatively regulated”. The type of
relation is negative regulation of gene expression. SigmaK negatively regulates the
expression of SigE. Such relations are represented as complex events in the Genia
event corpus [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ] used in the BioNLP Shared Tasks, where the expression of SigE is
considered as the first event and the negative regulation of this event by the SigmaK
gene is considered as the second event. In contrast, INO represents such complex
events using a different strategy as described below.
        </p>
        <p>BFO upper
level terms</p>
      </sec>
      <sec id="sec-3-2">
        <title>INO-based standardization of literature mining of gene-gene interactions</title>
        <p>
          As shown in Fig. 1, the literature mining keywords for an INO term are defined as an
annotation using the annotation property ‘has literature mining keywords’. To provide
a reproducible strategy of representing the literature mining keywords, we used the
sign “//” to separate two keywords, which indicates that these two keywords do not
have to be next to each other in a sentence (Fig. 1). For example, many keywords are
added for the INO term ‘regulation of transcription’ (INO_0000032), including
“transcription // dependent, regulated // transcription, requires // transcription”.
These terms mean that the two keywords such as “requires” and “transcription” can
be separate in one sentence, for example, “sspG transcription also requires the DNA
binding protein GerE” [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ].
        </p>
        <p>Different ways can be used to get the information of keywords. One way is to
query INO using SPARQL. To show how we can quickly obtain the INO literature
mining keywords, we have shown the usage of a SPARQL query to automatically
generate the INO subset for literature mining (Fig. 2).</p>
        <p>
          Before the SPARQL can be executed, the INO ontology content should be first
deposited in RDF triple store. Indeed, the INO is included in the Hegroup RDF Triple
Store [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ], which is the default RDF triple store for the ontologies in the Open
Biological and Biomedical Ontologies (OBO) library (http://www.obofoundry.org/).
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>Incorporation of INO literature mining system to a software program</title>
        <p>
          SciMiner [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ] is our in-house literature mining software program for identifying
interactions among genes/proteins/vaccines and analyzing their biological
significance. We recently incorporated INO into SciMiner and demonstrated its
successful application to the identification of specific interaction types significantly
associated with gene-gene interactions in the context of vaccine [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. SciMiner can
also be utilized in identifying and modeling two interaction keywords, which will be
eventually used to improve the final literature-mined interaction network.
        </p>
        <p>Fig. 3 illustrates the overall workflow of INO modeling and its application in
literature mining for gene-interaction analysis. Briefly, the INO modeling procedure
aims at identifying and classifying the interaction patterns of two INO keywords.
Sentences with potential multiple interaction keywords (from gold standard sets) are
first scanned to identify individual single-word INO keywords and biological entities.
For any sentences with two or more interaction keywords identified, combinations of
two keywords are queried against the dictionary of keywords associated with existing
INO interaction classes. For any two keyword patterns that are not included in the
current dictionary, INO experts manually examine the sentences and two-keyword
patterns to confirm their valid interactions, update the INO annotations accordingly
with new entries, and upload the updated INO to a RDF triple store. Then, SPARQL
can be used to create new INO keyword dictionary for literature mining.</p>
        <p>Once INO-interaction keyword dictionary is established, it can be applied to
constructing interaction networks of biological entities from any set of biomedical
literature using SciMiner (as shown in the right part of Fig. 3). Briefly, SciMiner
accepts PubMed abstracts or sentences as input. After internal preprocessing of the
abstracts/sentences, SciMiner identifies biological entities such as gene/protein or any
ontologies (e.g. vaccine ontology) as well as single-word level INO terms. From the
sentences with at least two identified entities and one or more INO terms are used in
the interaction modeling. Sentences with two interaction keywords will further go
through multi-keyword interaction modeling, and a final interaction network will be
generated and subjected to down-stream functional analysis. A standalone
commandline based SciMiner, rather than the web version, was used in the current study and
the complete standalone pipeline will be available upon completion of the
development.</p>
        <p>Fig 3. INO modeling and application workflow.
3.4</p>
      </sec>
      <sec id="sec-3-4">
        <title>Annotation of the LLL data set for interaction types</title>
        <p>
          The LLL data set contains gene/protein interactions in Bacillus subtilis, which is a
model bacterium [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. The data set contains 77 sentences and 164 pairs of
genes/proteins that are described as interacting in these sentences. As an example,
consider the sample sentence “Transcriptional studies showed that nadE is strongly
induced in response to heat, ethanol and salt stress or after starvation for glucose in a
sigma B-dependent manner.” [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ] from the LLL data set. The interacting protein/gene
pairs (i.e., nadE and sigma B) have already been annotated in the data set. Given the
sentence and the interacting pair of proteins/genes, we annotated the type of relation
between them and the interaction keywords signaling this relation. The type of
interaction between nadE and Sigma B is “positive regulation of gene transcription”,
in other words Sigma B positively regulates the transcription of nadE. The relevant
interaction keywords are “transcriptional”, “induced”, and “dependent”. Our
interaction type and keyword annotation of the data set will be made publicly
available for future studies.
        </p>
        <p>Our annotation of the LLL data set for interaction types showed that many
regulatory relations between gene/protein pairs are represented with multiple
keywords. While the interactions among 43 pairs of genes/proteins were represented
with a single keyword, the interactions among 116 pairs were signaled using multiple
keywords. These interactions correspond to 27 different classes of regulation in INO.
Fig. 4 shows the hierarchical structure of these 27 classes, their related classes, and
the number of gene/protein pairs in the sentences identified for each class.
1
1
1
23
2
3
2
1
2
2
2
12
6
1
9
5
11
1
4
3
3
1</p>
        <p>Our study of the LLL dataset indicated that the majority of the sentences are
related to the gene expression regulation, especially in the area of transcriptional
regulation. More sentences describe positive regulation rather than negative
regulation. An interesting observation is the presence of many sentences focusing on
the domain of promoter-based regulation of transcription (Fig. 1). In addition to gene
expression regulation, this data set also includes other types of gene regulation, for
example, regulation of protein location, regulation of gene activation, and regulation
of protein activity. It is noted that protein activity is different from gene expression.
Protein activity depends on many factors other than expression, such as correct
folding of the protein and the presence of any required cofactors.</p>
        <p>
          Our analysis showed that most multi-keyword interactions are represented with
two keywords. Consider the interaction between KinC and Spo0A~P in the sentence
“KinC and KinD were responsible for Spo0A~P production during the exponential
phase of growth in the absence of KinA and KinB” [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ]. This sentence states that
KinC is responsible for Spo0A~P production. The interaction type between these
genes is classified as “regulation of translation” in INO. The two keywords signaling
this interaction are “responsible” and “production”. The keyword “responsible”
signals that this is an interaction of type “regulation”, whereas the keyword
“production” signals that this is a specific type of regulation, namely “regulation of
translation”. We can consider “responsible” as the main type signaling keyword and
“production” as the secondary (sub) type signaling keyword.
        </p>
        <p>
          There are also more complex interactions, which are represented with more than
two keywords. For example, in the sentence “A low concentration of GerE activated
cotB transcription by final sigma(K) RNA polymerase, whereas a higher
concentration was needed to activate transcription of cotX or cotC.” [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ], the
interaction between GerE and cotB is signaled with the three keywords “low
concentration”, “activated”, and “transcription”. The type of interaction corresponds
to the INO class “activation of gene transcription by low level protein”. In another
sentence “sigmaH-dependent promoter is responsible for yvyD transcription” [
          <xref ref-type="bibr" rid="ref33">33</xref>
          ],
four keywords are used: “dependent”, “promoter”, “responsible”, and “transcription”.
Such a complex interaction is labeled as “promoter-based regulation of transcription”
in INO.
3.5
        </p>
      </sec>
      <sec id="sec-3-5">
        <title>Analysis of vaccine-based gene-gene interaction literature mining results</title>
        <p>
          Our previous INO-based literature mining study used an INO-based SciMiner
program to identify many gene-gene interactions in the vaccine domain using all
PubMed abstracts [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. A statistical method based on the results was also developed
to classify significantly over- and under-represented interaction types. Our manual
examination of randomly selected 50 sentences identified by SciMiner, a small
portion of the whole vaccine corpus, suggested that similar to the LLL data set, over
50% of sentences use two or more keywords to represent specific gene-gene
interaction types.
4
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Discussion</title>
      <p>In this paper, we investigated the interaction types that are characterized with
multiple keywords used in combination. The main contributions are: (1) Extending
INO by modeling interaction types (classes) each signaled with multiple keywords in
literature sentences and adding many new terms by analyzing the LLL and vaccine
data sets, (2) Standardizing INO-based literature mining for easy use and testing by
future studies. (3) Characterizing and demonstrating multi-keyword interaction type
ontology modeling of literature sentences by analyzing the LLL and vaccine-gene
interaction data sets.</p>
      <p>
        Multi-keyword interactions have been represented as complex events in the Genia
corpus [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ], which has also been used in the BioNLP Shared Tasks on Event
Extraction. In this representation, in order to identify the complex events, first the
simple events (e.g. gene expression, regulation) signaled with individual keywords
need to be identified. Next, the simple events are combined to form a complex event.
For instance, given a sentence that states that gene A regulates the expression of gene
B, the expression of gene B is represented as Event 1 (i.e., expression of gene B), and
Event 2 is a complex event where gene A regulates Event 1. Therefore, we could infer
a possible relation between gene A and gene B, by the association of Event 1 – gene
B – Event 2 – gene A. Such recognition of the gene A-B interaction is indirect, and
may become even more complex when multiple events (with multiple keywords) are
applied. Compared to the Genie approach, INO provides a more fine-grained and
direct classification of interaction types and can directly model the relation between
two biomolecules (e.g., genes or proteins). For instance, the interaction between gene
A and gene B in the above example is directly modeled as the interaction type
“regulation of gene expression” in INO.
      </p>
      <p>
        The Gene Regulation Ontology (GRO) [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ] models complex gene regulatory
events similarly to INO. GRO has recently been used in the Corpus Annotation with
Gene Regulation Ontology Task in the 2013 edition of BioNLP Shared Task [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ].
The domains of GRO and INO differ. GRO focuses on only gene regulations.
However, INO targets the broader scope of interactions and interaction networks.
Similar to INO, GRO is also aligned with the Basic Formal Ontology (BFO) and
many other ontologies such as the Gene Ontology (GO). However, for the ontology
alignments, GRO uses its own identifiers and references back to the original
ontologies; in contrast, INO directly imports related terms from other ontologies.
Technical representations of entities in INO and GRO also differ in many aspects.
Compared to GRO, one of the main advantages of INO is that the interaction types
and sub-types are associated with manually compiled comprehensive lists of literature
mining keywords. These keywords can be incorporated in dictionary-based or
statistical taggers for tagging the interaction keywords in text, which can then be used
to map the interactions to their corresponding types in INO.
      </p>
      <p>Future work includes automatic identification and modeling of novel two keyword
interactions by SciMiner, and a new notation of multi-keyword interactions using
regular expressions to be more systematic rather than the current ‘//’-based strategy.
In this paper we demonstrated our strategy of integrating INO with the SciMiner
tagger for ontology-based literature mining. Currently, the integrated INO-SciMiner
works as a standalone package, and it can be easily incorporated into other literature
mining pipelines, if desired. The current SciMiner system can identify gene/protein
and vaccine, but is being upgraded to be able to identify other entities such as drug,
tissue, and etc., thus, the future version of INO-integrated SciMiner can be applied to
not only the typical gene-gene interaction, but also other interactions such as
genedrug interaction, drug-chemical, drug-tissue and various types of interaction.
Acknowledgments. This research was supported by grant R01AI081062 from the US
NIH National Institute of Allergy and Infectious Diseases (to YH) and Marie Curie
FP7-Reintegration-Grants within the 7th European Community Framework
Programme (to AO).</p>
    </sec>
    <sec id="sec-5">
      <title>References</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Arighi</surname>
            ,
            <given-names>C.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krallinger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>K.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wilbur</surname>
            ,
            <given-names>W.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hirschman</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>C.H.</given-names>
          </string-name>
          :
          <article-title>Overview of the BioCreative III Workshop</article-title>
          . BMC Bioinformatics.
          <volume>12</volume>
          <issue>Suppl 8</issue>
          ,
          <issue>S1</issue>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Krallinger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leitner</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodriguez-Penagos</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Overview of the protein-protein interaction annotation extraction task of BioCreative II</article-title>
          .
          <source>Genome Biol</source>
          .
          <volume>9</volume>
          <issue>Suppl 2</issue>
          ,
          <issue>S4</issue>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Jelier</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jenster</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dorssers</surname>
          </string-name>
          , L.C.,
          <string-name>
            <surname>van der Eijk</surname>
          </string-name>
          , C.C.,
          <string-name>
            <surname>van Mulligen</surname>
            ,
            <given-names>E.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mons</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kors</surname>
            ,
            <given-names>J.A.:</given-names>
          </string-name>
          <article-title>Co-occurrence based meta-analysis of scientific texts: retrieving biological relationships between genes</article-title>
          .
          <source>Bioinformatics</source>
          .
          <volume>21</volume>
          ,
          <fpage>2049</fpage>
          -
          <lpage>2058</lpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Fundel</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuffner</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zimmer</surname>
          </string-name>
          , R.: RelEx-
          <article-title>-relation extraction using dependency parse trees</article-title>
          .
          <source>Bioinformatics</source>
          .
          <volume>23</volume>
          ,
          <fpage>365</fpage>
          -
          <lpage>371</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Daraselia</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yuryev</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Egorov</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Novichkova</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nikitin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mazo</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Extracting human protein interactions from MEDLINE using a full-sentence parser</article-title>
          .
          <source>Bioinformatics</source>
          .
          <volume>20</volume>
          ,
          <fpage>604</fpage>
          -
          <lpage>611</lpage>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Temkin</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gilder</surname>
            ,
            <given-names>M.R.</given-names>
          </string-name>
          :
          <article-title>Extraction of protein interaction information from unstructured text using a context-free grammar</article-title>
          .
          <source>Bioinformatics</source>
          .
          <volume>19</volume>
          ,
          <fpage>2046</fpage>
          -
          <lpage>2053</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Airola</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pyysalo</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bjorne</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pahikkala</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ginter</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Salakoski</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>All-paths graph kernel for protein-protein interaction extraction with evaluation of cross-corpus learning</article-title>
          .
          <source>BMC Bioinformatics. 9 Suppl</source>
          <volume>11</volume>
          ,
          <issue>S2</issue>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Tikk</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , Thomas,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Palaga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Hakenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Leser</surname>
          </string-name>
          ,
          <string-name>
            <surname>U.</surname>
          </string-name>
          :
          <article-title>A comprehensive benchmark of kernel methods to extract protein-protein interactions from literature</article-title>
          .
          <source>PLoS Comput. Biol</source>
          .
          <volume>6</volume>
          ,
          <issue>e1000837</issue>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Erkan</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Özgür</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Radev</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          :
          <article-title>Semi-supervised classification for extracting protein interaction sentences using dependency parsing</article-title>
          .
          <source>In: EMNLP-CoNLL</source>
          , pp.
          <fpage>228</fpage>
          -
          <lpage>237</lpage>
          . (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>J.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ohta</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pyysalo</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kano</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tsujii</surname>
            ,
            <given-names>J.i.</given-names>
          </string-name>
          :
          <article-title>Overview of BioNLP'09 shared task on event extraction</article-title>
          .
          <source>In: Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task</source>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>9</lpage>
          . Association for Computational Linguistics, (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>J.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nguyen</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tsujii</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Takagi</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yonezawa</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>The Genia Event and Protein Coreference tasks of the BioNLP Shared Task 2011</article-title>
          .
          <source>BMC Bioinformatics</source>
          .
          <volume>13</volume>
          <issue>Suppl 11</issue>
          ,
          <issue>S1</issue>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Joachims</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Making large-scale support vector machine learning practical</article-title>
          . In: B.
          <string-name>
            <surname>Schölkopf</surname>
            ,
            <given-names>C.J.B.</given-names>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A. J.</given-names>
            <surname>Smola</surname>
          </string-name>
          , Eds. (ed.)
          <source>Advances in Kernel Methods: Support Vector Learning</source>
          , pp.
          <fpage>169</fpage>
          -
          <lpage>184</lpage>
          . MIT Press, Cambridge, MA. (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Ozgur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xiang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Radev</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Mining of vaccine-associated IFN-gamma gene interaction networks using the Vaccine Ontology</article-title>
          .
          <source>Journal of Biomedical Semantics</source>
          .
          <volume>2</volume>
          <issue>Suppl 2</issue>
          ,
          <issue>S8</issue>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Hur</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ozgur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xiang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Identification of fever and vaccine-associated gene interaction networks using ontology-based literature mining</article-title>
          .
          <source>Journal of Biomedical Semantics</source>
          .
          <volume>3</volume>
          ,
          <issue>18</issue>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Hur</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ozgur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xiang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Development and application of an interaction network ontology for literature mining of vaccine-associated gene-gene interactions</article-title>
          .
          <source>Journal of Biomedical Semantics. 6</source>
          ,
          <issue>2</issue>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Grenon</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>SNAP and SPAN: Towards Dynamic Spatial Ontology</article-title>
          .
          <source>Spatial Cognition and Computation</source>
          .
          <volume>4</volume>
          ,
          <fpage>69</fpage>
          -
          <lpage>103</lpage>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17. http://www.w3.org/TR/2009/REC-owl2
          <string-name>
            <surname>-</surname>
          </string-name>
          overview-20091027/
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>18. http://protege.stanford.edu/</mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Xiang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mungall</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruttenberg</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Ontobee: A linked data server and browser for ontology terms</article-title>
          .
          <source>In: The 2nd International Conference on Biomedical Ontologies (ICBO)</source>
          , pp.
          <fpage>279</fpage>
          -
          <lpage>281</lpage>
          . CEUR Workshop Proceedings, (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Xiang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Courtot</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brinkman</surname>
            ,
            <given-names>R.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruttenberg</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>OntoFox: web-based support for ontology reuse</article-title>
          .
          <source>BMC Research Notes</source>
          .
          <volume>3</volume>
          :
          <issue>175</issue>
          ,
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Nedellec</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Learning language in logic-genic interaction extraction challenge</article-title>
          .
          <source>In: Proceedings of the 4th Learning Language in Logic Workshop (LLL05)</source>
          . (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Hur</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schuyler</surname>
            ,
            <given-names>A.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>States</surname>
            ,
            <given-names>D.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Feldman</surname>
            ,
            <given-names>E.L.</given-names>
          </string-name>
          :
          <article-title>SciMiner: web-based literature mining tool for target identification and functional enrichment analysis</article-title>
          .
          <source>Bioinformatics</source>
          .
          <volume>25</volume>
          ,
          <fpage>838</fpage>
          -
          <lpage>840</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Ashburner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ball</surname>
            ,
            <given-names>C.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blake</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Botstein</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Butler</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cherry</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>A.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dolinski</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dwight</surname>
            ,
            <given-names>S.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eppig</surname>
            ,
            <given-names>J.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harris</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hill</surname>
            ,
            <given-names>D.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Issel-Tarver</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kasarskis</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lewis</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matese</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Richardson</surname>
            ,
            <given-names>J.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ringwald</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rubin</surname>
            ,
            <given-names>G.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sherlock</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium</article-title>
          .
          <source>Nature Genetics</source>
          .
          <volume>25</volume>
          ,
          <fpage>25</fpage>
          -
          <lpage>29</lpage>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Isserlin</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>El-Badrawi</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bader</surname>
            ,
            <given-names>G.D.</given-names>
          </string-name>
          :
          <article-title>The Biomolecular Interaction Network Database in PSI-MI 2.5</article-title>
          .
          <string-name>
            <surname>Database</surname>
          </string-name>
          (Oxford).
          <year>2011</year>
          , baq037 (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Petersohn</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Engelmann</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Setlow</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hecker</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The katX gene of Bacillus subtilis is under dual control of sigmaB and sigmaF</article-title>
          .
          <source>Molecular Genetics and Genomics</source>
          .
          <volume>262</volume>
          ,
          <fpage>173</fpage>
          -
          <lpage>179</lpage>
          (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Schroeter</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schlisio</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lucet</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yudkin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Borriss</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>The Bacillus subtilis regulator protein SpoIIE shares functional and structural similarities with eukaryotic protein phosphatases 2C</article-title>
          .
          <source>FEMS Microbiology Letters</source>
          .
          <volume>174</volume>
          ,
          <fpage>117</fpage>
          -
          <lpage>123</lpage>
          (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Struffi</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kroos</surname>
          </string-name>
          , L.:
          <article-title>sigmaK can negatively regulate sigE expression by two different mechanisms during sporulation of Bacillus subtilis</article-title>
          .
          <source>Journal of Bacteriology</source>
          .
          <volume>181</volume>
          ,
          <fpage>4081</fpage>
          -
          <lpage>4088</lpage>
          (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>J.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ohta</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tsujii</surname>
          </string-name>
          , J.:
          <article-title>Corpus annotation for mining biomedical events from literature</article-title>
          .
          <source>BMC Bioinformatics</source>
          .
          <volume>9</volume>
          ,
          <issue>10</issue>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Bagyan</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Setlow</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Setlow</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>New small, acid-soluble proteins unique to spores of Bacillus subtilis: identification of the coding genes and regulation and function of two of these genes</article-title>
          .
          <source>Journal of Bacteriology</source>
          .
          <volume>180</volume>
          ,
          <fpage>6704</fpage>
          -
          <lpage>6712</lpage>
          (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Antelmann</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schmid</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hecker</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The NAD synthetase NadE (OutB) of Bacillus subtilis is a sigma B-dependent general stress protein</article-title>
          .
          <source>FEMS Microbiology Letters</source>
          .
          <volume>153</volume>
          ,
          <fpage>405</fpage>
          -
          <lpage>409</lpage>
          (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Jiang</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shao</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Perego</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoch</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          :
          <article-title>Multiple histidine kinases regulate entry into stationary phase and sporulation in Bacillus subtilis</article-title>
          .
          <source>Molecular Microbiology</source>
          .
          <volume>38</volume>
          ,
          <fpage>535</fpage>
          -
          <lpage>542</lpage>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Ichikawa</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kroos</surname>
            ,
            <given-names>L.:</given-names>
          </string-name>
          <article-title>Combined action of two transcription factors regulates genes encoding spore coat proteins of Bacillus subtilis</article-title>
          .
          <source>Journal of Biological Chemistry</source>
          .
          <volume>275</volume>
          ,
          <fpage>13849</fpage>
          -
          <lpage>13855</lpage>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33.
          <string-name>
            <surname>Drzewiecki</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eymann</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mittenhuber</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hecker</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The yvyD gene of Bacillus subtilis is under dual control of sigmaB and sigmaH</article-title>
          .
          <source>Journal of Bacteriology</source>
          .
          <volume>180</volume>
          ,
          <fpage>6674</fpage>
          -
          <lpage>6680</lpage>
          (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          34.
          <string-name>
            <surname>Beisswanger</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>J.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rebholz-Schuhmann</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Splendiani</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dameron</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schulz</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hahn</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Gene Regulation Ontology (GRO): design principles and use cases</article-title>
          .
          <source>Stud Health Technol Inform</source>
          .
          <volume>136</volume>
          ,
          <fpage>9</fpage>
          -
          <lpage>14</lpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          35.
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>J.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>J.J.</given-names>
          </string-name>
          , Han,
          <string-name>
            <given-names>X.</given-names>
            ,
            <surname>Rebholz-Schuhmann</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.</surname>
          </string-name>
          :
          <article-title>Extending the evaluation of Genia Event task toward knowledge base construction and comparison to Gene Regulation Ontology task</article-title>
          .
          <source>BMC Bioinformatics</source>
          .
          <volume>16</volume>
          ,
          <issue>S3</issue>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>