<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Biomedical Ontology in Action"
November</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>“Lmo-2 interacts with Elf-2 ” On the Meaning of Common Statements in Biomedical Literature</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Stefan Schulz</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ludger Jansen</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Medical Informatics, Freiburg University Hospital</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Philosophy, University of Rostock</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2006</year>
      </pub-date>
      <volume>8</volume>
      <issue>2006</issue>
      <fpage>37</fpage>
      <lpage>45</lpage>
      <abstract>
        <p>Statements about the behavior of biological entities, e.g. about the interaction between two proteins, abound in the literature on molecular biology and are increasingly becoming the targets of information extraction and text mining techniques. We show that an accurate analysis of the semantics of such statements reveals a number of ambiguities that is necessary to take into account in the practice of biomedical ontology engineering. Several concurring formalizations are proposed. Emphasis is laid on the discussion of biological dispositions.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>The study of so-called protein-protein interactions
is essential for a better understanding of biological
processes, from replication and expression of genes
to the morphogenesis of organisms. Statements
such as “Lmo-2 interacts with Elf-2 ” – with
Lmo2 and Elf-2 being proteins – occur in biomedical
literature abstracts with a very high frequency and
represent, in many cases, the core message of a
scientific paper.</p>
      <p>There are several kinds of biomolecular
interactions, e.g. binding, inhibition, activation, and
transport. They all involve (1) at least two
biomolecules and (2) the spatial vicinity of these,
which leads to (3) a causal influence that they
exert on each other.</p>
      <p>Text mining, i.e the process of extracting
structured knowledge from unstructured text,
primarily targets statements such as these, and there is
a major interest by the text mining community in
obtaining ontological support for their information
and knowledge extraction activities. This is one
of the reasons why so-called bio-ontologies have
emerged, and the use of formal ontological criteria
has been repeatedly advocated in order to
facilitate the process of automatic processing of domain
information.</p>
      <p>
        Much work in this area has already been done in
the form of ontological investigations on
material continuants, such as organs, cells, molecules
[
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1, 2, 3</xref>
        ]. However, there has been much less
emphasis on biologically relevant functions and
processes. Furthermore, biomedical ontology
engineering has been mainly committed to traditions
of semantic networks, lexical semantics, and
cognitive science. Thus rather than being construed as
describing real world entities by means of logical
expressions, ontology has been understood as
relating concepts (i.e. representations of word
meanings) by means of conceptual relations. On this
assumption, “Lmo-2 interacts with Elf-2 ” would
simply signify that there is some plausible
linkage between the concepts (conceived of as
mental representations) “Interaction”, “Lmo-2”, and
“Elf-2”. As much as this approach might be
adequate for communicating knowledge about the
world by means of natural language or some kind
of abstraction (e.g. semantic networks), it fails
where exact statements and reasoning about
biological entities such as molecules, functions, or
pathways are required.
      </p>
      <p>Interestingly enough, scientists and other human
agents are perfectly able to communicate by means
of such sentences, although there is only a vague
consensus about the referents (the entities in the
world) which are denoted by these linguistic
expressions. Because of the ambiguities of natural
language, a natural language statement like
“Lmo2 interacts with Elf-2 ” may have more than one
possible interpretation and thus more than one
formalization in, say, first order predicate logic. In
formally representing the meaning of such
statements, we have thus to make explicit the
ontological assumptions intended by the speakers or
authors of that sentence.</p>
      <p>In this paper we will demonstrate that even the
formalization of an apparently simple but
prototypical statement about protein interaction like
“Lmo-2 interacts with Elf-2 ” can yield totally
different ontological assumptions.</p>
    </sec>
    <sec id="sec-2">
      <title>Basic Ontological Assumptions</title>
      <p>
        It is widely recognized that the construction of
biomedical ontologies should obey strict logical and
ontological criteria. To this end, several top-level
ontologies have been devised, such as DOLCE [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ],
BFO [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], and GOL [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. These ontologies mainly
coincide in their fundamental division between
continuants (endurants, e.g. material objects) and
occurrents (perdurants, e.g. events, processes).
The distinction is that occurrents have temporal
parts (they are never fully present at a given time)
and they are existentially dependent on
continuants. Continuants are split into independent and
dependent ones. Examples of independent
continuants are material objects and spaces. Dependent
continuants, on the other hand, are entities which
inhere in something and are thus ontologically
dependent on their bearer. Examples of dependent
continuants are masses, colors, and tendencies: A
particular mass may inhere in a particular
molecule, a particular color may inhere in a particular
flower. The tendency to divide may inhere in a cell
and the tendency to relieve headache may inhere
in an aspirin tablet. Tendencies are related to
occurrents by the relation of realization. They are
special kinds of dependent entities, in that they
need not be realized in order to exist. There are
cells which never divide, and aspirin tablets that
never relieve a headache.
      </p>
      <p>
        Our ontological framework for describing
molecular interaction patterns includes entities of all
these kinds. For instance, a protein molecule,
which is a material continuant, has a disposition
to perform a certain function, e.g. binding, which
is a dependent continuant, and an actual
realization of this disposition, viz. the process of binding
a protein molecule, which is an occurrent.
Aware of the need to comply with existing
standards for ontologies, especially in the light of the
Semantic Web and the various specifications of
Description Logic (DL) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], we keep our logic simple.
So we refrain from higher-order logics, as well as
temporal or modal logics. We also use a
parsimonious set of relations, following the OBO (Open
Biological Ontologies) recommendation [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. As a
primitive formal relation we introduce the
irreflexive, non-transitive and asymmetric instantiation
relation inst which relates particular entities to
their universal properties. In addition, we need
a formal relation for class subsumption between
universals, expressing scientific findings about
relations between the kinds of things that are in the
world. As scientific laws are meant to range not
only over all present instances of a given kind,
but also over all past and future instances and,
moreover, also over merely possible instances [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ],
such a relation is not easily defined. We will here
follow the OBO standard and introduce, to this
end, the taxonomic subsumption relation Is-a by
means of the inst 1 relation [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. We will neglect
the time parameter, which is not important for
present purposes. On this basis, we define Is-a as
a reflexive, transitive, and antisymmetric relation
between universals A and B, as follows:
      </p>
      <p>
        Is-a(A, B) =def
∀x : (inst(x, A) → inst(x, B))
(1)
Furthermore, we make the following ontological
subdivision: When we deal with things of a certain
kind, we have to distinguish between individuals
belonging to this kind and collectives of
individuals that belong to the same kind [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. This very
natural ontological distinction, which is mirrored
by the singular / plural division in most natural
languages, must be addressed wherever collectives
or pluralities of individual objects occur. However,
this distinction is often obscured when referring
to mass entities (e.g. water vs. water molecules).
Given the atomicity of material continuants, we
do not admit material mass entities in our present
framework, but consider them as collectives of
particles instead.
      </p>
      <p>1We use capitalized initial letters for the names of
relations between universals as well as for the names
of universals.
A collective is given, e.g., by all Lmo-2 molecules
involved in an experiment, as opposed to exactly
one individual Lmo-2 molecule. In the
following, we will use the subscript ”coll” to refer to
collectives. Thus, for each universal X we
pricipally admit the existence of a corresponding
collective XCOLL, the class of collections of instances
of X . For instance, “P roteinM oleculeCOLL”
denotes the class of collectives of protein molecules
as well as “Lmo-2 COLL” the class of collectives
of Lmo-2 molecules. We also admit collectives of
occurrents, such as InteractionCOLL.</p>
    </sec>
    <sec id="sec-3">
      <title>Concurrent interpretations I:</title>
    </sec>
    <sec id="sec-4">
      <title>Event Readings</title>
      <p>Let us come back to our example: “Lmo-2
interacts with Elf-2 ”. Such statements are generally
formulated by researchers who collect scientific
evidence by empirical observations. These
observations are commonly made in an indirect way, since
the objects under scrutiny are below the threshold
of visibility. For this reason measurement
procedures of varying degrees of sophistication are
applied, the results of which can be used to draw
conclusions about the significance of an experiment.
These conclusions may vary in their degrees of
certainty. This certainty is affected by measurement
errors as well as by errors in the design of the
experiment which then may lead to false conclusions.
If a statement like “Lmo-2 interacts with Elf-2 ”
is being uttered in a laboratory or written in a
scientific paper or textbook, the minimal thing that
can be inferred is that there are molecules of type
Lmo-2 and Elf-2. By way of contrast, this
inference is not possible if such a sentence appears
in a science fiction novel. As we are interested
here in the scientific context only, we assume in
what follows that there are universals Lmo-2 and
Elf-2 that are kinds of protein molecules. These
types of protein molecules do, of course, belong to
the genus of protein molecules, which in turn are
molecules, which are a kind of continuants. If one
has an Aristotelian theory of universals, universals
only exist if they are instantiated; that is, the
existence of the universals Lmo-2 and Elf-2 implies
that there are individual molecules that
instantiate these universals. (Whoever has a different
theory of universals may have to add this as a further
assumption.) All the possible readings of “Lmo-2
interacts with Elf-2 ” to be discussed in the
remainder of this paper have thus the following as a
common ground:</p>
      <sec id="sec-4-1">
        <title>Is-a(Lmo-2, P roteinM olecule) ∧</title>
      </sec>
      <sec id="sec-4-2">
        <title>Is-a(Elf -2, P roteinM olecule) ∧</title>
      </sec>
      <sec id="sec-4-3">
        <title>Is-a(P roteinM olecule, M olecule) ∧</title>
      </sec>
      <sec id="sec-4-4">
        <title>Is-a(M olecule, Continuant) ∧ ∃l, e : inst(l, Lmo-2) ∧ inst(e, Elf -2)</title>
        <p>(2)
Despite this common ground, the sentence remains
highly ambiguous even within a scientific context.
First we will discuss interpretations of “Lmo-2
interacts with Elf-2 ” that interpret it as a report of
events. Here are some possible interpretations of
the sample statement that belong to this group:
1. One individual Lmo-2 molecule interacts with
one individual Elf-2 molecule.
2. A collection of Lmo-2 molecules interacts with
one individual Elf-2 molecule.
3. One individual Lmo-2 molecule interacts with
a collection of Elf-2 molecules.
4. A collection of Lmo-2 molecules interacts with
a collection of Elf-2 molecules.</p>
        <p>Our sample statement appears to describe the fact
that exactly one such interaction happened.
Alternatively, it can describe the fact that a multitude
of such interactions (as described in 1-4) happens,
which would be the normal thing in many
biochemical contexts. This adds up to eight different
interpretations. But any of these interpretations
is still ambiguous in a very important respect.
With each of these interpretations, the speaker
may mean either that such interaction(s) did
actually happen, or the speaker may mean that the
molecules in question have the disposition or the
tendency to interact in such a way. This gives
way to even more possible interpretations. Thus,
“Lmo-2 interacts with Elf-2 ” turns out to be a
highly ambiguous sentence. We will now discuss
the different possible interpretations of this
sentence in turn and suggest methods for representing
them formally.</p>
        <sec id="sec-4-4-1">
          <title>Occurrents involving individual continuants</title>
          <p>On the first interpretation, “Lmo-2 interacts with
Elf-2 ” describes the fact that an individual Lmo-2
molecule interacts with an individual Elf-2
molecule. A standard way to render such a situation
formally would be the use of the existence
quantifier of first order predicate logic:
∃l, e : inst(l, Lmo-2)∧
(3)
inst(e, Elf -2) ∧ interacts(l, e)
This formalization ensures that there is at least
one individual Lmo-2 molecule which interacts
with at least one individual Elf-2 molecule at at
least one instant. This interpretation can now be
modified in various ways. We could, e.g., add
exclusivity postulates like in (4) that ensure that
exactly one individual molecule of each kind are
interacting with each other. Though such a solitary
event might be rarely observed in experiments,
there may be contexts where this is the intended
meaning:
∃l, e : inst(l, Lmo-2) ∧ inst(e, Elf -2)∧
(4)
interacts(l, e) ∧
∀l∗, e∗ : (inst(l∗, Lmo-2) ∧ inst(e∗, Elf -2) ∧
interacts(l∗, e∗)) → (l∗ = l ∧ e∗ = e)
Normally, however, this formalization will be
much too strong an interpretation of our sample
statement. For any statement of this form will be
false, if at any other time another Lmo-2 molecule
interacts with an Elf-2 molecule – or if at the very
same time another Lmo-2 molecule interacts with
an Elf-2 molecule at any other place. Therefore,
we do not consider it as a useful interpretation of
our sample sentence. We will, however, refer back
to this formula and the exclusivity clauses used in
it in the following discussion.</p>
          <p>
            In Formulae 3 and 4 we have expressed the
interaction event by means of a binary relation interacts
between individual continuants. This relation on
the level of instances is irreflexive (nothing ever
interacts with itself), symmetric, and non-transitive.
The OBO (Open Biological Ontologies) relation
ontologies, however, recommends to restrict
ourselves to a parsimonious array of basic relations.
Therefore, we will eliminate the interacts relation,
using the technique introduced by Davidson [
            <xref ref-type="bibr" rid="ref10">10</xref>
            ] to
quantify over events. This means that we
represent the interaction process as an occurrent entity
in its own right rather than by the relation
interacts as in Formulae 3 or 4. This move is made
possible through our admission of occurrent
entities, and it corresponds to common practice in
biomedical ontologies. The relation between the
particular process and the participating
particular continuants is then given by the relation
hasparticipant [
            <xref ref-type="bibr" rid="ref8">8</xref>
            ]. The has-participant relation is a
relation between a particular occurrent and a
particular continuant, in this order. It is irreflexive,
asymmetric, and non-transitive. For nothing
participates in a continuant and no occurrent
participates in anything. Again, we dispense with a time
index for sake of simplicity. Within a fully-fledged
implementation, a time index should be included,
as an occurrent may have different participants at
different stages.
∃l, e, i : inst(l, Lmo-2) ∧ inst(e, Elf -2)∧
(5)
has-participant(i, l) ∧ has-participant(i, e)
This formalization makes it easier to represent
occurrents with more than two participants, as with
the representation in Formula 3, where we would
have to deal with n-ary relations for n
participants. According to this formal representation,
“Lmo-2 interacts with Elf-2 ” is to be understood
as stating that there is at least one interaction
process, in which at least one protein molecule of
the given kinds is involved. It does not exclude
that other molecules are involved in this very
interaction process. If we want to secure that Lmo-2
and Elf-2 are the only participants of the
molecular interaction, we have to employ exclusivity
conditions similar to 4:
∃l, e, i : inst(l, Lmo-2) ∧ inst(e, Elf -2)∧
(6)
If we want to keep the requirement of pairwise
interaction, if have to add uniqueness conditions
in the fashion of Formula 4 for this purpose:
∃l, e, i : inst(l, Lmo-2) ∧ inst(e, Elf -2)∧
(7)
has-participant(i, l∗) ∧ has-participant(i, e∗))
→ (e∗ = e ∧ l∗ = l))
In contrast to Formula 4, such a formalization that
quantifies over events is still much more realistic,
because its truth is compatible with more than one
interaction process of the same kind happening at
the same time or at other times.
          </p>
        </sec>
        <sec id="sec-4-4-2">
          <title>Occurrents involving collectives of continuants</title>
          <p>
            As mentioned above, it is important to distinguish
between individuals of a kind and collectives of
individuals of that kind. Rector and Bittner [
            <xref ref-type="bibr" rid="ref1">1</xref>
            ]
have accounted for this by introducing the formal
relation has-grain which relates a collective c to
each of its constituents e. In [
            <xref ref-type="bibr" rid="ref3">3</xref>
            ] this account has
been further developed by introducing a collective
universal X COLL whose instances are constituted
by two or more constituents which are instances
of X :
∀c : inst(c, XCOLL) → ∃e1, e2, ..., en, n &gt; 1 : (8)
n
^ inst(eν , X ) ∧ has-grain(c, eν )
ν=1
Note that has-grain is a subrelation of has-part.
As a consequence, we identify a collection as a
mereological sum of its constituents (regardless of
their spatiotemporal arrangement), and not as a
mathematical set. The reason for rejecting the set
approach is two-fold. Firstly, because
mathematical sets are extensional and therefore not robust
with regard to the gain and loss of constituents.
Secondly, because collectives should be of the same
ontological category as their constituents: A
collective of material objects should be a material
object, and a collective of events should be an event.
Sets, however, are abstract objects that do neither
exist in space nor in time. We do not use the
haspart relation, because participants in interactions
may have parts that do not themselves participate
in the interaction. A Lmo-2 molecule, e.g., may
participate in an interaction without every of its
electrons being a participant in this interaction.
Whereas has-part is transitive, has-grain is not. It
is a irreflexive, asymmetric, and intransitive
relation that holds between particular collectives and
individuals.
          </p>
          <p>We therefore modify our formalism substituting
individuals by collectives:
∃l, e, i : inst(l, Lmo-2COLL)∧
(9)
inst(e, Elf -2COLL) ∧ inst(i, Interaction) ∧
has-participant(i, l) ∧ has-participant(i, e)</p>
        </sec>
        <sec id="sec-4-4-3">
          <title>Collectives of occurrents</title>
          <p>Formalism 7 and 9 use the same occurrent type
Interaction for different scenarios: In the first case, a
particular interaction has individual protein
molecules as participants, in the second case collectives
of molecules. This ambiguity may be acceptable
when we talk about such a generic process as
interaction. It would not be tolerable in the case of a
more specific one, such as binding. A binding can
only happen between two individual molecules,
not between two collectives of molecules. Thus, if
we encounter a plurality of bindings within a
plurality of molecules, it would not be admissible to
describe this as a binding between two collectives
of molecules but rather a collective of bindings
between pairs instances of the kinds of molecules in
question2. Thus we have to deal with a collective
of processes rather than with collectives of
continuants.</p>
          <p>In order to represent such a situation, let us
first introduce the collective interaction
universal I COLL which is constituted by individual
constituents which are instances of I, analogously to
Formula 8. Then we have to determine how each
of the grain interactions look like. If they are
pairwise interactions between an Lmo-2 molecule and
an Elf-2 molecule, each of these interactions fits
2A counterexample is the interaction between
solutes and solvents in a solution which necessarily
involves colletives of both solvents and solutes.
Formula 7. Combining Formulae 7 and 8, we get:
∃p, i1, i2, ...in, n &gt; 1 :
n
^ (inst(iν, I) ∧ has-grain(p, iν) ∧
ν=1
∃lν, eν : inst(lν, Lmo-2) ∧ inst(eν, Elf -2) ∧
(10)
has-participant(iν, lν) ∧
has-participant(iν, eν) ∧
∀x : (has-participant(iν, x) →</p>
          <p>inst(x, Lmo-2) ∨ inst(x, Elf -2)) ∧
∀lν∗, eν : ((inst(lν∗, Lmo-2) ∧</p>
          <p>∗
inst(eν∗, Elf -2) ∧ has-participant(iν, lν∗) ∧
has-participant(iν, eν∗)) → (eν∗ = eν ∧ lν∗ = lν)))</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Concurrent Interpretations II:</title>
    </sec>
    <sec id="sec-6">
      <title>Dispositional Readings</title>
      <p>
        The above interpretations stated the existence of
one or more interaction events. However, messages
of the style “Lmo-2 interacts with Elf-2 ” very
often do not focus on the accidental occurrence of
an event but are rather meant to express some
inherent property of the objects under
investigation. On the one hand it is likely that a biologist
would mean “An interaction between Lmo-2 and
Elf-2 happened” while describing the outcome of
a specific experiment. On the other hand a
biology textbook would rather want to communicate
something like “Lmo-2 molecules have the
disposition or tendency to interact with Elf-2 molecules”.
This ambiguity, of course, matches Aristotle‘s
famous distinction between act and potency, and
Aristotle himself observed that “potency” is in
itself an ambiguous term [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Thus the
ambiguity of our sample statement increases even more,
because the dispositional reading of our sample
sentence is ambiguous in itself. Obviously, such
a reading of “Lmo-2 interacts with Elf-2 ” is
intended to ascribe some causal or statistical
property, a disposition or tendency. But even if this
is the common ground of the dispositional
reading, three questions remain open and have to be
answered:
1. Which event is it exactly that the property in
question is meant to cause?
2. What is thought to be the bearer of this
property?
3. Which kind of property is in fact intended to be
ascribed?
The first question can be answered by pointing to
one of the many event readings we discussed (and
formalized) thus far. Our answer to the second
question will at least in part depend on our
response to question 1. Are all instances of a given
universal bearers of the disposition in question?
Or only some of the instances? Are the individual
molecules the bearers of the disposition, or rather
collectives of such molecules? The third question,
however, leads us in to the middle of the lively
debate going on in philosophy on the ontology of
disposition [
        <xref ref-type="bibr" rid="ref12 ref13 ref14">12, 13, 14</xref>
        ]. The dispositional
properties most often discussed in the literature are
socalled surefire dispositions: dispositions to react
invariably in a certain way under specific
circumstances. They are one candidate for an answer to
question 3. From the point of view of knowledge
representation, however, there are some problems
connected with surefire dispositions. First, things
may react differently in different circumstances.
Thus to say that Lmo-2 molecules have the
disposition to interact with Elf-2 molecules still leaves
it open under which circumstances such an
interaction will occur. We could account for this by
explicitly mentioning the conditions of realization
for each disposition. We may, of course, not know
all these conditions, but this is an epistemic
problem only. A more significant problem is that there
may be infinitely many causally relevant
conditions that have to be taken into account, and such
an infinite list would be impossible for principled
reason. We could try to circumvent this problem
by adding (implicitly or explicitly) quantification
phrases like “In all circumstances” or “In some
circumstances”. The all -phrase, however, will not
do. For if a certain disposition would be realized
under all circumstances, it will never be
unrealized. Such cases may exist, but normally a
disposition will only be realized under certain
circumstances and not realized under others. When
we use the some-phrase, on the other hand, many
statements about dispositions for molecule
interactions will become trivial, since nearly any
molecule may interact with any other molecule in some
peculiar way under certain (possibly very extreme)
conditions. A usual way to deal with this
problem is to introduce a set of standard or normal
conditions [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. In biology, this could mean that
the “disposition to interact with Elf-2 molecules”
is only ascribed to Lmo-2 if the interaction
commonly occurs under biological conditions, such as
physiological pH and temperature intervals. But
the problem is not solved by referring to normal
conditions. For, first, the problem that infinitely
many conditions cannot be described in
necessarily finite lists recurs with normal conditions. And,
second, biomedical knowledge may also include
the behavior of molecules in non-normal or even
extreme circumstances, like low or high
temperatures, exposure to intensive sunlight or atomic
radiations. One option at this point would be
to choose a different answer to question 3.
Instead of ascribing surefire dispositions we could
ascribe probabilistic dispositions, i.e. dispositions to
do something (under certain circumstances) with
a certain probability [
        <xref ref-type="bibr" rid="ref17">16</xref>
        ]. Such causal
properties are also sometimes called “tendencies” [
        <xref ref-type="bibr" rid="ref18">17</xref>
        ]
or “propensities” [
        <xref ref-type="bibr" rid="ref19">18</xref>
        ]. While with surefire
dispositions a certain event will happen invariably
in given circumstances, the event in question will
only happen with a certain probability when a
tendency is ascribed. It will, of course, be crucial to
know with which probability the event will
happen. Following standard procedures in
mathematical probability theory, we can represent the
quantities of the probabilities in question by real
numbers between 0 and 1 satisfying the Kolmogorov
axioms. In biomedical experiments, the observed
result is often such a probability. Tendencies are
thus of vital importance for the representation
of biomedical knowledge [
        <xref ref-type="bibr" rid="ref18">17</xref>
        ]. There can,
however, be several ontological groundings for such a
probability. Suppose that we observed a hundred
instances of a given universal U in situations in
which all conditions necessary for the realization
R of a certain disposition were present, but that in
only fifty cases R happened, i.e. in only 50 % of all
cases the disposition realized itself. There are
several ontological scenarios that would explain this
result. Here are two of them:
• (A) Every instance of U has a tendency to R
with a probability of 0.5.
• (B) Every second instance of U has a surefire
disposition to R; the other instances of U do
not have any disposition to R.
      </p>
      <p>Both of these scenarios would explain the assumed
observations. Which of these scenarios we choose
for our account of the observation will depend on
other observations and causal assumptions. If we,
e.g. knew that nearly always the same instances
of U display R and nearly always the same
instances of U do not display R, this would prima
facie count as a reason to embrace (B). If, on the
other hand, we know that the same instances of U
sometimes do display R and sometimes do not
display R, this would prima facie count as a reason
to embrace (A). For such reasoning, however, we
need background assumptions about the stability
of the causal properties in question: how they can
be stable over time, how (if at all) they can be
acquired and how (if at all) they can get lost. Last
but not least, (B) can indicate that the instances
of the universal U differ in certain features, which
are crucial to the ability to display R. An
important example of this is the observation of modified
proteins produced by mutated genes opposed to
the observation of normal (wild-type) proteins.
Considering all this, there is quite a long and
complex list of enitities that we implicitly refer to when
ascribing a disposition or tendency to a molecule:
• (independent) continuants (i.e. the bearer of the
disposition),
• dependent continuants or occurrents (i.e. the
realization),
• quantities (of probabilities), and
• state of affairs (of realization conditions).</p>
    </sec>
    <sec id="sec-7">
      <title>Conclusion</title>
      <p>
        Our deliberations shed light on the need for a more
principled account of dispositions and processes in
biomedical ontologies. Machine supported
information extraction and knowledge acquisition
techniques from scientific texts have become a
cornerstone in molecular biology and genomics due to
the increasing scientific productivity in this field.
The necessity of logic based ontologies for this
purpose has been controversially discussed [
        <xref ref-type="bibr" rid="ref20">19</xref>
        ]. If we
subscribe to a formally principled account as a
basis for the semantic representation of the content
of scientific texts then we have to take into
account that the most common type of statements
that are of interest in texts describing biochemical
regularities do not have a clear and unambiguous
meaning. Assertions of the type “A interacts with
B” are generally more than accounts of a single
event. Rather they refer to a plurality of events
of the same kind, or an event involving pluralities
(collectives) of participants. A universal
interpretation such as “For each instance of A there is an
interaction with some B” can easily be discarded.
The need for universal quantifications can be
satisfied by introducing dispositions: “Every A has
the disposition to interact with some B.”
However, not every occurrence of the participation of
some continuant in some process is proof of the
existence of a related disposition.
      </p>
      <p>
        Since interaction is a very general term, it is
difficult to express a clear preference in favor of any
of the proposed approaches without analyzing the
nature of interaction on a molecular level, as well
as the study of the “normal” behavior of
biomolecules. The question when to ascribe a disposition
or tendency – and which one – can not be discussed
here (but cf. [
        <xref ref-type="bibr" rid="ref17">16</xref>
        ] on this).
      </p>
      <p>We demonstrated that sentences like ”A interacts
with B” exhibit indeed a wide range of
ambiguity. We offered several possible analyses to
formally represent the different meanings of sentences
of this type. Now, which one should we choose?
One strategy would be to say: Which strategy
you choose depends on the intended meaning of
the particular occurrence of the sentence you deal
with. For text mining purposes, however, that
have to digest large amounts of texts in short
periods of time and with as much automatization as
possible, this strategy would be scarcely feasible.
To cope with this situation, several strategies are
conceivable. One strategy would be to choose the
highest common factor of all interpretation – that
what is included in all. Another strategy would
be to set as a standard interpretation that is most
likely the intended meaning. In order to determine
which interpretation is the best candidate,
empirical work on relevant text corpora may be helpful.
This, however, is already beyond the scope of the
present paper.</p>
      <sec id="sec-7-1">
        <title>Acknowledgments:</title>
        <p>This work was supported by the EU Network
of Excellence Semantic Interoperability and Data
Mining in Biomedicine (NoE 507505), the project
Forms of Life sponsored by the Volkswagen
Foundation, and the Wolfgang Paul Award of the
Alexander-von-Humboldt-Foundation. We are
indebted to Andrew D. Spear and to the anonymous
referees of KR-MED for valuable comments.</p>
      </sec>
      <sec id="sec-7-2">
        <title>Address for Correspondence:</title>
        <p>Stefan Schulz, Department of Medical
Informatics, Freiburg University Hospital,
Stefan-MeierStr. 26, 79104 Freiburg (Germany), phone: +49
761 203 6702, e-mail: stschulz@uni-freiburg.de</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Alan</given-names>
            <surname>Rector</surname>
          </string-name>
          , Jeremy Rogers, and Thomas Bittner.
          <article-title>Granularity scale and collectivity: When size does and doesn't matter</article-title>
          .
          <source>Journal of Biomedical Informatics</source>
          ,
          <volume>38</volume>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Stefan</given-names>
            <surname>Schulz</surname>
          </string-name>
          and
          <string-name>
            <given-names>Anand</given-names>
            <surname>Kumar</surname>
          </string-name>
          .
          <article-title>Biomedical ontologies: What part-of is and isn't</article-title>
          .
          <source>Journal of Biomedical Informatics</source>
          ,
          <volume>38</volume>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Stefan</given-names>
            <surname>Schulz</surname>
          </string-name>
          , Elena Beisswanger, Udo Hahn, Joachim Wermter, Anand Kumar, and
          <string-name>
            <given-names>Holger</given-names>
            <surname>Stenzhorn</surname>
          </string-name>
          .
          <article-title>From GENIA to BioTop towards a top-level ontology for biology</article-title>
          .
          <source>FOIS 2006 - International Conference on Formal Ontology in Information Systems</source>
          ,
          <year>2006</year>
          .
          <article-title>Accepted for publication</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Aldo</given-names>
            <surname>Gangemi</surname>
          </string-name>
          , Nicola Guarino, Claudio Masolo, and
          <string-name>
            <given-names>Alessandro</given-names>
            <surname>Oltramari</surname>
          </string-name>
          .
          <article-title>Sweetening ontologies with Dolce</article-title>
          . In Asunci´on G´
          <article-title>omezP´erez</article-title>
          and V. Richard Benjamins, editors,
          <source>Proceedings of the 13th International Conference - EKAW</source>
          <year>2002</year>
          , volume
          <volume>2473</volume>
          <source>of Lecture Notes in Artificial Intelligence</source>
          , pages
          <fpage>166</fpage>
          -
          <lpage>181</lpage>
          . Berlin: Springer,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Barry</given-names>
            <surname>Smith</surname>
          </string-name>
          and
          <string-name>
            <given-names>Pierre</given-names>
            <surname>Grenon</surname>
          </string-name>
          .
          <article-title>The cornucopia of formal-ontological relations</article-title>
          .
          <source>Dialectica</source>
          ,
          <volume>58</volume>
          (
          <issue>3</issue>
          ):
          <fpage>279</fpage>
          -
          <lpage>296</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Barbara</given-names>
            <surname>Heller</surname>
          </string-name>
          and
          <string-name>
            <given-names>Heinrich</given-names>
            <surname>Herre</surname>
          </string-name>
          .
          <source>Ontological categories in GOL. Axiomathes</source>
          ,
          <volume>14</volume>
          (
          <issue>1</issue>
          ):
          <fpage>57</fpage>
          -
          <lpage>76</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Franz</given-names>
            <surname>Baader</surname>
          </string-name>
          , Diego Calvanese, Deborah L.
          <string-name>
            <surname>McGuinness</surname>
          </string-name>
          ,
          <string-name>
            <surname>Daniele Nardi</surname>
          </string-name>
          , and
          <string-name>
            <surname>Peter F.</surname>
          </string-name>
          Patel-Schneider, editors.
          <source>The Description Logic Handbook. Theory</source>
          , Implementation, and
          <string-name>
            <surname>Applications</surname>
          </string-name>
          . Cambridge, U.K.: Cambridge University Press,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Barry</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Werner</given-names>
            <surname>Ceusters</surname>
          </string-name>
          , Bert Klagges,
          <article-title>Jacob K¨ohler</article-title>
          , Anand Kumar, Jane Lomax, Chris Mungall, Fabian Neuhaus, Alan L.
          <string-name>
            <surname>Rector</surname>
            , and
            <given-names>Cornelius</given-names>
          </string-name>
          <string-name>
            <surname>Rosse</surname>
          </string-name>
          .
          <article-title>Relations in biomedical ontologies</article-title>
          .
          <source>Genome Biology</source>
          ,
          <volume>6</volume>
          (
          <issue>5</issue>
          ),
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Karl. R.</given-names>
            <surname>Popper</surname>
          </string-name>
          . Logic of Scientific Discovery. Hutchinson, London,
          <year>1959</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Donald</given-names>
            <surname>Davidson</surname>
          </string-name>
          .
          <article-title>The logical form of action sentences</article-title>
          . In N. Rescher, editor,
          <source>The Logic of Decision and Action</source>
          , pages
          <fpage>81</fpage>
          -
          <lpage>95</lpage>
          . Pittsburgh, PA: University of Pittsburgh Press,
          <year>1967</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>Ludger</given-names>
            <surname>Jansen</surname>
          </string-name>
          .
          <article-title>Tun und K¨onnen</article-title>
          .
          <article-title>Ein systematische Kommentar zu Aristoteles' Theorie dispositionaler Eigenschaften im neunten Buch der Metaphysik</article-title>
          . H¨
          <string-name>
            <surname>ansel-Hohenhausen</surname>
          </string-name>
          ,
          <source>Frankfurt am Main</source>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Raimo</given-names>
            <surname>Tuomela</surname>
          </string-name>
          . Dispositions. Reidel, Dordrecht,
          <year>1978</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>Stephen</given-names>
            <surname>Mumford</surname>
          </string-name>
          .
          <source>Dispositions</source>
          . Oxford University Press, Oxford,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>Bruno</given-names>
            <surname>Gnassounou</surname>
          </string-name>
          and
          <string-name>
            <given-names>Max</given-names>
            <surname>Kistler</surname>
          </string-name>
          .
          <article-title>Les dispositions en philosophie et en sciences</article-title>
          .
          <source>CNRS Editions</source>
          , Paris,
          <year>2006</year>
          .
          <article-title>An English translation is to be published with Ashgate Publishers</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>Wolfgang</given-names>
            <surname>Spohn</surname>
          </string-name>
          .
          <article-title>Begru¨ndungen a priori - oder: ein frischer Blick auf Dispositionspr¨adikate</article-title>
          . In Wolfgang Lenzen, editor,
          <source>Das weite Spektrum der analytischen Philosophie.</source>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <source>Festschrift Franz von Kutschera</source>
          , pages
          <fpage>89</fpage>
          -
          <lpage>106</lpage>
          . de Gruyter, Berlin/New York,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Ludger</given-names>
            <surname>Jansen</surname>
          </string-name>
          .
          <article-title>Attribuer des dispositions</article-title>
          .
          <source>In Bruno Gnassounou and Max Kistler</source>
          , editors,
          <source>Les dispositions en philosophie et en sciences</source>
          , pages
          <fpage>89</fpage>
          -
          <lpage>106</lpage>
          . CNRS Editions, Paris,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>Ludger</given-names>
            <surname>Jansen</surname>
          </string-name>
          .
          <article-title>The ontology of tendencies and medical information sciences</article-title>
          .
          <source>In Ingvar Johansson</source>
          , Bertin Klein, and
          <string-name>
            <surname>Thomas</surname>
          </string-name>
          Roth-Berghofer, editors,
          <source>WSPI</source>
          <year>2006</year>
          :
          <article-title>Contributions to the</article-title>
          <source>Third International Workshop on Philosophy and Informatics</source>
          , volume
          <volume>14</volume>
          <source>of IFOMIS Reports</source>
          , pages
          <fpage>89</fpage>
          -
          <lpage>106</lpage>
          . IFOMIS, Saarbru¨cken,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>Karl. R.</given-names>
            <surname>Popper</surname>
          </string-name>
          . A World of Propensities. Thoemmes, Bristol,
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>Sophia</given-names>
            <surname>Ananiadou</surname>
          </string-name>
          and
          <article-title>Jun'ichi Tsujii. Thesaurus or logical ontology, which one do we need for text mining? Language Resources</article-title>
          and Evaluation, Springer Science and Business Media B.V.,
          <volume>39</volume>
          (
          <issue>1</issue>
          ):
          <fpage>77</fpage>
          -
          <lpage>90</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>