<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic Rendering of Data Tables: Multivalued Information Systems Revisited</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marcin Wolski</string-name>
          <email>marcin.wolski@umcs.lublin.pl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anna Gomolin´ ska</string-name>
          <email>anna.gom@math.uwb.edu.pl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Maria Curie-Skłodowska University, Department of Logic and Cognitive Science</institution>
          ,
          <addr-line>Pl. Marii Curie-Skłodowskiej 4, 20-031 Lublin</addr-line>
          ,
          <country country="PL">Poland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Białystok, Faculty of Mathematics and Informatics</institution>
          ,
          <addr-line>Konstantego Ciołkowskiego 1M, 15-245 Białystok</addr-line>
          ,
          <country country="PL">Poland</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Data tables provide a convenient means of representation of descriptive information about objects. They serve also as standard input for data analysis tools or theories. In this paper we focus our attention upon the special class of data tables, namely multivalued information systems introduced by Z. Pawlak and E. Orłowska in the early 80s. The main idea presented in the paper is to interpret multivalued information systems as semantically processed single valued data tables. This interpretation allows us to describe classical rough set theory, dominance-based rough set theory, and formal concept analysis within the framework of multivalued information systems.</p>
      </abstract>
      <kwd-group>
        <kwd>information system</kwd>
        <kwd>rough set</kwd>
        <kwd>dominance relation</kwd>
        <kwd>formal concept</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Data tables provide a simple and effective means of representation of collected pieces of
information about a given set of objects. They serve also as standard input for data
analysis tools or theories, output of which is often referred to as knowledge. In the present
paper we shall focus our attention upon the special class of data tables, namely
multivalued/approximate/nondeterministic information systems, introduced by Z. Pawlak
and E. Orłowska in the early 80s [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ]. These systems are generalisations of the
standard data tables/information systems to the case in which for an object x and an attribute
A we are given (as the entry in the table) a set VA of attribute values instead of a single
value. The formal definitions of both multivalued and approximate information
systems are actually the same (see [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]); it is the interpretation/semantics of the entries of
these tables which makes the difference: the first interpretation is the object x has all
values from VA for the attribute A (multivalued systems), whereas the second reads
as the object x has a single value from the set VA for the attribute A (approximate
systems). These systems equipped with a generalised semantics (which reads for the
object x and the attribute A the set VA provides some possible values) are called by
      </p>
    </sec>
    <sec id="sec-2">
      <title>Z. Pawlak and E. Orłowska nondeterministic information systems [3]. Of course, the semantics of the entries in the table determines how information is further processed; in other words, which relations between objects are used to construct information granules</title>
      <p>
        being the building blocks of knowledge. The main concern of [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] is the informational
indiscerniblity between objects (an equivalence relation), whereas the main focus of
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] are the information inclusion and the informational connection (a preorder and a
tolerance relation, respectively).
      </p>
    </sec>
    <sec id="sec-3">
      <title>The main novelty of the present paper consists in taking multivalued information systems as semantically enriched single valued data tables; this idea can be summarised by the following equation:</title>
      <p>data table + semantics = multivalued information system.</p>
    </sec>
    <sec id="sec-4">
      <title>Thus multivalued information systems represent semantically processed data. We start</title>
      <p>
        our study with some standard data tables used in the leading theories of data analysis:
single valued information systems from rough set theory (RST) [
        <xref ref-type="bibr" rid="ref4 ref5 ref6">4–6</xref>
        ], single valued
information systems enriched by dominance relations taken from dominance based rough
set theory (DRS) [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ], and formal contexts from formal concept analysis (FCA) [
        <xref ref-type="bibr" rid="ref1 ref9">1, 9</xref>
        ].
      </p>
    </sec>
    <sec id="sec-5">
      <title>Then we provide these structures with some specific interpretation/semantics. To this</title>
      <p>end we use scales from FCA, which are tools to convert a multivalued formal context
(which is actually a standard information system) into a (single valued) formal
context. We are going to employ these scales in order to obtain multivalued information
systems. Then we focus upon informational relations of indiscerniblity and inclusion.</p>
    </sec>
    <sec id="sec-6">
      <title>These steps allow us to consider RST, DRS, and FCA within a single conceptual framework of multivalued information systems and to emphasise how these theories differ semantically. Finally, we shall discuss different interpretations of RST, DRS, and FCA based operators in the context of John Stuart Mill inductive reasoning [2].</title>
      <p>2</p>
      <sec id="sec-6-1">
        <title>Data Tables and Semantics</title>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>In the present section we discuss different forms of data tables considered in the lead</title>
      <p>
        ing theories of data analysis: rough set theory (RST) [
        <xref ref-type="bibr" rid="ref4 ref5 ref6">4–6</xref>
        ], dominance based rough sets
(DRS) [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ], and formal concept analysis (FCA) [
        <xref ref-type="bibr" rid="ref1 ref9">1, 9</xref>
        ]. Of course, apart from data tables
(input), each theory provides special tools to process the tables and produce some
meaningful output. However, in the present section we shall discuss only tables, whereas the
ways they may be further processed will be presented in the next section.
Definition 1 (Formal Context [
        <xref ref-type="bibr" rid="ref1 ref9">1, 9</xref>
        ]). A formal context is a triple (U; Att; R), where
U is a set of objects, Att a set of binary attributes and R U Att.
      </p>
      <sec id="sec-7-1">
        <title>If an object x stands in the relation R to A, then we mark it in the data table by 8.</title>
        <p>Table depicted by Fig. 1 presents a very simple context; Bob is a good mathematician
whereas Agnes is not; on the bright side, she is rich.</p>
        <p>
          Definition 2 (Information System [
          <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
          ]). A quadruple I = (U; Att; V al; f ) is called
an information system, where:
– U is a nonempty finite set of objects,
– A is a nonempty finite set of attributes,
Steven
Bob
Agnes
good mathematician rich
8
8
– V = SA2Att V alA, where V alA is the value–domain of the attribute A, and
V alA \ V alB = ;, for all A; B 2 Att (the last condition is not necessary, yet
it is mathematically convenient),
– f : U Att ! V al is an information function, such that for
        </p>
        <p>all A 2 Att and x 2 U it holds that f (x; A) 2 V alA.</p>
        <p>If f is a partial function then the information system I is called incomplete. If the
codomain of f is the powerset of V al, then the system is called multivalued.</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>In what follows we shall confine our attention to the complete information (a simple example is depicted by Fig. 2) systems and their multivalued version being the result of scaling.</title>
      <p>
        Steven
Bob
Agnes
Definition 3 (Dominance-Based Data Table [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ]). A dominance-based data table is
a system I = (U; Att; V al; f; D), where (U; Att; V al; f ) is an information system
such that all attribute values are real numbers. Let us define: x A y iff f (A; x)
f (A; y), for x; y 2 U . Then x D y iff x A y for all A 2 Att.
      </p>
    </sec>
    <sec id="sec-9">
      <title>Thus, information systems allow one to be more specific about the meaning of at</title>
      <p>tributes. In the case of dominance-based data tables, we additionally assume that the
attribute values are comparable with respect to some complete order. If f (A; x)
f (A; y), then we say that y is at least as good as x with respect to A. If y is at least as
good as x with respect to all attributes then we say that y dominates x.</p>
    </sec>
    <sec id="sec-10">
      <title>Of course, any information system may be converted into a formal context. The</title>
      <p>easiest way of doing this is called in FCA a nominal scaling. Formally, a scale for</p>
      <sec id="sec-10-1">
        <title>A is a formal context (V alA; V alA; RA) having V alA as both the set of objects and</title>
        <p>the set of attributes. We also assume that the identity idV alA is included in RA. After
scaling each pair attribute-value (A; v) is regarded as a separate attribute of the new
context CI = (U; f(A; v)gA2Att v2V alA ; R). For the fundamental relation in rough set
theory is the indistinguishability, the values in (V alA; V alA; RA) are compared by the
equality relation: RA is =, for every A; and in consequence 1 is interpreted as 1, 2 as 2,
and so on. That is why in the scale depicted in Fig. 3 only the diagonal is marked by 8.
This type of scaling is called nominal. However, since all values in our exemplary data
1 2 3 4 5 6
1 8
2 8
3 8
4 8
5 8
6 8
$ 0.1
$ 0.8
$ 1
table are real numbers, we can compare them with respect to instead of =. Actually,
when we order these values according to we do change the scaling, or better still, the
semantics (interpretation) of attribute values. Now, e.g., Bob’s score in physics 5 and
Agnes’s score 2 means that Bob is a better mathematician than Agnes. In other words,
whatever Agnes can solve, Bob can as well. Following DRS, we may say that Bob is at
least as good as Agnes with respect to the attribute mathematician. Formally, the higher
value (e.g., 5) with respect to implies the lower value (e.g., 2). Such scales are called
ordinal scales. Thus this time the mark 8 on 5 may mean that Bob has solved at least</p>
      </sec>
      <sec id="sec-10-2">
        <title>5 problems. Under this reading Bob has solved at least 4 problems too. Therefore the</title>
        <p>scaling representing this interpretation (semantics) may be defined as depicted in Fig. 4.
Please note that the values $ 0.1 or $ 0.8 are interpreted in the same fashion. Thus 8
on $ 0.8 means that Bob has at least $ 0.8 on his bank account, and in consequence
he has at least $ 0.1 too. Following DRS, we may say that Bob dominates Steven. In
consequence, both RST and DRS start with the same date table but use different scales
(interpretations). Of course, there are also (many) other possibilities. We conclude this
part with a nontrivial interpretation offered by B. Ganter during his seminar at Warsaw
University (a few years ago). He suggested to read the exam scores in a natural language
and take the “natural” scale. Under this reading, someone who has done a very good
1 may be interpreted as
2 may be interpreted as
3 may be interpreted as
4 may be interpreted as
5 may be interpreted as
6 may be interpreted as</p>
        <p>bad
unsatisfactory
satisfactory</p>
        <p>good
very good
excellent
job has done also a good job. The job of course is also satisfactory. However, someone
who has done a bad job has also done an unsatisfactory job, but not vice versa. In
consequence we obtain yet another scale, as depicted by Fig. 5. In this scale we use two
orderings, and this type of scaling is therefore called bi-ordinal. Thus, starting from the
information system depicted by Fig. 2 we can obtain a number of different multivalued
information systems, depending on which scale is applied to the original system (see
Fig. 6).</p>
        <p>Nominal scaling</p>
        <p>)
Ordinal scaling
)</p>
        <p>Steven
Bob
Agnes
mathematician (1-6)
f5g
f4g
f2g</p>
        <p>rich
f$0:1 million g
f$0:8 million g
f$1 million g
Steven
Bob
Agnes
mathematician (1-6)
f1; 2; 3; 4; 5g
f1; 2; 3; 4g
f1; 2g</p>
        <p>rich
f$0:1 million g
f$0:1 million ; $0:8 million g
f$0:1 million ; $0:8 million $1 million g</p>
      </sec>
    </sec>
    <sec id="sec-11">
      <title>More formally, different interpretations/semantics of attribute values lead to different formal contexts. When one starts with a multivalued context (an information system)</title>
      <p>I = (U; Att; V al; f ) and a set of scales S = f(V alA; V alA; RA) : A 2 Attg (as
discussed above), then every pair (A; v), where A 2 Att and v 2 V alA, is regarded as the
attribute in the induced formal context CI = (U; f(A; v)gA2Att v2V alA ; R), where R
is defined by</p>
      <p>
        R = f(x; (A; v)) : v 2 fs(x; A)g;
fs(x; A) = fvi 2 V alA : f (x; A) = v &amp; (v; vi) 2 RAg:
Of course every (multivalued) context I = (U; Att; V al; f ) may also be converted
into a multivalued information system IS = (U; Att; V al; fs). The whole process of
providing I with semantics given by S is depicted by Fig.6.
3 Information Processing: Approximation Operators
In the previous section we have discussed information systems (collected pieces of data
about objects) which must be further processed to give some meaningful output
(knowledge). The information processing in rough set theory [
        <xref ref-type="bibr" rid="ref4 ref5 ref6">4–6</xref>
        ] and dominance based rough
set theory (DRS) [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ] is done by means of binary relations between objects. However,
formal concept analysis (FCA) [
        <xref ref-type="bibr" rid="ref1 ref9">1, 9</xref>
        ] is based upon a relation between objects and
attributes; in some special cases this relation may be reduced to the relation between
objects, but it is not always possible. In multivalued information systems, in contrast to
the classical information systems where there is considered a single relation, there are
three important relations between objects [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ].
      </p>
      <p>Definition 4. Let (U; Att; V al; f ) be a multivalued information system; then one can
define:
– Informational Indiscerniblity: x Ind y iff f (x; A) = f (y; A),
– Informational Connectivity (Similarity): x Sim y iff f (x; A) \ f (y; A) 6= ;,
– Informational Inclusion: x Incl y iff f (x; A) f (y; A),
for all A 2 Att and x; y 2 U .</p>
      <p>Usually, the output of data analysis is related to some aspect of reality represented
by a decision attribute. For example, we could take the subjective quality of life as
the decision attribute. Then our aim would be to “express” the subjective quality of
life in terms of the attributes mathematician and rich, which are (in this case) called
conditional attributes (we would like to obtain knowledge how the subjective quality of
life “depends” on wealth and education in mathematics). Information systems having a
single attribute distinguished as a decision attribute are called decision tables. In such a
case, all informational relations (indiscerniblity, connectivity, and inclusion) are defined
with respect to (only) conditional attributes.</p>
    </sec>
    <sec id="sec-12">
      <title>As earlier, we shall start discussion in this section with FCA (and its operators).</title>
      <p>Definition 5 (Derivation Operators). For a formal context C = (U; Att; R), define:
R0(X) = fA 2 Att : 8x 2 X ((x; A) 2 R)g;</p>
      <p>R0(A) = fx 2 U : 8A 2 A ((x; A) 2 R)g;
for all X</p>
      <p>U and A</p>
      <p>Att.
Steven
Bob
Agnes</p>
      <p>Let us start with a complete information system I = (U; Att; V al; f ) and a semantics
S, that is, a family of scales RA, for all A 2 Att. As one can easily observe, for every
object x in CI = (U; f(A; v)gA2Att v2V alA ; R), a pair (R0(R0(fxg)); R0(fxg)) is a
concept, and y 2 R0(R0(fxg)) provided that x I ncl y in the corresponding multivalued
information system IS = (U; Att; V al; fs):</p>
      <p>R0(R0(fxg)) = fy 2 U : x I ncl yg:</p>
    </sec>
    <sec id="sec-13">
      <title>However, it usually happens that</title>
    </sec>
    <sec id="sec-14">
      <title>But it always holds that</title>
      <p>R0(R0(X )) 6= fy 2 U : 9x (x I ncl y &amp; x 2 X )g:
X
fy 2 U : 9x (x I ncl y &amp; x 2 X )g</p>
      <p>R0(R0(X )):
However, after conversion of an information system I = (U; Att; V al; f ) to the formal
context CI = (U; f(A; v)gA2Att v2V alA ; R) we lose the contact with original attributes
(we shall discuss this issue in detail later in the paper).</p>
      <p>Definition 7 (Lower and Upper Approximations). A pair (U; E), where E is an
equivalence relation, is called an approximation space. Define after Z. Pawlak:
LowE (X ) = fx 2 U : [x]E</p>
      <p>X g;</p>
      <p>U ppE (X ) = fx 2 U : [x]E \ X 6= ;g:
LowE (X ) is called the lower approximation of X , whereas U ppE (X ) is called the
upper approximation of X .</p>
      <p>Coming back to information systems, every set of attributes A Att of an information
system I = (U; Att; V al; f ) induces an approximation space (U; EA), where EA =
f(x; y) : f (x; A) = f (y; A) for all A 2 Ag. In order to simplify the notation, we
shall write LowA(X ) and U ppA(X ) for LowEA (X ) and U ppEA (X ), respectively. In
the case when A = Att, we shall leave E without any subscript.</p>
      <p>Every information system I = (U; Att; V al; f ) (together with a family of scales S)
induces also a multivalued information system IS = (U; Att; V al; fs) and another
approximation space (U; I nd). Due to scaling, I nd and E may be two different relations.
For any A Att of (U; Att; V al; fs) the corresponding indiscernibility relation will
be denoted by IndA. This notational convention will also be used for other relations.</p>
      <sec id="sec-14-1">
        <title>As usual, we can generalise E to any reflexive relation P (e.g. Sim or Incl) and</title>
        <p>obtain generalised approximation operators. Let [x]P = fy 2 U : (x; y) 2 P g and
define:</p>
        <p>LowP (X) = fx 2 U : [x]P</p>
        <p>Xg;</p>
        <p>U ppP (X) = fx 2 U : [x]P \ X 6= ;g:</p>
      </sec>
    </sec>
    <sec id="sec-15">
      <title>As one can note, it holds that</title>
      <p>U ppIncl(x) = R0(R0(fxg)) = fy 2 U : x Incl yg;
and</p>
      <p>U ppIncl(X) = fy 2 U : 9x (x Incl y &amp; x 2 X)g:</p>
      <sec id="sec-15-1">
        <title>However, R0(R0(X)) is much more complex in the settings of information systems</title>
        <p>than it might seem at the first sight. It is worth noting that R0(X), for X U , is a set
consisting of pairs (A; v). So, in order to go back to the level of information systems
we need a method of retrieving the original attributes from this set, so as it would act as
A Att. Let Atex (attribute extraction) be defined by Atex(H) = fA : (A; v) 2 Hg
for H Att V al. Obviously, this a projection operation on the first coordinate and
it makes sense only for a family of regular scales. Consider the following example. Let
V al = f1; 2; 3; 4; 5; 6g be a set of values of some attribute A. Assume that a scaling
converts 1 to f1; 2g, 2 to f2; 3g, and all other values to f3; 4; 5; 6g. So after scaling A has
three value sets: f1; 2g, f2; 3g, f3; 4; 5; 6g. Let f (x; A) = f1; 2g and f (y; A) = f2; 3g.
Now, let us start with (U; fAg; V al; f ), then go to the corresponding CI , and compute
R0(fx; yg), which is f(A; 2)g – but this item does not make sense in our semantics S:
f2g is the meaning of neither element of V al. Thus, using set intersection \ we may
produce a new non-empty value set, which is not present in IS = (U; Att; V al; fs). A
scale is regular if that is not possible. Nominal and ordinal scales are regular.</p>
      </sec>
    </sec>
    <sec id="sec-16">
      <title>Only for regular scales we are able to define the concepts of the formal context on</title>
      <p>the level of attributes of information systems. Let IS = (U; Att; V al; fs) be a
multivalued information system obtained from an information system I = (U; Att; V al; f )
by means of regular scales S; then</p>
      <p>R0(R0(X)) = fy 2 U : 8A 2 Atex(R0(X)) 9x 2 X (x InclA y)g:</p>
    </sec>
    <sec id="sec-17">
      <title>If the scale is not regular, then the following inclusion holds only:</title>
      <p>R00(X) = fy 2 U : 8A 2 Atex(R0(X)) 9x 2 X (x InclA y)g
R0(R0(X)):</p>
      <sec id="sec-17-1">
        <title>Therefore, in such a case we need a new name R00 for this operator.</title>
        <p>Let us now consider a decision table IG = (U; Att; V al; G; f ), that is, an
information system I = (U; Att; V al; f ) equipped with a decision (goal) attribute G 62 Att
and f being defined on Att [ fGg. The semantics S for IG needs now to include a
scale RG = (V alG; V alG; RG) for the decision attribute. The main goal is to
approximate a given pair (G; [v]RG ), where v 2 V alG is a specific distinguished value. More
precisely, we want to approximate the set X = fx 2 U : (v; f (x; G)) 2 RGg.</p>
      </sec>
      <sec id="sec-17-2">
        <title>Let us take two scaling methods for the decision attribute subjective quality of life:</title>
        <p>nominal scale N om: RG = f(i; j) : i; j 2 f1; 2; 3g &amp; i = jg;
ordinal scale Ord: RG = f(i; j) : i; j 2 f1; 2; 3g &amp; j ig.</p>
        <p>The nominal scale N omG interprets 1 as low quality of life, 2 as average quality of life,
and 3 as high quality of life. As expected, the ordinal scale OrdG interprets 1; 2, and 3
as: at least low quality of life, at least average quality of life, and at least high quality
of life, respectively. Now, let us take the value 3 as the distinguished value of the
decision attribute. Then under N omG we are going to approximate the set fBob; Steveng,
however, under OrdG the set to be approximated is fAgnes; Bob; Steveng.</p>
        <p>
          The dominance-based rough set approach (DRS) [
          <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
          ] is actually a kind of rough
set theory rendered according to the above ideas and the ordinal scaling method. An
information system is equipped with a dominance relation D, that is, we consider
the system I = (U; Att; V al; f; D) (see Section 2). This system induces a
multivalued information system IS = (U; Att; V al; fs), where S consists of ordinal scales
(V alA; V alA; RA) for every A 2 Att. Before we recall the definitions of the
approximation operators, we need a few auxiliary concepts:
– D+(x) = fy 2 U : x D yg (a set of objects dominating x, or better than x);
– D (x) = fy 2 U : y D xg (a set of objects dominated by x, or worse than x);
– Decision attribute G, VG = T , Clt = fx 2 U : f (x; G) = tg,
where t; s 2 T . It is additionally assumed that
        </p>
        <p>Clt =
[ Cls and Clt =</p>
        <p>[ Cls;
s Gt</p>
        <p>t Gs
Cls \ Clt = ; for s 6= t and
[ Cls = U:
s2T</p>
      </sec>
      <sec id="sec-17-3">
        <title>Classification patterns to be discovered are functions representing granules Cl t and</title>
        <p>Cl t by means of granules D+(x) and D (x). It is worth emphasising that due to
the preference order, the sets to be approximated are not the particular Clt (for some
t 2 VG), but the upward and downward unions.</p>
      </sec>
    </sec>
    <sec id="sec-18">
      <title>As said in the previous section, DRS may be represented in terms of ordinal scaling.</title>
    </sec>
    <sec id="sec-19">
      <title>In what follows we would like to make this scaling explicit in DRS and use relations</title>
      <sec id="sec-19-1">
        <title>Ind and Incl (from multivalued information systems) rather than the dominance re</title>
        <p>lation D. Let us consider a multivalued information system IS = (U; Att; V al; fs)
obtained from I = (U; Att; V al; f; D) by means of a scaling set S. Please note that
due to the ordinal scaling of all attributes of I, it holds that:</p>
        <p>D+(x) = fy 2 U : x Incl yg = R0(R0(fxg));
D (x) = fy 2 U : x Incl 1 yg = fy 2 U : y Incl xg;</p>
        <p>Clt = fy 2 U : 9x (x InclfGgy &amp; x 2 Clt)g;</p>
        <p>Clt = fy 2 U : 9x (x InclfG1gy &amp; x 2 Clt)g:</p>
      </sec>
    </sec>
    <sec id="sec-20">
      <title>The specific interpretation of ordinal scaling in DRS makes a new type of inconsistency in data tables possible:</title>
      <p>(a) an object x belongs to Cl t (that is, it belongs to Clt or a class better than Clt),
but it is dominated by some objects y 62 Cl t (it is dominated by some object from
a worse class),
(b) an object x belongs to the class Cl t (that is, it belongs Clt or a class worse than
Clt), but it dominates some object y 62 Cl t (it dominates some object from a
better class).</p>
    </sec>
    <sec id="sec-21">
      <title>These objects are regarded as borderline cases: they might or might not belong to a given class. In consequence, in DRS we consider the following approximations:</title>
      <p>Clt = fx 2 U : D (x)
Clt g;
Clt = fx 2 U : D+(x) \ Clt 6= ;g;
Clt = fx 2 U : D+(x)
Clt g;
Clt = fx 2 U : D (x) \ Clt 6= ;g:</p>
      <sec id="sec-21-1">
        <title>As earlier, our aim is to express these approximation operators by means of Incl. Thus,</title>
        <p>let be given an information system I = (U; Att; V al; f; D) and its corresponding
multivalued information system (U; Att; V al; fs), obtained by means of the ordinal
scaling method Ord. Then</p>
        <p>D (x) = fy 2 U : 8x 2 U (x Incl y ) x 2 Clt )g;
D (x) = fy 2 U : 9x 2 U (x Incl 1 y &amp; x 2 Clt )g;</p>
        <p>D+(x) = fy 2 U : 8x 2 U (x Incl 1 y ) x 2 Clt )g;
Clt =</p>
        <p>Clt =
Clt =</p>
        <p>[
D (x) Clt</p>
        <p>[
x2Clt</p>
        <p>[
D+(x) Clt</p>
        <p>[
x2Clt
Clt =</p>
        <p>D+(x) = fy 2 U : 9x 2 U (x Incl y &amp; x 2 Clt )g:</p>
      </sec>
      <sec id="sec-21-2">
        <title>So, we are able to transfer FCA, RST, and DRS, along with explicitly given semantics S</title>
        <p>(a family of scales RA for every attribute A 2 Att) into the framework of multivalued
information systems. The connections between the operators discussed above are as
follows:</p>
        <p>Clt = LowIncl 1 (Clt );</p>
        <p>Clt = U ppIncl(Clt );</p>
        <p>Clt = LowIncl(Clt );
Clt = U ppIncl 1 (Clt )</p>
        <p>R0(R0(Clt )):</p>
      </sec>
      <sec id="sec-21-3">
        <title>Two important comments are needed. As long as we regard all above operators as approximation operators, RST and DRS based results are better than that coming from</title>
      </sec>
    </sec>
    <sec id="sec-22">
      <title>FCA. However, when we change the context, then the FCA operator may be more</title>
      <p>preferable. Secondly, in DRS, x Incl y is read as y is better than x. However, there are
other readings possible, e.g., we have more pieces of information about y than about x.</p>
      <p>
        Let us consider an example which brings new meanings for relations and
operators we have discussed so far. Let w be a serious disease which we have an antibiotic
working against. Let us assume that we have a test checking whether someone is ill,
but the antibiotic should be given to a patient before the disease develops. So, our aim
is to select people who will be given the medicine. One very expensive solution is to
give the medicine to all people (that is, all elements of the universe U ). On the other
extreme, we could give the medicine only to people who test positive for w. Of course,
both solutions are not good and we need to find another method of selection. We are
going to employ the John Stuart Mill inductive reasoning [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] here, which was designed
to solve problems of this type, but under complete knowledge. We shall focus our
attention on the very basic cannon, namely the direct method of agreement (Fig. 8). We are
not going to use the pure form of this cannon, but rather its rough set based rendering.
A B C D
A E F G
occur together with w, v
occur together with w, z
      </p>
      <p>Therefore A is the cause of w</p>
      <p>Let be given an information system I = (U; Att; V al; f ) representing our (medical)
knowledge about people. At first, we employ a semantics S consisting only of ordinal
scales such that for each attribute A of IS = (U; Att; V al; fS ), the relation x InclA y
means that y is in a worse medical condition than x with respect to A and w. In our
settings, Mill’s inductive reasoning is modelled in the following way. The concept w
is actually a set fx 2 U : f (x; w) = &gt;), where V alw = f&gt;; ?g (truth and false,
respectively). The scale Rw is given by f(&gt;; &gt;); (&gt;; ?); (?; ?)g, so if f (x; w) = &gt;,
then fS (x; w) = f&gt;; ?g, and if f (x; w) = ?, then fS (x; w) = f?g. Thus, as required,
if x Inclw y, then y is in the same or worse medical condition than x, so if x is ill, then
y must be ill as well. Let Cl&gt; be a set of positive examples of w (direct method of
agreement). The term “in common” (Fig. 8) is beyond the expressive power of pure</p>
    </sec>
    <sec id="sec-23">
      <title>RST and DRS. However, in our case we can retrieve common attributes by means of</title>
      <p>Atex(R0(Cl&gt;)). Now we can compute possible solutions to our problem:
Cl&gt;</p>
      <p>U ppInd(Cl&gt;)</p>
      <p>U ppIn1cl(Cl&gt;) = Cl&gt;</p>
      <p>R0(R0(Cl&gt;):
As said Cl&gt; is an extreme solution, another one may be given by U ppInd(Cl&gt;) (that is,
we give the antibiotic to all people with exactly the same medical description in terms of
conditional attributes as some patients having positive test for w). It seems reasonable,
however the next solution U ppIn1cl(Cl&gt;) = Cl&gt; is much better: we give the medicine
to all people with the same or worse medical condition than some patients who have
positively tested for w. Better still, we may apply the direct method of agreement and
give medicine to all people in R0(R0(Cl&gt;) having medically worse results only for
attributes which seem to be relevant to w. It is worth emphasising that regarded as
an approximation of Cl&gt;, the set R0(R0(Cl&gt;) is the worst candidate, but in this very
settings it is the best solution for the problem at issue. Consider a non-regular scale
now and assume that all people who positively tested on w have problems with blood
pressure A. So scaling of A shows how unstable is the pressure. Some patients may
have value fnormal; highg, some flow; highg, and some flow; normal; highg, but
none of them has fnormalg. This time R0(R0(Cl&gt;) is a very bad solution, because it
may include people with a normal blood pressure. However, we can still use the direct
method of agreement in a modified version, and take R00(Cl&gt;) as the solution.
4</p>
      <sec id="sec-23-1">
        <title>Conclusions</title>
        <p>
          In the paper we have investigated the implicit semantics used in some leading
theories of data analysis: rough set theory (RST) [
          <xref ref-type="bibr" rid="ref4 ref5 ref6">4–6</xref>
          ], dominance based rough set theory
(DRS) [
          <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
          ], and formal contexts from formal concept analysis (FCA) [
          <xref ref-type="bibr" rid="ref1 ref9">1, 9</xref>
          ]. We have
presented all theories within the unifying framework of multivalued information
systems [
          <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
          ], enriched with the scaling methods from FCA. We have also discussed the
relations between the operators coming from these theories, and presented their
different interpretations in the context of John Stuart Mill inductive reasoning [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
        </p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Ganter</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wille</surname>
          </string-name>
          , R.:
          <article-title>Formal Concept Analysis</article-title>
          .
          <source>Mathematical Foundations</source>
          , Springer Verlag (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Mill</surname>
            ,
            <given-names>J.S.:</given-names>
          </string-name>
          <article-title>A System of Logic</article-title>
          . Vol.
          <volume>1</volume>
          .
          <string-name>
            <surname>London</surname>
          </string-name>
          (
          <year>1843</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Orłowska</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pawlak</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>Representation of nondeterministic information</article-title>
          .
          <source>Theoretical Computer Science 29</source>
          , pp.
          <fpage>27</fpage>
          -
          <lpage>39</lpage>
          (
          <year>1984</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Pawlak</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>Systemy informacyjne</article-title>
          .
          <source>Podstawy teoretyczne</source>
          . Warszawa, WNT (
          <year>1983</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Pawlak</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <source>Rough Sets: Theoretical Aspects of Reasoning about Data</source>
          . Kluwer Academic Publisher (
          <year>1991</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Pawlak</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>Wiedza z perspektywy zbiorów przybliz˙onych</article-title>
          .
          <source>Institute of Computer Science Report</source>
          <volume>23</volume>
          , (
          <year>1992</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Słowinski</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Greco</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matarazzo</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Dominance-based rough set approach to reasoning about ordinal data</article-title>
          .
          <source>Lecture Notes in Artificial Intelligence</source>
          <volume>4585</volume>
          ,
          <fpage>5</fpage>
          -
          <lpage>11</lpage>
          , (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8. Słowin´ski, R.,
          <string-name>
            <surname>Greco</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matarazzo</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Rough sets in decision making</article-title>
          . In: R.A.Meyer y (Ed.),
          <source>Encyclopedia of Complexity and Systems Science</source>
          , Springer, NY,
          <fpage>7753</fpage>
          -
          <lpage>7786</lpage>
          , (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Wille</surname>
          </string-name>
          , R.:
          <article-title>Concept lattices and conceptual knowledge systems</article-title>
          .
          <source>Computers &amp; Mathematics with Applications</source>
          <volume>23</volume>
          ,
          <fpage>493</fpage>
          -
          <lpage>515</lpage>
          , (
          <year>1992</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>