<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Representing Vague Places : Determining a Suitable Method</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute for Geoinformatics, University of Muenster</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <fpage>19</fpage>
      <lpage>25</lpage>
      <abstract>
        <p>The representation of places with vague or ill-de ned boundaries continues being an issue for information systems. Despite the presence of multiple representation methods, it is still unclear how to determine which approach is best suited for a particular task. This paper proposes a set of characteristics based on the application domain, conceptual, and logical levels and di erentiates the approaches according to these characteristics. We demonstrate how they are matched with the task requirements and in uence the choice of representation method.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Representing `place' poses challenges, more so when the extents or boundaries
cannot be well-de ned. Although humans are capable of interpreting what is
being referenced in such cases, handling these in information systems is more
complex. Some of the associated problems are discussed in literature under
discipline of spatial vagueness and uncertainty, especially their philosophical
and representational aspects. The former aspects address whether vagueness is
intrinsic to the real world or just a feature of language[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], the di erent kinds
of vagueness [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and how to handle imperfection in geographic information [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
The latter suggest models and theories to handle spatial vagueness, each with
its distinct assumptions and properties. This has resulted in the development of
various representation methods such as probabilistic [
        <xref ref-type="bibr" rid="ref4 ref5">4,5</xref>
        ], fuzzy-set based [
        <xref ref-type="bibr" rid="ref6 ref7">6,7</xref>
        ],
egg-yolk model [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], rough-sets [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], and supervaluation[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], among others. We
propose a methodology to distinguish between di erent representation methods
based on their characteristics, which may then be matched with the application
requirements in order to determine a suitable method.
      </p>
      <p>
        No single representation can claim to be applicable for all cases. The methods
di er in the way assumptions are made about space, the underlying formal models,
applicability of data models and the kinds of reasoning they allow. Selecting the
right one for a given task is a matter of tness for purpose and requires that
the method's capabilities are matched to the requirements. Requirements vary
and can be speci ed in numerous ways. A consistent way of specifying these
requirements is needed. Our approach is to specify these in terms of the model
characteristics. The characteristics themselves may be de ned at di erent levels
similar to the levels of data abstraction in an information system [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
1. At the application domain level, a subset of the reality to be represented
is chosen with respect to a particular domain. We also specify what kind
of reasoning is to be performed on a representation. It is also important to
decide here whether vagueness is perceived to be intrinsic to the entity, or if
di erent possible interpretations should be supported.
2. The conceptual level is the next and deals with how the vague place is
conceptualized in an implementation independent fashion. Important concerns
here are, how the vague referent can be individuated ? (e.g. through use of
objecti able parameters), how it is demarcated ? and its identity (e.g. temporal
changes).
3. The logical level is the next and deals with more detailed speci cs such as
the data model of the data sources, or how the extents should be modelled.
In this paper, we identify a criteria set to determine suitable representation
methods for vague places. Section 2 analyzes the requirements of an application task.
Based on these requirements, section 3 develops a criteria set and di erentiates
vague representations in terms of these criteria. Section 4 gives an example how
to choose the right representation method for a given task based on using our
criteria set.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Analyzing Requirements for a Use Case</title>
      <p>Lake Carnegie in Australia is ephemeral. Depending on the amount of
precipitation the lake may or may not be lled with water (Fig. 1). Though a lake in
vernacular terms, in dry seasons it is reduced to a muddy marsh1. This presents
a problem, since it is now unclear where exactly the boundaries of the lake lie.</p>
      <p>Suppose a user needs a representation of Lake Carnegie. We examine a few
questions that need to be answered to arrive at a clear understanding of what
needs to be represented.
1. What is a `lake' ? First, the semantics of the term `lake' need to be clear. Is
it a single contiguous body of water or does it include smaller scattered pools
in the vicinity as well? Do the requirements dictate that water be present in
the lake all year round? This is treated as the rst step towards arriving at a
solution.
2. What is the purpose of representation? Requirements for an ecologist di er
from that of a cartographer. An ecologist is likely to be interested in the
variation of the lake over time; a fuzzy spatial extent rather than precisely
de ned boundaries being of importance. A cartographer is more interested
in the lake as a crisp object.
3. What data sources are available? The choice of representation is in uenced
by the data sources. A di erent method is needed for a representation built
up from satellite imagery, than another which uses water level observations
from sensors scattered through the lake.
1 http://www.nasa.gov/multimedia/imagegallery/image_feature_817.html
(a) Apr 2011</p>
      <p>(b) Sep 2011</p>
      <p>As one can infer, varying user needs and requirements must be met with a
suitable method to represent the same place. From the myriad of possibilities,
one needs a way to identify the right representation approach. Next, we brie y
propose certain characteristics and explain how these characteristics can be used
to distinguish among representation methods as the rst step to this end.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Methods of Representation</title>
      <p>From the di erent levels of abstraction, we identify characteristics which will
serve as the criteria to di erentiate between methods. Some commonly used
methods for representing vague places are then brie y analyzed based on these.
3.1</p>
      <sec id="sec-3-1">
        <title>Criteria for Di erentiation</title>
        <p>
          Starting from the di erent levels of abstraction, we propose the following
characteristics for use in deciding upon the correct method to employ for representation
of vague entities.
1. Conceptualization of space - The adopted perspective of vagueness (whether
it is linguistic or ontic) a ects the choice of the representational and semantic
framework [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. How the phenomenon is treated by the method forms a useful
basis for di erentiation. This also has implications on the kind of boundary
of the phenomenon (crisp, graduated, indeterminate etc.).
2. Formal model - This di erentiates between the methods based on whether
the underlying model is stochastic, fuzzy set based, three-valued logic or other.
3. Data model - Certain methods handle only regions (egg-yolk and
supervaluation) whereas others are well suited for points or grid based data structures
(fuzzy sets for instance). Since sources of data di er according to the data
model they use (raster versus vector data), it is important to consider how a
method behaves with respect to it. This also has implications on the kinds of
boundaries that can be de ned, e.g. how is a crisp boundary generated in
the raster data model?
4. Reasoning - This characteristic determines what kinds of reasoning can
be performed with the representation methods. Reasoning covers metric,
directional and topological operations performed on vague places. This is
particularly important from the perspective of a task, since it limits what
kind of analysis can be done on the vague place. Some representations provide
a well-de ned framework for reasoning, whereas others do not.
3.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Analysis of Representation Methods</title>
        <p>Base representations - We coin this term to refer to those methods which
abstract or crisp the vague place. Possible ways are to de ne the feature a priori
according to some metric, or reduce it to a simple feature type (point, line or
polygon), a minimum bounding rectangle (MBR) which covers the entire extent
of the space where the entity is located, or through tessellations of space. These
are usually in the form of vector data. Examples may be seen in VGI where a real
world feature is outlined by contributors (from GPS tracks or tracing from aerial
imagery), or in gazetteers where a feature is simply located by a representative
point. Here vagueness is not preserved, and they are generally not classi ed under
methods for vagueness representation. They are included here for the sake of
completeness since they are often applied and prove adequate in some cases. The
methods themselves do not provide any theory for reasoning.</p>
        <p>
          Probabilistic methods - These methods derive the membership value of an
individual in a set through a statistically de ned probability function. These are
used mainly to handle uncertainty. The underlying stochastic model assumes
that phenomena are crisp and knowable, with the result that no measurable way
for metrics such as precision in the case of vagueness exist [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. These methods
are best suited for phenomena with measurable objective properties such as
ow, temperature, or water level. Probabilistic interpretations have also been
employed to determine where city centres lie, based on probability of sample
points computed from trials using participant studies [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. These are generally
suited for point or eld based data and allow for a variety of statistical reasoning
techniques to be performed.
        </p>
        <p>
          Fuzzy-set methods - These are based on Fuzzy-set theory and ideal for
modelling objects which have graduating or indeterminate boundaries [
          <xref ref-type="bibr" rid="ref14 ref6 ref7">6,7,14</xref>
          ]. The
membership value ( 0 1), of a point in the region is highest at the core
of the region and decreases gradually as the boundary is approached.
Determination of membership value itself is subjective and may not relate directly to
the phenomenon itself. The model also allows for obtaining a crisp boundary by
means of -cuts which are a way of obtaining crisp sets from a fuzzy set. This
method is applicable in the case of both raster and vector data, where a feature
or cell may be assigned a membership. Reasoning using fuzzy set operations such
as intersection, union, complement etc. can also be performed [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
Egg-yolk method - A vague region is considered analogous to an egg { the yolk
corresponds to the minimal extent, the white being the indeterminate region and
its maximal extent. Any acceptable crisping must lie between these inner and
outer subregions. This method allows performing qualitative spatial reasoning
between vague regions or between a vague region and a crisp region under the
framework of the regional connection calculus (RCC-5) [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. Reasoning is possible
on di erent possible con gurations of two regions represented this way. However,
the theory itself does not make any assertion as to how the crisp regions are
obtained. Egg-yolk models are ideal when topological reasoning on vector data
in the form of regions is to be performed.
        </p>
        <p>
          Rough sets - The basis for rough sets is the indiscernibility relation { where a
collection of elements is indiscernible from another. Rough sets use a three-valued
logic (true; f alse; maybe) to determine the membership of a point to a region as
opposed to the binary notion of membership (true; f alse) in classical set theory.
Similar to the egg-yolk model, a region may be represented by its determinate
lower approximation and an indeterminate upper approximation [
          <xref ref-type="bibr" rid="ref3 ref9">9,3</xref>
          ]. Rough
sets are ideal for reasoning on multi-resolution raster data, where a change in
resolution results in indiscernibility.
        </p>
        <p>
          Supervaluation - The idea behind supervaluation is to account for the di erent
possible interpretations of a vague predicate when multiple interpretations for a
vague region exist. The positive extension is where all interpretations are true.
Its inverse the negative extension is the region where no interpretation is true.
The remaining regions constitute the penumbra [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. Supervaluation enables
use of classical logic to reason about vagueness, but computational applications
are hampered by the fact that all admissible interpretations must be explicitly
speci ed, which is di cult in practice [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. These are applicable in data models
where regions are primitives and allow for reasoning on vague regions where
several boundaries may be associated with an object.
4
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Determining a Suitable Representation - An Example</title>
      <p>We take the example of Lake Carnegie and consider two di erent sets of
requirements for representations.</p>
      <p>{ The cartographer imagines the lake to be a single contiguous body of water
with a crisp boundary, though the reality is di erent. Satellite imagery is
used as source of data. No reasoning needs to be performed.
{ The ecologist views the lake as a non-crisp object de ned by level of water.</p>
      <p>Water level observation data from sensors is available. The need is to generate
a surface exhibiting water presence over a period of time.</p>
      <p>In the rst case, the space is conceptualized as a crisp body. Data from
satellite imagery is a raster, from which the boundary needs to be derived. One
obvious solution is to simply trace the outline from the image, resulting in a base
representation. The suggested approach in this case is however to use fuzzy-set
modelling with -cuts. By varying , di erent boundaries (each of which is crisp)
can be obtained. One such representation by using fuzzy membership of pixel
values and obtaining a crisp boundary is seen in Fig. 2a.</p>
      <p>In the second case, the lake is conceptualized depending on its objective
property (water level). Spatial distribution of the data source, sensors which
provide observations, can be thought of as consisting of points (vector). Since
the user needs to obtain interpolated values in order to obtain a lake surface,
application of probabilistic methods is suitable here. A probabilistic representation
simulated from randomly distributed sensors with arbitrary observations is shown
in Fig. 2b, with outline of the lake from OpenStreetMap 2 for reference.</p>
      <p>This is a trivial example, but the same principles apply in other cases as
well. For example, in the Tell Us Where3 project dataset, it would be possible
to obtain a representation of places with noncrisp boundaries. This however has
not been attempted here owing to sampled locations in the current dataset being
insu cient in number to demonstrate our cause.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>Various representation methods have been proposed in literature for the
representation of places with vague boundaries. It is important to enable decision makers
to adopt the right method based on their needs. The approach taken here uses
multiple levels of abstraction to specify the requirements in a consistent manner.
The levels allow identi cation of characteristics with which di erent methods can
be analyzed. Di ering requirements in the modelling of a vague region such as a
lake can lead to di erent possible solutions as presented in the lake use case.
6</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgements</title>
      <p>This research was carried out under the International Research Training Group
on Semantic Integration of Geospatial Information (IRTG-SIGI) and is funded
by the DFG (German Research Foundation), GRK 1498.
2 http://www.openstreetmap.org
3 http://telluswhere.net</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Varzi</surname>
            ,
            <given-names>A.C.</given-names>
          </string-name>
          :
          <article-title>Vagueness in geography</article-title>
          .
          <source>Philosophy &amp; Geography</source>
          <volume>4</volume>
          (
          <issue>1</issue>
          ) (
          <year>2001</year>
          )
          <volume>49</volume>
          {
          <fpage>65</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bennett</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Modes of concept de nition and varieties of vagueness</article-title>
          .
          <source>Applied Ontology</source>
          <volume>1</volume>
          (
          <issue>1</issue>
          ) (
          <year>2005</year>
          )
          <volume>17</volume>
          {
          <fpage>26</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Duckham</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mason</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stell</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Worboys</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>A formal approach to imperfection in geographic information</article-title>
          .
          <source>Computers, Environment and Urban Systems</source>
          <volume>25</volume>
          (
          <issue>1</issue>
          ) (
          <year>2001</year>
          )
          <volume>89</volume>
          {
          <fpage>103</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Leung</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yan</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>A locational error model for spatial features</article-title>
          .
          <source>International Journal of Geographical Information Science</source>
          <volume>12</volume>
          (
          <issue>6</issue>
          ) (
          <year>1998</year>
          )
          <volume>607</volume>
          {
          <fpage>620</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Montello</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goodchild</surname>
            ,
            <given-names>M.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gottsegen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fohl</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Where's downtown?: Behavioral methods for determining referents of vague spatial queries</article-title>
          .
          <source>Spatial Cognition &amp; Computation</source>
          <volume>3</volume>
          (
          <issue>2</issue>
          ) (
          <year>2003</year>
          )
          <volume>185</volume>
          {
          <fpage>204</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Usery</surname>
            ,
            <given-names>E.L.:</given-names>
          </string-name>
          <article-title>A conceptual framework and fuzzy set implementation for geographic features</article-title>
          . In Burrough, P.A.,
          <string-name>
            <surname>Frank</surname>
          </string-name>
          , A.U., eds.:
          <article-title>Geographic Objects with Indeterminate Boundaries</article-title>
          . Taylor and Francis, London (
          <year>1996</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hall</surname>
          </string-name>
          , G.B.:
          <article-title>Fuzzy representation of geographical boundaries in gis</article-title>
          .
          <source>International journal of geographical information systems 10(5)</source>
          (
          <year>1996</year>
          )
          <volume>573</volume>
          {
          <fpage>590</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Cohn</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gotts</surname>
            ,
            <given-names>N.:</given-names>
          </string-name>
          <article-title>The `egg-yolk' representation of regions with indeterminate boundaries</article-title>
          .
          <source>Geographic Objects With Indeterminate Boundaries</source>
          <volume>2</volume>
          (
          <year>1996</year>
          )
          <volume>171</volume>
          {
          <fpage>187</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Worboys</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>Imprecision in nite resolution spatial data</article-title>
          .
          <source>GeoInformatica</source>
          <volume>2</volume>
          (
          <issue>3</issue>
          ) (
          <year>1998</year>
          )
          <volume>257</volume>
          {
          <fpage>279</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Kulik</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>A geometric theory of vague boundaries based on supervaluation</article-title>
          . In Montello, D., ed.:
          <source>Spatial Information Theory. Volume 2205 of Lecture Notes in Computer Science</source>
          . Springer Berlin / Heidelberg (
          <year>2001</year>
          )
          <volume>44</volume>
          {
          <fpage>59</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Worboys</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Duckham</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>GIS: A Computing Perspective. 2nd edn</article-title>
          . CRC Press (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Bennett</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Spatial vagueness</article-title>
          . In Jeansoulin, R.,
          <string-name>
            <surname>Papini</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Prade</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schockaert</surname>
          </string-name>
          , S., eds.:
          <source>Methods for Handling Imperfect Spatial Information</source>
          . Springer (
          <year>2011</year>
          )
          <volume>15</volume>
          {
          <fpage>47</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Duckham</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sharp</surname>
          </string-name>
          , J.:
          <article-title>Uncertainty and geographic information: Computational and critical convergence</article-title>
          . In:
          <article-title>Re-presenting GIS</article-title>
          . John Wiley &amp; Sons Ltd. (
          <year>2005</year>
          )
          <volume>113</volume>
          {
          <fpage>124</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14. Fisher,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Wood</surname>
          </string-name>
          , J., Cheng, T.:
          <article-title>Where is Helvellyn? Fuzziness of multi-scale landscape morphometry</article-title>
          .
          <source>Transactions of the Institute of British Geographers</source>
          <volume>29</volume>
          (
          <issue>1</issue>
          ) (
          <year>2004</year>
          )
          <volume>106</volume>
          {
          <fpage>128</fpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>