<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Automating geological mapping: A constraint-based approach</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Azimjon Sayidov</string-name>
          <email>azimjon.sayidov@geo.uzh.ch</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Robert Weibel</string-name>
          <email>robert.weibel@geo.uzh.ch</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Zurich</institution>
          ,
          <addr-line>Winterthurerstrasse 190, Zurich</addr-line>
          ,
          <country country="CH">Switzerland</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <abstract>
        <p>Cartographic generalization in geological mapping is receiving increasing interest, though only few reliable automated generalization tools are available for this purpose today. Thus, improvements to methods for the generalization of categorical data, such as geological or soil maps are in demand. We advocate a constraint-based approach for geological map generalization, which could be implemented by integrating vector and raster based generalization methods. The research is divided into three parts: conceptual development, process modelling and data processing, and vector and raster based geological map generalization. In the first part, we develop the general methodology of the research, including identification and classification of constraints for geological map generalization, while the second part is dedicated to process modelling and its implementation. The third part of the research evaluates the results of generalization while comparing advantages and drawbacks of vector-based generalization against raster-based generalization. Below we give a short summary of the overall research idea highlighting the gaps found, methods used and some initial results.</p>
      </abstract>
      <kwd-group>
        <kwd>Geological mapping</kwd>
        <kwd>map generalization</kwd>
        <kwd>constraint-based</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Map generalization is both a central and complex process if
map-making. This process is responsible for producing legible
and useful maps, by making choices about what to display,
simplify, aggregate or even emphasize for specific map
purpose. Due to the importance of map generalization, its
automation has been an active area of research for several
decades [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Most research on map generalization, however,
has focused on topographic maps, which are the most common
map type used (e.g. national maps, Google maps etc.). Specific
thematic maps, such as geological map, which have specific
geometrical and topological demands, have been largely
neglected by generalization research [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Moreover, applying
the same strategies and processes used for topographic map
generalization to categorical mapping would not render a
proper solution as requirements and procedures for geological
map generalization are quite different from topographic
mapping.
      </p>
      <p>Geological maps are among the most complex thematic
maps, with various elaborate shapes and structures, rendering
the generalization process more demanding and require
indepth analysis of these structures prior to the generalization.
One of the key properties of geological maps is that the entire
map space is covered by polygons, with no overlaps or gaps.</p>
      <p>Geological maps contains big, small, long and narrow,
concave and convex, round and rectangular and etc. shapes of
polygons and generalization of such complex fabrics requires
making multiple interrelated and possibly conflicting
generalization decisions. Such situations can be best formalized
and controlled by using constraints.</p>
      <p>
        The constraint-based approach to automating map
generalization has emerged as the leading paradigm over the
past two decades [
        <xref ref-type="bibr" rid="ref14 ref3">3, 14</xref>
        ]. In this approach, constraints are
understood as design specifications and graphical condition
that a valid map should adhere to. For instance, map objects
should be sufficiently large to remain visible and legible on a
reduced scale map; or map objects should be separated by
sufficient space to remain visually separable when the map
scale is reduced. In these two simple examples, a constraint
would be defined for the minimum size, and a second one for
the minimum separation distance. If any of these constraints are
violated, a conflict resolution action is triggered, such as in the
first case, when a map object becomes too small, it may be
either removed or enlarged, depending on whether it is
considered unimportant or important. The definition of
constraints has the advantage of formulating the map
generalization in a modular fashion, and formulating it as an
optimization problem [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>The overall objective of the research is to develop a
methodology to automatically generalize geological maps
using a constraint-based approach. The methodology considers
the generalization of individual polygons as well as group of
polygons. This papers presents a methodology that deals with
the individual polygons in the geological maps. Next, step of
the research however, is dedicated to a procedure to detect
meaningful groups of polygons as a precursor to generalizing
these polygon groups.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Background</title>
      <p>
        Generalization of categorical maps can be carried out in raster
as well as in vector environments, depending on the demand on
the output. Thus, researches are divided in two parts. Early
research aiming at generalization in a raster environment was
carried out by [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] or [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. In vector representations [
        <xref ref-type="bibr" rid="ref1 ref12 ref2 ref6 ref7">7, 1, 2, 12,
6</xref>
        ] provide examples. The integration of methods for both
representations was addressed by [
        <xref ref-type="bibr" rid="ref11 ref8">8, 11</xref>
        ]. The approach of [
        <xref ref-type="bibr" rid="ref11 ref13">11,
13</xref>
        ] is confined to raster-based generalization, i.e. to maps that
exist in raster form, where it works relatively well. In terms of
available software tools for geological map generalization, the
work by [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] still defines the state of the art. However, the
approach is not able to explicitly consider cartographic
properties of features such as the size of polygons or the
distance between them.
      </p>
      <p>Moreover, since most geological maps are stored in vector
format, data will have to be converted to raster format in order
to execute the generalization step, and subsequently back to
vector format again. These two conversion steps cause a loss of
data accuracy, which is a further drawback of the approach.
Thus, the conceptual approach used in this paper aims to
improve existing methods for the generalization of geological
maps by firstly identifying constraints for geological map
generalization and modelling them for integrated vector and
raster approaches, which are at the same time able to provide
quality control for the target map.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Methodology and initial results</title>
      <p>Our conceptual framework is based on defining constraints,
defining corresponding measures, modelling the generalization
process and finally executing the process, while monitoring
quality evaluation. Moreover, it may also be regarded as a
dynamic generalization model guided by constraints, where
decisions depend on the semantic and geometrical
characteristics of an object or set of objects, requiring the
existence of procedural knowledge in order to appropriately
select map generalization operators and algorithms.
In categorical maps typically the entire surface of the map is
covered with contiguous polygons or areal features, with no
holes nor overlaps. Such maps can equally be modelled as a
vector or raster data representation, respectively.</p>
      <p>
        Raster generalization is seen by some authors as the preferred
choice and ideal for geological mapping at all scales [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], using
classification, reclassification, majority filters, or low and high
pass filters. However, it is generally not recommended to use
raster generalization, unless there is a good reason, such as if
the source map is in raster format or if only raster operators can
handle a particular task. Otherwise, converting vector data to
raster causes loss of information as well as positional accuracy
of the features in the map.
      </p>
      <p>The vector representation lends itself better to geometrical
transformations of vertices, such as shifting the position of
individual vertices, or removing vertices or polygons
altogether. Also, since geological units are modelled as entire
polygons rather than simply as a collection of pixels, spatial
relations between polygons can be explicitly modelled,
enabling better contextual operations, such as contextual
aggregation of sub-categories to a unique category.
The next main steps of the framework consist in defining the
generalization constraints, and in defining the measures that
can implement the previously defined constraints and thus
assess whether any constraints are violated.</p>
      <p>Constraints dictate the decisions, limit the search space of the
generalization process and reduce the content of the map, while
generalizing it. They can be defined conceptually regardless of
the spatial data model used, vector or raster, however their
implementation may differ. For instance, if the pixel size of a
raster is already larger than the minimum visual separation
limit, the associated constraints (minimum size, minimum
separation distance) will not apply. Similarly, the measures
used to implement the constraints will differ between the two
spatial data models. For instance, distances are measured
differently in vector or raster data.</p>
      <p>
        In the generalization process constraints have the following
functions (Figure 1): conflict detection - to identify areas that
have to be generalized, for example by evaluating the quantity
and severity of constraint violations; and conflict resolution
to guide the choice of operators according to constraints
priorities [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>conflict detection</p>
      <p>conflict resolution
value</p>
      <sec id="sec-3-1">
        <title>Severity</title>
        <p>Method</p>
      </sec>
      <sec id="sec-3-2">
        <title>Evaluation</title>
        <p>method</p>
      </sec>
      <sec id="sec-3-3">
        <title>Measure(s)</title>
        <p>value</p>
      </sec>
      <sec id="sec-3-4">
        <title>Goal value</title>
        <p>value</p>
      </sec>
      <sec id="sec-3-5">
        <title>List of plans</title>
        <p>value</p>
      </sec>
      <sec id="sec-3-6">
        <title>Importance</title>
        <p>value</p>
      </sec>
      <sec id="sec-3-7">
        <title>Priority</title>
        <p>
          Graphical constraints, also referred to as size constraints, are
related to the readability of the map features, such as size, width
and differentiation of the objects. They are detected by
graphical legibility limits and are handled in the first part of the
research. Six size constraints as well as associated measures
have been identified (Figure 2): 1. The number of polygons in
the source and target scale should correspond to the number
which identified by Radical Law [
          <xref ref-type="bibr" rid="ref15 ref16">15, 16</xref>
          ] (1).
        </p>
        <p>2
2. The minimum area of polygons should not be less than
1250 m2 (for the example of a transition from 1:25k to 1:50k);
if there are polygons less then this limit they are either
removed, enlarged based on their geological importance, or
aggregated based on their similarities with neighbouring
polygons. 3. The distance between polygons should not be less
than 25 meters, and if so, they are either aggregated (again
based on the geological properties) or displaced to the
minimum distance. 4. and 5. The distance between consecutive
vertices and the outline granularity may be handled by a
bandwidth simplification algorithm and smoothing
respectively, removing vertices that are very close and giving
the shape a smoother look, respectively. 6. The distance
between interior boundaries of a polygon should be larger than
15 meters. If not, the polygon is grown by a certain value, until
its width reaches the corresponding graphical limit (Figure 2).
We have recently developed a workflow-based methodology
that implements the above size constraints (Sayidov &amp; Weibel,
in prep.). The methodology starts by detecting polygons that
are too small. Depending on their geological importance, they
are then either enlarged or removed. Proximity conflicts that
may have been caused by the enlargement of polygons then
trigger a series of aggregation and displacement operations, and
finally the remaining size constraints are dealt with.
So far, in the first stage of this research, we have only
considered constraints that deal mostly with single polygons or
groups of polygons confined to their immediate
neighbourhood. The next, second stage will deal with groups of
polygons or polygon patterns, which could be regarded as
constraints on the level of the entire map. These include e.g.
‘number of categories’, ‘area ratios’, ‘group polygons
proximity’, ‘maintenance of overall shape of patches’. On the
other hand, these two stages, or levels, are closely connected
and it seems fit to always link them and iterate between the two
levels (i.e. individual polygons vs. groups of polygons). For
instance, reducing the number of polygons in reaction to the
minimum area constraint will directly affect the constraints
‘maintenance of overall shape of patches’, ‘group polygons
proximity’, and ‘area ratio between source and target map’
which belong to the group level and map level constraints.
The final stage of this PhD research will cover the comparison
of operators used in vector- and raster-based geological map
generalization to assess their corresponding advantages and
weaknesses in order to make further recommendations
regarding the integration of these two approaches.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>This PhD project departs from the hypothesis that automating
the generalization of geological maps can be made more
objective and flexible by integrating vector and raster-based
generalization techniques and by guiding and monitoring the
process with constraints that define cartographic requirements
and legibility principles. Defining constraints, taking into
account the properties and peculiarities of geological maps,
however, is a key point accompanied by logical and structural
integration of generalization algorithms. It does not only
require generalization algorithms, but also algorithms that
implement the measures needed to assess whether the
constraints are maintained.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Downs</surname>
            ,
            <given-names>T. C.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Mackaness</surname>
            ,
            <given-names>W. A.</given-names>
          </string-name>
          (
          <year>2002</year>
          ).
          <article-title>An Integrated Approach to the Generalization of Geological Maps</article-title>
          . In
          <source>Cartographic Journal, The</source>
          ,
          <volume>39</volume>
          (
          <issue>2</issue>
          ),
          <fpage>137</fpage>
          -
          <lpage>152</lpage>
          . http://doi.org/10.1179/000870402786962489.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Galanda</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2003</year>
          ).
          <article-title>Automated Polygon Generalization in a Multi Agent System</article-title>
          .
          <source>PhD thesis</source>
          , Department of Geography, University of Zurich,
          <volume>188</volume>
          pages. Retrieved from http://www.geo.uzh.ch/fileadmin/files/content/abteilunge n/gis/research/phd_theses/thesis_MartinGalanda_
          <year>2003</year>
          .p df.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Harrie</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Weibel</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <year>2007</year>
          .
          <article-title>Modelling the Overall Process of Generalisation</article-title>
          . In: Mackaness,
          <string-name>
            <given-names>W.A.</given-names>
            ,
            <surname>Ruas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            &amp;
            <surname>Sarjakoski</surname>
          </string-name>
          , L.T. (eds.).
          <source>Generalisation of Geographic Information: Cartographic Modelling and Applications. Elsevier Science</source>
          ,
          <volume>67</volume>
          -
          <fpage>87</fpage>
          . http://doi.org/10.1016/B978-0-
          <fpage>08</fpage>
          -045374-3.
          <fpage>X5000</fpage>
          -5.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Mackaness</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruas</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Sarjakoski</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <year>2007</year>
          .
          <article-title>Generalisation of Geographic Information</article-title>
          .
          <source>In Cartographic Modelling and Applications</source>
          . Oxford: Elsevier. http://doi.org/10.1016/B978-0-
          <fpage>08</fpage>
          -045374- 3.
          <fpage>X5000</fpage>
          -5.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Marjoribanks</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Geological methods in mineral exploration and mining</article-title>
          .
          <source>In Geological Methods in Mineral Exploration and Mining</source>
          . http://doi.org/10.1007/978-3-
          <fpage>540</fpage>
          -74375-0.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>McCabe</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Vector Approaches to Generalizing Faults and</article-title>
          Polygons in 1:24,000 Geologic Maps: Santa Rosa, California, Case Study. https://www.geovista.psu.edu/publications/2008/CMcCa be_GeologicMapGeneralization.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Muller</surname>
            ,
            <given-names>J. C.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          (
          <year>1992</year>
          ).
          <article-title>Area-patch generalization for land use and land cover maps</article-title>
          .
          <source>In Cartographic Journal</source>
          ,
          <volume>29</volume>
          (
          <issue>2</issue>
          ),
          <fpage>137</fpage>
          -
          <lpage>144</lpage>
          , (
          <year>1992</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Peter</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Weibel</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>1999</year>
          ).
          <article-title>Using vector and rasterbased techniques in categorical map generalization</article-title>
          .
          <source>In Third ICA Workshop on Progress in Automated Map Generalization.</source>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Sayidov</surname>
            ,
            <given-names>A.K.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Weibel</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <article-title>(in prep</article-title>
          .):
          <article-title>Size constraints: Geological map generalization</article-title>
          .
          <source>In Cartography and Geographic Information Science.</source>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Schylberg</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          (
          <year>1993</year>
          ).
          <article-title>Computational Methods for Generalization of Cartographic Data in a Raster</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Smirnoff</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paradis</surname>
            ,
            <given-names>S. J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Boivin</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Generalizing surficial geological maps for scale change: ArcGIS tools vs. cellular automata model</article-title>
          .
          <source>In Computers and Geosciences</source>
          ,
          <volume>34</volume>
          (
          <issue>11</issue>
          ),
          <fpage>1550</fpage>
          -
          <lpage>1568</lpage>
          . http://doi.org/10.1016/j.cageo.
          <year>2007</year>
          .
          <volume>10</volume>
          .013.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Steiniger</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Weibel</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>A Conceptual Framework For Automated Generalization and its Application to Geologic and Soil Maps</article-title>
          .
          <source>In Proceedings 22nd International Cartographic Conference, La Coruña (Spain)</source>
          ,
          <fpage>11</fpage>
          -
          <issue>16</issue>
          <year>July 2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Steiniger</surname>
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weibel</surname>
            <given-names>R.</given-names>
          </string-name>
          ,
          <year>2007</year>
          .
          <article-title>Relations among map objects in cartographic generalization</article-title>
          .
          <source>In Cartography and Geographic Information Science</source>
          ,
          <volume>34</volume>
          (
          <issue>3</issue>
          ):
          <fpage>175</fpage>
          -
          <lpage>197</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Su</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          <string-name>
            <surname>Lodwick</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Müller</surname>
            ,
            <given-names>J. C.</given-names>
          </string-name>
          (
          <year>1997</year>
          ).
          <article-title>Algebraic models for the aggregation of area features based upon morphological operators</article-title>
          . In
          <source>International Journal of Geographical Information Science</source>
          ,
          <volume>11</volume>
          (
          <issue>3</issue>
          ):
          <fpage>233</fpage>
          -
          <lpage>246</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Töpfer</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          (
          <year>1974</year>
          ). Kartographishe Generalisierung,
          <string-name>
            <surname>VEB</surname>
          </string-name>
          , Hermann Haack, Leipzig.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Töpfer</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Pillewizer</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          (
          <year>1966</year>
          ). “
          <article-title>The principles of selection”</article-title>
          .
          <source>The Cartographic Journal</source>
          , Vol.
          <volume>3</volume>
          ,
          <issue>1966</issue>
          , pp.
          <fpage>10</fpage>
          -
          <lpage>16</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Touya</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bucher</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Falquet</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Jaara</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <year>2014</year>
          .
          <article-title>Modelling Geographic Relationships in Automated Environments</article-title>
          . In: Burghardt,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Duchêne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            , &amp;
            <surname>Mackaness</surname>
          </string-name>
          , W. (eds.).
          <source>Abstracting Geographic Information in a Data Rich World</source>
          . Springer International. http://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          -00203-3.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>