=Paper= {{Paper |id=None |storemode=property |title=OWL Based Formalisation of Geographic Databases Specifications |pdfUrl=https://ceur-ws.org/Vol-674/Paper115.pdf |volume=Vol-674 |dblpUrl=https://dblp.org/rec/conf/ekaw/AbadieMM10 }} ==OWL Based Formalisation of Geographic Databases Specifications== https://ceur-ws.org/Vol-674/Paper115.pdf
         OWL-based formalisation of geographic databases
                         specifications
            Nathalie Abadie                               Ammar Mechouche                             Sébastien Mustière
     Institut Géographique National,                 Institut Géographique National,              Institut Géographique National,
            Laboratoire COGIT                               Laboratoire COGIT                            Laboratoire COGIT
            73 Avenue de Paris                              73 Avenue de Paris                           73 Avenue de Paris
       94160 Saint-Mandé, France                       94160 Saint-Mandé, France                    94160 Saint-Mandé, France
             +33 1 43 98 80 03                          +33 1 43 98 80 00 + 71 25                         +33 1 43 98 81 49
      nathalie-f.abadie@ign.fr                      ammar.mechouche@ign.fr                       sebastien.mustiere@ign.fr



ABSTRACT                                                                aim at helping a user in retrieving geo-data that represent a
                                                                        specific geographic concept, such as „buildings’, even if feature
The ability to share and combine geographic data from different         class names of available datasets are totally different. In the latter
information sources in a consistent way is a key issue for enabling     cases, recent approaches [4][5][6] provide geo-databases experts
successful implementation of Spatial Data Infrastructures (SDIs).       with a graphical interface to help them in manually describing
This can only be done through a deep understanding of databases         their schemas and specifying mappings between source and target
structure and content. In this poster, we propose to do that            schemas.
through the elicitation and formalisation of geographic database        However, each geo-data producer has its own rules for data
specifications, relying on OWL ontologies, as recommended in            capture, and its own point of view about the geographic real world
the semantic Web community. We thus propose a general                   [7]. As an example, if a feature class is named „Building‟, it may
ontology for eliciting key concepts manipulated by data                 actually designate only permanent buildings, or include precarious
specifications, and rules to build local ontologies representing        buildings, such as cabins, or huts. Besides, a geographic database
knowledge contained in specific data specifications.                    is produced at a specific scale of analysis and geographic features
                                                                        are then captured in the database consistently with this specific
Categories and Subject Descriptors                                      level of detail. For example, only buildings of area greater than 50
H.2.8 [Database Management]: Databases Applications –                   m2 may be captured. Furthermore, the geometric representation of
Spatial databases and GIS.                                              a given geographic feature may vary: a building may be
                                                                        represented by a polygon representing its perimeter or by a point
General Terms                                                           captured at its centre.
Management, Standardization.                                            All these selection and representation criteria are stored in specific
                                                                        textual documents, used as guideline for data capture, namely the
                                                                        database specifications. They are a very rich source of knowledge
Keywords                                                                about geo-data semantics and their use in a schema matching
Geographic Database        Specification,   Ontologies,    Geo-data     process could help in identifying and solving complex
Semantics, OWL                                                          heterogeneities. Let us consider two different databases covering
                                                                        the same geographical space. The first one has a feature class
1. WHY FORMALISING                                                      named „Building‟ which represents only “buildings of area greater
                                                                        than 20 m2”, while the second one has a feature class named
SPECIFICATIONS?                                                         „Built-up area‟ which represents “buildings of area greater than 50
In the last decades, the increase of geographic data acquisition        m2”. Comparing these feature classes‟ specifications enables to
campaigns has resulted in a huge amount of diverse,                     find the following mapping rule: „Building‟ instances of area
heterogeneous and distributed geographic data sources. However,         greater than 50 m2 represent the same real world buildings as
even if these data represent the same geographic real world, there      „Built-up area‟ instances. Providing a schema matching
is a great heterogeneity between them. Consequently, the ability to     application with formal specifications would therefore enable to
share and combine geographic data from different sources in a           automatically find such complex mapping rules between
consistent way is a key issue for enabling their efficient usability.   heterogeneous geo-databases.
Previous geo-data integration efforts mainly focused on syntactic
heterogeneities through the development of standards. Semantic          2. THE SPECIFICATIONS ONTOLOGY
interoperability, which addresses more complex problems, is still       Several formal models for geographic database specifications
investigated. Actually, recent works mainly focused either on geo-      have already been proposed [8][9]. As formalisation of data
data discovery and retrieval or on transformation of geo-data           specifications in SDIs is a kind of elicitation of data semantics in
schema. In the former case, most of the proposed approaches             a Web environment, we propose to rely on semantic Web
[1][2][3] use a global domain ontology to specify the precise           standards to do so: our approach is based on ontologies developed
meaning of geo-data, either by renaming feature classes with            with the Ontology Web Language (OWL 2 [10]).
ontology labels, or thanks to semantic annotations. They rather
A first step to formalise specifications is to define unambiguously      specification is described in a specification application ontology
key concepts commonly used in geo-database specifications. In            (LSO) which uses SO‟s concepts. A tool enabling automatic
other words, we define a domain ontology, named “Specifications          comparison of formal specifications is being implemented. It aims
Ontology” (SO, see Figure 1). This ontology SO only contains             at providing expressive schemas mappings between geographic
concepts specific to geographic data specifications. It relies in        heterogeneous databases, for schema translation or schema
turn on more general ontologies, for example for defining basic          integration purposes.
geometric types [11]. For example, this domain ontology SO
formalises the concepts of data source and centreline, which are         5. ACKNOWLEDGMENTS
commonly used in many data specifications.                               This work is partly funded by the French Research Agency
                                                                         through the GeOnto project ANR-O7-MDCO-005 on “creation,
                                                                         alignment, comparison and use of geographic ontologies”
                                                                         (http://geonto.lri.fr/).

                                                                         6. REFERENCES
                                                                         [1] Paul, M., and Ghosh S.K. 2006. An Approach for Service
                                                                             Oriented Discovery and Retrieval of Spatial Data, In
                                                                             Proceedings of International Workshop on Service Oriented
                                                                             Software Engineering, Shangay, China, 84-94.
                                                                         [2] Nambiar, U., Ludscher Ludäscher, B., Lin K., and Baru, C.
                                                                             2006. The GEON portal: accelerating knowledge discovery
     Figure 1. The Specifications Ontology's main classes                    in the geosciences, Procedings In Proceedings of the 8th
                                                                             ACM International Workshop on Web Information and Data
3. HOW TO FORMALISE A GEO-                                                   Management, Aarlington, Virginia, USA, 83-90, (2006).
DATABASE SPECIFICATION?                                                  [3] Klien E.M. 2008.Semantic Annotation of Geographic
Besides we propose to formalise each database specification by               Information. Phd thesis, Institute for Geoinformatics,
means of an application ontology, named “local specification                 University of Muenster. Muenster, Germany.
ontology” (LSO). This ontology imports SO and extends it to              [4] Balley, S. 2007. Aide à la restructuration de données
describe real world geographic concepts and database classes.                géographiques sur le Web - Vers la diffusion à la carte
In order to clearly separate database concepts from real world               d'information géographique. PhD Thesis, University of
concepts encountered in specifications, two main classes from SO             Marne-La-Vallée, France.
are used: GeographicEntity and Feature. On the one hand,                 [5] Schade, S. 2009. Translation of Geospatial Data, Challenges,
GeographicEntity‟s subclasses are concepts imported from a                   Solution and Vision, In Proceedings of the 12th
domain ontology of topographic concepts. They may be created                 International Conference on Geographic Information
from the specifications text thanks to natural language processing           Science (AGILE'09), Pre-Conference Workshop “Challenges
tools [12]. On the other hand, Feature‟s subclasses are concepts             in Spatial Data Harmonisation”, Hannover, Germany.
directly derived from the corresponding database schema. They
                                                                         [6] Reitz, T., de Vries, M., and Fitzner, D. 2009. A5.2-
contains information such as selection criteria used to populate
                                                                             D3[3.3]Conceptual Schema Specification and Mapping,
the database, such as the fact that “only habitation buildings of
                                                                             HUMBOLDT Technical Report.
area greater than 20 m2 are represented in the feature class
„Building‟ of the database”. We thus require feature classes such        [7] Fonseca, F., Clodoveu, D., and Camara, G. 2003. Bridging
as „Building‟ to be modelled as “classes” in the OWL language,               Ontologies and Conceptual Schemas in Geographic
and selection constraints to be modelled as “axioms” including               Information Integration. In GeoInformatica, 7(4), 355-378.
rules that restrict the possible interpretations for the defined term,   [8] Gesbert N. 2005. Etude de la formalisation des
those axioms being defined by means of concepts and relations                spécifications de bases de données géographiques en vue de
defined in SO and LSO. Considering the example above, the                    leur intégration. Phd thesis, University of Marne-la-Vallée.
feature class „Building‟ of this geo-database will be defined in
LSO as follows:                                                          [9] Christensen, J. V. 2006. Formalizing Specifications for
                                                                             Geographic Information. In Proceedings of the 9th AGILE
Class: lso:db_Building                                                       Conference on Geographic Information Science. College of
          EquivalentTo: so:represents some                                   Geoinformatics, University of West Hungary, 186-194.
          (lso:Habitation and so:area some
          double[>20.0])                                                 [10] W3C. 2009. OWL 2 Web Ontology Language, Primer, W3C
                                                                              Recommendation 27 October 2009.
4. CONCLUSION                                                                 http://www.w3.org/TR/owl2-primer/
In this poster we proposed an OWL 2 based model for geographic           [11] Lieberman J., Singh R., Goad C. 2007. W3C geospatial
database specification formalisation, which aims at eliciting                 ontologies, W3C incubator group report, 23 october 2007.
geographic databases semantics by describing the link between
                                                                         [12] Kamel M., Aussenac-Gilles N. 2009. Ontology Learning by
data and what they represent. Key concepts used in data
                                                                              Analysing XML Document Structure and Content, In
specifications are specified in a specifications domain ontology
                                                                              Proceedings of Knowledge Engineering and Ontology
(SO), whereas knowledge contained in one given database
                                                                              Development, Madère, Portugal.