=Paper=
{{Paper
|id=None
|storemode=property
|title=OWL Based Formalisation of Geographic Databases Specifications
|pdfUrl=https://ceur-ws.org/Vol-674/Paper115.pdf
|volume=Vol-674
|dblpUrl=https://dblp.org/rec/conf/ekaw/AbadieMM10
}}
==OWL Based Formalisation of Geographic Databases Specifications==
OWL-based formalisation of geographic databases
specifications
Nathalie Abadie Ammar Mechouche Sébastien Mustière
Institut Géographique National, Institut Géographique National, Institut Géographique National,
Laboratoire COGIT Laboratoire COGIT Laboratoire COGIT
73 Avenue de Paris 73 Avenue de Paris 73 Avenue de Paris
94160 Saint-Mandé, France 94160 Saint-Mandé, France 94160 Saint-Mandé, France
+33 1 43 98 80 03 +33 1 43 98 80 00 + 71 25 +33 1 43 98 81 49
nathalie-f.abadie@ign.fr ammar.mechouche@ign.fr sebastien.mustiere@ign.fr
ABSTRACT aim at helping a user in retrieving geo-data that represent a
specific geographic concept, such as „buildings’, even if feature
The ability to share and combine geographic data from different class names of available datasets are totally different. In the latter
information sources in a consistent way is a key issue for enabling cases, recent approaches [4][5][6] provide geo-databases experts
successful implementation of Spatial Data Infrastructures (SDIs). with a graphical interface to help them in manually describing
This can only be done through a deep understanding of databases their schemas and specifying mappings between source and target
structure and content. In this poster, we propose to do that schemas.
through the elicitation and formalisation of geographic database However, each geo-data producer has its own rules for data
specifications, relying on OWL ontologies, as recommended in capture, and its own point of view about the geographic real world
the semantic Web community. We thus propose a general [7]. As an example, if a feature class is named „Building‟, it may
ontology for eliciting key concepts manipulated by data actually designate only permanent buildings, or include precarious
specifications, and rules to build local ontologies representing buildings, such as cabins, or huts. Besides, a geographic database
knowledge contained in specific data specifications. is produced at a specific scale of analysis and geographic features
are then captured in the database consistently with this specific
Categories and Subject Descriptors level of detail. For example, only buildings of area greater than 50
H.2.8 [Database Management]: Databases Applications – m2 may be captured. Furthermore, the geometric representation of
Spatial databases and GIS. a given geographic feature may vary: a building may be
represented by a polygon representing its perimeter or by a point
General Terms captured at its centre.
Management, Standardization. All these selection and representation criteria are stored in specific
textual documents, used as guideline for data capture, namely the
database specifications. They are a very rich source of knowledge
Keywords about geo-data semantics and their use in a schema matching
Geographic Database Specification, Ontologies, Geo-data process could help in identifying and solving complex
Semantics, OWL heterogeneities. Let us consider two different databases covering
the same geographical space. The first one has a feature class
1. WHY FORMALISING named „Building‟ which represents only “buildings of area greater
than 20 m2”, while the second one has a feature class named
SPECIFICATIONS? „Built-up area‟ which represents “buildings of area greater than 50
In the last decades, the increase of geographic data acquisition m2”. Comparing these feature classes‟ specifications enables to
campaigns has resulted in a huge amount of diverse, find the following mapping rule: „Building‟ instances of area
heterogeneous and distributed geographic data sources. However, greater than 50 m2 represent the same real world buildings as
even if these data represent the same geographic real world, there „Built-up area‟ instances. Providing a schema matching
is a great heterogeneity between them. Consequently, the ability to application with formal specifications would therefore enable to
share and combine geographic data from different sources in a automatically find such complex mapping rules between
consistent way is a key issue for enabling their efficient usability. heterogeneous geo-databases.
Previous geo-data integration efforts mainly focused on syntactic
heterogeneities through the development of standards. Semantic 2. THE SPECIFICATIONS ONTOLOGY
interoperability, which addresses more complex problems, is still Several formal models for geographic database specifications
investigated. Actually, recent works mainly focused either on geo- have already been proposed [8][9]. As formalisation of data
data discovery and retrieval or on transformation of geo-data specifications in SDIs is a kind of elicitation of data semantics in
schema. In the former case, most of the proposed approaches a Web environment, we propose to rely on semantic Web
[1][2][3] use a global domain ontology to specify the precise standards to do so: our approach is based on ontologies developed
meaning of geo-data, either by renaming feature classes with with the Ontology Web Language (OWL 2 [10]).
ontology labels, or thanks to semantic annotations. They rather
A first step to formalise specifications is to define unambiguously specification is described in a specification application ontology
key concepts commonly used in geo-database specifications. In (LSO) which uses SO‟s concepts. A tool enabling automatic
other words, we define a domain ontology, named “Specifications comparison of formal specifications is being implemented. It aims
Ontology” (SO, see Figure 1). This ontology SO only contains at providing expressive schemas mappings between geographic
concepts specific to geographic data specifications. It relies in heterogeneous databases, for schema translation or schema
turn on more general ontologies, for example for defining basic integration purposes.
geometric types [11]. For example, this domain ontology SO
formalises the concepts of data source and centreline, which are 5. ACKNOWLEDGMENTS
commonly used in many data specifications. This work is partly funded by the French Research Agency
through the GeOnto project ANR-O7-MDCO-005 on “creation,
alignment, comparison and use of geographic ontologies”
(http://geonto.lri.fr/).
6. REFERENCES
[1] Paul, M., and Ghosh S.K. 2006. An Approach for Service
Oriented Discovery and Retrieval of Spatial Data, In
Proceedings of International Workshop on Service Oriented
Software Engineering, Shangay, China, 84-94.
[2] Nambiar, U., Ludscher Ludäscher, B., Lin K., and Baru, C.
2006. The GEON portal: accelerating knowledge discovery
Figure 1. The Specifications Ontology's main classes in the geosciences, Procedings In Proceedings of the 8th
ACM International Workshop on Web Information and Data
3. HOW TO FORMALISE A GEO- Management, Aarlington, Virginia, USA, 83-90, (2006).
DATABASE SPECIFICATION? [3] Klien E.M. 2008.Semantic Annotation of Geographic
Besides we propose to formalise each database specification by Information. Phd thesis, Institute for Geoinformatics,
means of an application ontology, named “local specification University of Muenster. Muenster, Germany.
ontology” (LSO). This ontology imports SO and extends it to [4] Balley, S. 2007. Aide à la restructuration de données
describe real world geographic concepts and database classes. géographiques sur le Web - Vers la diffusion à la carte
In order to clearly separate database concepts from real world d'information géographique. PhD Thesis, University of
concepts encountered in specifications, two main classes from SO Marne-La-Vallée, France.
are used: GeographicEntity and Feature. On the one hand, [5] Schade, S. 2009. Translation of Geospatial Data, Challenges,
GeographicEntity‟s subclasses are concepts imported from a Solution and Vision, In Proceedings of the 12th
domain ontology of topographic concepts. They may be created International Conference on Geographic Information
from the specifications text thanks to natural language processing Science (AGILE'09), Pre-Conference Workshop “Challenges
tools [12]. On the other hand, Feature‟s subclasses are concepts in Spatial Data Harmonisation”, Hannover, Germany.
directly derived from the corresponding database schema. They
[6] Reitz, T., de Vries, M., and Fitzner, D. 2009. A5.2-
contains information such as selection criteria used to populate
D3[3.3]Conceptual Schema Specification and Mapping,
the database, such as the fact that “only habitation buildings of
HUMBOLDT Technical Report.
area greater than 20 m2 are represented in the feature class
„Building‟ of the database”. We thus require feature classes such [7] Fonseca, F., Clodoveu, D., and Camara, G. 2003. Bridging
as „Building‟ to be modelled as “classes” in the OWL language, Ontologies and Conceptual Schemas in Geographic
and selection constraints to be modelled as “axioms” including Information Integration. In GeoInformatica, 7(4), 355-378.
rules that restrict the possible interpretations for the defined term, [8] Gesbert N. 2005. Etude de la formalisation des
those axioms being defined by means of concepts and relations spécifications de bases de données géographiques en vue de
defined in SO and LSO. Considering the example above, the leur intégration. Phd thesis, University of Marne-la-Vallée.
feature class „Building‟ of this geo-database will be defined in
LSO as follows: [9] Christensen, J. V. 2006. Formalizing Specifications for
Geographic Information. In Proceedings of the 9th AGILE
Class: lso:db_Building Conference on Geographic Information Science. College of
EquivalentTo: so:represents some Geoinformatics, University of West Hungary, 186-194.
(lso:Habitation and so:area some
double[>20.0]) [10] W3C. 2009. OWL 2 Web Ontology Language, Primer, W3C
Recommendation 27 October 2009.
4. CONCLUSION http://www.w3.org/TR/owl2-primer/
In this poster we proposed an OWL 2 based model for geographic [11] Lieberman J., Singh R., Goad C. 2007. W3C geospatial
database specification formalisation, which aims at eliciting ontologies, W3C incubator group report, 23 october 2007.
geographic databases semantics by describing the link between
[12] Kamel M., Aussenac-Gilles N. 2009. Ontology Learning by
data and what they represent. Key concepts used in data
Analysing XML Document Structure and Content, In
specifications are specified in a specifications domain ontology
Proceedings of Knowledge Engineering and Ontology
(SO), whereas knowledge contained in one given database
Development, Madère, Portugal.