<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Generic Multilevel Approach Designing Domain Ontologies based on XML Schemas</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Thomas</forename><surname>Bosch</surname></persName>
							<email>thomas.bosch@gesis.org</email>
							<affiliation key="aff0">
								<orgName type="institution">GESIS -Leibniz Institute for the Social Sciences</orgName>
								<address>
									<addrLine>Square B2, 1</addrLine>
									<postCode>68159</postCode>
									<settlement>Mannheim</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Brigitte</forename><surname>Mathiak</surname></persName>
							<email>brigitte.mathiak@gesis.org</email>
							<affiliation key="aff1">
								<orgName type="institution">GESIS -Leibniz Institute for the Social Sciences</orgName>
								<address>
									<addrLine>Lennéstr. 30</addrLine>
									<postCode>53113</postCode>
									<settlement>Bonn</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Generic Multilevel Approach Designing Domain Ontologies based on XML Schemas</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">2C9832DF21D7E55EF124C6ED25146436</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T02:25+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Semantic Web</term>
					<term>Ontology Design</term>
					<term>XML Schema</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Designing an ontology for a specific domain is a time-consuming process. In many cases, information sources like XML Schemas serve as a basis for ontology engineers to conceptualize the intended ontologies. The ontology design process is sped up significantly when XML Schemas are transformed automatically into generated ontologies. An XML Schema Metamodel Ontology has been designed to represent the components of the XML Schema abstract data model. The generated ontologies' classes are defined as sub classes of this ontology. The classes specified for the generated ontologies are intended to be further supplemented with additional semantic and domain specific information defined in domain ontologies. The resulting ontologies are as usable as ontologies that were constructed completely manual, but with a fraction of necessary effort.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>XML has reached wide acceptance as a data exchange format in e-business. Data and metadata structured by ontologies can be published in the increasingly popular LOD cloud to get linked with a huge number of other RDF datasets <ref type="bibr" target="#b0">[1]</ref>. As RDF is an established standard there is a plethora of tools which can be used to interoperate with data and metadata represented in RDF. An effective and efficient cooperation between e-business partners is only possible if they agree on a common syntax and have a common understanding of the domain classes. XML Schema and OWL support differing modeling goals. The data model of XML describes a node labeled tree <ref type="bibr" target="#b1">[2]</ref>, the syntactic structure of XML document instances. OWL, however, is based on the subject-predicate-object triples from RDF <ref type="bibr" target="#b2">[3]</ref>, based upon formal logic, and describes semantic information about domain classes as well as their relations and therefore allows the sharing of conceptualizations. XML represents a large set of information in many domains. This fact has driven the development of generalpurpose tools for converting XML Schemas to OWL ontologies. The direct mapping from XML and XML Schema to RDF and OWL is not sufficient, since it only transports information about the syntactic structure of XML document instances. Semantic information has to be added in a further step. The aim of this paper is to bridge the gap between XML and OWL by lifting the syntactic level of XML documents to the semantic level of OWL ontologies. The process of designing domain ontologies is extremely time-consuming. XML Schemas describing specific domains are often existent in early stages of the ontology design process. In this paper, the authors describe a generic multilevel approach which accelerates the process of designing domain ontologies from scratch based on already available XML Schemas. The intention is to create generated ontologies automatically based on any possible XML Schemas of an underlying domain data model using XSLT transformations. Initially defined generated ontologies are linked to an ontology of the appropriate domain used to specify supplementary semantic information not covered in the XML Schemas. Domain experts enrich the domain ontology with additional semantics needed for tasks typically performed in the particular domain.  The XML Schema Metamodel Ontology serves as a basis for this process. Domain ontologies are related to generated ontologies in order to append semantic information not expressed in the XML Schemas. Further, you may integrate external ontologies' semantics. The relationships between the separate levels of XML and between the distinct ontologies are delineated. You can derive generated ontologies and corresponding XML Schemas simultaneously, automatically and model-driven from the data model of the domain ontologies. The ensuing paragraphs present the different levels of XML, the individual ontologies and their relationships in more detail.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Designing Domain Ontologies based on XML Schemas</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">XML Schema and the XML Schema Metamodel Ontology</head><p>XML <ref type="bibr" target="#b3">[4]</ref> documents are commonly used to store and transfer information in distributed environments. XML documents may be instances of XML Schemas <ref type="bibr" target="#b4">[5]</ref> determining their terminology and syntactic structure. The W3C has defined XML Schema, the class of XML documents, recursively using the XML Schema language to describe the XML Schema language <ref type="bibr" target="#b5">[6]</ref>, just like XML Schema documents are XML documents describing XML documents. Generated ontologies are based on the components of the XML Schema abstract data model, the meta-model of XML Schema. Table <ref type="table" target="#tab_0">1</ref> outlines the mappings between the XML Schema meta-model and the XML Schema Metamodel Ontology. In order to visualize OWL language constructs, Description Logic syntax is used. The authors have mapped the meta-element information items corresponding to the XML Schema abstract data model components (e.g. 'element') directly to classes of the XML Schema Metamodel Ontology (e.g. 'Element'). Attributes of meta-element information items have been mapped to datatype properties '&lt;attribute&gt;_&lt;domain meta-element information item&gt;_String' (e.g. 'name_Element_String') with the classes representing the meta-element information items as domains and the built-in primitive datatype 'string' as range. Universal restrictions on datatype properties have been defined, since all range individuals of these datatype properties have to be of the primitive datatype 'string': &lt;domain meta-element information item&gt; ⊑ ∀ &lt;attribute&gt;_&lt;domain meta-element information item&gt;_String.String (e.g. Element ⊑ ∀ name_Element_String.String). As XML Schemas' components can not only have child elements as content, but also plain text, the datatype property 'valueXSD_&lt;domain meta-element information item&gt;_String' (e.g. 'valueXSD_Documentation_String') and the associated universal restriction &lt;domain meta-element information item&gt; ⊑ ∀ valueXSD_&lt;domain meta-element information item&gt;_String.String (e.g. Documentation ⊑ ∀ valueXSD_Documentation_String. String) have been added, since the class representing the meta-element information item is a sub-class of the anonymous super-class of all the individuals which have only relationships along this datatype property to individuals of the class 'String' corresponding to the built-in datatype 'string' or have no relationships along this datatype property. The XML Schema Metamodel Ontology includes the datatype property 'valueXML_&lt;domain meta-element information item&gt;_String' and the corresponding universal restriction &lt;domain meta-element information item&gt; ⊑ ∀ valueXML_&lt;domain meta-element information item&gt;_String.String, since XML document instances' components may contain text. Considering the OWL assertional knowledge, the XML document fragment &lt;VariableName ... lang="en"&gt;EF1 &lt;/VariableName&gt; is mapped to the property assertions valueXML_Attribute_String (Lang-Individual, 'en') and valueXML_Element_String (VariableName-Individual, 'EF1'). Attributes of meta-element information items such as 'ref' referring to metaelement information items, have been transferred to object properties '&lt;attribute&gt;_&lt;domain meta-element information item&gt;_&lt;range meta-element information item&gt;' with corresponding universal restrictions &lt;domain meta-element information item&gt; ⊑ ∀ &lt;attribute&gt;_&lt;domain meta-element information item&gt;_&lt;range meta-element information item&gt;.&lt;range meta-element information item&gt; (e.g. Attribute ⊑ ∀ ref_Attribute_Attribute.Attribute). Diverse meta-element information items include the attributes 'type' or 'base'. These attributes may have simple ur-type, simple type or complex type definitions as possible attribute values. According to the specified naming conventions, each attribute would be transformed into three different object properties with the ranges 'AnySimpleType', 'SimpleType' and 'ComplexType'. XSLT transformations, building generated ontologies automatically based on XML Schemas, would have to determine the range of the object properties belonging to specific 'type' and 'base' attributes at runtime. It is complicated and error-prone to determine if type references either point to simple or complex type definitions which are part of external XML Schemas' namespaces. During the transformation process, XML Schemas with appropriate target namespaces have to be available and it has to be iterated over each simple and complex type. Due to these reasons, the authors have decided to map the attributes 'type' and 'base' to the object properties 'type|base_&lt;domain meta-element information item&gt;_Type' with the class 'Type' as range. 'Type' represents the super-class of all three possible type definitions. As a consequence, each specific type definition can be in the range of the 'type' and 'base' object properties. Universal restrictions on these object properties have been specified as well: &lt;domain meta-element information item&gt; ⊑ ∀type|base_&lt;domain meta-element information item&gt;_Type.Type (e.g. Element ⊑ ∀ type_Element_Type.Type). Part-of relationships to child meta-element information items as content of meta-element information items have been transferred to object properties 'contains_&lt;domain meta-element information item&gt;_&lt;range meta-element information item&gt;'. Universal restrictions on each object property have been defined, because the range of relationships along these object properties is assumed as fixed: &lt;domain meta-element information item&gt; ⊑ ∀ contains_&lt;domain meta-element information item&gt;_&lt;range meta-element information item&gt;.&lt;range meta-element information item&gt; (e.g. ComplexType ⊑ ∀ contains_ComplexType_SimpleContent. SimpleContent).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Generated Ontologies</head><p>Executing an XSLT script, the declarations and definitions of any XML Schemas are transformed into classes of generated ontologies directly and automatically. As all components of the normative XML Schema for XML Schemas are included in the XML Schema Metamodel Ontology, this works with all valid XML Schemas. The mapping process takes seconds and requires no human interaction. The generated ontologies' classes are defined as sub-classes of the XML Schema Metamodel Ontology. Hence, all generated ontologies are based on the same reusable classes. Like heavyweight ontologies <ref type="bibr" target="#b6">[7]</ref>, the generated ontologies consist of a hierarchy of classes as well as relations with domains and ranges. Moreover, the generated ontologies include universal restrictions on object properties, hasValue restrictions on datatype properties and complex classes consisting of the union of multiple classes' individuals, if universal restrictions on object properties have more than one class in the range. Table <ref type="table" target="#tab_1">2</ref> depicts the mappings between XML Schemas and the generated ontologies. universal restrictions on XML Schema Metamodel Ontology's object properties: &lt;domain element information item&gt; ⊑ ∀type|base_&lt;domain meta-element information item&gt;_Type.&lt;range element information item&gt; element information items' part-of relationships universal restrictions on XML Schema Metamodel Ontology's object properties: &lt;domain element information item&gt; ⊑ ∀contains_&lt;domain meta-element information item&gt;_&lt;range meta-element information item&gt;.&lt;union of range element information items&gt; XML Schemas' element information items are mapped to sub-classes of the XML Schema Metamodel Ontology's classes: &lt;element information item&gt; ⊑ &lt;meta element information item&gt;. The element information item 'element' with the assigned name 'VariableName' (&lt;xs:element name="VariableName" ... /&gt;), for example, is tranferred to the class 'VariableName' with 'Element' as super-class (VariableName ⊑ Element), since all 'VariableName' individuals are also part of the 'Element' class extension. Values of element information items' attributes are transformed into hasValue restrictions on the XML Schema Metamodel Ontology's datatype properties &lt;element information item&gt; ⊑ ∃ &lt;attribute&gt;_&lt;domain meta-element information item&gt;_String.{&lt;String&gt;}, as the element information item is the sub-class of the anonymous super-class of all the individuals which have at least one relationship along the datatype property '&lt;attribute&gt;_&lt;domain meta-element information item&gt;_String' to the specified individual of the primitive datatype 'string'. For instance, the value of the attribute 'name' of the element information item 'element' (&lt;xs:element name="VariableName" ... /&gt;) is converted to the datatype property hasValue restriction VariableName ⊑ ∃ name_Element_String.{'VariableName'}, since each element 'VariableName' has at least one associated name, namely 'VariableName'. Texts contained in element information items are mapped to hasValue restrictions on the XML Schema Metamodel Ontology's datatype property &lt;element information item&gt; ⊑ ∃ valueXSD_&lt;domain meta-element information item&gt;_String.{&lt;String&gt;}. For example, the text included in the element information item 'documentation' (&lt;xs:documentation&gt;Indicates the language of content.&lt;/xs:documentation&gt;) is translated into the datatype property hasValue restriction Documentation1 ⊑ ∃ valueXSD_Documentation_String. {'Indicates the language of content.'}. As element information items may contain more than one element information item of the same meta-element information item, the contained element information items' identifiers are sequential (e.g. Documentation1). Values of element information items' attributes referring to other element information items are converted to universal restrictions on the XML Schema Metamodel Ontology's object properties &lt;domain element information item&gt; ⊑ ∀ &lt;attribute&gt;_&lt;domain meta-element information item&gt;_&lt;range meta-element information item&gt;.&lt;range element information item&gt;. The reference to the element information item 'attribute' called 'lang' (&lt;xs:attribute ref="lang"/&gt;) is transformed into the object property universal restriction Lang-Reference ⊑ ∀ ref_Attribute_Attribute.Lang. The values of the attributes 'type' and 'base' are transferred to universal restrictions on XML Schema Metamodel Ontology's object properties: &lt;domain element information item&gt; ⊑ ∀ type|base_&lt;domain meta-element information item&gt;_Type.&lt;range element information item&gt;. The attribute 'type' of the element information item 'element' named 'VariableName' (&lt;xs:element name="VariableName" type="NameType"/&gt;), for example, is converted to the object property's universal restriction VariableName ⊑ ∀ type_Element_Type.NameType. Element information items' part-of relationships are realized by universal restrictions on XML Schema Metamodel Ontology's object properties &lt;domain element information item&gt; ⊑ ∀ contains_&lt;domain meta-element information item&gt;_&lt;range meta-element information item&gt;.&lt;union of range element information items&gt;. The complex type definition 'InternationalStringType' includes only one 'simpleContent' element information item (&lt;xs:complexType name="InternationalStringType"&gt; ...&lt;xs:simpleContent&gt;...&lt;/xs:simpleContent&gt;&lt;/xs:complexType&gt;). As a consequence, the range of the object property can only consist of individuals of one class (InternationalStringType ⊑ ∀ contains_ComplexType_SimpleContent.Simple Content1). If element information items like 'extension' have more than one element information item as content (e.g. &lt;xs:extension...&gt;&lt;xs:attribute name="translated"...&gt; ...&lt;/xs:attribute&gt;&lt;xs:attribute name="translatable"..&gt;..&lt;/xs:attribute&gt;&lt;/xs:extension&gt;), the domain element information items can only have relationships along the object property to individuals of the complex class consisting of the union of individuals of multiple classes representing the contained range element information items (Extension1 ⊑ ∀ contains_Extension_Attribute.(Translated ⊔ Translatable)).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3">Domain Ontologies and Integration of Other Ontologies</head><p>In domain ontologies, the semantics of classes are specified as exactly as needed using formal logic <ref type="bibr" target="#b6">[7]</ref>. Each data model of a specific domain can be expressed in the form of a domain ontology. Classes of any number of generated ontologies of a given domain can be annotated as equivalent to classes of the domain ontology (&lt;domain ontology class&gt; ≡ &lt;generated ontology class&gt;). Thus, the information of a particular domain stored in generated ontologies and in corresponding XML Schemas can be reused during early stages of the domain ontology design process. Ontology engineers can add further domain specific semantic information to the domain ontology subsequently in a continuous way. You can perform queries on domain ontologies using the semantics of the particular domain without knowledge of complex XML Schemas' structures. Requests on domain ontologies are propagated to the underlying generated ontology or ontologies (if the domain data model consists of more than one XML Schema) via equivalence relationships. Hence, there is no need to query each associated generated ontology individually using different classes, object and datatype properties. Classes of domain ontologies can be annotated as being equivalent to existing similar and widely adopted classes of external ontologies (&lt;domain ontology class&gt; ≡ &lt;external ontology class&gt;; other types of relationships are also possible). Due to this, reasoners may use additional semantic information defined in external ontologies for deductions <ref type="bibr" target="#b7">[8]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Related Work</head><p>The XML Schema Metamodel Ontology, although much more complex, corresponds to the general database ontology designed by Kupfer et al. <ref type="bibr" target="#b7">[8]</ref>. These ontology engineers have defined a database schema-to-ontology mapping, which means that specific database ontologies are generated automatically from any database schemas. Kupfer et al. have specified the conceptual model of the general database ontology as follows: databases can consist of multiple tables and tables can comprise diverse attributes. The authors used the three classes 'Database', 'Table <ref type="table">'</ref>, and 'Attribute' as well as the object property 'consistsOf' to describe database schemas. The classes' identifiers serve as links to all tables and attributes of the underlying database schemas. Kupfer et al. depicted domain ontologies in the context of the developed general database ontology. Using domain ontologies, semantic information about specific domains is annotated and added supplementary to database ontologies. The relation between database ontologies' classes and classes of domain ontologies has been conceptualized using the object property 'containsDataAbout'. Several strategies lifting the syntactic level of XML documents to the semantic level of OWL ontologies can be distinguished. The authors have clustered appropriate tools implementing these transformations into three classes depending on the kind of conversion either at the instance, the conceptual, or both the instance and the conceptual level. At the instance level, Klein has developed the so-called RDF Schema mapping ontology enabling a one-way mapping of XML documents to RDF. Relevant XML documents' content can be identified <ref type="bibr" target="#b8">[9]</ref>. Extending this approach, Battle has introduced a bidirectional mapping of XML components to RDF <ref type="bibr" target="#b9">[10]</ref>. The WEESA system implements an automatic transformation from XML to RDF using an OWL ontology, manually created from corresponding XML Schemas and manually defined rules. XML document instances are not mapped to OWL equivalents <ref type="bibr" target="#b10">[11]</ref>. O'Connor and Das developed an approach transforming XML documents to individuals of an OWL ontology describing the serialization of the XML document. SWRL <ref type="bibr" target="#b11">[12]</ref> is used to map these instances to individuals of a domain ontology <ref type="bibr" target="#b12">[13]</ref>. At the conceptual level you can distinguish between approaches converting XML schema languages to RDFS or OWL. Several languages for writing schemas like DTD <ref type="bibr" target="#b3">[4]</ref>, XML Schema <ref type="bibr" target="#b4">[5]</ref>, DSD <ref type="bibr" target="#b13">[14]</ref> and Relax NG <ref type="bibr" target="#b14">[15]</ref> exist. The prototype OntoLiFT <ref type="bibr" target="#b15">[16]</ref> offers a generic means for converting arbitrary XML schema languages to RDFS ontologies semi-automatically. In a first step, XML schema languages are transformed into regular tree grammars consisting of non-terminals, terminals, start symbols and production rules <ref type="bibr" target="#b16">[17]</ref>. In a second step, non-terminals as well as terminals are converted to RDFS classes and production rules are mapped to RDF properties. In comparison with our approach, OntoLiFt converts any XML schema language and not just XML Schema to ontologies. Anicic et al. evolved an approach based on meta-models transforming between the different models of XML Schema and OWL <ref type="bibr" target="#b17">[18]</ref>. At the instance and the conceptual level, there are methods transforming XML to RDF and XML Schema to either RDFS or OWL. Within the EU-funded project called 'Harmonise' the interoperability of existing standards for the exchange of tourism data has been achieved by the transformation of XML documents and XML Schemas into RDF and RDFS ontologies which have been mapped to each other <ref type="bibr" target="#b18">[19]</ref>. Using the approach of O'Connor and Das <ref type="bibr" target="#b19">[20]</ref>, XML document instances are transformed to OWL ontologies even though associated XML Schemas not exist. As a consequence, unstructured contents can be mapped to OWL ontologies as well. XML Schemas can also be mapped to OWL ontologies, as XML Schema documents are represented in XML, too. New OWL ontologies can be generated from scratch and existing ones can be extended. O'Connor and Das evolved XML Master, a language describing OWL ontologies declaratively. XML Master combines the Manchester OWL Syntax <ref type="bibr" target="#b20">[21]</ref> and XPath <ref type="bibr" target="#b21">[22]</ref> to refer to XML content. O'Connor and Das criticize the limited and unsatisfactory number of OWL constructs supported by current tools converting XML Schemas to OWL ontologies. Thus, all OWL constructs are covered. One shortcoming associated with this method is that you have to write mapping language expressions manually and therefore you cannot transform XML documents and XML Schemas to OWL ontologies automatically. Another drawback is that ontology engineers have to be familiar with the Manchester OWL Syntax and XPath in order to express the mappings. Ferdinand et al. propose both mappings from XML to RDF and XML Schema to OWL which are independent of each other. This means, OWL individuals do not necessarily correspond to the OWL conceptual model, since XML documents' declarations and definitions may be transferred to differing OWL constructs <ref type="bibr" target="#b22">[23]</ref>. In addition, another system can be stated transferring XML Schema components to OWL language constructs at the terminological level and XML document instances to OWL individuals at the assertional level. XPath expressions are applied selecting XML documents' content <ref type="bibr" target="#b23">[24]</ref>. Besides that, the approach of Tous et al. is very similar to this method <ref type="bibr" target="#b24">[25]</ref>. The authors of <ref type="bibr" target="#b25">[26]</ref> devised a mapping between XML and RDF and between XML Schema and OWL .The authors assume that XML documents are structured like relational databases. Thus, XML documents' relational structures are discovered and represented in OWL. Relations correspond to classes, columns to properties, and rows to instances. XML data model elements are mapped automatically to components of the OWL data model. Named simple and complex types, for instance, are transferred to classes. Elements, containing other elements or having at least one attribute, are converted to classes and object properties between these classes. Both elements, including neither attributes nor sub-elements, and attributes, assumed to represent database columns, are transformed into datatype properties with the surrounding element as their domain. Also, XML cardinality contraints are transformed into equivalent OWL cardinality restrictions. Many approaches try to extrac semantics from XML Schemas. The suggested approach, in contrast, only gains information about the syntactic structure of XML document instances contained in XML Schemas. Generated ontologies are connected with domain ontologies which are enriched with semantic domain specific information in a further step. The majority of the tools attempt to convert either schemas to ontologies at the conceptual level or XML to RDF at the instance level. The method, presented in this paper, follows a complete approach transforming XML document instances' content to OWL individuals as well as XML Schemas to OWL. In comparison with our approach, many others transform XML to RDF and/or XML schema languages to ontologies in a manual or at most in a semi-automatic and not automatic manner. Furthermore, diverse existent methods generate RDFS ontologies and not the more expressive OWL ontologies.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Conclusion</head><p>The aim of this paper is to bridge the gap between XML and OWL. XML Schema and OWL ontologies follow differing modeling goals. While XML Schema describes the syntactic structure of XML document instances, OWL is based on formal logic and describes the semantics of data models. In this paper, the authors demonstrated a generic multilevel approach designing domain ontologies when XML Schemas are provided as input sources for the ontology design process. This process of designing ontologies from scratch requires considerable effort. The normal procedure of ontology engineers is specifying the domain ontologies' semantics in collaboration with domain experts already at the beginning of the process. Applying our approach, ontology engineers are allowed to pursue a different path. They can rely on existent information located in XML Schemas of a given domain data model. To realize this, generated ontologies are built in an automatic way based on already available XML Schemas. Therefore, time-consuming work is already done and can be reused by the ontology engineers who do not have to define the domain data model anew. The generated ontologies' classes are based on super-classes of the XML Schema Metamodel Ontology. This ontology consists of classes representing the components of the XML Schema abstract domain model and the corresponding element information items. The components of the XML Schema abstract data model are used to describe XML Schemas recursively using XML Schema language constructs. Based on interviews with domain experts, the information stored in the generated ontologies can be extended in a continuous manner. This supplemental semantic domain specific information is defined in domain ontologies whose classes are linked to the generated ontologies' classes via equivalence relationships. Domain ontologies' classes can be annotated as equivalent to classes of widely adopted external ontologies. As a consequence, reasoners may use additional semantics for deductions.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1</head><label>1</label><figDesc>Figure 1 sketches the devised underlying concept of the generic multilevel approach for designing domain ontologies based on XML Schemas.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Generic multilevel approach for designing domain ontologies based on XML Schemas XSLT transformations map any XML Schemas to generated ontologies automatically.The XML Schema Metamodel Ontology serves as a basis for this process. Domain ontologies are related to generated ontologies in order to append semantic information not expressed in the XML Schemas. Further, you may integrate external ontologies'</figDesc><graphic coords="2,175.86,396.66,243.54,213.78" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>Mapping of the XML Schema meta-model to the XML Schema Metamodel Ontology</figDesc><table><row><cell>XML Schema for XML Schemas</cell><cell>XML Schema Metamodel Ontology</cell></row><row><cell>meta-element information items</cell><cell>classes: &lt;meta-element information item&gt;</cell></row><row><cell>attributes of meta-element information</cell><cell>datatype properties and associated</cell></row><row><cell>items</cell><cell>universal restrictions: &lt;domain meta-</cell></row><row><cell></cell><cell>element information item&gt; ⊑</cell></row><row><cell></cell><cell>∀&lt;attribute&gt;_&lt;domain meta-element</cell></row><row><cell></cell><cell>information item&gt;_String.String</cell></row><row><cell>texts contained in meta-element</cell><cell>datatype property and associated</cell></row><row><cell>information items</cell><cell>universal restriction: &lt;domain meta-</cell></row><row><cell></cell><cell>element information item&gt; ⊑</cell></row><row><cell></cell><cell>∀valueXSD_&lt;domain meta-element</cell></row><row><cell></cell><cell>information item&gt;_String.String</cell></row><row><cell>texts contained in XML document</cell><cell>datatype property and associated</cell></row><row><cell>instances' components</cell><cell>universal restriction: &lt;domain meta-</cell></row><row><cell></cell><cell>element information item&gt; ⊑</cell></row><row><cell></cell><cell>∀valueXML_&lt;domain meta-element</cell></row><row><cell></cell><cell>information item&gt;_String.String</cell></row><row><cell>attributes of meta-element information</cell><cell>object properties and associated universal</cell></row><row><cell>items referring to meta-element</cell><cell>restrictions: &lt;domain meta-element</cell></row><row><cell>information items</cell><cell>information item&gt; ⊑</cell></row><row><cell></cell><cell>∀&lt;attribute&gt;_&lt;domain meta-element</cell></row><row><cell></cell><cell>information item&gt;_&lt;range meta-element</cell></row><row><cell></cell><cell>information item&gt;.&lt;range meta-element</cell></row><row><cell></cell><cell>information item&gt;</cell></row><row><cell>attributes 'type' and 'base'</cell><cell>object properties and associated universal</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 .</head><label>2</label><figDesc>Mapping of XML Schemas to generated ontologies</figDesc><table><row><cell>XML Schemas</cell><cell>Generated Ontologies</cell></row><row><cell>element information items</cell><cell>sub-classes of XML Schema Metamodel</cell></row><row><cell></cell><cell>Ontology's classes:</cell></row><row><cell></cell><cell>&lt;element information item&gt; ⊑</cell></row><row><cell></cell><cell>&lt;meta-element information item&gt;</cell></row><row><cell>values of element information items'</cell><cell>hasValue restrictions on XML Schema</cell></row><row><cell>attributes</cell><cell>Metamodel Ontology's datatype</cell></row><row><cell></cell><cell>properties: &lt;element information item&gt; ⊑</cell></row><row><cell></cell><cell>∃&lt;attribute&gt;_&lt;domain meta-element</cell></row><row><cell></cell><cell>information item&gt;_String.{&lt;String&gt;}</cell></row><row><cell>texts contained in element information</cell><cell>hasValue restrictions on XML Schema</cell></row><row><cell>items</cell><cell>Metamodel Ontology's datatype</cell></row><row><cell></cell><cell>properties: &lt;element information item&gt; ⊑</cell></row><row><cell></cell><cell>∃valueXSD_&lt;domain meta-element</cell></row><row><cell></cell><cell>information item&gt;_String.{&lt;String&gt;}</cell></row><row><cell>values of element information items'</cell><cell>universal restrictions on XML Schema</cell></row><row><cell>attributes referring to other element</cell><cell>Metamodel Ontology's object properties:</cell></row><row><cell>information items</cell><cell>&lt;domain element information item&gt; ⊑</cell></row><row><cell></cell><cell>∀&lt;attribute&gt;_&lt;domain meta-element</cell></row><row><cell></cell><cell>information item&gt;_&lt;range meta-element</cell></row><row><cell></cell><cell>information item&gt;.&lt;range element</cell></row><row><cell></cell><cell>information item&gt;</cell></row><row><cell>values of attributes 'type' and 'base'</cell><cell></cell></row></table></figure>
		</body>
		<back>
			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Future Work</head><p>A complete use case designing a specific domain ontology using the devised multilevel approach based on already existing XML Schemas will be described in detail. The underlying data model of the application domain is called the Data Documentation Initiative (DDI) <ref type="bibr" target="#b26">[27]</ref>. DDI in its current version 3 is an international standard for describing data from the social, behavioral and economic sciences. Furthermore, more use cases from different domains will be shown to prove the generality of the developed approach. The main benefit associated with this approach, saving time for ontology engineers in the process of designing domain ontologies from scratch, will be evaluated as well. The authors will develop an XSLT framework to implement a complete stylesheetdriven approach to generate OWL ontologies. So far, XSLT transformations build generated ontologies automatically based on arbitrary XML Schemas. Moreover, the authors will write XSLT transformations, converting XML documents without corresponding XML Schemas determining their syntactic structure to generated ontologies. The first step is creating suitable XML Schemas out of XML document instances automatically. These XML Schemas will then be converted to generated ontologies in a second step. Another XSLT stylesheet will convert XML document instances' data to OWL instances according to the generated ontologies. Generated ontologies and corresponding XML Schemas will be derived automaticly from designed domain ontologies using XSLT transformations. These scripts will be evolved realizing the model-driven development of generated ontologies and underlying XML Schemas associated with the domain ontologies.</p></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<ptr target="http://linkeddata.org" />
		<title level="m">Linked Data</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<ptr target="http://www.w3.org/XML/Datamodel.html" />
		<title level="m">The XML data model</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<ptr target="http://www.w3.org/TR/2002/WD-rdf-concepts-20021108/" />
		<title level="m">Resource Description Framework (RDF): concepts and abstract syntax</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">Extensible Markup Language (XML) 1.0 (</title>
		<ptr target="http://www.w3.org/TR/2008/REC-xml-20081126/" />
		<imprint>
			<date type="published" when="2008-11">November 2008</date>
		</imprint>
	</monogr>
	<note>fifth edition) -W3C recommendation 26</note>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<ptr target="http://www.w3.org/TR/2004/REC-xmlschema-0-20041028/" />
		<title level="m">XML Schema part 0: primer second edition -W3C recommendation 28</title>
				<imprint>
			<date type="published" when="2004-10">October 2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<ptr target="http://www.w3.org/TR/2004/REC-xmlschema-1-20041028/" />
		<title level="m">XML Schema part 1: structures second edition -W3C recommendation 28</title>
				<imprint>
			<date type="published" when="2004-10">October 2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<author>
			<persName><forename type="first">H</forename><surname>Stuckenschmidt</surname></persName>
		</author>
		<title level="m">Ontologien: Konzepte, Technologien und Anwendungen</title>
				<meeting><address><addrLine>Berlin Heidelberg</addrLine></address></meeting>
		<imprint>
			<publisher>Springer-Verlag</publisher>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Methods for a synchronised evolution of databases and associated ontologies</title>
		<author>
			<persName><forename type="first">A</forename><surname>Kupfer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Eckstein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Störmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Neumann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Mathiak</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceeding of the 2007 Conference on Databases and Information Systems IV</title>
				<meeting>eeding of the 2007 Conference on Databases and Information Systems IV</meeting>
		<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Interpreting XML documents via an RDF Schema ontology</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">C A</forename><surname>Klein</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">13th International Workshop on Database and Expert Systems Applications</title>
				<meeting><address><addrLine>Aix-en-Provence</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Gloze: XML to RDF and back again</title>
		<author>
			<persName><forename type="first">S</forename><surname>Battle</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">1st Jena User Conference</title>
				<meeting><address><addrLine>Bristol</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">WEESA -web engineering for Semantic Web applications</title>
		<author>
			<persName><forename type="first">G</forename><surname>Reif</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Gall</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Jazayeri</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">14th World Wide Web Conference</title>
				<meeting><address><addrLine>Chiba</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<ptr target="http://www.w3.org/Submission/SWRL/" />
		<title level="m">SWRL: a Semantic Web Rule Language combining OWL and RuleML</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Semantic reasoning with XML-based biomedical information models</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>O'connor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">K</forename><surname>Das</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">13th World Congress on Medical Informatics</title>
				<meeting><address><addrLine>Cape Town</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">DSD: a schema language for XML</title>
		<author>
			<persName><forename type="first">N</forename><surname>Karlund</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Moller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">I</forename><surname>Schwartzbach</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ACM SIGSOFT Workshop on Formal Methods in Software Practice</title>
				<imprint>
			<date type="published" when="2000">2000</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<title level="m" type="main">Information technology -document schema definition language (DSDL) -part 2: regular-grammar-based validation -RELAX NG</title>
		<author>
			<persName><forename type="first">J</forename><surname>Clark</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Cowan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Fitzgerald</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Kawaguchi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Lubell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Murata</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Walsh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Webber</surname></persName>
		</author>
		<idno>ISO/IEC 19757-2:2003</idno>
		<imprint>
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">OntoLiFT Prototype -WonderWeb: ontology infrastructure for the Semantic Web</title>
		<author>
			<persName><forename type="first">R</forename><surname>Volz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Oberle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Staab</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Studer</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2003">2003</date>
			<pubPlace>Karlsruhe</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Taxonomy of XML schema languages using formal language theory</title>
		<author>
			<persName><forename type="first">M</forename><surname>Murata</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Mani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Kawaguchi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Transactions on Internet Technology</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Mapping XML Schema to OWL</title>
		<author>
			<persName><forename type="first">N</forename><surname>Anicic</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Ivezic</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Marjanovic</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Enterprise Interoperability, Part V</title>
				<meeting><address><addrLine>Berlin</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2007">2007</date>
			<biblScope unit="page" from="243" to="252" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Harmonise: a solution for data interoperability</title>
		<author>
			<persName><forename type="first">M</forename><surname>Dell'erba</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Fodor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Ricci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Werthner</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2nd IFIP Conference on E-Commerce</title>
				<meeting>the 2nd IFIP Conference on E-Commerce</meeting>
		<imprint>
			<publisher>E-Business, E-Government I3E</publisher>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Acquiring OWL ontologies from XML documents</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>O'connor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">K</forename><surname>Das</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Sixth International Conference on Knowledge Capture</title>
				<meeting>the Sixth International Conference on Knowledge Capture<address><addrLine>New York</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<ptr target="http://www.w3.org/TR/owl2-manchester-syntax/" />
		<title level="m">OWL 2 Web Ontology Language Manchester Syntax</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<ptr target="http://www.w3.org/TR/xpath20/" />
		<title level="m">XML Path Language (XPath) 2.0</title>
				<imprint/>
	</monogr>
	<note>second edition</note>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Lifting XML Schema to OWL</title>
		<author>
			<persName><forename type="first">M</forename><surname>Ferdinand</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Zirpins</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Trastour</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Web Engineering -4th International Conference</title>
				<meeting><address><addrLine>Munich</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Mapping XML to OWL for seamless information retrieval in context-aware environments</title>
		<author>
			<persName><forename type="first">N</forename><surname>Kobeissy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">G</forename><surname>Genet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Zeghlache</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference on Pervasive Services</title>
				<meeting><address><addrLine>Istanbul</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Architecture of a semantic XPath processor</title>
		<author>
			<persName><forename type="first">R</forename><surname>Tous</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Garcia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Rodriguez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Delgado</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Application to digital rights management</title>
				<meeting><address><addrLine>Copenhagen</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
	<note>6th E-Commerce and Web Technologies</note>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Mapping XML to OWL Ontologies</title>
		<author>
			<persName><forename type="first">H</forename><surname>Bohring</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Auer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="s">Leipziger Informatik Tage</title>
		<imprint>
			<biblScope unit="volume">72</biblScope>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<monogr>
		<ptr target="http://www.ddialliance.org" />
		<title level="m">Data Documentation Initiative</title>
				<imprint/>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
