<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Using a Linguistic Approach to Represent Terminology in an Ontology</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Amparo</forename><surname>Alcina</surname></persName>
							<email>alcina@uji.es</email>
							<affiliation key="aff0">
								<orgName type="institution">University Jaume I</orgName>
								<address>
									<addrLine>Avda. Sos Baynat s/n</addrLine>
									<postCode>12071</postCode>
									<settlement>Castelló</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="department">International Conference on &quot;Multilingual digital terminology today. Design</orgName>
								<orgName type="institution">representation formats and management systems&quot;</orgName>
								<address>
									<addrLine>June 16 -17</addrLine>
									<settlement>Padova</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Using a Linguistic Approach to Represent Terminology in an Ontology</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">DCD40AFEDD2FF900AE3291E8F22AC650</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T18:02+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Terminology</term>
					<term>dictionaries</term>
					<term>ontology</term>
					<term>natural language processing</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>text. Ontology models of lexical and terminological resources need to include the linguistic dimension in a more coherent and adequate way. In general, ontologies describe objects, but the linguistic approach consist of describe terms as words, not as codes or labels. In this work, we will present our ontology model, where terms are the 'individuals' that are the object of classification and linguistic concepts (whether grammatical, as a noun or verb, or morphological, as a full or derived form) constitute the 'classes' into which the terms are classified. This model of lexical representation can enhance the use of ontologies to make dictionaries and contribute to the Web Semantic and Linked Data Linguistics.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The increased interest in having terminological resources based on the OWL-RDF language of the Semantic Web by experts in natural language processing or knowledge engineering contrasts with the fact that ontologies are not considered adequate repositories as lexical or terminological resources according to some authors <ref type="bibr" target="#b0">[1]</ref><ref type="bibr" target="#b1">[2]</ref><ref type="bibr" target="#b2">[3]</ref>.</p><p>On the one hand, these ontologies are valuable because, like lexical networks, they organize terminology according to semantic or conceptual relationships. This is evident in the case of termologists who work according to ISO principles and standards (ISO 1087 and 704). In the same case, we have the European projects that have developed models to incorporate linguistic information into ontologies (such as OntoLex, Lemon) and the thematic networks and communities, such as ELEXIS, NEXULINGUARUM and TOTh, which promote the work of lexicon and terminology based on lexical networks. and ontologies.</p><p>On the other hand, these initiatives encounter the problem of representing the lexicon and terminology in such a way that the onomasiological perspective (conceptual representation) and the semasiological perspective (representation of linguistic data) are adequately combined. These shortcomings dissuade many lexicographers and terminologists from using ontologies in the development of their resources.</p><p>In our opinion, if the formalization of the linguistic dimension and the conceptual dimension in ontologies are properly combined, their use in the field of linguistics could increase considerably, which would benefit the Semantic Web and Linked Data Paradigm. Therefore, the challenge is to represent the linguistic dimension of natural language terminology in an ontology, avoiding the problems it currently suffers from.</p><p>In the first part, we show the reasons why lexical networks or ontologies do not satisfy some of the expression needs of the linguistic dimension of terms until now. In the second part, we present our proposal to extend the ontological model with the linguistic dimension. In contrast to other models, our model allows to formalize the linguistic data as part of the ontology, not of the metadata, which avoids the weaknesses identified by linguists.</p><p>For the development of our proposal, we have followed the ontology creation methodology of Knowledge Engineering <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b4">5]</ref>. For the implementation of the model and the data we have used the Protégé ontology editor <ref type="bibr" target="#b5">[6]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Objects vs. Words Formalisation</head><p>A number of authors concluded that ontologies does not meet all the needed requirements to organize words. The main problems of using lexical networks or linguistic ontologies (such as WordNet, Cyc, SIMPLE) to represent the lexicon have been pointed out by Hirst <ref type="bibr" target="#b0">[1]</ref> and other authors <ref type="bibr" target="#b1">[2]</ref><ref type="bibr" target="#b2">[3]</ref><ref type="bibr" target="#b3">[4]</ref><ref type="bibr" target="#b4">[5]</ref><ref type="bibr" target="#b5">[6]</ref><ref type="bibr" target="#b7">[7]</ref><ref type="bibr" target="#b8">[8]</ref>. In short, the main difficulties presented by these models appear to be:</p><p>1. Deficiencies in representing the meaning of the lexicon 2. No correspondence between the ontological categories in an ontology and the semantic categories of the words 3. No correspondence between categories necessary for classifying the objects of the world and categories necessary for classifying the lexicon 4. No correspondence between the organization of words according to semantic categories, and the organization of words according to morphological, grammatical or other linguistic categories When we analize these problems, we realize that the real aim of ontologies, including lexical networks and so-called linguistic ontologies, is to structure the entities of the world. They follow a philosophical conception, whose aim is to describe objects. In these ontologies, terms are considered "codes" or "labels" which refer to these concepts (classes or set of objects), following an approach base on the referential semantics. That is evident in the following aspects:</p><p>1. the identification of concept and word: each concept in ontology is intended to describe real world entities (e.g. aeroplanes in aeronautics, plants in botany). 2. the exclusively referential (non-semantic) description of concepts. In these ontologies, there is no description or formalisation of the meaning of the concept, apart from the specification of a more general concept. Instead, they allude to a semantics by reference: concept is given by the set of the objects of the world to which it refers. 3. the different levels of language description, such as phonological, morphological, grammatical, syntactic, etc., are not represented. Some models, such as SKOS or Lemon, propose the insertion of this data on the periphery of the ontology, as tags in the metadata. 4. the aspiration to a universal ontology. Some ontologies seek/proclaim the universality of ontological representation, which they also extend to a linguistic universality, which is a contradiction. The existence of words is inextricably linked to the linguistic system in which they develop and are used. In Linguistics, however, the terms are not considered mere codes, but elements that interact with each other and with other elements within a complex system, the linguistic system. This complex system does not fit under the semantics by reference approach. A more complex model capable of modeling all the dimensions of the terms in natural language, not just the conceptual dimension, is needed.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">An ontological model to represent dictionaries: ONTODIC model</head><p>Ontologies, in knowledge engineering (descriptive logic), are applied to any field and have the objective of organizing objects (individuals) under concepts (in classes). So, we have organized a sample of terms including the ontological and linguistic dimensions following the ontology creation methodology and using the appropriate tools, starting from a linguistic perspective.</p><p>In our ontology model, words or terms are the individuals that are the object of classification and linguistic concepts (whether grammatical, as a noun or verb, or morphological, as a full or derived form) constitute the classes into which the terms are classified. We have thus, from a linguistic approach:</p><p>1. terms as elements of a linguistic system, which are represented as 'individuals' 2. linguistic concepts or 'classes' under which the terms are classified 3. linguistic relationships of various kinds that exist between the terms in a linguistic system formalized as 'object properties'</p><p>In this work, we will present this configuration by applying it to the terminology of the Spanish ceramics industry and analyse its morphological, grammatical and semantic aspects, with examples of the representation of cases of polysemy and synonymy and the representation of equivalences in other languages. We will examine the peculiarities of this configuration of elements in comparison to other configurations and in comparison to the methodology habitually used in knowledge engineering.</p><p>The linguistic approach we propose solves the main problems of using ontologies to represent natural language. This ontological model is based on the following principles:</p><p>1. it represents the lexicon and its conceptualization using the main elements of ontology (such as classes, relations and individuals, axioms, etc.), not tags or metadata 2. it formalizes the differences between form and meaning, allowing the representation of asymmetric relations such as polysemy or synonymy 3. it allows the different levels of language (phonology, morphology, grammar, semantics) to be represented, with the necessary and sufficient level of detail 4. it assumes that all languages have their own formal and conceptual configuration, and allows each of them to be represented independently, rather than assuming a universal or supposedly objective representation. We believe that this model of lexical representation will enhance the use of ontologies by linguists (lexicographers, translators and linguistics researchers) to make dictionaries, which will contribute to the increased availability of high-quality lexical resources in the OWL-RDF format on the Semantic Web.</p></div>		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Acknowledgements</head><p>I would like to thank the reviewers of a previous version of this paper for their valuable comments. This research is part of the project "PRO-ONTODIC: Protocolos para la creación de diccionarios terminológicos basados en ontologías (Modelo ONTODIC)" Ref. UJI-B2018-65, funded by the University Jaume I of Castellón, Spain.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Ontology and the Lexicon</title>
		<author>
			<persName><forename type="first">G</forename><surname>Hirst</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Handbook on Ontologies</title>
				<editor>
			<persName><forename type="first">S</forename><surname>Staab</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">R</forename><surname>Studer</surname></persName>
		</editor>
		<meeting><address><addrLine>Berlin</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2004">2004</date>
			<biblScope unit="page" from="209" to="229" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Near-Synonymy and Lexical Choice</title>
		<author>
			<persName><forename type="first">P</forename><surname>Edmonds</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Hirst</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="105" to="144" />
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">A note on ontology localization</title>
		<author>
			<persName><forename type="first">P</forename><surname>Cimiano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Montiel-Ponsoda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Buitelaar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Espinoza</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Gómez-Pérez</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Applied Ontology</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="127" to="137" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">Multilingualism in Ontologies. Multilingual Lexico-Syntactic Patterns for Ontology Modeling and Linguistic Information Repository for Ontology Localization</title>
		<author>
			<persName><forename type="first">E</forename><surname>Montiel-Ponsoda</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2011">2011</date>
			<pubPlace>Madrid</pubPlace>
		</imprint>
		<respStmt>
			<orgName>Universidad Politécnica de Madrid</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Ph.D. thesis</note>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Terms as labels for concepts, terms as lexical units: A comparative analysis in ontologies and specialized dictionaries</title>
		<author>
			<persName><forename type="first">M.-C</forename><surname>L'homme</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Bernier-Colborne</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Applied Ontology</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="page" from="387" to="400" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">Towards Linguistically Grounded Ontologies</title>
		<author>
			<persName><forename type="first">P</forename><surname>Buitelaar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Cimiano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Haase</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Sintek</surname></persName>
		</author>
		<editor>L.</editor>
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">The Semantic Web: Research and Applications</title>
		<editor>Aroyo, P. Traverso, F. Ciravegna, P. Cimiano, T. Heath, E. Hyvönen, R. Mizoguchi, E. Oren, M. Sabou, E. Simperl</editor>
		<imprint>
			<date type="published" when="2009">2009</date>
			<publisher>Springer</publisher>
			<biblScope unit="page" from="111" to="125" />
			<pubPlace>Berlin Heidelberg</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">Lexicon Model for Ontologies: Final Community Group Report 10</title>
		<author>
			<persName><forename type="first">P</forename><surname>Cimiano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Mccrae</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Buitelaar</surname></persName>
		</author>
		<ptr target="https://www.w3.org/2016/05/ontolex/" />
		<imprint>
			<date type="published" when="2016-05">May 2016. 2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Towards a Linguistic Ontology with an Emphasis on Reasoning and Knowledge Reuse</title>
		<author>
			<persName><forename type="first">A</forename><surname>Parvizi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kohl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Gonzàlez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Saurí</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC16), European Language Resources Association (ELRA)</title>
				<meeting>the Tenth International Conference on Language Resources and Evaluation (LREC16), European Language Resources Association (ELRA)<address><addrLine>Portoro\vz, Slovenia</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="441" to="448" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
