<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">OntoLex and Onomasiological Ordering: Supporting Topical Thesauri</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Sander</forename><surname>Stolk</surname></persName>
							<email>s.s.stolk@umail.leidenuniv.nl</email>
							<affiliation key="aff0">
								<orgName type="institution">Leiden University</orgName>
								<address>
									<settlement>Leiden</settlement>
									<country key="NL">The Netherlands</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">OntoLex and Onomasiological Ordering: Supporting Topical Thesauri</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">47A561571A1A85C182175D5790D68232</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T23:23+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>OntoLex</term>
					<term>Lemon</term>
					<term>onomasiological ordering</term>
					<term>thesaurus</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The OntoLex vocabulary has been designed to capture lexicons and to add their lexicographical knowledge to ontologies in the Semantic Web. Although the specification of the vocabulary posits that OntoLex allows lexicons to be ordered onomasiologically, it does so for a very specific kind of onomasiological ordering only. As a consequence, the vocabulary is currently insufficient for capturing a large proportion of the existing topical thesauri. This paper demonstrates the current expressivity and this shortcoming of OntoLex through two case studies:</p><p>The Historical Thesaurus of the Oxford English Dictionary and The Scots Thesaurus. In order for OntoLex to offer full support for topical thesauri and their ordering principles, this paper proposes the addition of a single property to the vocabulary: ontolex:isSenseIn. . . .</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>The Lexicon Model for Ontologies vocabulary has been designed to capture lexicons and to add their lexicographical knowledge to ontologies in the Semantic Web <ref type="bibr" target="#b0">[1]</ref>. The vocabulary has seen a number of updates, and was published as a W3C vocabulary by the OntoLex community group in May 2016 <ref type="bibr" target="#b1">[2]</ref>. This version, henceforth OntoLex, has since been picked up by a number of bodies, including the Global WordNet Association, to represent and link existing lexical resources on the Semantic Web <ref type="bibr" target="#b2">[3]</ref>.</p><p>The specification of OntoLex puts forward a manner in which "lexicons can be ordered onomasiologically, that is by meanings rather than by lemmas" <ref type="bibr" target="#b1">[2]</ref>. For publishers of topical thesauri, this is good news indeed. Such support is essential for these lexicographical works, which order their words by meaning instead of from a to z as is common in typical dictionaries. Yet the OntoLex vocabulary supports a very specific kind of onomasiological ordering only. As a consequence, the vocabulary is currently insufficient for capturing the knowledge from a large proportion of the existing topical thesauri. The current paper demonstrates this shortcoming of OntoLex and proposes a way forward for the vocabulary.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Methodology</head><p>In order to provide insight into the current support of OntoLex for the onomasiological ordering of topical thesauri, this paper will present two case studies. The first is based on the Historical Thesaurus of the Oxford English Dictionary <ref type="bibr" target="#b3">[4]</ref>; the second on The Scots Thesaurus <ref type="bibr" target="#b4">[5]</ref>. Both lexicographical works employ an onomasiological ordering for their lexicon. The first-mentioned thesaurus is considered to be a distinctive one and contains sets of synonyms. The second is not distinctive but cumulative and refrains from indicating synonymy <ref type="bibr" target="#b5">[6]</ref>.</p><p>This paper expresses samples from both thesauri in the OntoLex vocabulary. The manner in which OntoLex is applied is in line with the specification of the vocabulary <ref type="bibr" target="#b1">[2]</ref> and the approach outlined by the Global WordNet Association <ref type="bibr" target="#b2">[3]</ref>. This approach has been adopted by several projects, amongst which the Open Dutch Wordnet <ref type="bibr" target="#b6">[7]</ref>. Namespaces relevant for this paper are provided in Listing 1. The RDF snippets in subsequent listings are specified in the Turtle RDF syntax <ref type="bibr" target="#b7">[8]</ref>. Sample data from the case studies correspond with resources between angular brackets in the RDF snippets (that is to say, their namespace is left unspecified for the present purpose).</p><p>Listing 1. Namespaces @prefix ontolex: &lt;http://www.w3.org/ns/lemon/ontolex#&gt; . @prefix owl: &lt;http://www.w3.org/2002/07/owl#&gt; . @prefix rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt; . @prefix skos: &lt;http://www.w3.org/2004/02/skos/core#&gt; . @prefix wn: &lt;http://wordnet-rdf.princeton.edu/ontology#&gt; .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Case study Historical Thesaurus of the OED</head><p>The first case study presented here is that of the Historical Thesaurus of the Oxford English Dictionary (HTOED). HTOED captures the English lexis that has existed throughout its 1300-year history, from Old English up to Modern English. This topical thesaurus groups together lexical items that are considered near-synonymous and provides insight into their use in time and place. HTOED was first published in print in 2009 <ref type="bibr" target="#b3">[4]</ref> and in the following year also electronically <ref type="bibr" target="#b8">[9]</ref>.</p><p>Figure <ref type="figure" target="#fig_0">1</ref> depicts a sample from HTOED. This sample contains six categories from the topical system of the thesaurus (here represented by circles), which are organized in a hierarchy. A category that is displayed lower than another category to which it is connected by means of a line is subordinate to that connected category. On the right, a number of lexical senses are displayed (some of which are obsolete, conveyed by a dagger sign). These senses are considered synonyms, or rather, near-synonyms, in HTOED and are members of the "Freedom/liberty" category. Expressing categories of the topical system of HTOED in OntoLex is relatively straightforward. Each HTOED category corresponds with a lexical concept in OntoLex. The latter is defined as a "mental abstraction, concept or unit of thought that can be lexicalized by a given collection of senses" <ref type="bibr" target="#b1">[2]</ref>. This definition appears highly applicable to categories from topical thesauri. As lexical concepts are asserted to be specializations of SKOS concepts, it is possible to capture the hierarchy between categories using the broader/narrower relations from SKOS <ref type="bibr" target="#b9">[10]</ref>. Listing 2 contains the RDF for expressing one of the HTOED categories in OntoLex, "Freedom/Liberty", and the relation to its superordinate category "Lack of subjection".</p><p>Listing 2. HTOED category "Freedom/liberty" expressed in OntoLex &lt;category-FreedomLiberty&gt; a ontolex:LexicalConcept ; skos:prefLabel "Freedom/liberty"@en ; skos:broader &lt;category-LackOfSubjection&gt; .</p><p>The OntoLex vocabulary also contains terminology to express lexical senses and the lexical entries to which they belong. In order to state that a given lexical sense from HTOED belongs to one of its categories, the property ontolex:isLexicalizedSenseOf can be used. This property relates a lexical sense to a lexical concept, stating that it "lexicalizes" that concept. According to the section on Lexical Nets in the OntoLex specification, lexical senses that lexicalize the same concept are considered synonymous <ref type="bibr" target="#b1">[2]</ref>. In other words, the relation of synonymy is not explicitly asserted in OntoLex, but can be inferred from the use of the ontolex:isLexicalizedSenseOf property. The resulting RDF for the sense of freedom from the HTOED sample and its relation to the "Freedom/liberty" category is provided in Listing 3.</p><p>Listing 3. HTOED sense of freedom expressed in OntoLex &lt;sense-freedom-n-3&gt; a ontolex:LexicalSense ; skos:prefLabel "freedom n. (sense 3)"@en ; ontolex:isSenseOf &lt;entry-freedom-n&gt; ; ontolex:isLexicalizedSenseOf &lt;category-FreedomLiberty&gt; .</p><p>&lt;entry-freedom-n&gt; a ontolex:LexicalEntry ; skos:prefLabel "freedom, n."@en ; wn:partOfSpeech wn:noun .</p><p>As shown, capturing the onomasiological ordering of the HTOED lexicon presents no issues with the OntoLex vocabulary. The vocabulary enables one to express categories and their hierarchy, lexical senses and their relation to a lexical entry, and the relation between the senses from HTOED and the categories to which they belong.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Case study The Scots Thesaurus</head><p>The second case study in this paper concerns The Scots Thesaurus (ScT) <ref type="bibr" target="#b4">[5]</ref>. ScT captures the Lowland Scots lexis available throughout history, from its twelfthcentury beginnings to the present. This thesaurus, published in 1990, categorizes its lexical items but does not indicate synonymy. Figure <ref type="figure" target="#fig_1">2</ref> depicts the sample taken from ScT, encompassing five categories and four lexical senses. Expressing categories from ScT is possible in a manner identical to that used for HTOED. The result for the "Sowing" category from ScT, including its relation to the superordinate category "Crops", is provided in Listing 4.</p><p>Listing 4. ScT category "Sowing" expressed in OntoLex &lt;category-Sowing&gt; a ontolex:LexicalConcept ; skos:prefLabel "Sowing"@en ; skos:broader &lt;category-Crops&gt; .</p><p>As for the lexical senses from ScT, these too can be expressed in OntoLex comparable to how it has been done for HTOED. There is, however, a notable difference. The property ontolex:isLexicalizedSenseOf is unsuitable for relating the senses of ScT to the categories to which they belong. The lexical senses in ScT are not necessarily lexicalizations of the category in question. Moreover, senses that belong to the same category are not necessarily considered synonymous. In fact, they rarely are. A case in point are the senses of happer and miss from the sample. Both of these senses are members of the category "Sowing", and indeed belong to that semantic domain, but can hardly be said to be synonymous or even to lexicalize the category.</p><p>What is missing, then, from the OntoLex vocabulary is terminology to express a looser manner of onomasiological ordering with categories than ontolex:isLexicalizedSenseOf does. The RDF snippet in Listing 5 contains the desired situation, where a tentative property isSenseIn is coined (see highlighted line) to express the relation between the sense of blander and the category to which it belongs.</p><p>Listing 5. ScT sense of blander expressed in OntoLex &lt;sense-blander-v-disperseScantily&gt; a ontolex:LexicalSense ; skos:prefLabel "blander"@sco ; skos:definition "disperse scantily"@en ; ontolex:isSenseOf &lt;entry-blander-v&gt; ; :isSenseIn &lt;category-Sowing&gt; .</p><p>&lt;entry-blander-v&gt; a ontolex:LexicalEntry ; skos:prefLabel "blander, v."@sco ; wn:partOfSpeech wn:noun .</p><p>In short, OntoLex itself does not yet provide terminology to onomasiologically order the lexicographical content of ScT -and of other thesauri like it.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Discussion</head><p>The two case studies have shown that OntoLex is not yet expressive enough to indicate the relation between senses and categories for all topical thesauri. In fact, the lack of a property like the tentative isSenseIn does not just affect conveying content from ScT and the great many existing cumulative thesauri like it. It also affects expressing these very relations found in thesauri such as HTOED. After all, senses in HTOED are not just lexicalizations of a category, they are also members of a number of categories. To illustrate, the assertion that the HTOED sense of freedom is a lexicalization of the category "Freedom/liberty" entails that this sense is a member of not just that category but also of its superordinate categories (see <ref type="bibr">Listing 6)</ref>.</p><p>Listing 6. HTOED sense of freedom and its relation to the categories of HTOED &lt;sense-freedom-n-3&gt; a ontolex:LexicalSense ; ontolex:isLexicalizedSenseOf &lt;category-FreedomLiberty&gt; ; :isSenseIn &lt;category-FreedomLiberty&gt; , &lt;category-LackOfSubjection&gt; , &lt;category-Authority&gt; , &lt;category-Society&gt; .</p><p>In order to truly express how senses are categorized according to topical systems in thesauri, then, additional terminology is required beyond what OntoLex currently offers. Properties from other vocabularies that might fill the gap, such as the subject property from Dublin Core Terms [11], tend to be too generic to be able to infer further knowledge from topical systems of thesauri. Moreover, the relation between such properties and ontolex:isLexicalizedSenseOf is not evident. As such, the required terminology is best captured in an update of the OntoLex vocabulary itself. The small addition of a single property such as isSenseIn (see <ref type="bibr">Listing 7)</ref>, then, and asserting its connection to the existing OntoLex property (see Listing 8) would enable onomasiological ordering of lexicons in topical thesauri of all varieties -distinctive or cumulative, and regardless of whether synonymy is indicated between senses.</p><p>Listing 7. Suggested OntoLex property isSenseIn ontolex:isSenseIn a owl:ObjectProperty ; rdfs:label "is sense in"@en ; rdfs:comment "This property relates a lexical sense to a concept that captures its meaning to some extent (that is, partially or even fully)."@en ; rdfs:domain ontolex:LexicalSense ; rdfs:range ontolex:LexicalConcept . </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Conclusion</head><p>This paper has shown, by means of two case studies, to what extent the OntoLex vocabulary currently supports relating lexical senses to the concepts that facilitate an onomasiological ordering. Such an ordering is (by their very definition) used in lexicographical works known as topical thesauri. As it stands, the OntoLex vocabulary offers some support for those thesauri considered to be distinctive and that capture synonymy. Such thesauri ensure that lexical senses displayed at a certain category do not just belong to that category, but also express (or lexicalize) that category. Those thesauri that do not have that same level of specificity, but merely use their categories to organize lexical senses into semantic domains, are not yet supported by the terminology in OntoLex.</p><p>The small addition of a single property, as suggested in this paper, would have a big impact on the expressivity of OntoLex. The onomasiological ordering of both distinctive and cumulative thesauri -regardless of whether these thesauri indicate synonymy -could then properly be conveyed on the Semantic Web. As a result, the variety of lexicographical resources that sit comfortably in OntoLex would not be limited to dictionaries and lexical nets, as is presently the case, but would also include thesauri. Increased support in OntoLex for onomasiological ordering, then, would allow all these resources to truly shine on the Web. In short, ordering by meaning through the new ontolex:isSenseIn is both meaningful and sensible.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig.1. Example HTOED content based on<ref type="bibr" target="#b8">[9]</ref> </figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. Example ScT content</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Listing 8 .</head><label>8</label><figDesc>Connection between existing OntoLex property and the suggested one ontolex:isLexicalizedSenseOf rdfs:subPropertyOf ontolex:isSenseIn .</figDesc></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Interchanging lexical resources on the Semantic Web</title>
		<author>
			<persName><forename type="first">J</forename><surname>Mccrae</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Aguado-De Cea</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Buitelaar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Cimiano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Declerck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Gmez-Prez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Gracia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Hollink</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Montiel-Ponsoda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Spohr</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Wunner</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Language Resources and Evaluation</title>
		<imprint>
			<biblScope unit="volume">46</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="701" to="719" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<ptr target="http://www.w3.org/2016/05/ontolex/" />
		<title level="m">Lexicon Model for Ontologies: Community report</title>
				<imprint>
			<date type="published" when="2016-05-10">10 May 2016 (2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<ptr target="http://globalwordnet.github.io/schemas/" />
		<title level="m">Global WordNet Association: Global Wordnet formats</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">Historical thesaurus of the Oxford English Dictionary: with additional material from &quot;A thesaurus of Old English</title>
		<editor>Kay, C., Roberts, J., Samuels, M., Wotherspoon, I.</editor>
		<imprint>
			<date type="published" when="2009">2009</date>
			<publisher>Oxford University Press</publisher>
			<pubPlace>Oxford</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">The Scots thesaurus</title>
		<editor>Macleod, I., Cairns, P., Macafee, C., Martin, R.</editor>
		<imprint>
			<date type="published" when="1990">1990</date>
			<publisher>Aberdeen University Press</publisher>
			<pubPlace>Aberdeen</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Diachronic and synchronic thesauruses</title>
		<author>
			<persName><forename type="first">C</forename><surname>Kay</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Alexander</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The Oxford handbook of lexicography</title>
				<editor>
			<persName><forename type="first">P</forename><surname>Durkin</surname></persName>
		</editor>
		<meeting><address><addrLine>Oxford</addrLine></address></meeting>
		<imprint>
			<publisher>Oxford University Press</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="367" to="380" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Open Dutch WordNet</title>
		<author>
			<persName><forename type="first">M</forename><surname>Postma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Van Miltenburg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Segers</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Schoen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Vossen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Eighth Global Wordnet Conference</title>
				<meeting>the Eighth Global Wordnet Conference<address><addrLine>Bucharest, Romania</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">RDF 1.1 Turtle: W3C recommendation 25</title>
		<author>
			<persName><forename type="first">D</forename><surname>Beckett</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Berners-Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Prud'hommeaux</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Carothers</surname></persName>
		</author>
		<ptr target="http://www.w3.org/TR/turtle/" />
		<imprint>
			<date type="published" when="2014-02">February 2014. 2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<ptr target="http://oed.com/thesaurus" />
		<title level="m">Historical thesaurus of the Oxford English Dictionary</title>
				<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title level="m" type="main">SKOS Simple Knowledge Organization System reference: W3C recommendation 18</title>
		<ptr target="http://www.w3.org/TR/skos-reference/" />
		<imprint>
			<date type="published" when="2009-08">August 2009. 2009</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
