<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Subject Fields in Termbases -Their Design, Use and Representation</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Kara</forename><surname>Warburton</surname></persName>
							<email>karacw@illinois.edu</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Illinois at Urbana-Champaign</orgName>
								<address>
									<addrLine>707 S. Mathews Ave</addrLine>
									<postCode>61801</postCode>
									<settlement>Urbana</settlement>
									<region>Illinois</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="department">International Conference on &quot;Multilingual digital terminology today. Design</orgName>
								<orgName type="institution">representation formats and management systems&quot;</orgName>
								<address>
									<addrLine>June 16 -17</addrLine>
									<settlement>Padova</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Subject Fields in Termbases -Their Design, Use and Representation</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">FB751C1014CF0FA645576E22BE6BABC4</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T18:02+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Terminology</term>
					<term>TBX</term>
					<term>subject fields</term>
					<term>domains</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Subject fields play an essential role in terminological resources by allowing for the creation of semantically-based subdivisions in addition to acting as a conceptual boundary for the principle of univocity. However, due to the lack of guidelines and standards, their application in termbases risks being ad-hoc, which reduces their effectiveness in achieving these goals. ISO TC/37 has published a technical specification (TS) aimed to increase the rigour of subject-field use and the interoperability of the data. This paper describes some issues and challenges relating to subject-fields in termbases and how the TS may resolve them.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Classification is a widely-used ordering mechanism, indispensable for instance in information and library science <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b4">5]</ref>. Philosophers such as Aristotle, taxonomists such as Carl Linnaeus, and documentalists such as Melvil Dewey established principles for the classification of knowledge into categories that are widely used today. It is no surprise then that terminological entries are frequently organized into categories. These categories can be based on semantic properties, or criteria of a more administrative nature such as institutional departments, clients, and so forth. In the former case, the most common type of categorization is referred to as domains or subject fields.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Subject fields in Terminology</head><p>The notion of subject fields is critical to terminology theory and practice. According to convention, terms designate concepts that belong to a language for special purposes (LSP) (as opposed to language for general purposes or LGP) <ref type="bibr" target="#b7">[8]</ref>, and an LSP is the language used by specialists in a subject field <ref type="bibr" target="#b0">[1]</ref>. For many scholars, adherence to a subject field is a requirement for a linguistic unit to be deemed a term <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b1">2]</ref>. Indeed, specifying the subject field that a term belongs to is often considered mandatory for terminological description <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b0">1,</ref><ref type="bibr" target="#b6">7,</ref><ref type="bibr" target="#b4">5]</ref>.</p><p>Univocity, a key principle in classical terminology theory, may also depend on subject fields. According to this principle, a term should have only one meaning. But we maintain that univocity is only achievable if it is applied within the scope of a subject field. This is because "identical" lexical units occur in different subject fields with different meanings (homonyms, homographs) (for example, "port" the strong wine and "port" the computer connection). Consequently, univocity has been defined with domain-specificity as its scope <ref type="bibr" target="#b1">[2]</ref>.</p><p>Scholars have also noted that subject fields should be organized in a hierarchical structure, to include sub-fields and even finer divisions <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b1">2,</ref><ref type="bibr" target="#b4">5]</ref>. Figure <ref type="figure" target="#fig_0">1</ref> provides an example of a three level system showing the top level Education, followed by child levels, three of which are further divided into subordinate values. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Challenges</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.1.">Lack of guidelines and of a universal subject-field classification system</head><p>Guidelines, standards, and representation models for subject fields are lacking in the literature. Given the importance of subject fields which we have demonstrated, this is surprising if not troubling. Consequently, the use of subject fields in today's termbases varies considerably. Some termbases use none at all, others feature a flat list of values 2 . Each termbase that features subject fields employs a unique set, different even from that of other termbases that cover the same or similar spheres of knowledge. The lack of a universal subject-field classificaiton system represents a major obstacle to the interoperability of terminological databases.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.2.">Difficulties when assigning subject field values to concepts</head><p>Deciding which subject field a concept "belongs" to is another challenge. The choice is not always obvious, and terminologists often rely purely on intuition. Under these conditions, subject-field assignments will not be reliable, which raises questions as to the effectiveness of subject fields as a classificatory mechanism.</p><p>There is also the question of whether a concept can be assigned to more than one subject field. Here, terminologists disagree; some say yes, others no. However, if a subject-field value sets a boundary enabling the term to be univocal, then one would assume that it is confined to this subject field. This leads to the possibility that, if a terminologist feels inclined to select two subject fields, perhaps it is their "parent" that should be assigned instead. These are philosophical questions worthy of further debate.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.3.">Lack of models for representing subject fields</head><p>ISO Technical Committee 37, Sub-committee 3, has published a standard for representing terminological resources in an XML markup format, ISO 30042: TermBase eXchange (TBX). TBX also constitutes a model framework for designing a termbase. However, subject fields and their representation is not addressed in any substantive manner. They are loosely modelled in plain text fields (with therefore no control over permissible values), and there is no facility for establishing a taxonomic structure. The standard merely stipulates that subject fields are to be represented in a &lt;descrip&gt; element at the concept level, for example: &lt;descrip type="subjectField"&gt;Nuclear power&lt;/descrip&gt;</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">The response of ISO TC 37</head><p>To address the TBX limitations, in 2021 the committee published a Technical Specification (TS) that provides guidelines for subject fields as well as for concept relations (another important feature of termbases for which guidelines are lacking): ISO/TS 24634 -TBX-compliant representation of concept relations and subject fields. In the following paragraphs, we summarize the contents of this TS.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Constraints</head><p>The TS specifies the following constraints relating to subject fields. The aim is to increase interoperability.</p><p>1. The content of the subject-field data category shall be a picklist (closed list of values). These values form the organization's subject field classification system. 2. Whenever possible, an existing public subject field classification system should be adopted, such as EuroVoc or Lenoch.</p><p>3. The name and source of the subject-field classification must be declared in the TBX header. 4. The full subject-field classification system should be described, either in the backmatter of the TBX document instance, or through an XML namespace. Within this description, the scope, or meaning, of subject-field values, should also be defined. This aims to facilitate a more reliable assignment of subject-field values to concept entries.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">XML representation</head><p>An XML model for representing subject-field classification systems is provided in the TS. The model includes some markup adopted from the RDF-based SKOS.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusion</head><p>The ISO TS should help to increase the interoperability of termbases. However, it will only have an effect if its provisions are adopted by termbase administrators. The uptake of ISO TC37 standards, however, has been slow in the past. Furthermore, full interoperability will not be achieved without a universal classification of subject fields. Whether that is a realistic goal remains open to debate.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1</head><label>1</label><figDesc>Figure 1 Sample hierarchical subject-field classification (courtesy Interverbum Technology AB).</figDesc><graphic coords="2,89.20,75.00,110.74,185.60" type="bitmap" /></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><forename type="first">M</forename></persName>
		</author>
		<author>
			<persName><forename type="first">Teresa</forename><surname>Cabre</surname></persName>
		</author>
		<title level="m">Terminology -Theory, Methods, and Applications</title>
				<meeting><address><addrLine>Amsterdam</addrLine></address></meeting>
		<imprint>
			<publisher>John Benjamins Publishing Co</publisher>
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">R</forename><surname>Dubuc</surname></persName>
		</author>
		<title level="m">Manuel Pratique de Terminologie</title>
				<meeting><address><addrLine>Montreal</addrLine></address></meeting>
		<imprint>
			<publisher>Linguatech</publisher>
			<date type="published" when="1992">1992</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m">International Organization for Standardization, ISO 30042 -TermBase eXchange (TBX)</title>
				<meeting><address><addrLine>Geneva</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m">International Organization for Standardization, ISO/TS 24634 -TBX-compliant representation of concept relations and subject fields</title>
				<meeting><address><addrLine>Geneva</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Essays on Terminology</title>
		<author>
			<persName><forename type="first">A</forename><surname>Rey</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1995">1995</date>
			<publisher>John Benjamins Publishing Co</publisher>
			<pubPlace>Amsterdam</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">Introduction à la terminologie</title>
		<author>
			<persName><forename type="first">G</forename><surname>Rondeau</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1981">1981</date>
			<publisher>Centre Educatif et Culturel Inc</publisher>
			<pubPlace>Montreal</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">A Practical Course in Terminology Processing</title>
		<author>
			<persName><forename type="first">J</forename><surname>Sager</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1990">1990</date>
			<publisher>John Benjamins Publishing Co</publisher>
			<pubPlace>Amsterdam</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">Language as an economic factor: the importance of terminology</title>
		<author>
			<persName><forename type="first">W</forename><surname>Teubert</surname></persName>
		</author>
		<editor>G. Barnbrook, P. Danielsson, M.Mahlberg</editor>
		<imprint>
			<date type="published" when="2005">2005</date>
			<publisher>Continuum</publisher>
			<biblScope unit="page" from="96" to="106" />
			<pubPlace>London</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
