<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Padua, Italy
$ gsperanza@unior.it (G. Speranza); mpdibuono@unior.it (M. P. Di Buono); jmonti@unior.it (J. Monti)</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Tailoring Terminological Resources to the Users' Needs: a Corpus-based Study on Appositive Constructions in Italian and English</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Giulia Speranza</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maria Pia Di Buono</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Johanna Monti</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Naples "L'Orientale" - UNIOR NLP Research Group</institution>
          ,
          <addr-line>Via Chiatamone, 61/62, 80121 Napoli NA</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2022</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>Terminological resources are indispensable tools for accessing a specialized domain of knowledge. Several actors gravitate around a specialized domain and, therefore, the creation of terminological resources for diferent kind of users is a challenging task. In this paper, we propose a methodology for extracting terms and linguistic information useful for diferent receivers, hinging on appositive constructions.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Terminological Resources</kwd>
        <kwd>Users</kwd>
        <kwd>Appositive Constructions</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Terminology, intended as the set of diferent terms composing the vocabulary of a specific
discipline [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], is the core of every Language for Specialized Purposes (LSPs), which convey the
most informative content in written or spoken communication.
      </p>
      <p>
        Terminology gathering often involves the processing of actual texts, since they contain terms
as the users of that specific language would use them [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]; therefore, investigating large collection
of electronic texts, i.e., corpora, constitutes a valuable solution for the analysis of LSP and for
the creation of a terminological resource [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>In this study, we propose a methodology for designing diferent types of terminological
resources intended for diferent target users who gravitate around LSPs, i.e., domain experts,
non-experts, and translators.</p>
      <p>
        Taking into consideration the users’ needs in relation to technical terminology is
fundamental for the creation of a suitable resource [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Indeed, the intended users may resort to a
terminological resource for diferent purposes: laypeople mainly aim at understanding the term
they perceive as new, dificult, and semantically obscure to decipher with the sole background
knowledge at their disposal; on the other hand, translators, interpreters, and language experts
in general, might more be interested in terminological equivalents in diferent languages,
terminological variants, synonyms, part-of-speech, definitions as well as examples of the term in the
context of a sentence; finally, technical writers, who fall under the category of experts in the
domain of knowledge, may use such resources as an aid to better convey technical concepts
from a purely linguistic point of view.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Theoretical Background</title>
      <p>In the communication between experts, terms are used by a restricted niche to commonly
designate and refer to agreed technical concepts.</p>
      <p>
        Nonetheless, communication on a specialized topic can also be addressed to non-experts in
the field, following the vertical dimension of specialized languages [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ], where the degree of
technical specialism is usually mediated and lowered.
      </p>
      <p>Indeed, specialized texts, especially when addressed to a diversified, non-expert audience,
usually show the presence of specific linguistic structures aimed at better conveying technical
concepts, such as appositive constructions.</p>
      <p>
        Appositive constructions have been studied and described by several scholars [
        <xref ref-type="bibr" rid="ref7 ref8 ref9">7, 8, 9</xref>
        ] as the
juxtapposition of (usually) two elements: the anchor and the supplement [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>From a graphical point of view, the supplement can be enclosed between punctuation marks
such as commas, dashes or brackets.</p>
      <p>On the semantic and pragmatic level, appositive constructions are usually employed with the
aim of reformulating, paraphrasing, explaining or adding information by means of supplements
with reference to the anchors.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>
        Our hypothesis is that, provided that the domain corpus of reference is composed of specialized
text addressed to non-experts, appositive constructions retrieval can be beneficial for the
extraction of terms [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] and the creation of diferent terminological resources responding to
diferent declared aims and users.
      </p>
      <p>As a case study, we select the domain of archaeology, a sub-field of Cultural Heritage (CH),
which, despite making use of a highly technical language, rich in terminology, is one of the
least investigated among the LSPs, if compared to other fields of knowledge.</p>
      <p>As a linguistic base for our study we compile an Italian-English parallel corpus composed
of texts in the form of brochures, leaflets, and museums and archaeological sites guides, for
a total of 200k tokens for each side of the parallel corpus. The parallel corpus is also chosen
as a suitable source for dealing with terminology from the point of view of translation and
interpreting.</p>
      <p>
        The methodology we follow comes from the corpus linguistics field and makes use of the
Corpus Query Language (CQL) provided in Sketch Engine [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], as it results a powerful tool
enabling the setting of specific criteria for lexical pattern identification and investigation. We
set queries able to retrieve appositional constructions in the parallel corpus and visualize them
as Keyword in Context (KWIC) in the Parallel Concordance section. The queries hing on the
syntactic structure and nature of appositional constructions, especially where brackets are
registered, making use of both Part of Speech (PoS) and Regex.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <p>Examples of appositive constructions in Italian and English, mainly enclosed between brackets,
retrieved from our corpus can be classified as follows:
• term and its simplified synonym/variant: i.e., fibula (spilla) / fibula (brooch)
• term and its description: i.e., erma (ritratto su pilastro) / herm (portrait on the pilaster)
• term and its description by means of the function: i.e., praefurnium (forno per il
riscaldamento) / praefurnium (oven for heating)
• term and its description by means of the morphology: i.e. rhyton (coppa a forma di corno)
/ rhyton (a horn-shaped cup)</p>
      <p>Making use of only the anchors or the anchor in combination with its supplement we can
obtain diferent resources, i.e., the anchors can be used to create a term base of bilingual
equivalent terms (erma(it) - herm(en)); the supplement can be used to provide a resource
containing explanation of the technical term for non-experts in the field, both from a monolingual
and a bilingual perspective.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions and Future Works</title>
      <p>Retrieved results show that appositive constructions can be extracted and employed to:
• create a terminological resource containing the term and its simplification/reformulation
(the so-called supplement) with the aim of making technical terminology intelligible
for a public of non-experts in the field, following an "informational perspective", i.e.
praefurnium is, more generally, a oven (for heating).
• identify bilingual terminological equivalents (candidate terms) and monolingual
terminological variants in order to create a terminological resource intended for translators,
interpreters, or language experts, according to a "professional perspective", i.e., erma in
Italian is the equivalent of "herm" in English. Furthermore, an alternative term for "fibula"
is "brooch".
• help technical writers by providing them a linguistic and semantic based guide with the
most frequent and productive ways of dealing with terminology, from an "educational
perspective", i.e., technical terms can be exemplified by a description of their function
(purpose), or their morphology (shape) or a more generic synonym.</p>
      <p>As future work it would be interesting to enlarge the domain corpus in order to obtain even
more results. Furthermore, a parallel corpus in more than two languages would also be useful
for the creation of a multilingual resource.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This research has partly been supported by POR Campania FSE 2014/2020 funds and by the
National Operative Program (Programma Operativo Nazionale - PON).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J. C.</given-names>
            <surname>Sager</surname>
          </string-name>
          ,
          <article-title>Practical course in terminology processing</article-title>
          , John Benjamins Publishing,
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M. T.</given-names>
            <surname>Cabré</surname>
          </string-name>
          , Terminology: Theory, methods, and applications, volume
          <volume>1</volume>
          , John Benjamins Publishing,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>L.</given-names>
            <surname>Bowker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pearson</surname>
          </string-name>
          ,
          <article-title>Working with specialized language: a practical guide to using corpora</article-title>
          ,
          <source>Routledge</source>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>G.</given-names>
            <surname>Rondeau</surname>
          </string-name>
          , Introduction à la terminologie,
          <year>1984</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Cortelazzo</surname>
          </string-name>
          ,
          <article-title>Lingue speciali: la dimensione verticale</article-title>
          ,
          <source>Unipress</source>
          ,
          <year>1994</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Dardano</surname>
          </string-name>
          ,
          <article-title>Linguaggi settoriali e processi di riformulazione</article-title>
          ,
          <source>Parallela</source>
          <volume>3</volume>
          (
          <year>1987</year>
          )
          <fpage>134</fpage>
          -
          <lpage>145</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R.</given-names>
            <surname>Quirk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Greenbaum</surname>
          </string-name>
          , G. Leech,
          <string-name>
            <given-names>J.</given-names>
            <surname>Svartvik</surname>
          </string-name>
          ,
          <article-title>A comprehensive grammar of the english language</article-title>
          ,
          <source>Longman</source>
          ,
          <year>1985</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>C. F.</given-names>
            <surname>Meyer</surname>
          </string-name>
          , Apposition in contemporary English, Cambridge University Press; Cambridge [England]; New York,
          <year>1992</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>N.</given-names>
            <surname>Burton-Roberts</surname>
          </string-name>
          , Parentheticals, Keith Brown (ed.),
          <article-title>Encyclopaedia of language and linguistics, 2nd edn</article-title>
          ., vol.
          <volume>9</volume>
          , (
          <year>2006</year>
          )
          <fpage>179</fpage>
          -
          <lpage>182</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>R.</given-names>
            <surname>Huddleston</surname>
          </string-name>
          ,
          <string-name>
            <surname>G. Pullum,</surname>
          </string-name>
          <article-title>The cambridge grammar of the english language</article-title>
          ,
          <source>Zeitschrift für Anglistik und Amerikanistik</source>
          <volume>53</volume>
          (
          <year>2005</year>
          )
          <fpage>193</fpage>
          -
          <lpage>194</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>G.</given-names>
            <surname>Speranza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. P.</given-names>
            <surname>Di Buono</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Monti</surname>
          </string-name>
          ,
          <article-title>Terms and appositions: What unstructured texts tell us</article-title>
          ,
          <source>in: Formalizing Natural Languages: Applications to Natural Language Processing and Digital Humanities</source>
          , Springer,
          <year>2021</year>
          , pp.
          <fpage>219</fpage>
          -
          <lpage>230</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kilgarrif</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Baisa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bušta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jakubíček</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Kovář</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Michelfeit</surname>
          </string-name>
          , P. Rychl y`,
          <string-name>
            <given-names>V.</given-names>
            <surname>Suchomel</surname>
          </string-name>
          ,
          <article-title>The sketch engine: ten years on</article-title>
          ,
          <source>Lexicography</source>
          <volume>1</volume>
          (
          <year>2014</year>
          )
          <fpage>7</fpage>
          -
          <lpage>36</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>