<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Models for Dynamic Ontologies in Wind Energy Domain Analysis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Andrea Lops</string-name>
          <email>andrea.lops@poliba.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Serena Sassi</string-name>
          <email>serena.sassi@uniba.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Workshop</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Politecnico di Bari, Dipartimento di Ingegneria Elettrica e dell'Informazione</institution>
          ,
          <addr-line>Bari, 70125</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Terminology</institution>
          ,
          <addr-line>Natural Language Processing, Large Language Model, Domain Analysis, Domain Tree, Wind Power</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Università degli Studi di Bari Aldo Moro, Dipartimento di Ricerca e Innovazione Umanistica</institution>
          ,
          <addr-line>Bari, 70121</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This study introduces LUMEN (Latent Understanding through Modeling Embeddings in Natural language), a structured approach leveraging Large Language Models (LLMs) and semantic embeddings to construct a hierarchical thesaurus for the wind energy domain. Using a domain-specific corpus generated via Sketch Engine, LUMEN applies a three-step methodology: corpus creation, semantic label identification through LLM-based analysis, and hierarchical term classification via embedding-based semantic similarity calculations. We outline our machine learning configuration, including the embedding techniques and similarity metrics employed. Results indicate that LUMEN can capture nuanced subdomains and semantic interrelations within wind energy, despite occasional misclassification issues. Future research directions include systematic benchmarking against established ontology-building tools and multilingual adaptation to broaden applicability.</p>
      </abstract>
      <kwd-group>
        <kwd>Analysis</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Research</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>The advancement of Natural Language Processing (NLP) techniques, particularly through the
integration of Large Language Models (LLMs) and deep learning models, has opened new avenues for
exploring and structuring knowledge domains. These technologies have demonstrated remarkable
potential in understanding complex linguistic patterns, contextual relationships, and domain-specific
terminologies. In this study, we leverage these advancements to construct a domain tree for wind
energy, derived from a specialized corpus created using the Sketch Engine software. Our approach,
named LUMEN (Latent Understanding through Modeling Embeddings in Natural language), employs
LLMs and contextual embeddings to uncover latent subdomains and their interrelations, ofering a
comprehensive representation of the wind energy domain.</p>
      <p>
        The use of LLMs allows a semantic analysis of the corpus [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], enabling the identification of key labels
that define the domain’s hierarchical structure. These labels are organized in a structured JSON format,
providing a blueprint for subsequent classification tasks. Deep learning techniques, particularly those
involving embeddings, are then utilized to compute semantic similarity between labels and corpus
terms. This process facilitates the automatic distribution of terms in the domain tree, ensuring an
accurate and scalable classification across multiple levels of granularity.
      </p>
      <p>By applying these techniques, the latent structure of the wind energy domain is brought out,
delineating distinct subdomains and highlighting their interconnections. This approach not only facilitates
the construction of data-driven ontologies but also uncovers unexpected relationships within the field,
enriching the understanding of its multidisciplinary nature. This study also aims to demonstrate the
potential of these technologies, ofering a scalable and adaptable framework for domain analysis while
advancing ontological studies and fostering interdisciplinary collaboration in complex fields.
4th International Conference on “Multilingual digital terminology today. Design, representation formats and management systems”
https://sisinflab.poliba.it/people/andrea-lops (A. Lops); https://www.linkedin.com/in/serena-sassi (S. Sassi)</p>
      <p>CEUR</p>
      <p>ceur-ws.org</p>
    </sec>
    <sec id="sec-3">
      <title>2. Genesis of the Issue</title>
      <p>
        This research emerged from a series of theoretical and technical challenges encountered during some
investigations on the terminological field of wind energy. The initial obstacle stemmed from the need
to comprehend the status of the “domain” notion, a central concept to our study. The dificulty lies not
only in defining the boundaries of the wind energy domain but also in understanding how to position
this notion within a broader context characterized by rapidly advancing technologies [
        <xref ref-type="bibr" rid="ref2 ref3 ref4">2, 3, 4</xref>
        ]. The
wind energy sector, like all renewable energy sources, is inherently dynamic and ofers vast potential
for technological and conceptual advancements. For these reasons, its dynamic nature poses significant
challenges in defining clear boundaries. In particular, its fluidity and interdisciplinary nature further
complicate this task, as the domain encompasses a broad range of disciplines, practices, and discourses,
all contributing to a complex and ever-evolving terminological field.
      </p>
      <sec id="sec-3-1">
        <title>2.1. Rethinking the Boundaries of Environmental Domains</title>
        <p>
          The concept of “domain” occupies a central role across various academic fields, from linguistics and
philosophy to sociology and information sciences [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. Within the field of terminology, it serves as a
foundational principle used to organize and classify scientific and technical terms, thereby
structuring knowledge coherently and systematically. However, this traditional conception of the “domain”,
grounded in rigid classifications and boundaries, warrants reconsideration in light of contemporary
shifts in both knowledge production and linguistic practices. Although such classifications may have
once suficed in contexts characterized by less terminological proliferation, the increasing complexity of
science and technology, alongside the hybridization of specialized and general discourse, has rendered
this notion increasingly problematic and open to debate.
        </p>
        <p>
          In this context, it is essential to critically assess the relevance of the “domain” as it is presently
conceived, especially in fields like environmental studies, where technical terms frequently overcome
specialized spheres and infiltrate general language and public discourse. As Delavigne emphasizes, the
attempt to precisely delineate a specific domain often leads to significant challenges, particularly in
addressing complex subjects such as environmental issues [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. The environment, by its very nature, is a
multidimensional field that involves continuous interaction among various disciplines, relying on a vast
network of interconnections with technical, scientific, social, and cultural discourses [
          <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
          ]. As Myerson
and Rydin observe, “in academic terms, ‘environment’ belongs to every discipline and none” [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ].
        </p>
        <p>
          These interactions transcend the connections between various domains and levels of expertise. Even
highly specialized discussions on this matter inevitably draw on knowledge from a broad spectrum of
associated fields [
          <xref ref-type="bibr" rid="ref10 ref11 ref12">10, 11, 12</xref>
          ]. Delavigne highlights not only the vast and interconnected network of
disciplines, practices, discourses, and techniques that constitute environmental studies but also the
considerable disparities in qualitative equivalence across these components [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. Indeed, no domain,
subdomain, or sub-subdomain within the environmental field can be addressed without acknowledging
its intrinsic diversity and interdisciplinary nature. As Sager contends, “In practice no individual or
group of individuals possesses the whole structure of a community’s knowledge; conventionally, we
divide knowledge up into subject areas, or disciplines, which is equivalent to defining subspaces of the
knowledge space” [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ].
        </p>
        <p>
          In a field where the concept of “domain” remains fundamentally ambiguous and open to various
interpretations, several critical questions naturally arise. How can these expansive and complex bodies
of knowledge be segmented in a methodologically sound and pertinent manner? What methodologies
can be adopted to quantify and delineate these domains, which are often fluid and interrelated? How
can one adequately represent the meaning and scope of these difuse networks of knowledge, practices,
and scientific communities that collectively contribute to intellectual production? How can domain
trees be constructed in an era when the boundaries between various fields are increasingly “permeable,”
and their overlap is both evident and significant [
          <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
          ]?
        </p>
        <p>Given these theoretical and technical challenges, we decided to integrate terminology and NLP
to try a diferent approach to terminological analysis that can address the complexities inherent in
domains. By combining traditional terminological methods with NLP techniques, we sought to develop
a more systematic and scalable method to categorize and analyze terminology. This hybrid approach
would enable us to create terminological resources, facilitating a deeper understanding of the semantic
relationships within, in this case, the wind energy domain. NLP techniques, by processing large corpora
of text and identifying complex patterns in language, ofered the potential for potentially eficient
categorization of terms. This methodological shift allowed us not only to enhance the precision of
our terminological cataloging but also to streamline the overall process, making it more adaptable to
ongoing developments within the field.</p>
      </sec>
      <sec id="sec-3-2">
        <title>2.2. Enhancing Terminology through Ontologies</title>
        <p>
          Another key aspect of our study involves the relationship between terminology and ontologies.
Ontologies, as formal representations of knowledge within a particular domain, have long been recognized for
their ability to structure information in a way that reflects both semantic relationships and conceptual
hierarchies [
          <xref ref-type="bibr" rid="ref15 ref16 ref17">15, 16, 17</xref>
          ]. In the context of domain-specific terminological analysis, ontologies play a
critical role in organizing and classifying terms based on their interconnections and shared meanings.
The systematic classification of terms within a particular field has long been essential for
structuring knowledge in a coherent and accessible way [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. An ontology provides a formalized structure
that organizes terms into categories such as domains, subdomains, and concepts, highlighting their
relationships and interdependencies [
          <xref ref-type="bibr" rid="ref19 ref20">19, 20</xref>
          ].
        </p>
        <p>The intersection of NLP and ontological studies presents an exciting opportunity for advancing the
systematic representation of knowledge, especially in complex and evolving domains. The integration
of NLP techniques ofers, in fact, a diferent approach to this challenge. By leveraging LLMs and
embedding-based methods, we were able to uncover latent relationships between terms and construct
a dynamic, data-driven representation of a domain adaptive to future developments. By combining
terminological analysis with ontological principles, we created a structured framework to organize the
wind energy domain. This framework includes domain labels, subdomain classifications, and semantic
relationships, which are essential for the categorization of terms.</p>
        <p>By employing NLP techniques, it is also possible to uncover latent knowledge structures that would
be dificult to identify through manual analysis alone. In fact, in our study, LUMEN demonstrated its
ability to detect and categorize multiple domains beyond the strictly technical scope of wind energy.
Despite the field’s highly specialized nature, LUMEN identified many distinct macro-domains: the
environmental domain (encompassing terminology related to flora, fauna, etc.), the health domain,
the political and economic domain, the social and cultural domain, and the juridical domain, each
one with their subdomains and key terms. The integration of terminology studies with NLP-based
ontological frameworks ofers significant potential for advancing domain analysis, facilitating deeper
interdisciplinary collaboration, and a more comprehensive understanding of emerging fields. We
believe that this methodology can also be extended to other complex domains, allowing researchers
and practitioners to continuously refine and expand their understanding of rapidly changing fields.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>3. Our Approach</title>
      <p>This section outlines the methodology used for latent subdomain identification. Our approach,
illustrated in Figure 1, consists of three main stages: corpus creation using Sketch Engine, domain label
identification via LLM, and term classification based on semantic similarity.</p>
      <p>These stages contribute to the creation of a hierarchical domain tree that facilitates a comprehensive
understanding of the field.</p>
      <sec id="sec-4-1">
        <title>3.1. Step 1: Creation of an English Corpus</title>
        <p>
          The first step of our methodology involves the compilation of a domain-specific corpus [
          <xref ref-type="bibr" rid="ref21 ref22 ref23">21, 22, 23</xref>
          ],
leveraging Sketch Engine’s [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ] advanced text retrieval features. To create this corpus, we employed
Sketch Engine’s “Find texts on the web” functionality, a tool designed to automate the collection of textual
data from online sources based on a predefined set of keywords. This semi-automatic approach ensures
a streamlined and scalable process for assembling a corpus that accurately reflects the terminological
landscape of wind energy.
        </p>
        <p>Figure 2 presents the set of keywords used for corpus generation. The selection of these keywords
was guided by a principle of neutrality to minimize biases in data collection. Specifically, the keywords
chosen—“wind energy”, “wind power”, “onshore wind power”, “onshore wind energy”, “ofshore wind power”,
“ofshore wind energy”, “wind power plant”, and “wind energy power plant”—were primarily two-word
expressions. This decision was made to ensure broad coverage while avoiding over-reliance on highly
specialized terms that could introduce distortions in the corpus composition. Moreover, the selection
was informed by an efort to capture both general and specific terms relevant to the wind energy domain,
ensuring a balance between comprehensiveness and precision.</p>
        <p>As shown in Figure 3, following the keyword-based retrieval process, Sketch Engine compiled a
corpus comprising 287 documents, exclusively in English. This corpus spans a total of 1,586,074 words,
encompassing a diverse array of textual sources relevant to the wind energy sector, ensuring the
capture of a large spectrum of discourse surrounding the field. Through Sketch Engine’s automated
terminology extraction features, a subset of 100,000 terms was identified as domain-specific, serving as
the foundational dataset for subsequent classification and hierarchical organization.</p>
        <p>The creation of this corpus is a critical step in our methodology, as it provides a rich linguistic
dataset from which domain labels can be inferred, and semantic relationships among terms can be
analyzed. The extracted terms represent a comprehensive cross-section of the wind energy domain,
encompassing technical concepts, industry-specific terminology, and related expressions that contribute
to the identification of subdomains within the field.</p>
      </sec>
      <sec id="sec-4-2">
        <title>3.2. Step 2: Domain Label Identification via LLMs</title>
        <p>The second stage involves using a state-of-the-art LLM (specifically ‘ gpt-4o’1) to analyze the corpus
and identify a hierarchical labeling structure. This step translates unstructured textual data into a
structured hierarchical format.</p>
        <p>
          The procedure includes:
1. Pre-processing and Prompt Engineering: The text corpus is cleaned up and segmented into
manageable documents, and prompt engineering techniques [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ] are used to instruct the LLM to
extract representative domain and subdomain labels taking into account the entire distribution of
terms. Figure 4 shows the general structure of the prompt.
2. Semantic Analysis and Label Extraction: ‘gpt-4o’ generates potential labels by recognizing
semantic clusters within the corpus, outputting structured labels hierarchically categorized into
three levels (domain, subdomain, and sub-subdomain).
3. Hierarchical JSON Generation: The extracted labels are serialized into a hierarchical JSON
format, constituting a structured thesaurus ready for semantic classification.
        </p>
        <p>The hyperparameters used for ‘gpt-4o’ were: temperature set at 0 (favoring more deterministic
outputs), top-p sampling at 0.95 (ensuring diversity while maintaining relevance), and a token limit of
128,000 to manage processing constraints.</p>
      </sec>
      <sec id="sec-4-3">
        <title>3.3. Step 3: Term Classification Using Semantic Similarity</title>
        <p>The third step uses embedding-based semantic similarity for term classification in the hierarchical
structure established previously. The workflow involves:</p>
        <p>
          1. Embedding Generation: Terms and hierarchical paths (concatenated labels, e.g., “Energy &gt;
Renewable Sources”) are converted into vector embeddings using Sentence-BERT (SBERT) [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ],
chosen due to its efectiveness in capturing semantic nuances. Specifically, we employed the
SBERT model all-mpnet-base-v22 with a vector dimension of 768, ensuring balanced performance
between accuracy and computational eficiency.
2. Similarity Calculation: Cosine similarity [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ] between the embeddings of each term and each
hierarchical path is computed. This metric quantitatively measures the semantic closeness between
terms and labels. We also explored alternative distance metrics (e.g., Euclidean distance) during
an initial evaluation phase; however, cosine similarity yielded consistently higher alignment with
expert judgments for this domain-specific setting.
3. Threshold-based Classification: Terms exceeding an experimentally determined similarity
threshold (initially set at 0.75, optimized via grid search on a validation set) are automatically
classified under corresponding labels. Terms below this threshold are provisionally categorized
as “noise” and flagged for further human review. In addition to adjusting the threshold, we are
developing a mechanism for incremental reclassification, whereby a term initially labeled as
“noise” can be re-evaluated if the subsequent context or expert feedback indicates it is indeed
relevant to a specific subdomain.
        </p>
        <p>To ensure scientific rigor, a preliminary subset of classified terms was assessed following a
terminological evaluation of the domain, providing initial feedback that enabled the estimation of precision
and recall on a limited portion of the corpus (approximately 500 terms). In this context, precision refers
to the proportion of terms labeled as relevant by the relevant system (i.e., the percentage of correct
classifications), while recall measures the proportion of truly relevant terms that the system successfully
2https://huggingface.co/sentence-transformers/all-mpnet-base-v2</p>
        <p>Prompt Structure
You are an experienced linguist with a deep understanding of natural language processing. You will be
provided with a list of 100,000 lemmas. Your task is to return to me a JSON with all the labels of possible
domains and sub-domains that you can unearth in this list. Be detailed, return me a JSON with as many
domains and as many sub-domains as possible so as to include as many lemmas as possible. Take your
time to answer don’t be rushed, think it through. Always remember that I just want labels, don’t return
me lemmas. Follow this example:
1 {
2 "total_labels":[
3 {
4
5
6
"label": "label domain 1",
"sub-labels":[
"Label sub-domain 1":[ "Label sub-sub-domain 1", "Label sub-sub-domain
↪ 2", ...],
"Label sub-domain 2": ["Label sub-sub-domain 1", "Label sub-sub-domain</p>
        <p>2", ...],
"label": "Label domain 2",
"sub-labes": {[
"Label sub-domain 1": ["Label sub-sub-domain 1", "Label sub-sub-domain
↪ 2", ...],
"Label sub-domain 2": ["Label sub-sub-domain 1", "Label sub-sub-domain</p>
        <p>2", ...],
7
identified (i.e., how many relevant items are captured in total). The results indicated a precision of about
0.82 and a recall of 0.78. Recognizing the preliminary nature of these findings, subsequent evaluations
will expand the sample size, incorporate more diverse terms, and include comparative analysis with
established ontology framework extraction as well as input from additional experts.</p>
      </sec>
      <sec id="sec-4-4">
        <title>3.4. Output</title>
        <p>The final output of the LUMEN methodology is a hierarchical JSON structure enriched with terms
classified under each label node, ofering a comprehensive representation of the wind energy domain
and its subdomains. A selected extract from this structure is shown in Figure 5, this hierarchical
representation allows the continued expansion of the thesaurus as new terms and concepts emerge
within the domain.</p>
        <p>In our experimentation, LUMEN successfully identified nine primary subdomains within wind energy:
1 {
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29 }
"Environment" : [
"Environmental Impact": [
"On Fauna": [
"farm development",
"habitat loss",
"cumulative impact",
...
],
"On Flora": [
"impact on the marine environment",
"perspective on marine environmental impacts",
"public land",
...</p>
        <p>Environment, Energy, Geography, Economy, Technology, Society, Safety, Fauna, and Research. Each
subdomain contains one or more sub-labels, ensuring that users can navigate from broader concepts
(e.g., “Environmental Impact”) to increasingly granular topics (e.g., “On Fauna” or “On Flora”). By
employing the semantic similarity approach described in Section 3.3, the final JSON structure not only
organizes these terms eficiently but also highlights hidden thematic relationships and interdisciplinary
links inherent in the field.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>4. Conclusions and further research</title>
      <p>This study illustrates the implementation of advanced NLP techniques and the construction of a domain
tree for wind energy, based on a specialized corpus created with Sketch Engine. Through the application
of the LUMEN methodology, latent subdomains within the wind energy sector have been identified. The
ifndings highlight the potential of NLP methodologies to navigate the complexities of multidisciplinary
domains. By ofering a tool accessible to researchers, practitioners, and non-specialists, this study also
aims to provide a more dynamic understanding of the renewable energy sector through a diferent
methodology.</p>
      <p>Nevertheless, a new challenge emerged during the testing phase: a substantial portion of terms was
not accurately recognized by the software and was subsequently categorized as “noise.” While some of
these terms were indeed unrelated to the domain (e.g., typographical errors, corpus artifacts), others
were relevant but failed to meet our initial similarity threshold. Future iterations will incorporate a
more adaptive threshold, guided by expert feedback and additional contextual cues extracted from the
corpus.</p>
      <p>To address concerns about the validation of our classification accuracy, we plan to establish a formal
evaluation pipeline, including precision and recall metrics. Through platforms such as Google Forms
or specialized annotation tools, experts and stakeholders will assess classification outcomes on both
large-scale samples and targeted subdomains. The resulting quantitative evaluations will not only
provide transparency and reliability but also guide incremental refinements of our similarity metrics
and the threshold-based classification step.</p>
      <p>Finally, while this case study has primarily focused on English-language data, the methodology is
designed for scalability to multiple languages. Initial multilingual experiments will involve parallel corpora
in languages such as French, Spanish, or Italian, enabling semi-automated extraction of corresponding
term hierarchies. This extension is of particular interest for diferent stakeholders, where multilingual
resources are essential for policy-making, industry collaborations, and cross-border academic research.
By enhancing our approach to classification accuracy and expanding our linguistic scope, LUMEN
aims to serve as an adaptive platform for terminological analysis and ontology construction in rapidly
evolving technical domains.</p>
    </sec>
    <sec id="sec-6">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the authors used ChatGPT-4 for grammar and spelling checks.
The authors have subsequently reviewed and edited the content and take full responsibility for the
publication’s final version.
The sources of LUMEN are available via https://anonymous.4open.science/r/LUMEN,</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>D.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Su</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Fuoli</surname>
          </string-name>
          ,
          <article-title>Using LLM-assisted Annotation for Corpus Linguistics: A Case Study of Local Grammar Analysis</article-title>
          ,
          <source>CoRR abs/2305</source>
          .08339 (
          <year>2023</year>
          ). URL: https://doi.org/10.48550/arXiv. 2305.08339. doi:
          <volume>10</volume>
          .48550/ARXIV.2305.08339. arXiv:
          <volume>2305</volume>
          .
          <fpage>08339</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>G.</given-names>
            <surname>Bordet</surname>
          </string-name>
          ,
          <article-title>Brouillage des frontières, rencontres des domaines: quelles conséquences pour l'enseignement de la terminologie et de la traduction spécialisée</article-title>
          ,
          <source>ASp. la revue du GERAS</source>
          (
          <year>2013</year>
          )
          <fpage>95</fpage>
          -
          <lpage>115</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>B.</given-names>
            <surname>De Bessé</surname>
          </string-name>
          , Le domaine,
          <source>Le sens en terminologie</source>
          (
          <year>2000</year>
          )
          <fpage>182</fpage>
          -
          <lpage>197</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>N.</given-names>
            <surname>Lechevrel</surname>
          </string-name>
          , L'écolinguistique
          <article-title>: une discipline émergente</article-title>
          ,
          <source>RELQ/QSJL 3</source>
          (
          <year>2008</year>
          )
          <fpage>16</fpage>
          -
          <lpage>38</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Rey</surname>
          </string-name>
          ,
          <article-title>La terminologie: noms et notions, Que sais-je: Le point des connaissances actuelles</article-title>
          ,
          <source>PUF</source>
          ,
          <year>1979</year>
          . URL: https://books.google.it/books?id=7jJKAAAAYAAJ.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>V.</given-names>
            <surname>Delavigne</surname>
          </string-name>
          , La notion de domaine en question-à propos de l'environnement,
          <year>Neologica 2022</year>
          (
          <year>2022</year>
          )
          <fpage>27</fpage>
          -
          <lpage>59</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Hamon</surname>
          </string-name>
          , P. Paissa (Eds.),
          <article-title>Discours environnementaux</article-title>
          , volume
          <volume>20</volume>
          of Lingue d'
          <source>Europa e del Mediterraneo</source>
          ,
          <year>2023</year>
          . URL: https://www.aracneeditrice.eu/anteprime/9791221807769.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>C.</given-names>
            <surname>Grimaldi</surname>
          </string-name>
          ,
          <article-title>Le sfide linguistiche del cambiamento climatico</article-title>
          ,
          <source>AIDAinformazioni</source>
          (
          <year>2021</year>
          )
          <fpage>213</fpage>
          -
          <lpage>216</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>G.</given-names>
            <surname>Myerson</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y. Rydin,</surname>
          </string-name>
          <article-title>The language of environment: A new rhetoric</article-title>
          ,
          <source>Routledge</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>D.</given-names>
            <surname>Pascaline</surname>
          </string-name>
          , Étude en corpus de l'
          <article-title>implantation de quelques emprunts à l'anglais et de leurs concurrents oficiels, dans le domaine de l'environnement, Entre discours, langues et cultures: regards croisés sur le climat, l'environnement, l'énergie et l'écologie (</article-title>
          <year>2017</year>
          )
          <fpage>61</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>J.</given-names>
            <surname>Altmanova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Cartier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Luzzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. N.</given-names>
            <surname>Pinto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Piscopo</surname>
          </string-name>
          , et al.,
          <article-title>Innovations lexicales dans le domaine de l'environnement et de la biodiversité. le cas de bio en français et en italien</article-title>
          ,
          <source>Neologica</source>
          <volume>16</volume>
          (
          <year>2022</year>
          )
          <fpage>85</fpage>
          -
          <lpage>110</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>D.</given-names>
            <surname>Candel</surname>
          </string-name>
          ,
          <article-title>La présentation par domaines des emplois scientifiques et techniques dans quelques dictionnaires de langue, Langue française (</article-title>
          <year>1979</year>
          )
          <fpage>100</fpage>
          -
          <lpage>115</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>V.</given-names>
            <surname>Delavigne</surname>
          </string-name>
          ,
          <article-title>Le domaine aujourd'hui</article-title>
          .
          <source>Une notion à repenser</source>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>J. C.</given-names>
            <surname>Sager</surname>
          </string-name>
          , A Practical Course in Terminology Processing, John Benjamins, Amsterdam/Philadelphia,
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>C.</given-names>
            <surname>Roche</surname>
          </string-name>
          , Terminologie et ontologie,
          <source>Langages</source>
          (
          <year>2005</year>
          )
          <fpage>48</fpage>
          -
          <lpage>62</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>C.</given-names>
            <surname>Roche</surname>
          </string-name>
          ,
          <article-title>Ontological definition</article-title>
          , in: Handbook of terminology, John Benjamins Publishing Company,
          <year>2015</year>
          , pp.
          <fpage>128</fpage>
          -
          <lpage>152</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>N.</given-names>
            <surname>Guarino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Oberle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Staab</surname>
          </string-name>
          ,
          <article-title>What is an ontology?</article-title>
          , in: S. Staab, R. Studer (Eds.), Handbook on Ontologies,
          <source>International Handbooks on Information Systems</source>
          , Springer,
          <year>2009</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>17</lpage>
          . URL: https://doi.org/10.1007/978-3-
          <fpage>540</fpage>
          -92673-
          <issue>3</issue>
          _0. doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>540</fpage>
          - 92673- 3\_0.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>I.</given-names>
            <surname>Durán-Muñoz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Bautista-Zambrana</surname>
          </string-name>
          ,
          <article-title>Applying ontologies to terminology: Advantages and disadvantages, Hermes-Journal of Language and Communication in Business (</article-title>
          <year>2013</year>
          )
          <fpage>65</fpage>
          -
          <lpage>77</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>C.</given-names>
            <surname>Roche</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Calberg-Challot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Damas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rouard</surname>
          </string-name>
          ,
          <article-title>Ontoterminology - A new paradigm for terminology</article-title>
          , in: J. L. G. Dietz (Ed.),
          <source>KEOD 2009 - Proceedings of the International Conference on Knowledge Engineering and Ontology Development</source>
          , Funchal - Madeira, Portugal, October 6-
          <issue>8</issue>
          ,
          <year>2009</year>
          , INSTICC Press,
          <year>2009</year>
          , pp.
          <fpage>321</fpage>
          -
          <lpage>326</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>R.</given-names>
            <surname>Temmerman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Kerremans</surname>
          </string-name>
          , Termontography:
          <article-title>Ontology building and the sociocognitive approach to terminology description</article-title>
          ,
          <source>Proceedings of CIL17 7</source>
          (
          <year>2003</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <article-title>ISO 1087-1, Terminology work-Vocabulary-Part 1: Theory and application</article-title>
          , Standard, International Organization for Standardization, Geneva,
          <string-name>
            <surname>CH</surname>
          </string-name>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <source>[22] ISO 704</source>
          ,
          <article-title>Terminology work-Principles and methods</article-title>
          , Standard, International Organization for Standardization, Geneva,
          <string-name>
            <surname>CH</surname>
          </string-name>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <source>[23] ISO 1087</source>
          ,
          <article-title>Terminology work and terminology science-</article-title>
          <string-name>
            <surname>Vocabulary</surname>
            , Standard, International Organization for Standardization, Geneva,
            <given-names>CH</given-names>
          </string-name>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kilgarrif</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Baisa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bušta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jakubíček</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Kovář</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Michelfeit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rychlý</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Suchomel</surname>
          </string-name>
          ,
          <source>The Sketch Engine: ten years on, Lexicography</source>
          <volume>1</volume>
          (
          <year>2014</year>
          )
          <fpage>7</fpage>
          -
          <lpage>36</lpage>
          . URL: https://doi.org/10.1007/ s40607-014-0009-9. doi:
          <volume>10</volume>
          .1007/s40607- 014- 0009- 9.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>P.</given-names>
            <surname>Sahoo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. K.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Saha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Jain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mondal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Chadha</surname>
          </string-name>
          ,
          <article-title>A systematic survey of prompt engineering in large language models: Techniques and applications</article-title>
          ,
          <source>CoRR abs/2402</source>
          .07927 (
          <year>2024</year>
          ). URL: https://doi.org/10.48550/arXiv.2402.07927. doi:
          <volume>10</volume>
          .48550/ARXIV.2402. 07927. arXiv:
          <volume>2402</volume>
          .
          <fpage>07927</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>N.</given-names>
            <surname>Reimers</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Gurevych</surname>
          </string-name>
          ,
          <article-title>Sentence-bert: Sentence embeddings using siamese bert-networks, in: K. Inui</article-title>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Ng</surname>
          </string-name>
          ,
          <string-name>
            <surname>X.</surname>
          </string-name>
          Wan (Eds.),
          <source>Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP</source>
          <year>2019</year>
          ,
          <string-name>
            <given-names>Hong</given-names>
            <surname>Kong</surname>
          </string-name>
          , China, November 3-
          <issue>7</issue>
          ,
          <year>2019</year>
          , Association for Computational Linguistics,
          <year>2019</year>
          , pp.
          <fpage>3980</fpage>
          -
          <lpage>3990</lpage>
          . URL: https://doi.org/10.18653/v1/
          <fpage>D19</fpage>
          -1410. doi:
          <volume>10</volume>
          .18653/V1/D19- 1410.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>A.</given-names>
            <surname>Singhal</surname>
          </string-name>
          ,
          <article-title>Modern information retrieval: A brief overview</article-title>
          ,
          <source>IEEE Data Eng. Bull</source>
          .
          <volume>24</volume>
          (
          <year>2001</year>
          )
          <fpage>35</fpage>
          -
          <lpage>43</lpage>
          . URL: http://sites.computer.org/debull/A01DEC-CD.
          <year>pdf</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>