<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>M. L. Coutinho);</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Ontology-Driven Conceptual Modeling</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Matheus L. Coutinho</string-name>
          <email>matheuslenke@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>João Paulo A. Almeida</string-name>
          <email>jpalmeida@ieee.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giancarlo Guizzardi</string-name>
          <email>g.guizzardi@utwente.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Ontology-Driven Conceptual Modeling, Large Language Models, Textual Modeling</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Ontology &amp; Conceptual Modeling Research Group (NEMO), Federal University of Espírito Santo (UFES)</institution>
          ,
          <country country="BR">Brazil</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>SCME</institution>
          ,
          <addr-line>Doctoral Consortium, Tutorials</addr-line>
          ,
          <institution>Project Exhibitions</institution>
          ,
          <addr-line>Posters and Demos</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Semantics, Cybersecurity &amp; Services (SCS), University of Twente</institution>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>Large Language Models (LLMs) have already shown potentially significant capabilities in assisting users with writing and coding tasks. In this paper, we explore how LLM-based assistance can be leveraged in a modeling environment for textual Ontology-Driven Conceptual Modeling. We integrate the UFO-based textual language 'Tonto' with an LLM-powered assistant. We employ detailed UFO-based 'guidance' texts which are included by the modeling environment automatically in the context of user prompts along with the current ontology coding artifacts. The tool can take actions such as creating files, changing code, invoking Tonto syntax verification, while still maintaining the modeler in the loop. Our initial exploration shows that a number of modeling tasks can potentially be automated (such as suggesting new elements, summarizing the model, checking consistency of usage of UFO concepts, model fixing, etc.). The tool is proposed as a testbed for empirical user studies.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        For more than two decades, the Unified Foundational Ontology (UFO) [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ] has been used in a number
of tasks in Ontology-Driven Conceptual Modeling. A key result of this research agenda has been the
OntoUML profile, which brings to UML class diagrams important ontological distinctions based on
UFO. OntoUML employs UFO’s concepts and axioms to enable the creation of high-quality reference
ontologies and is supported by a suite of advanced tools [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] for model editing, verification, simulation,
transformation, database schema generation, and anti-pattern detection.
      </p>
      <p>
        Despite the benefits of using UML class diagrams as a basis, which allows OntoUML to leverage
UML’s well-known visual notation, there are also drawbacks associated with graphical modeling. These
include the manual efort of diagram layout, scalability issues with large diagrams, and the inability to
use mature text-based tools for tasks like version control, comparison, merging, and auto-completion.
Because of this, we have been exploring the use of textual notations for UFO-based conceptual modeling,
which has led to the development of the Tonto (Textual Ontologies) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] language and tooling.
      </p>
      <p>
        Tonto inherits its core concepts from UFO in line with OntoUML. The language is accompanied by a
fully-featured VS Code extension that performs syntax highlighting, auto-completion and real-time
syntactic verification against UFO rules. The Tonto extension is also integrated into the OntoUML
server [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], and, therefore, can benefit from full interoperability with the OntoUML ecosystem.
      </p>
      <p>
        Tonto’s textual format facilitates its integration with tools based on Large Language Models (LLMs) [
        <xref ref-type="bibr" rid="ref5 ref6">5,
6</xref>
        ]. We explore this synergy to provide advanced modeling support for UFO-based ontologies. We
combine Tonto’s rigorous constraints, aimed at ensuring ontological robustness with the flexibility of
LLM-based assistance. The use of UFO concepts in Tonto allow us to provide rich context in prompts,
which guide automated task execution. We leverage the fact that LLMs incorporate significant empirical
linguistic knowledge acquired during their massive pre-training phase.
      </p>
      <p>CEUR
Workshop</p>
      <p>ISSN1613-0073</p>
      <p>Our ultimate goal is to understand what kinds of ontology-driven conceptual modeling tasks can
be supported with pre-trained LLMs and (zero-shot or few-shot) prompting strategies. A common
barrier in assessing this sort of support empirically is the lack of usable tools for experimentation.
That is the object of this paper. We combine the existing Tonto extension with the AI Cursor tool
(https://cursor.com). Cursor integrates an extensible text-based coding environment with a variety
of LLMs in a seamless way. It allows us to define ‘guidance’ texts which are provided to the LLMs in
tandem with the current coding artifacts. These texts provide instructions to perform specific tasks;
the tool (in its ‘agent mode’) can take actions such as creating files, changing code, etc., while still
maintaining the modeler in the loop. (Similar functionality has recently been implemented in VS Code
natively, and hence, can also be explored with the approach discussed in this paper.)</p>
      <p>We have defined a number of detailed UFO-based ‘guidance’ texts which are used as part of the
context of prompts for the LLMs in the following tasks: (i) enhancing an existing ontology by creating
new elements and relations (while adhering to UFO constraints); (ii) checking the consistency of the
ontology terminology; (iii) checking the usage of UFO concepts; (iv) understanding a package or the
entire ontology in a summarized explanation in natural language; (v) creating documentation for
existing elements in an ontology, labels and descriptions; (vi) adding multi-lingual terminology and
descriptions. Our initial exploration with the tool shows that it is capable of automating these tasks for
Tonto models. We plan to employ the tool as a testbed for empirical user studies.</p>
      <p>This paper is further structured as follows: Section 2 describes the solution we employ, briefly
positioning Tonto and its editor in an overall architecture. Section 3 presents one of the examples used
for demonstration, and Section 4 provides some conclusions and outlines plans for future work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Solution</title>
      <p>
        Tonto The Tonto language was designed to be both human-readable and expressive [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Its syntax is
intentionally familiar to users of mainstream object-oriented languages. Tonto introduces keywords
to reflect the taxonomy of types in UFO. For example, there are keywords for kind, subkind, phase,
role, category, etc., reflecting the metaproperties of rigidity and sortality as well as other ontological
distinctions. There are also keywords to enable the reification of relations and aspects ( relator, mode,
quality), event types (event) and to provide specialized semantics for associations (object participation
in events, aspect inherence, relator mediation, whole-part relations, etc.) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        Architectural Overview Figure 1 illustrates the basic workflow of Cursor’s ‘agent mode’ as used in
this project in integration with the Tonto extension. An agent is an artificial entity that perceives its
environment, makes decisions, and takes actions based on those perceptions in order to achieve specific
goals. Cursor implements an Agent that is able to execute a series of activities in the Code Workspace,
and it leverages an underlying Large Language Model (LLM) to operate within the code workspace,
where it can execute a series of tasks such as file access, folder navigation, and semantic search to
achieve specific modeling goals. The process begins when the user makes a request in the Cursor agent
chat. In response, Cursor prepares the initial input for the selected LLM. This input includes the main
system prompt, any custom rules defined by the user, and the actual content of any referenced Tonto
ifles. The agent then begins a step-by-step loop to complete the request. It invokes various actions such
as reading from files, writing new code, or running Tonto syntax verification. The result of each action
is used to decide the next steps. To ensure transparency, the user can see this entire process—including
the agent’s reasoning and tool usage—in the chat window. This interactive loop continues until the
agent concludes the task is finished or a technical limit is reached (such as the maximum LLM token
count). At the end of each cycle, the agent presents its proposed changes for user validation. The user
retains oversight by accepting or rejecting these modifications, thereby ensuring the model evolves in
alignment with their preferences. (Not unlike Cursor, VS Code also leverages an ‘agent mode’, with a
similar architecture.)
UFO-Based Task Guidance Rules are provided as markdown files to help the LLM understand and
work with Tonto. The following guidance files were defined (and are publicly available in the project
repository1): (i) Tonto Guidance: This is the core instruction set. It provides a concise orientation
for the LLM (establishing ’chains of thoughts’), covering the fundamentals of the Tonto language, its
project structure, and its key capabilities. (ii) LLM Guidance: This document acts as a ‘router’. It gives
the LLM a general workflow and tells it how to use the other more specialized guidance files to handle
specific user requests. (iii) Enhancing Guidance: This guide is for building and extending ontologies.
It instructs the LLM on how to add new classes, attributes, relations, and generalization sets while
maintaining the model’s ontological consistency with UFO’s axioms. (iv) Terminology Guidance:
This document helps the LLM act as a vocabulary expert. It guides the analysis of the names used for
classes, attributes, and relations, allowing the LLM to provide feedback for improving the ontology’s
clarity and precision. (v) Understanding Guidance: This guide helps the LLM summarize a model in
natural language. It is used to generate descriptions of a single package or an entire project to help users
quickly understand its contents. UFO notions are used, e.g., directing the LLM to prioritize ultimate
sortals (kinds) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. (vi) Documentation Guidance: This document provides guidelines for generating
high-quality documentation for elements in a Tonto ontology. It pushes the LLM to describe an element
and also to explain the underlying ontological choices based on UFO principles.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Demonstration</title>
      <p>In this section, we illustrate the use of the UFO-based guidance files. Given an existing ontology about
the university domain, as shown in Figure 2, we have prompted the LLM agent as follows: “Your task is
to help me check the terminology of my ontology and fix semantic errors. Get the errors from @Linter Errors.
Take a look at all packages.” We ran this test under two conditions: with our custom guidance texts, and
without them, in order to establish a comparative baseline. The initial model uses the unspecific class
keyword (as shown in the screenshot shaded in a light red background). Further, it lacks documentation
in the form of labels and descriptions. The complete original and automatically modified specifications
are available on the aforementioned project repository.</p>
      <p>Without guidance, the agent struggled to produce a valid model, and it did not use the best practices
from Tonto (this baseline output is also available in the project repository). The output contained
significant errors, such as: incorrect use of Tonto’s grammar and keywords; failure to apply appropriate
1https://github.com/matheuslenke/tonto-llm
UFO concepts, syntactically incorrect specialization relationships, and labels and descriptions as code
comments instead of using the proper ‘label’ and ‘description’ syntax.</p>
      <p>In stark contrast, when the same prompt was provided in the environment that was customized
with the Tonto guidance files, the agent generated a significantly more robust and ontologically sound
model. It correctly leveraged a wide range of UFO concepts (that correspond to Tonto keywords) and
successfully implemented complex structures, such as the powertype pattern, demonstrating clear
adherence to the well-founded modeling guidelines. In the screenshot shown in Figure 2, the generic
class keyword was properly replaced by the collective keyword; the generic aggregation relation
was properly replaced by the subCollectionOf relation. Space constraints prevent us from describing
other examples here, covering the various supported tasks. Instructions for download of the guidance
ifles and configuration of Cursor for use of Tonto are available in the project repository along with
example prompts for other tasks.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusions</title>
      <p>
        There is a plethora of research addressing the use LLMs for diferent aspects of ontology engineering
and conceptual modeling, as evidenced by a recent survey [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and a discussion paper [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. For example,
LLMs have been employed to to classify domain entities using top-level categories [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], generate SWRL
rules [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], support ontology development from datasets [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], etc. Here, our focus is on LLM assistance
for tasks in ontology capture and formalization [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. We employ production-ready tools and leverage
foundational ontology distinctions both in guidance texts and in the language grammar verification.
      </p>
      <p>This work is currently a proof of concept. While the initial results are promising, the LLM integration
is a prototype that requires further development and rigorous evaluation to measure its efectiveness.
Our immediate future work will focus on conducting controlled experiments to rigorously assess the
impact of the LLM-powered assistant. This study will be designed to measure not only the efect of
using an LLM but also the specific value added by our UFO-based guidance files. Participants will be
asked to perform ontology modeling tasks in Tonto under three diferent conditions: (i) with no AI
support, (ii) with a baseline LLM model, and (iii) with the guided assistant. We’ll measure success
across three key areas: eficiency, by measuring task completion time; model quality, by evaluating
the number of ontological errors and anti-pattern occurrences; and, finally, user satisfaction, measured
using standard usability surveys to gauge the user experience. We also intend to assess whether possible
benefits persist in projects of diferent levels of complexity and involving users with diferent profiles.</p>
      <p>Finally, we plan to explore more sophisticated prompt engineering techniques and expand the range
of supported tasks. A long-term goal is to leverage LLMs to create natural language interfaces for
querying and manipulating ontologies, making them more accessible to a wider audience of domain
experts who may not be familiar with formal modeling languages. One challenge we anticipate is
domain sensitivity. In particular, we expect the agent to struggle with niche domains, which are
underrepresented in pre-training. Although we have observed good initial results in this exploratory
phase with the models Claude 3.7 Sonnet, Gemini 2.5 Pro, and OpenAI o3, an indepth study should
assess their particular merits in these tasks.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This research is funded in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior –
Brasil (CAPES) – Finance Code 001, FAPES (1022/2022) and CNPq (443130/2023-0, 313412/2023-5).</p>
    </sec>
    <sec id="sec-6">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the authors have employed Gemini 2.5 Pro in refining the guidance
texts as well as in grammar and spelling check for the manuscript. The authors reviewed and edited the
content as needed and take full responsibility for the publication’s content.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>G.</given-names>
            <surname>Guizzardi</surname>
          </string-name>
          ,
          <article-title>Ontological foundations for structural conceptual models, Centre for Telematics and Information Technology</article-title>
          , Enschede, The Netherlands,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>G.</given-names>
            <surname>Guizzardi</surname>
          </string-name>
          , et al.,
          <source>UFO: Unified Foundational Ontology, Applied Ontology</source>
          <volume>17</volume>
          (
          <year>2022</year>
          )
          <fpage>167</fpage>
          -
          <lpage>210</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C. M.</given-names>
            <surname>Fonseca</surname>
          </string-name>
          , et al.,
          <article-title>Ontology-driven conceptual modeling as a service</article-title>
          ,
          <source>in: Proc. Joint Ontology Workshops</source>
          <year>2021</year>
          , CEUR Workshop Proceedings,
          <year>2021</year>
          . URL: http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2969</volume>
          /.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Coutinho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P. A.</given-names>
            <surname>Almeida</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. P.</given-names>
            <surname>Sales</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Guizzardi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A Textual</given-names>
            <surname>Syntax</surname>
          </string-name>
          and
          <article-title>Toolset for Well-Founded Ontologies</article-title>
          ,
          <source>in: Frontiers in Artificial Intelligence and Applications</source>
          , IOS Press,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>C.</given-names>
            <surname>Rafel</surname>
          </string-name>
          , et al.,
          <article-title>Exploring the Limits of Transfer Learning with a Unified Text-to-</article-title>
          <string-name>
            <surname>Text</surname>
            <given-names>Transformer</given-names>
          </string-name>
          ,
          <year>2023</year>
          . URL: http://arxiv.org/abs/
          <year>1910</year>
          .10683.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>H.</given-names>
            <surname>Touvron</surname>
          </string-name>
          , et al.,
          <source>LLaMA: Open and Eficient Foundation Language Models</source>
          ,
          <year>2023</year>
          . URL: http: //arxiv.org/abs/2302.13971, arXiv:
          <fpage>2302</fpage>
          .13971 [cs].
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>D.</given-names>
            <surname>Garijo</surname>
          </string-name>
          , et al.,
          <article-title>LLMs for Ontology Engineering: A landscape of Tasks and Benchmarking challenges</article-title>
          ,
          <source>in: ISWC 2024 Special Session on Harmonising Generative AI and Semantic Web Technologies</source>
          ,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>V.C.</given-names>
            <surname>Storey</surname>
          </string-name>
          et al.,
          <article-title>Large language models for conceptual modeling: Assessment and application potential</article-title>
          ,
          <source>Data &amp; Knowledge Engineering</source>
          <volume>160</volume>
          (
          <year>2025</year>
          )
          <article-title>102480</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.datak.
          <year>2025</year>
          .
          <volume>102480</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Lopes</surname>
          </string-name>
          , et al.,
          <article-title>How to classify domain entities into top-level ontology concepts using large language models: A study across multiple labels, resources, and languages</article-title>
          , Applied Ontology (
          <year>2024</year>
          )
          <fpage>1</fpage>
          -
          <lpage>29</lpage>
          . doi:
          <volume>10</volume>
          .3233/AO- 240032.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A.</given-names>
            <surname>Soularidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Kotis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lamolle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Mejdoul</surname>
          </string-name>
          , G. Lortal,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>Vouros, LLM-Assisted Generation of SWRL Rules from Natural Language</article-title>
          , in: 2024 International Conference on
          <article-title>AI x Data and Knowledge Engineering (AIxDKE)</article-title>
          , IEEE, Tokyo, Japan,
          <year>2024</year>
          , pp.
          <fpage>7</fpage>
          -
          <lpage>12</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M.</given-names>
            <surname>Val-Calvo</surname>
          </string-name>
          , et al.,
          <article-title>Ontogenix: Leveraging large language models for enhanced ontology engineering from datasets</article-title>
          ,
          <source>Information Processing &amp; Management</source>
          <volume>62</volume>
          (
          <year>2025</year>
          )
          <article-title>104042</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.ipm.
          <year>2024</year>
          .
          <volume>104042</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>R.</given-names>
            <surname>Falbo</surname>
          </string-name>
          ,
          <article-title>SABiO: Systematic approach for building ontologies</article-title>
          ,
          <source>in: Proc. 1st Joint Workshop ONTO.COM / ODISE on Ontologies in Conceptual Modeling and Information Systems Engineering</source>
          , volume
          <volume>1301</volume>
          <source>of CEUR Workshop Proceedings</source>
          ,
          <year>2014</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1301</volume>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>