<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Ontology: Supporting Clinical Studies through Semantic Representation of the Speech Pathology Domain</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Lia Draetta</string-name>
          <email>lia.draetta@unito.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nicola Barbaro</string-name>
          <email>nicola.barbaro@unito.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Francesco Petriglia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrea Meirone</string-name>
          <email>andrea.meirone@centropaideia.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cristina Gena</string-name>
          <email>cristina.gena@unito.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Domain Ontology, FAIR Principles, Speech Therapy, Healthcare</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Fondazione Paideia</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Turin, Italy, Department Of Computer Science</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <abstract>
        <p>In recent years, Semantic Web technologies have gained increasing attention for their ability to enhance data integration, reusability, and interoperability across various domains, including healthcare. In line with the FAIR (Findability, Accessibility, Interoperability, and Reusability) principles, ontologies provide structured frameworks for representing domain-specific knowledge across diferent resources and applications, facilitating clinical research, and improving decision-making processes, but their potential remains underexplored in the domain of speech therapy. Existing digital tools for speech therapy often lack interoperability and full alignment with FAIR principles. To bridge this gap, we propose the Talkidz Ontology, which is designed to support the development of applications that assist speech therapists in detecting and treating speech disorders. The Ontology was developed through a participatory approach involving speech therapists, ontology engineers, and clinical researchers. Its structure was validated through a survey of domain experts assessing the relevance of key classes and competency questions. Additionally, we demonstrate a real-world clinical application, showcasing the ontology's ability to facilitate epidemiological studies, and support evidence-based decision-making. By providing a standardized, expert-validated resource, the Talkidz Ontology aims to enhance the eficiency and efectiveness of speech therapy practices. It serves as a crucial step toward improving knowledge sharing and data-driven research in the field of language disorders.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        In recent years, the Semantic Web and its technologies have gained increasing attention,
particularly for their impact on various domains such as e-learning, social media, healthcare, and
digital humanities [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. In healthcare, semantic technologies play a crucial role in ensuring data
uniformity, reusability, and shareability. Following the FAIR principles [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], they provide a shared
and unambiguous semantics for the representation of data and metadata, bridging fragmented
data sources. Additionally, they facilitate epidemiological and clinical research by uncovering
patterns in medical reports [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ], supporting monitoring systems [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], and enabling ontology-based
decision-making [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        Despite their success in various medical fields, the application of semantic technologies in speech
therapy domain remains underexplored [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ]. Developing an ontology-based tool for sharing
therapy data and analyzing the prevalence of speech disorders in relation to demographic factors
can provide valuable epidemiological insights and assist therapists in selecting optimal treatment
approaches. This is particularly relevant given the significant prevalence of developmental
language disorders, for instance, in Italy, 7.01% of children are diagnosed with Specific language
BY 4.0).
      </p>
      <p>CEUR
Workshop</p>
      <p>
        ISSN1613-0073
disorders [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. In addition, childhood speech disorders can severely impact education and social
development, leading to learning dificulties [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], behavioral and psychiatric issues [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], and
challenges in emotional and social adaptation [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Early identification of at-risk children is
essential for timely interventions that maximize developmental outcomes [
        <xref ref-type="bibr" rid="ref13 ref14">13, 14</xref>
        ].
      </p>
      <p>
        In recent years, it has become increasingly evident that data and knowledge sharing are
powerful tools for advancing research across all fields, contributing to more data-driven and
empirically supported clinical studies and ontologies provide a structured and unambiguous
framework for representing large volumes of domain-specific data. In speech therapy, they have
been applied mainly for clinical data storage and visualization [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], to support early detection
of speech pathologies [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], [17], and as base for educational tools in assisting speech therapists
[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. However, few ontological resources have been rigorously evaluated by experts or fully comply
with FAIR principles while ensuring alignment with others existing resources.
      </p>
      <p>To address these gaps, we propose an ontology1 designed to integrate and analyze data
from diverse medical clinics. The Talkidz Ontology aims to support epidemiological studies,
facilitate clinical research, and enhance diagnostic and therapeutic decision-making for speech
therapists. To do so, the ontology represent speech disorders, therapies, and relationships between
patients, therapists, and conditions while incorporating demographic information. Developed
and validated with input from speech therapists and domain experts, the Ontology has been
assessed through a survey to validate competency questions and ontology classes. In this paper
we describe the ontology and we demonstrate a clinical application of our resource, showcasing its
potential for integrating heterogeneous data sources and assisting speech therapists in obtaining
a comprehensive and standardized view to support evidence-based decision-making.</p>
      <p>The present work is organized as follows: in Section 2, we review the most relevant studies that
have addressed speech disorders using ontologies, highlighting both their practical applications
and the ontology engineering processes, and underscoring whether they comply with the FAIR
principles. In Section 3, we present the multidisciplinary research initiative to which this ontology
belongs. In Sections 4 and 5, we provide an in-depth description of our methodological approach
and the validation process of the ontology’s main classes and competency questions, followed by
the description of the full ontology structure. Finally, we validate the presented work with a
case study, and in Section 6, we discuss its limitations and outline future work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Background and Motivations</title>
      <p>The field of Speech Therapy has been extensively studied for many years by speech therapists and
language experts. It encompasses the prevention, assessment, and treatment of communication
impairments, as well as the scientific study of speech, spoken and written language, and related
disorders. Speech Language Therapy plays a crucial role in addressing dificulties associated
with impairments of sound production by developing efective treatment programs to improve
spoken communication and prevent future literacy challenges [18]. In the following sections,
we first review key works that leverage ontologies to develop applications supporting speech
therapists, highlighting the main addressed tasks and objectives. In subsequent paragraphs,
we focus the review on the ontology engineering process, emphasizing the goals and validation
methods of existing resources while underscoring the diferences and novelties of our resource
compared to others.</p>
      <sec id="sec-2-1">
        <title>2.1. Ontology-based Services in Speech Therapy Domain</title>
        <p>In the healthcare domain in general, several attempts have been made to represent the field
by leveraging semantic representation, aiming to establish a standard for the classification of
diseases and pathologies. For example, the Human Disease Ontology (DO) [19] is a resource
1https://github.com/liadraetta/talkidz-ontology
that plays a central role in classifying rare, common, and complex human diseases, as well as
linking them to genetic information. Although it is a comprehensive and standardized resource
for representing diseases in a general way, it lacks specialization within specific fields.</p>
        <p>In recent years semantic web technologies have been increasingly applied to the field of speech
pathology for various purposes, including data sharing, support for Speech Therapy and speech
therapists in the diagnostic process. Notably, the use of ontologies in guiding Speech Therapy
design has been steadily increasing and is expected to serve as a valuable support for developing
eficient, solution-based therapies. As highlighted by Usip and colleagues [ 20], well-founded
ontologies play a crucial role in accelerating scientific research; the researchers highlighted that
their structured nature ensures efective discovery, automation, integration, and reuse across
diverse application platforms, providing a comprehensive framework for representing relevant
concepts and their relationships.</p>
        <p>
          Exploiting the potential of ontologies, Robles-Bykbaev and colleagues [21] developed a system
to provide several services related with information querying, reports generation, inference of
intervention strategies, with the aim of helping speech therapist in the detection process. To
build and validate the ontology they used clinical information about 152 real patients cases. In
another study Robles-Bykbaev et al. [22] developed Speech Therapy knowledge-based model
tools to support speech and language pathologists, doctors, students, patients and their relatives.
Martín-Ruiz and colleagues [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] developed an ontology-based smart solution to support the
detection of language disorders among young children during pediatricians routine visits. They
built the ontology with the help of pediatricians and speech therapists creating a representation
that helps pediatrician in asking the right clinical questions according to the age of the child at
the time of evaluation. The competency questions (CQs) were built engaging four experts in
neuropediatrics, neonatology and language disorders and the tool finally was validated against
21 cases obtained from a center oriented to the detection and treatment of language disorders.
        </p>
        <p>In the same line, the Martín Ruiz and colleagues [17] leveraged an ontology to develop and
evaluate a web-based Clinical Decision Support System. This web-based system relies on an
ontology and can be used by pediatricians with the aim of detecting communication disorders
in children. The ontology contains information related with acquisition and development of
language in children as well as a rule set to generate alerts when language and communication
disorders are detected. Ontologies were also exploited with the aim of building decision support
system for planning therapy sessions. To do this, Robles-Bykbaev and colleagues [23] built an
ontology-based expert system with the aim of suggesting activities that can be part of the therapy
plan. The resource was developed with classes and relations that describe the speech-language
therapy elements, aiming at answering questions such as “Determine with which speech-language
categories must be carried out the therapy for a given patient.”</p>
        <p>
          Ontology-based applications have also been used to develop tools to assist future speech
therapists during their training. In [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] researchers developed an ontology to model patients’
anamnesis and speech-language milestones (exercises, skills), representing clinical profiles that
include personal data, medical records, and speech-language records. The goal was to create
a system capable of automatically generating tests and educational exercises to enhance skill
development for designing efective therapy plans.
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Speech Disorders Ontologies</title>
        <p>As discussed in the previous section, various ontological resources have been developed in recent
years aiming at representing the speech therapy domain. However, some of these resources fail
to fully meet the principles of Findability, Accessibility, Interoperability, and Reusability (FAIR).
In other cases, a comprehensive validation process of the ontology structure and competency
questions is lacking.</p>
        <p>
          Robles-Bykbaev and colleagues developed [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] an ontological resource to represent the domain
of speech-language therapy with the aim of assisting speech therapists in the detection process.
The resource is built in collaboration with specialists but is not accessible and, therefore, not
reusable. Additionally, no alignments with other resources are provided, nor are case studies
presented for the resource’s evaluation. The authors state that the resource was validated by
collecting 130 real cases of children with speech disorders; however, no further details on the
evaluation results are given, and no competency questions are provided. On the same line García
et al. [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] aiming at developing an ontology as a tool to support therapists in detection and
suggest potential treatment, collected a corpus by crawling web data using specific keywords
and their semantic relations (synonyms, hyponyms, and hyperonyms), ultimately obtaining
1, 097 documents. The authors then constructed a taxonomy based on the collected documents,
focusing on the representation of speech disorder, therapy strategies, individuals, and signs and
symptoms. From this taxonomy, they engineered the ontology by identifying key classes using
both top-down and bottom-up strategies [24]. However, the resulting resource is not publicly
available, and no evaluation of the ontology or application case studies have been reported.
Chuchuca-Mendez and colleagues [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] propose an ontology designed to model key aspects of speech
therapy, with the goal of automatically generating tests for future therapists. The ontology
includes elements such as patient anamnesis, profiles, and speech-language clinical history. The
researchers further populated the ontology by collecting 383 samples of rehabilitation activities
used in therapy. However, no competency questions, evaluation process, or access to the resource
are provided.
        </p>
        <p>The review of existing works clearly highlights persistent gaps in the representation of speech
disorders using ontologies. Firstly, many studies fail to adopt standardized classifications, such
as the International Classification of Diseases (ICD-11) 2. More concerning, however, is that none
of the analyzed resources are openly shared or freely accessible. It appears that these ontologies
are primarily designed to meet specific project goals rather than to adhere to FAIR principles,
which would ensure their reusability by other researchers and institutions. Table 1 summarizes
the key characteristics of the reviewed works. From this brief analysis, it becomes clear that
several gaps remain to be addressed.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. The Talkidz Project</title>
      <p>The ontology presented in this work is part of a larger project that aims to automatically
transcribe children’s pathological language. The recent advancements in Natural Language
Processing (NLP) and Machine Learning (ML) have significantly impacted various sectors,
including the clinical domain. NLP techniques have been applied to the study of both typical
language development [25], [26] and atypical language patterns [27], [28]. These technological
advancements have also facilitated the development of NLP and ML based tools to support
clinical practice [29]. Despite the promising results of these applications, there is still a lack
of models specifically designed for the automatic analysis of pathological language in the
Italian language. Currently, the evaluation of expressive language disorders is often conducted
using paper tests, with no digital support available to help clinicians reduce scoring time
and result analysis. In recent years, the use of software for language analysis has gained
traction, as these tools can significantly decrease correction time, making the process faster,
automated, and more accurate [30]. Starting from these premises, the Talkidz Project3 aims to
automate and accelerate the analysis of language assessment tests by providing an automatic
transcription of pathological language in the International Phonetic Alphabet (IPA) using Deep
Learning Techniques. Additionally, it preserves errors and automatically generates phonetic and
phonological, lexical and morphosyntactical analyses of the processed language. In the context
of the explained Project, the ontology presented in this work aims to support and enhance the
tool ofered to speech therapist, providing a software capable of processing and automatically
analyzing patients’ atypical speech. Additionally, it includes an ontology that integrates data
from various medical studies, aiming to ofer statistical insights and a comprehensive overview
of speech and language disorders and their distribution.</p>
    </sec>
    <sec id="sec-4">
      <title>4. The Talkidz Ontology: Design</title>
      <p>The Talkidz Ontology was developed by a multidisciplinary team of experts across various fields,
including speech therapists, occupational therapists, ontology engineers, machine learning experts,
and human-machine interaction experts. This diverse expertise facilitated a comprehensive
development process, ensuring that the ontology addressed multiple perspectives, from real
clinical needs to ontology alignment, as well as case studies and practical applications. In the
following section, we detail the ontology’s development and validation process, including the
design methodology and the formulation of competency questions.</p>
      <sec id="sec-4-1">
        <title>4.1. Design Methodology</title>
        <p>When developing an ontology in the healthcare domain, a fundamental challenge is deciding
whether to model the main classes “from scratch” or to adhere to international standards. The
3The project “Talkidz: Software for Language Analysis”, funded by Fondazione CRT (Ordinary Grants Call 2023,
RF = 106465 / 2023.1740)
choice between standardized and customized classes remains an open issue in healthcare ontology
development [31], as both approaches ofer distinct advantages and disadvantages. Spoladore
and colleagues [31] found that, in the development of disability-related ontologies, the majority
of reviewed works (26 out of 43) adopted a “from-scratch” approach. This choice was primarily
driven by the need to better address the empirical requirements of the modeled resources. Such
ontologies rarely aim to holistically represent an entire domain, instead focusing on a limited set
of classes, allowing them to narrow down a complex domains into a more well-defined scope.
In addiction the authors highlighted that the reuse of existing ontologies is not hindered by a
“from scratch” approach, indeed it is documented in more than 50% of the articles analyzed.
Aware of this and of the benefits that participatory design can bring to the development of
computational resources [32], we structured our design methodology into multiple steps (Figure
1). First, we discussed the ontology design with two domain experts from a local foundation that
deals with children speech pathology and occupational therapy. Our methodology then consists
in conducting a survey to validate the proposed classes taxonomy. Specifically, we presented
a taxonomy of speech disorders based on ICD-11 and speech therapy classifications based on
possible clinical needs. We then asked experts to assess its acceptability, and if they found
it unsuitable, they were invited to explain why and suggest alternatives so as to expand and
enrich our model. By building on a standardized framework while incorporating expert input,
we tailored our design onto the needs of those who will use the resource.</p>
        <p>The survey was then filled by a total of 21 speech therapist aged between 24 and 66 years old,
85,5% were female and 14,5% male. Regarding the validation of classes we asked experts to
validate the categorization of language disease and therapies (“Regarding the categorization
of language disorders proposed, do you think the presented classification reflects the one most
commonly used in clinical practice?”, “Regarding the categorization of treatment types, do you
think the presented classification reflects the one most commonly used in clinical practice?”). As
for the first question, 73.3% responded positively, while 26.7% answered negatively and suggested
modifications. Regarding the second question, concerning the categorization of treatment, all
respondents confirmed the acceptability of the proposed taxonomy for clinical uses. Given that
the majority of experts we consulted validated it, we decided to retain both classifications as
originally proposed, setting the goal of further analyzing the discrepancy emerged from the
answers to the first question as future work.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Competency Questions</title>
        <p>When developing an ontology from scratch, one of the first challenges is defining its requirements
[33]. In this context, competency questions (CQs) serve to clarify the ontology’s purpose, scope,
and target users. Following an participatory approach, we engaged experts and key stakeholders
to validate a preliminary list of potential uses and applications for the ontology. Trough the
survey we also ask experts to evaluate each presented CQ on a scale from 0 to 4, assessing its
relevance, utility, and potential applications. From the survey, we obtained homogeneous results:
8 out of 9 of the CQs we presented were scored higher than 3. In Table 2 2 we present all the
competency questions with the obtained scores.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Ontology Description</title>
      <p>As specified below, the main aim of this resource is twofold. On one hand, we aim to build a
semantically enriched resource that addresses the real needs of stakeholders, and on the other,
to aimed for compliance with FAIR principles. For this reason, when developing classes and
relations, we incorporate both standard classifications (such as speech and language disorders
from ICD-11) and a taxonomy more tailored to clinical needs. In the following sections, we will
provide a detailed description of the resource’s structure, review the main classes, present a
validation strategy, and discuss a case study.</p>
      <sec id="sec-5-1">
        <title>5.1. Core Model</title>
        <p>The Talkidz Ontology model is structured to provide a clear and comprehensive resource
that supports extensibility and practical application in clinical settings. The Talkidz
Ontology includes six top-level disjoint classes, namely Person, LinguisticArea, SpeechError,
SpeechOrLanguageDisorder, SpeechTherapy, and Severity.</p>
        <p>The classification of persons within the top class Person (Patient, Therapist, and
AdultInCharge with its further specializations), and the relations over these classes, including
caring (inChargeOf) and treatment (isBeingTreatedBy) allow for indexing the responsibilities
and interactions between every part involved in the speech therapy. This class can be aligned
with the Agent class in foundational ontologies such as Dolce [34].</p>
        <p>The top class LinguisticArea, modeled as an enumeration of individuals representing the
various linguistic levels afected by disorders, allows mapping the range of speech language
disorders (SpeechOrLanguageDisorder) onto the specific language areas they afect through the
relation afects.</p>
        <p>The class SpeechOrLanguageDisorder, representing the language disorders, is aligned with the
Language Disorder class of the Disease Ontology [19], which provides a standardized reference
framework for multiscale biomedical data integration and analysis across thousands of clinical,
biomedical, and computational research studies.</p>
        <p>The Severity top class (enumeration class), standardizes disorder classification according
to qualitatively diferent degrees and aids in treatment prioritization. It is linked trough the
relation hasSeverity to the class (SpeechOrLanguage Disorder), thus supporting queries aimed at
understanding the incidence of specific speech disorders in a subset of (filtered and anonymized)
patients (answering a need identified in a CQ, see Table 2).</p>
        <p>SpeechError subclasses (a type of syndrome in Disease Ontology) capture distinct categories
of articulation and cognitive speech issues, and through the property makesErrors (linking each
speech disorder with a specific Patient), facilitates targeted searches for therapeutic purposes.</p>
        <p>With the support of domain experts, we designed the class SpeechTherapy (a Plan in Dolce)
and its subclasses trough a “from-scratch” approach, aiming at creating a taxonomy that could
represent the real clinical needs. The pivotal class is linked to the Therapist through the
property appliesTherapy, to the LanguageDisorder trough the property isTreatedWithTherapy
and to the Patient through the property isFollowedBy. We decided not to introduce specific Role
classes. However, after discussing this issue with the therapists, we chose to handle separately
the cases in which a single individual may be both a patient and a therapist.</p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. Ontology Validation</title>
        <p>To ensure that our ontology can efectively support research in child Speech Therapy, and
due to the fact that synthetic datasets in the last years have been used in computer science
domain with diferent applications and promising results [ 35], we created for testing purposes a
synthetic dataset designed to mimic real-world clinical data. The generated data were modeled
for validation purposes based on real data stored in the databases of a speech therapy clinic,
with the aim of providing a data structure that is as realistic as possible while addressing
real needs. This dataset consists of anonymized, artificially generated patient records, with
distributions of speech disorders modeled after incidence rates reported in the literature. To
enhance realism, speech therapists have validated these distributions ensuring they align with
clinical expectations. By testing our SPARQL queries and hypotheses against this dataset, we
assessed the ontology’s ability to retrieve relevant aggregated information and evaluated diferent
therapeutic approaches, all while maintaining full data privacy.</p>
      </sec>
      <sec id="sec-5-3">
        <title>5.3. Use Case: Speech Therapists in Clinical Decision-Making</title>
        <p>To demonstrate the applicability, we present a use case illustrating its role in assisting therapists
with clinical recommendations. This scenario will showcase how the ontology integrates data
to provide a structured view of speech disorders, therapies and patient progress. A speech
therapist from a speech therapy clinic is evaluating a 5-year old child (Patient) who has been
accepted due to concerns about speech development. The child’s caregiver (AdultInCharge) is
recorded, and evidences the child inability of pronouncing such and such words. The therapist
aims to diagnose the condition, determine the severity, and design an intervention plan based
on evidence-based practices. Figure 3 shows the modeling of the following case study through
ontology classes and properties. The structured process followed by the therapist is described as
follows:
1. Assessment And Detection:
2. Therapy Selection:
• The patient undergoes the standardized language assessment test, identifying errors
to record in any subclass of Speech Error.
• The speech therapist then maps the error to the LinguisticArea afected by the
disorder.
• Based on test results, the patient is diagnosed wit a Developmental LanguageDisorder,
a subclass of SpeechOrLanguageDisorder.
• The Severity is classified for each disorder and mapped to the patient’s description.
• The speech therapists suggest possible treatment options and links, through the
AimedToResolve property, the speech disorder to an appropriate SpeechTherapy type
(e.g.: the CognitiveLinguisticTherapy is chosen, which is usually aligned with
treating DevelopmentalLanguage Disorder.</p>
        <p>• The system records that the Therapist appliesTherapy to the Patient.
3. Monitoring and Adaptation:
• Over time, therapy sessions and patient progress are recorded, updating values like the
therapy start and end date data properties (therapyStartDate and therapyEndDate)
or the severity (hasSeverity).
• If new occurrences of SpeechError emerge or the progress is slower than expected,
alternative therapy strategies can be recommended based on the alternatives listed in
the Ontology. However, the adaptation is not currently modeled in the ontology as it
requires an apparatus for dealing with temporal aspects.</p>
        <p>This use case highlights how our Ontology supports speech therapists during the treatment
process, providing comprehensive information for clinical assessments and interventions.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion and Future Work</title>
      <p>In this work, we presented a semantic resource designed to represent the domain of language
disorders with the dual goal of assisting speech pathologists in the detection phase and providing
large-scale epidemiological insights. We reviewed previous eforts in the semantic representation
of language disorders and highlighted a critical gap: most existing resources lack compliance
with FAIR principles and fail to undergo a comprehensive validation process. To address this
shortcoming, we developed the presented Ontology as part of a larger project aimed at supporting
speech therapists in clinical practice.</p>
      <p>Our approach embraces a community-based methodology, engaging experts throughout
diferent phases, from defining the ontology’s taxonomy to designing the case study. The result is
a semantically enriched resource that not only aligns with standardized frameworks but also
responds to real-world needs, ofering a valuable tool for specialists in the field. As a limitation
we are aware that the pilot validation survey we proposed is not representative and we plan to
further investigate our taxonomy and competency questions on a larger scale, involving more
experts from diferent countries and considering the various approaches that could be taken into
account during language therapy. We also acknowledge that a semantic resource of this kind
cannot claim to be fully representative or universally applicable across all cases. For this reason,
we emphasize the importance of a collaborative approach to minimize the risk of developing
a resource that, while perfectly aligned with international standards and taxonomies, could
ultimately fail to address real-world needs.</p>
      <p>In this line, we strongly support multidisciplinary collaboration and believe that researchers
from diferent fields must continue to work together to develop a realistic and practical resource
for speech therapy. As future work, following a review phase, we plan to update the ontology by
adapting its classes and relations to the needs that will emerge from field use. This alignment
should necessarily involve the integration in the ontology of the capability to express temporal
aspects, needed for the monitoring of therapies. Additionally, aligning our Ontology with other
resources in the healthcare domain, such as those related to cognitive or motor disorders, would
be an interesting direction, aiming to provide an even more comprehensive perspective.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Limitations</title>
      <p>This contribution presents a preliminary phase of a broader project that aims to support speech
therapists in daily work and in patient monitoring. However, this paper has some limitations that
we plan to address in future work. First, regarding alignment with the FAIR principles, while
the ontology is available online, compliance with these principles is not fully detailed. As future
work, we intend to apply and document the FAIR principles more thoroughly. Secondly, we
acknowledge that the current case study may be limited, as the syntactic dataset and SPARQL
queries are not publicly available. We plan to conduct and describe in detail a second case
study with real usage examples to strengthen our validation. As future work, we plan to enrich
this contribution by presenting additional case studies that demonstrate the potential of the
proposed resource.</p>
    </sec>
    <sec id="sec-8">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the authors used ChatGPT in order to: Grammar and
spelling check, Paraphrase and reword. After using this tool/service, the authors reviewed and
edited the content as needed and takes full responsibility for the publication’s content.
[17] M. L. Martín Ruiz, M. A. Valero Duboy, C. Torcal Loriente, I. Pau de la Cruz, Evaluating
a web-based clinical decision support system for language disorders screening in a nursery
school, Journal of medical Internet research 16 (2014) e139.
[18] S. Crosbie, A. Holm, B. Dodd, Intervention for children with severe speech disorder:
a comparison of two approaches, International Journal of Language &amp; Communication
Disorders 40 (2005) 467–491.
[19] L. M. Schriml, J. B. Munro, M. Schor, D. Olley, C. McCracken, V. Felix, J. A. Baron,
R. Jackson, S. M. Bello, C. Bearer, et al., The human disease ontology 2022 update, Nucleic
acids research 50 (2022) D1255–D1261.
[20] P. U. Usip, M. E. Ekpenyong, Towards ontology-driven application for multilingual speech
language therapy, Human Language Technologies for Under-Resourced African Languages:
Design, Challenges, and Prospects (2018) 85–101.
[21] V. E. Robles-Bykbaev, W. Guamán-Murillo, D. Quisi-Peralta, M. López-Nores, J. J.
PazosArias, J. García-Duque, An ontology-based expert system to generate therapy plans for
children with disabilities and communication disorders, in: 2016 ieee ecuador technical
chapters meeting (etcm), IEEE, 2016, pp. 1–6.
[22] V. Robles-Bykbaev, M. López-Nores, J. Pazos-Arias, D. Quisi-Peralta, J. García-Duque, An
ecosystem of intelligent ict tools for speech-language therapy based on a formal knowledge
model, in: MEDINFO 2015: eHealth-enabled Health, IOS Press, 2015, pp. 50–54.
[23] V. Robles-Bykbaev, M. Guamán-Heredia, Y. Robles-Bykbaev, J. Lojano-Redrován,
F. Pesántez-Avilés, D. Quisi-Peralta, M. López-Nores, J. Pazos-Arias, Onto-speltra: A
robotic assistant based on ontologies and agglomerative clustering to support
speechlanguage therapy for children with disabilities, in: Advances in Computing: 12th Colombian
Conference, CCC 2017, Cali, Colombia, September 19-22, 2017, Proceedings 12, Springer,
2017, pp. 343–357.
[24] M. C. Suárez-Figueroa, A. Gómez-Pérez, E. Motta, A. Gangemi, Introduction: Ontology
engineering in a networked world, Springer, 2012.
[25] B. MacWhinney, Understanding spoken language through talkbank, Behavior research
methods 51 (2019) 1919–1927.
[26] K. Sagae, A. Lavie, B. MacWhinney, Automatic measurement of syntactic development
in child language, in: Proceedings of the 43rd Annual Meeting of the Association for
Computational Linguistics (ACL’05), 2005, pp. 197–204.
[27] J. Boorse, M. Cola, S. Plate, L. Yankowitz, J. Pandey, R. T. Schultz, J. Parish-Morris,
Linguistic markers of autism in girls: evidence of a “blended phenotype” during storytelling,
Molecular autism 10 (2019) 1–12.
[28] I. Chojnicka, A. Wawer, Social language in autism spectrum disorder: A computational
analysis of sentiment and linguistic abstraction, PLoS One 15 (2020) e0229985.
[29] L.-Y. Guo, S. Eisenberg, N. B. Ratner, B. MacWhinney, Is putting sugar (sampling
utterances of grammatical analysis revised) into language sample analysis a good thing? a
response to pavelko and owens (2017), Language, Speech, and Hearing Services in Schools
49 (2018) 622–627.
[30] M. J. Pezold, C. M. Imgrund, H. L. Storkel, Using computer programs for language sample
analysis, Language, Speech, and Hearing Services in Schools 51 (2020) 103–114.
[31] D. Spoladore, M. Sacco, A. Trombetta, A review of domain ontologies for disability
representation, Expert Systems with Applications 228 (2023) 120467.
[32] F. Delgado, S. Yang, M. Madaio, Q. Yang, The participatory turn in ai design: Theoretical
foundations and the current state of practice, in: Proceedings of the 3rd ACM Conference
on Equity and Access in Algorithms, Mechanisms, and Optimization, 2023, pp. 1–23.
[33] M. C. Suárez-Figueroa, A. Gómez-Pérez, M. Fernández-López, The neon methodology for
ontology engineering, in: Ontology engineering in a networked world, Springer, 2011, pp.
9–34.
[34] S. Borgo, R. Ferrario, A. Gangemi, N. Guarino, C. Masolo, D. Porello, E. M. Sanfilippo,
L. Vieu, Dolce: A descriptive ontology for linguistic and cognitive engineering, Applied
ontology 17 (2022) 45–69.
[35] A. Bauer, S. Trapp, M. Stenger, R. Leppich, S. Kounev, M. Leznik, K. Chard, I.
Foster, Comprehensive exploration of synthetic data generation: A survey, arXiv preprint
arXiv:2401.02524 (2024).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S. K.</given-names>
            <surname>Narayanasamy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Srinivasan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.-C.</given-names>
            <surname>Hu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. K.</given-names>
            <surname>Masilamani</surname>
          </string-name>
          , K.-
          <string-name>
            <given-names>Y.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <article-title>A contemporary review on utilizing semantic web technologies in healthcare, virtual communities, and ontology-based information processing systems</article-title>
          ,
          <source>Electronics</source>
          <volume>11</volume>
          (
          <year>2022</year>
          )
          <fpage>453</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M. D.</given-names>
            <surname>Wilkinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumontier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. J.</given-names>
            <surname>Aalbersberg</surname>
          </string-name>
          , G. Appleton,
          <string-name>
            <given-names>M.</given-names>
            <surname>Axton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Baak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Blomberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-W.</given-names>
            <surname>Boiten</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. B. da Silva</given-names>
            <surname>Santos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. E.</given-names>
            <surname>Bourne</surname>
          </string-name>
          , et al.,
          <article-title>The fair guiding principles for scientific data management and stewardship</article-title>
          ,
          <source>Scientific data 3</source>
          (
          <year>2016</year>
          )
          <fpage>1</fpage>
          -
          <lpage>9</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A. B.</given-names>
            <surname>Haque</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Arifuzzaman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. A. N.</given-names>
            <surname>Siddik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kalam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. S.</given-names>
            <surname>Shahjahan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Saleena</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Alam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Islam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ahmmed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Hossain</surname>
          </string-name>
          ,
          <article-title>Semantic web in healthcare: a systematic literature review of application, research gap</article-title>
          , and future research avenues,
          <source>International Journal of Clinical Practice</source>
          <year>2022</year>
          (
          <year>2022</year>
          )
          <fpage>6807484</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>F.</given-names>
            <surname>Okikiola</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ikotun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Adelokun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Ishola</surname>
          </string-name>
          ,
          <article-title>A systematic review of health care ontology</article-title>
          ,
          <source>Asian J Res Comput Sci</source>
          <volume>5</volume>
          (
          <year>2020</year>
          )
          <fpage>15</fpage>
          -
          <lpage>28</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>H. B.</given-names>
            <surname>Elhadj</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Sallabi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Henaien</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chaari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Shuaib</surname>
          </string-name>
          , M. Al Thawadi,
          <article-title>Do-care: A dynamic ontology reasoning based healthcare monitoring system</article-title>
          ,
          <source>Future Generation Computer Systems</source>
          <volume>118</volume>
          (
          <year>2021</year>
          )
          <fpage>417</fpage>
          -
          <lpage>431</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>D.</given-names>
            <surname>Spoladore</surname>
          </string-name>
          , E. Pessot,
          <article-title>Collaborative ontology engineering methodologies for the development of decision support systems: Case studies in the healthcare domain</article-title>
          ,
          <source>Electronics</source>
          <volume>10</volume>
          (
          <year>2021</year>
          )
          <fpage>1060</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>F.</given-names>
            <surname>Chuchuca-Méndez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Robles-Bykbaev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Vanegas-Peralta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lucero-Saldaña</surname>
          </string-name>
          , M. LópezNores, J.
          <string-name>
            <surname>Pazos-Arias</surname>
          </string-name>
          ,
          <article-title>An educative environment based on ontologies and e-learning for training on design of speech-language therapy plans for children with disabilities and communication disorders</article-title>
          ,
          <source>in: IEEE CACIDI 2016-IEEE Conference on Computer Sciences, IEEE</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>V.</given-names>
            <surname>Robles-Bykbaev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Pazos-Arias</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>López-Nores</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>García-Duque</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. OchoaZambrano</surname>
          </string-name>
          ,
          <article-title>Modelling domain knowledge of speech and language therapy with an owl ontology and openehr archetypes</article-title>
          , in: International Conference on Health Informatics, volume
          <volume>2</volume>
          , SCITEPRESS,
          <year>2015</year>
          , pp.
          <fpage>585</fpage>
          -
          <lpage>591</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>P.</given-names>
            <surname>Rinaldi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bello</surname>
          </string-name>
          , I. Simonelli,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Caselli</surname>
          </string-name>
          ,
          <article-title>Is specific learning disorder predicted by developmental language disorder? evidence from a follow-up study on italian children</article-title>
          ,
          <source>Brain Sciences</source>
          <volume>13</volume>
          (
          <year>2023</year>
          )
          <fpage>701</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>G.</given-names>
            <surname>Conti-Ramsden</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>St Clair</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Pickles</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Durkin</surname>
          </string-name>
          ,
          <article-title>Developmental trajectories of verbal and nonverbal skills in individuals with a history of specific language impairment: From childhood to adolescence (</article-title>
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>G.</given-names>
            <surname>Conti-Ramsden</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. L.</given-names>
            <surname>Mok</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Pickles</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Durkin</surname>
          </string-name>
          ,
          <article-title>Adolescents with a history of specific language impairment (sli): Strengths and dificulties in social, emotional and behavioral functioning</article-title>
          ,
          <source>Research in developmental disabilities 34</source>
          (
          <year>2013</year>
          )
          <fpage>4161</fpage>
          -
          <lpage>4169</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>M. C. St Clair</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Pickles</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Durkin</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Conti-Ramsden</surname>
          </string-name>
          ,
          <article-title>A longitudinal study of behavioral, emotional and social dificulties in individuals with a history of specific language impairment (sli)</article-title>
          ,
          <source>Journal of communication disorders 44</source>
          (
          <year>2011</year>
          )
          <fpage>186</fpage>
          -
          <lpage>199</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A.</given-names>
            <surname>Sansavini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. E.</given-names>
            <surname>Favilla</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. T.</given-names>
            <surname>Guasti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Marini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Millepiedi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. V.</given-names>
            <surname>Di Martino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vecchi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Battajon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bertolo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Capirci</surname>
          </string-name>
          , et al.,
          <article-title>Developmental language disorder: Early predictors, age for the diagnosis, and diagnostic tools. a scoping review</article-title>
          ,
          <source>Brain sciences 11</source>
          (
          <year>2021</year>
          )
          <fpage>654</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>T. J.</given-names>
            <surname>Spaulding</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Plante</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. A.</given-names>
            <surname>Farinella</surname>
          </string-name>
          ,
          <article-title>Eligibility criteria for language impairment</article-title>
          , Language, Speech, and Hearing Services in Schools 37 (
          <year>2006</year>
          )
          <fpage>61</fpage>
          -
          <lpage>72</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>M. J. S.</given-names>
            <surname>García</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. V.</given-names>
            <surname>González</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. H. P.</given-names>
            <surname>Torres</surname>
          </string-name>
          ,
          <string-name>
            <surname>C. P. de Celis Herrero</surname>
          </string-name>
          ,
          <article-title>A diagnostic tool for speech disorders based on nlp with ontological reasoning</article-title>
          .,
          <source>Res. Comput. Sci</source>
          .
          <volume>147</volume>
          (
          <year>2018</year>
          )
          <fpage>75</fpage>
          -
          <lpage>86</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Martín-Ruiz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. Á. V.</given-names>
            <surname>Duboy</surname>
          </string-name>
          , I. P. de la Cruz,
          <article-title>Deployment and validation of a smart system for screening of language disorders in primary care</article-title>
          ,
          <source>Sensors</source>
          <volume>13</volume>
          (
          <year>2013</year>
          )
          <fpage>7522</fpage>
          -
          <lpage>7545</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>