<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>OQuaRE-KG: an OQuaRE inspired framework for knowledge graph quality assessment</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Belén Juanes-Cortés</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Juan Mulero-Hernández</string-name>
          <email>belen.juanesc@um.es</email>
          <email>juan.mulero@um.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jesualdo Tomás Fernández-Breis</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Instituto Murciano de Investigación Biosanitaria Pascual Parrilla</institution>
          ,
          <addr-line>CP 30120, Murcia</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Universidad de Murcia</institution>
          ,
          <addr-line>CEIR Mare Nostrum, CP 30100, Murcia</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2026</year>
      </pub-date>
      <abstract>
        <p>Knowledge graphs play a central role in semantic data integration, knowledge-based applications, and artificial intelligence systems, due to their ability to represent entities, relations, and constraints in a structured and interoperable manner. As knowledge graphs increase in size, complexity, and heterogeneity, ensuring their quality has become a critical challenge for guaranteeing reliability, reuse, and efective reasoning. Existing approaches to knowledge graph quality assurance are largely fragmented, addressing isolated aspects such as schema validation, reasoning, data quality, or ontology evaluation, without ofering an integrated and comprehensive assessment framework. In this work, we introduce OQuaRE-KG, a unified quality evaluation framework inspired by ontology quality models and enriched with established data quality principles. OQuaRE-KG integrates concepts from standards and methodologies in data quality assessment, linked data evaluation, and ontology quality evaluation into a coherent model specifically tailored to the dual nature of knowledge graphs, encompassing both schema-level and instance-level characteristics. We present the conceptual design of the framework, including its quality dimensions and assessment structure, and demonstrate its applicability through an evaluation of several biological Knowledge Graphs. The results illustrate OQuaRE-KG's ability to systematically characterize and diferentiate knowledge graphs quality profiles, highlighting its potential as a foundational step toward standardized, reproducible knowledge graph quality evaluation.</p>
      </abstract>
      <kwd-group>
        <kwd>Knowledge graphs</kwd>
        <kwd>Quality evaluation</kwd>
        <kwd>Quantitative metrics</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Knowledge Graphs (KGs) have become a central piece in semantic data integration, knowledge-based
applications and artificial intelligence systems. Its capacity to represent entities, relations and constraints
makes them suitable for reasoning applications, interoperability or advance analysis [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. However,
as graphs are growing in complexity and heterogeneity, guaranteeing their quality became the main
challenge to ensure their usefulness and reliability.
      </p>
      <p>
        Quality assurance in KGs has traditionally been addressed in a fragmented manner. Existing
approaches and tools typically focus on specific aspects, such as schema validation languages (SHACL,
ShEx) [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ] and ontological reasoners (HermiT) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. However, current frameworks remain limited
to isolated perspectives and do not provide an integrated process for a comprehensive assessment of
a graph’s overall quality. The work done in related fields such as data engineering, linked data and
ontology quality evaluation should be considered in benefit of KG quality evaluation.
      </p>
      <p>
        The ISO/IEC 25012 standard [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] distinguishes between two categories of data quality: (i) Inherent
Data Quality, which includes characteristics such as completeness and consistency, and (ii)
SystemDependent Data Quality, which includes characteristics such as availability and understandability. This
distinction is useful for classifying diferent types of data quality requirements. In parallel, the FAIR
principles have become a cornerstone for data management and sharing in scientific and industrial
LGOBE
(J. T. Fernández-Breis)
(J. T. Fernández-Breis)
https://webs.um.es/jfernand (J. T. Fernández-Breis)
      </p>
      <p>CEUR
Workshop</p>
      <p>
        ISSN1613-0073
contexts [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. While the focus of FAIR is on enabling data discoverability and reuse, its implementation
in KGs is closely tied to quality characteristics such as accessibility, interoperability, and provenance.
      </p>
      <p>
        From the ontology quality evaluation perspective, OQuaRE [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] provides a quality evaluation model
based on the SQuaRE standard. Its strength lies in providing a systematic and reproducible structure;
nevertheless, its scope is confined to ontological artefacts, without considering instances typical from
KGs [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>Our hypothesis is that KG evaluation can benefit from the application of a systematic and reproducible
framework, and that it makes sense adapting the OQuaRE principles to KGs given the semantic nature
of both ontologies and KGs. Therefore, in this work, we introduce OQuaRE-KG, an OQuaRE-inspired
framework that integrates principles from ontology quality and data quality assessment into a coherent
model tailored for KGs. This article introduces the design of the framework and describes the application
to a set of biological KGs to illustrate its capability to detect diferences between KGs.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>The quality evaluation in KGs has been addressed from multiple perspectives in the literature, and this
section contextualize the approach of OQuaRE-KG regarding previous works.</p>
      <p>
        The framework proposed by Zaveri et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] is considered to be one of the most comprehensive
taxonomies, with 18 quality dimensions and 69 metrics. This framework categorizes dimensions such as
completeness, availability, accessibility, consistency and provenance. This taxonomy has been essential
for comprehending the complexity of evaluating data distributed on the web [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] a methodology for assessing Linked Datasets is proposed. It introduces Luzzu, which provides
an extensible framework for the automated calculation of Linked Data metrics. However, it should be
noted that Luzzu ofers only an execution environment and does not include a conceptual model.
      </p>
      <p>
        Additionally, the empirical study conducted by Debattista et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] evaluated 130 datasets from
the LOD Cloud, using 27 metrics. This study builds on the work undertaken in the survey of Zavery
et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. The quality metadata for each assessed dataset is published as Linked Data enabling data
consumers to search and filter datasets based on diferent quality criteria.
      </p>
      <p>
        Farber et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] proposed a framework to determine the most suitable KG for applications based on
data quality criteria and the analysis of DBpedia, Freebase, OpenCyc, Wikidata, and YAGO. That work
presented 34 data quality criteria classified into 11 dimensions and 4 categories for evaluating KGs.
      </p>
      <p>
        The framework developed [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] is based on the framework proposed by Zaveri et al., in which a
comprehensive study of existing frameworks was conducted to evaluate the quality of KGs. The authors
selected a set of quality dimensions and their corresponding metrics, which were mapped to 18 quality
requirements and recommended an approach for evaluating each metric considering its feasibility and
scalability.
      </p>
      <p>
        FAIR principles [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] have gained increasing significance in the domain of research data management
and reproducibility. To facilitate the adoption of best practices in FAIR by researchers, the scientific
community has developed self-assessment tools and validation tools that help to assess the FAIRness
of their resource. Self-assessment tools consist of online self-report questionnaires and checklists.
The objectives of the questions are to reflect each of the FAIR principles. The questions are grouped
according to each principle. An example of this kind of tool is Australian Research Data Commons
(ARDC)1 The FAIR Maturity Evaluation Service (FAIR evaluator) [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] is a fully automated tool based on
community-driven eforts to compile discipline specific FAIR maturity indicators (MIs). FAIR Checker
[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] also utilises the reference FAIR MIs.
1https://ardc.edu.au/resource/fair-data-self-assessment-tool/
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Materials and Methods</title>
      <sec id="sec-3-1">
        <title>3.1. The OQuaRE-KG Framework</title>
        <p>
          OQuaRE-KG is a quality framework for assessing KGs inspired on the OQuaRE framework for ontology
quality [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. OQuaRE is a systematic framework designed to evaluate the quality of ontologies by
adapting the ISO/IEC 25000:2005 SQuaRE standard, for Software Product Quality Requirements and
Evaluation (SQuaRE).
        </p>
        <p>The OQuaRE Quality Model Division addresses the internal and external quality of the ontology by
defining key characteristics and sub-characteristics. The OQuaRE Quality Metrics Division establishes
both base and derived measures for quality evaluation. The OQuaRE quality model consists of 7
characteristics, 39 subcharacteristics, and 19 metrics. Each characteristic has a set of subcharacteristics
associated which, in turn, have a set of metrics associated.</p>
        <p>
          In OQuaRE, the raw scores of the metrics are normalized to quality scores on a Likert scale [
          <xref ref-type="bibr" rid="ref1 ref5">1,5</xref>
          ], with
1 being the lowest quality score, and 5 the highest quality score. Then, a quality score can be calculated
for each subcharacteristic and characteristic by averaging the scores of the diferent associated items.
The detailed specification of OQuaRE can be found at https://github.com/tecnomod-um/oquare.
        </p>
        <p>OQuaRE-KG follows the same design principles and adapts them to the evaluation of the quality of
KGs. The next subsections describe the OQuaRE-KG quality model, whereas the whole framework can
be found at https://github.com/tecnomod-um/oquare-kg.</p>
        <p>The OQuaRE KG framework includes 7 characteristics found in the OQuaRE quality model: structural,
functional adequacy, compatibility, transferability, operability, reliability, and maintainability. Next, we
provide the definition adapted to KGs.</p>
        <p>• Structural: The structural dimension accounts for the intrinsic design and structure of the KG,
independent of the user’s specific context.
• Functional adequacy: Functional adequacy refers to the capability of the KG to provide concrete
functions.
• Compatibility: The ability of a KG to function correctly and be integrated with other KGs, data
sources, or systems by adhering to common technical standards for data representation and
exchange.
• Transferability: The degree to which a KG can be deployed, reused, or adapted across diferent
platforms, domains, or technical environments with minimal modification, preserving its data,
schema, and semantic definitions.
• Operability: The level of efort required by users (e.g., developers, data scientists, domain experts)
to efectively access, query, navigate, and update the information stored within the KG to perform
their tasks.
• Reliability: It refers to the capability of the KG to maintain its level of performance under stated
conditions for a given period of time.
• Maintainability: The capability of a KG to be modified or extended in response to changes in
environments, data sources, requirements, or functional specifications.</p>
        <p>
          The current version of OQuaRE-KG includes 16 quality subcharacteristics. Each subcharacteristic is
associated with one characteristic. Table 1 provides the set of subcharacteristics and their association
with characteristics. Some subcharacteristics have been adapted from OQuaRE, but some have been
adapted from works on Linked Data quality [
          <xref ref-type="bibr" rid="ref10 ref15">15, 10</xref>
          ] or inspired by the FAIR principles [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
        </p>
        <p>The current version of OQuaRE-KG also includes 28 quality metrics. Each metric is associated with
one or more subcharacteristics. The metrics for the consistency subcharacteristic are shown in table 2.
The description of the metrics, and the associations between metrics and the subcharacteristics, are
also available in the OQuaRE-KG GitHub repository.</p>
        <p>
          OQuaRE-KG scales the values of the metrics into quality scores in the range [
          <xref ref-type="bibr" rid="ref1 ref5">1,5</xref>
          ] (see Figure 1). We
can classify the OQuaRE-KG metrics into three groups based on their range of values:
        </p>
        <p>
          Description
The capability of a knowledge graph’s ontology or schema to
enable reasoning processes
Structural accuracy refers to the correctness of the use of
ontological terms in the graph
Consistency means that two or more entities do not conflict with
respect to knowledge representation and inference mechanisms
Syntactic validity is defined as the degree to which an RDF
document conforms to the specification of the serialization format
Redundancy refers to the degree of avoidance of duplicate
entities or relationships that could confuse the reasoning and
querying of the graph
The extent to which the information within the knowledge graph
is machine-readable, semantically unambiguous and consistent,
enabling automated systems to perform valid reasoning
The degree to which the ontology or schema of a knowledge
graph can be used by reasoners to make implicit knowledge
explicit within the graph
Understandability refers to the ease with which data can be
comprehended without ambiguity and be used by a human
information consumer
Trustworthiness is defined as the degree to which the
information is accepted to be correct, true, real and credible
Provenance refers to the provision of information regarding the
origin of the dataset and of the resources within the dataset
itself
Clustering refers to the degree to which entities representing
the same or closely related concepts are accurately identified,
grouped and linked within or across data sources
Interoperability in knowledge graphs refers to the use of formal,
accessible, and shared ontologies, controlled vocabularies, and
machine-readable formats in the representation of data and
metadata, enabling seamless integration, automated reasoning,
and cross-system understanding
Versatility refers to the availability of the data in diferent
representations and in an internationalized way
Licensing is defined as the granting of permission for a consumer
to re-use a dataset under defined conditions
Accessibility is the degree to which a knowledge graph and its
metadata can be reliably retrieved by humans and machines
through open, standardized protocols, with clear access
conditions and persistent availability of metadata
Reusability is the degree to which components of a knowledge
graph can be efectively used in multiple contexts to build new
systems or enrich other knowledge graphs
• Binary value: They are scaled to 1 or 5.
• Values in the range [
          <xref ref-type="bibr" rid="ref1">0,1</xref>
          ] are scaled [
          <xref ref-type="bibr" rid="ref1 ref5">1,5</xref>
          ] by intervals of 20%.
        </p>
        <p>
          • Values in other ranges are scaled into [
          <xref ref-type="bibr" rid="ref1">0,1</xref>
          ] and then the previous scaling to [
          <xref ref-type="bibr" rid="ref1 ref5">1,5</xref>
          ] was applied.
        </p>
        <p>
          In case high values of the metric are a sign of high quality, those high values are mapped onto
the higher scores in the range [
          <xref ref-type="bibr" rid="ref1 ref5">1,5</xref>
          ]. Otherwise, the high values are mapped onto the lower scores.
Once the quality scores of the metrics are calculated, the quality scores of the subcharacteristics and
characteristics are obtained by averaging the scores of their corresponding associated metrics and
        </p>
        <p>Checks whether classes appear incorrectly in
predicate position or properties appear incorrectly in
object position.</p>
        <p>Assesses correct usage of predicates according to Score from 0 to 1, where 0
owl:DatatypeProperty and owl:ObjectProperty ax- indicates no misused
propioms. Detects erroneous triples where literals are erties.
linked to object properties or entities to datatype
properties.</p>
        <p>Checks if entities in the graph use terms not de- Score from 0 to 1, where 0
infined in any ontology. dicates no undefined terms.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Use case</title>
        <p>
          In this work, we have applied OQuaRE-KG to evaluate gene regulation KGs. In particular, we have used
data from the BioGateway knowledge network available at https://github.com/juan-mulero/cisregEA
[
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. This resource integrates data related to diferent domains of relevance in the biomedical context,
such as proteins, genes and diseases. From the gene regulation perspective, it has a particular focus on
enhancers, which are the most widely studied cis-regulatory modules (CRM) [17]. These sequences were
modelled using the CisReg schema, which was used in BioGateway to integrate data from 25 diferent
databases, modelling information from enhancer sequences and their relations with other entities. For
the evaluation of OQuaRE-KG, we have used five graphs ofered by the BioGateway knowledge network,
which are described in Table 3.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <p>We have calculated the values of the OQuaRE-KG quality metrics for the five aforementioned graphs and
scaled the values into score at the level of metrics, subcharacteristics and characteristics. In this section
we mainly focus on the main results at the level of characteristics and subcharacteristics, whereas the
complete ones are available at https://github.com/tecnomod-um/oquare-kg, including the raw value of
the metrics, and the quality score (scaled value) of the metrics, subcharacteristics and characteristics.</p>
      <sec id="sec-4-1">
        <title>4.1. Characteristics</title>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Subcharacteristics</title>
        <p>In this section we analyse the results of subcharacteristics which exhibit diferences between graphs,
namely, structural and functional adequacy. Regarding the structural category (see Figure 3), all graphs
get the same score for formalisation, structural accuracy, syntactic validity, and redundancy. The
diferences between graphs are exhibited for consistency and interpretability. Further analysis requires
to work at the level of metrics. In this case, one metric is responsible for the diferences:
• Entities with no type metric: this metric is associated with both consistency and interpretability. It
accounts for the ratio of nodes lacking an rdf:type in the graph. The crm and all graphs have the
highest score, whereas the other three graphs exhibit the general pattern observed at the level of
characteristics, that is, crm2tfac and crm2gene have the same score.</p>
        <p>It should be noted that having the same quality score for a metric does not mean having the same
metric value. For example, the value of the metric Entities with no type metric for crm and all are 0.0239
and 0.0756 respectively. Since they are in the interval 0-0.20, they are scaled to the same score (5).</p>
        <p>Regarding functional adequacy (see Figure 4), the diference in quality scores happen for the following
subcharacteristics: inference, trustworthiness and clustering. The pattern is like in the previous cases
except for trustworthiness, where all graphs obtain a quality score 3 except for crm2phen, which gets a
score 1.</p>
        <p>The variation for inference and clustering is due to the entities with no type metric, which reveals as
a distinctive metric for these graphs (see Figures 5, 6). These results are consistent with the fact that
these graphs contain untyped entities, because the definition of these entities is carried out in other
graphs such as crm or gene.</p>
        <p>The diference in trustworthiness is due to the evidence metric, which verifies whether graph’s
assertions have terms for capturing evidence. This is calculated by considering a predetermined list of
annotation properties, which includes properties from Dublin Core (source, references), RDFs
(isDeifnedBy), OWL (sameAs), schema.org (evidenceLevel, evidenceOrigin), and from GO-LEGO (evidence,
evidence-with).</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Discussion</title>
      <p>According to the results of the experiments, the graphs may be classified in three groups by their quality
scores. Group 1 includes crm and all, which have the same and highest quality scores. Group 2 includes
crm2phen and Group 3 include crm2tfac and crm2gene, which also have the same but lowest quality
scores.</p>
      <p>The analysis at the level of characteristics shows that all the graphs have the same quality score
for the characteristics more related to the methodological process, which is an expected and desired
result, independently of the concrete score. That means that the homogeneity is captured by those
characteristics. The diferences are in characteristics more related to the modelling of the data and the
functional adequacy, which may vary more between graphs, since it may have also links to the quality
of the data itself. If we interpret the quality scores for the characteristics, the graphs obtained scores
≥3 for the structural and compatibility characteristics, whereas the scores are ≤3 for the rest of the
characteristics. This information can be used by the developers of the KGs to identify strengths and
weaknesses.</p>
      <p>The analysis at the level of subcharacteristics has shown that the pattern identified at the level of
characteristics. Either all the graphs have the same scores or the same three groups are identified. In
the current version of the framework, five characteristics have only one subcharacteristic associated,
but the structural characteristic and the functional adequacy ones have 6 and 5 subcharacteristics,
respectively. This permits to a more detailed analysis for those characteristics.</p>
      <p>Regarding the structural category, for which we have shown the radar chart, the diference in the
quality scores for the subcharacteristics is due to aspects related to interpretability and consistency.
This can be further inspected at the level of metrics, which reveals that it is due to the ratio of missing
type definitions. These findings can be used by the developers of the knowledge graph to review their
decisions.</p>
      <p>Additional findings come from the analysis of the functional adequacy subcharacteristics. The
evidence metric score for crm2phen is 1, whereas the score of this metric is 5 for the rest of graphs. This
may be due to the nature of the data of this graph. In these graphs, the evidence properties are used
for asserting from which database the data comes from and for associating experimental methods as
evidence. The latter is not applicable to crm2phen, since this information is absent in the original data
sources, and this may justify this diference.</p>
      <p>These results also underscore the relevance of the approach adopted in this work on KGs, since the
schema used for graph modelling can include information that is subsequently present or absent in the
instantiated KGs according to the sources used. Therefore, the metrics and results derived from the
ontology evaluation are not directly transferred to KG quality. The results also suggest that improving
the quality of the graphs would require greater efort in metadata. For example, including licenses and
data formats, such as language, would be helpful.</p>
      <p>Table 4 summarizes the levels of the quality scores of the metrics for each graph. The graphs obtain
a very similar number of metrics with the same quality scores. The same three groups of graphs can be
made based on the values observed in the table. At the levels of characteristics and subcharacteristics,
crm2phen got higher scores than crm2tfac and crm2gene. However, the analysis of the levels of the
metric reveals that crm2phen has more metrics with 1 and less with 5, although the diference is small.
This is another example of how the framework permits to analyse the KGs at diferent levels, which
provide complementary information.</p>
      <p>In the present article, we do not intend to make a strong interpretation of the quality scores, because
we have used a simple scaling function to map metrics values into quality scores. This is a limitation of
the study. Future work should improve the mapping function. For example, our reference framework
for ontology evaluation, OQuaRE, proposed two diferent scaling functions, one static function based
on best practices and another dynamic function based on the distribution of real data [18, 19]. For
this purpose, the Evaluome framework developed by our research group will be used [20]. We will
also carry out further experiments with diferent KGs to optimize the framework, including resources
such as Wikidata [21] or DBPedia [22]. Research will also be done to extend the framework to include
additional aspects that impact on the quality of the KG, such as the quality of the ontologies used
or the quality of the data sources employed for the generation of the KG. Finally, we plan to apply
OQuaRE-KG to perform a metrics-based comparison of KGs developed by traditional processes against
KGs generated by LLMs or automated methods, as done with OQuaRE for ontologies [23].</p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusions</title>
      <p>In this work we have presented OQuaRE-KG, a framework for assessing the quality of knowledge
graphs that follows principles previously applied for the assessment of the quality of ontologies. The
experiments reported in this article show that it is able to detect diferences in the quality scores of
diferent knowledge graphs from the gene regulation domain. Further experiments in diferent domains
will help to define levels of quality based on the scores of metrics, subcharacteristics, and characteristics.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Acknowledgments</title>
      <p>This research is part of the grant PID2024-155257OB-I00 funded by MICIU/AEI/10.13039/501100011033/
and by ERDF,EU (FONDOS FEDER).</p>
    </sec>
    <sec id="sec-8">
      <title>Declaration on Generative AI</title>
      <p>The author(s) have not employed any Generative AI tools.
for the development of interoperable gene regulation knowledge graphs, IEEE Access 14 (2025)
792–814.
[17] J. Mulero-Hernández, V. Mironov, J. A. Miñarro-Giménez, M. Kuiper, J. T. Fernández-Breis,
Integration of chromosome locations and functional aspects of enhancers and topologically associating
domains in knowledge graphs enables versatile queries about gene regulation, Nucleic Acids
Research 52 (2024) e69–e69.
[18] A. Duque-Ramos, M. Quesada-Martínez, M. Iniesta-Moreno, J. T. Fernández-Breis, R. Stevens,
Supporting the analysis of ontology evolution processes through the combination of static and
dynamic scaling functions in oquare, Journal of biomedical semantics 7 (2016) 63.
[19] M. Franco, J. M. Vivo, M. Quesada-Martínez, A. Duque-Ramos, J. T. Fernández-Breis, Evaluation of
ontology structural metrics based on public repository data, Briefings in bioinformatics 21 (2020)
473–485.
[20] J. A. Bernabé-Díaz, M. Franco, J.-M. Vivo, J. T. Fernández-Breis, Optimizing clustering-based
analytical methods with trimmed and sparse clustering, Computers in Biology and Medicine 194
(2025) 110436.
[21] D. Vrandečić, M. Krötzsch, Wikidata: a free collaborative knowledgebase, Communications of the</p>
      <p>ACM 57 (2014) 78–85.
[22] S. Auer, C. Bizer, G. Kobilarov, J. Lehmann, R. Cyganiak, Z. Ives, Dbpedia: A nucleus for a web of
open data, in: international semantic web conference, Springer, 2007, pp. 722–735.
[23] M. Val-Calvo, M. E. Aranguren, J. Mulero-Hernández, G. Almagro-Hernández, P. Deshmukh,
J. A. Bernabé-Díaz, P. Espinoza-Arias, J. L. Sánchez-Fernández, J. Mueller, J. T. Fernández-Breis,
Ontogenix: Leveraging large language models for enhanced ontology engineering from datasets,
Information Processing &amp; Management 62 (2025) 104042.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>C.</given-names>
            <surname>Peng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Xia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Naseriparsa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Osborne</surname>
          </string-name>
          ,
          <article-title>Knowledge graphs: Opportunities and challenges</article-title>
          ,
          <source>Artificial intelligence review 56</source>
          (
          <year>2023</year>
          )
          <fpage>13071</fpage>
          -
          <lpage>13102</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>H.</given-names>
            <surname>Knublauch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kontokostas</surname>
          </string-name>
          ,
          <article-title>Shapes Constraint Language (SHACL)</article-title>
          ,
          <source>W3C Recommendation, World Wide Web Consortium (W3C)</source>
          ,
          <year>2017</year>
          . URL: https://www.w3.org/TR/2017/ REC-shacl-
          <volume>20170720</volume>
          /.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>E.</given-names>
            <surname>Prud</surname>
          </string-name>
          <article-title>'hommeaux</article-title>
          ,
          <string-name>
            <given-names>J. E. Labra</given-names>
            <surname>Gayo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Solbrig</surname>
          </string-name>
          ,
          <article-title>Shape expressions: an rdf validation and transformation language</article-title>
          ,
          <source>in: Proceedings of the 10th International Conference on Semantic Systems</source>
          ,
          <year>2014</year>
          , pp.
          <fpage>32</fpage>
          -
          <lpage>40</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>B.</given-names>
            <surname>Glimm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Horrocks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Motik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Stoilos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Hermit: an owl 2 reasoner</article-title>
          ,
          <source>Journal of automated reasoning 53</source>
          (
          <year>2014</year>
          )
          <fpage>245</fpage>
          -
          <lpage>269</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>I. O.</given-names>
            for
            <surname>Standardization</surname>
          </string-name>
          <string-name>
            <given-names>ISO</given-names>
            /IEC 25012:
            <surname>2008 (E),</surname>
          </string-name>
          <article-title>Software engineering-software product quality requirementsand evaluation (square)-data quality model</article-title>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M. D.</given-names>
            <surname>Wilkinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumontier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. J.</given-names>
            <surname>Aalbersberg</surname>
          </string-name>
          , G. Appleton,
          <string-name>
            <given-names>M.</given-names>
            <surname>Axton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Baak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Blomberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-W.</given-names>
            <surname>Boiten</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. B. da Silva</given-names>
            <surname>Santos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. E.</given-names>
            <surname>Bourne</surname>
          </string-name>
          , et al.,
          <article-title>The fair guiding principles for scientific data management and stewardship</article-title>
          ,
          <source>Scientific data 3</source>
          (
          <year>2016</year>
          )
          <fpage>1</fpage>
          -
          <lpage>9</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>A.</given-names>
            <surname>Duque-Ramos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. T.</given-names>
            <surname>Fernández-Breis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Stevens</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Aussenac-Gilles</surname>
          </string-name>
          ,
          <article-title>Oquare: A square-based approach for evaluating the quality of ontologies</article-title>
          ,
          <source>Journal of research and practice in information technology 43</source>
          (
          <year>2011</year>
          )
          <fpage>159</fpage>
          -
          <lpage>176</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Zaveri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rula</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Maurino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Pietrobon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <article-title>Quality assessment for linked data: A survey: A systematic literature review and conceptual framework</article-title>
          ,
          <source>Semantic web 7</source>
          (
          <year>2015</year>
          )
          <fpage>63</fpage>
          -
          <lpage>93</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>J.</given-names>
            <surname>Debattista</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Lange</surname>
          </string-name>
          ,
          <article-title>Luzzu-a framework for linked data quality assessment</article-title>
          ,
          <source>in: 2016 IEEE Tenth International Conference on Semantic Computing (ICSC)</source>
          , IEEE,
          <year>2016</year>
          , pp.
          <fpage>124</fpage>
          -
          <lpage>131</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J.</given-names>
            <surname>Debattista</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Lange</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Cortis</surname>
          </string-name>
          ,
          <article-title>Evaluating the quality of the lod cloud: An empirical investigation</article-title>
          ,
          <source>Semantic Web</source>
          <volume>9</volume>
          (
          <year>2018</year>
          )
          <fpage>859</fpage>
          -
          <lpage>901</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M.</given-names>
            <surname>Färber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Bartscherer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Menne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rettinger</surname>
          </string-name>
          ,
          <article-title>Linked data quality of dbpedia, freebase, opencyc, wikidata, and yago</article-title>
          ,
          <source>Semantic Web</source>
          <volume>9</volume>
          (
          <year>2018</year>
          )
          <fpage>77</fpage>
          -
          <lpage>129</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>H.</given-names>
            <surname>Chen</surname>
          </string-name>
          , G. Cao,
          <string-name>
            <given-names>J.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ding</surname>
          </string-name>
          ,
          <article-title>A practical framework for evaluating the quality of knowledge graph, in: Knowledge Graph and Semantic Computing: Knowledge Computing and Language Understanding: 4th China Conference</article-title>
          ,
          <string-name>
            <surname>CCKS</surname>
          </string-name>
          <year>2019</year>
          , Hangzhou, China,
          <source>August 24-27</source>
          ,
          <year>2019</year>
          ,
          <source>Revised Selected Papers 4</source>
          , Springer,
          <year>2019</year>
          , pp.
          <fpage>111</fpage>
          -
          <lpage>122</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>M. D. Wilkinson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Dumontier</surname>
            ,
            <given-names>S.-A.</given-names>
          </string-name>
          <string-name>
            <surname>Sansone</surname>
            ,
            <given-names>L. O.</given-names>
          </string-name>
          <string-name>
            <surname>Bonino da Silva Santos</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Prieto</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Batista</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>McQuilton</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Kuhn</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Rocca-Serra</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Crosas</surname>
          </string-name>
          , et al.,
          <article-title>Evaluating fair maturity through a scalable, automated, community-governed framework</article-title>
          ,
          <source>Scientific data 6</source>
          (
          <year>2019</year>
          )
          <fpage>174</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Gaignard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Rosnet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>De Lamotte</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Lefort</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.-D. Devignes</surname>
          </string-name>
          ,
          <article-title>Fair-checker: supporting digital resource findability and reuse with knowledge graphs and semantic web standards</article-title>
          ,
          <source>Journal of Biomedical Semantics</source>
          <volume>14</volume>
          (
          <year>2023</year>
          )
          <article-title>7</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>J.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Zaveri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rula</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Maurino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Pietrobon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <article-title>Quality assessment for linked data: A survey</article-title>
          .,
          <source>Semantic Web (1570-0844) 7</source>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>J.</given-names>
            <surname>Mulero-Hernández</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Almagro-Hernández</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. T.</given-names>
            <surname>Fernández-Breis</surname>
          </string-name>
          , Graph alignment methods
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>